Insights into the phylogeny and coding potential of microbial dark matterGenome sequencing enhances our understanding of the biological world by providing blueprints for the evolutionary and functional diversity that shapes the biosphere. However, microbial genomes that are currently available are of limited phylogenetic breadth, owing to our historical inability to cultivate most microorganisms in the laboratory. We apply single-cell genomics to target and sequence 201 uncultivated archaeal and bacterial cells from nine diverse habitats belonging to 29 major mostly uncharted branches of the tree of life, so-called ‘microbial dark matter’. With this additional genomic information, we are able to resolve many intra- and inter-phylum-level relationships and to propose two new superphyla. We uncover unexpected metabolic features that extend our understanding of biology and challenge established boundaries between the three domains of life. These include a novel amino acid use for the opal stop codon, an archaeal-type purine synthesis in Bacteria and complete sigma factors in Archaea similar to those in Bacteria. The single-cell genomes also served to phylogenetically anchor up to 20% of metagenomic reads in some habitats, facilitating organism-level interpretation of ecosystem function. This study greatly expands the genomic representation of the tree of life and provides a systematic step towards a better understanding of biological evolution on our planet. Uncultivated archaeal and bacterial cells of major uncharted branches of the tree of life are targeted and sequenced using single-cell genomics; this enables resolution of many intra- and inter-phylum-level relationships, uncovers unexpected metabolic features that challenge established boundaries between the three domains of life, and leads to the proposal of two new superphyla. Currently available genome sequences give us a narrow view of the remarkable diversity of microorganisms because the vast majority of them have never been cultivated in pure culture. Here Tanja Woyke and colleagues use single-cell genomics to target and sequence 201 uncultivated archaeal and bacterial cells from nine diverse habitats. This information reveals numerous intra- and inter-phylum relationships and a number of unexpected metabolic features. On the basis of the new data the authors propose taxonomic revisions to the archaeal and bacterial domains, including a proposal to reorganizing the Archaea into three superphyla.
Prevalent genome streamlining and latitudinal divergence of planktonic bacteria in the surface oceanBrandon K. Swan, Ben Tupper, Alexander Sczyrba et al.|Proceedings of the National Academy of Sciences|2013 Planktonic bacteria dominate surface ocean biomass and influence global biogeochemical processes, but remain poorly characterized owing to difficulties in cultivation. Using large-scale single cell genomics, we obtained insight into the genome content and biogeography of many bacterial lineages inhabiting the surface ocean. We found that, compared with existing cultures, natural bacterioplankton have smaller genomes, fewer gene duplications, and are depleted in guanine and cytosine, noncoding nucleotides, and genes encoding transcription, signal transduction, and noncytoplasmic proteins. These findings provide strong evidence that genome streamlining and oligotrophy are prevalent features among diverse, free-living bacterioplankton, whereas existing laboratory cultures consist primarily of copiotrophs. The apparent ubiquity of metabolic specialization and mixotrophy, as predicted from single cell genomes, also may contribute to the difficulty in bacterioplankton cultivation. Using metagenome fragment recruitment against single cell genomes, we show that the global distribution of surface ocean bacterioplankton correlates with temperature and latitude and is not limited by dispersal at the time scales required for nucleotide substitution to exceed the current operational definition of bacterial species. Single cell genomes with highly similar small subunit rRNA gene sequences exhibited significant genomic and biogeographic variability, highlighting challenges in the interpretation of individual gene surveys and metagenome assemblies in environmental microbiology. Our study demonstrates the utility of single cell genomics for gaining an improved understanding of the composition and dynamics of natural microbial assemblages.
UGA is an additional glycine codon in uncultured SR1 bacteria from the human microbiotaJames H. Campbell, Patrick O’Donoghue, Alisha G. Campbell et al.|Proceedings of the National Academy of Sciences|2013 The composition of the human microbiota is recognized as an important factor in human health and disease. Many of our cohabitating microbes belong to phylum-level divisions for which there are no cultivated representatives and are only represented by small subunit rRNA sequences. For one such taxon (SR1), which includes bacteria with elevated abundance in periodontitis, we provide a single-cell genome sequence from a healthy oral sample. SR1 bacteria use a unique genetic code. In-frame TGA (opal) codons are found in most genes (85%), often at loci normally encoding conserved glycine residues. UGA appears not to function as a stop codon and is in equilibrium with the canonical GGN glycine codons, displaying strain-specific variation across the human population. SR1 encodes a divergent tRNA(Gly)UCA with an opal-decoding anticodon. SR1 glycyl-tRNA synthetase acylates tRNA(Gly)UCA with glycine in vitro with similar activity compared with normal tRNA(Gly)UCC. Coexpression of SR1 glycyl-tRNA synthetase and tRNA(Gly)UCA in Escherichia coli yields significant β-galactosidase activity in vivo from a lacZ gene containing an in-frame TGA codon. Comparative genomic analysis with Human Microbiome Project data revealed that the human body harbors a striking diversity of SR1 bacteria. This is a surprising finding because SR1 is most closely related to bacteria that live in anoxic and thermal environments. Some of these bacteria share common genetic and metabolic features with SR1, including UGA to glycine reassignment and an archaeal-type ribulose-1,5-bisphosphate carboxylase (RubisCO) involved in AMP recycling. UGA codon reassignment renders SR1 genes untranslatable by other bacteria, which impacts horizontal gene transfer within the human microbiota.
Genome-wide selective sweeps and gene-specific sweeps in natural bacterial populationsMultiple models describe the formation and evolution of distinct microbial phylogenetic groups. These evolutionary models make different predictions regarding how adaptive alleles spread through populations and how genetic diversity is maintained. Processes predicted by competing evolutionary models, for example, genome-wide selective sweeps vs gene-specific sweeps, could be captured in natural populations using time-series metagenomics if the approach were applied over a sufficiently long time frame. Direct observations of either process would help resolve how distinct microbial groups evolve. Here, from a 9-year metagenomic study of a freshwater lake (2005-2013), we explore changes in single-nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in 30 bacterial populations. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied by >1000-fold among populations. SNP allele frequencies also changed dramatically over time within some populations. Interestingly, nearly all SNP variants were slowly purged over several years from one population of green sulfur bacteria, while at the same time multiple genes either swept through or were lost from this population. These patterns were consistent with a genome-wide selective sweep in progress, a process predicted by the 'ecotype model' of speciation but not previously observed in nature. In contrast, other populations contained large, SNP-free genomic regions that appear to have swept independently through the populations prior to the study without purging diversity elsewhere in the genome. Evidence for both genome-wide and gene-specific sweeps suggests that different models of bacterial speciation may apply to different populations coexisting in the same environment.
Ecology and evolution of viruses infecting uncultivated SUP05 bacteria as revealed by single-cell- and meta-genomicsViruses modulate microbial communities and alter ecosystem functions. However, due to cultivation bottlenecks, specific virus-host interaction dynamics remain cryptic. In this study, we examined 127 single-cell amplified genomes (SAGs) from uncultivated SUP05 bacteria isolated from a model marine oxygen minimum zone (OMZ) to identify 69 viral contigs representing five new genera within dsDNA Caudovirales and ssDNA Microviridae. Infection frequencies suggest that ∼1/3 of SUP05 bacteria is viral-infected, with higher infection frequency where oxygen-deficiency was most severe. Observed Microviridae clonality suggests recovery of bloom-terminating viruses, while systematic co-infection between dsDNA and ssDNA viruses posits previously unrecognized cooperation modes. Analyses of 186 microbial and viral metagenomes revealed that SUP05 viruses persisted for years, but remained endemic to the OMZ. Finally, identification of virus-encoded dissimilatory sulfite reductase suggests SUP05 viruses reprogram their host's energy metabolism. Together, these results demonstrate closely coupled SUP05 virus-host co-evolutionary dynamics with the potential to modulate biogeochemical cycling in climate-critical and expanding OMZs.