Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaeaWe present two standards developed by the Genomic Standards Consortium (GSC) for reporting bacterial and archaeal genome sequences. Both are extensions of the Minimum Information about Any (x) Sequence (MIxS). The standards are the Minimum Information about a Single Amplified Genome (MISAG) and the Minimum Information about a Metagenome-Assembled Genome (MIMAG), including, but not limited to, assembly quality, and estimates of genome completeness and contamination. These standards can be used in combination with other GSC checklists, including the Minimum Information about a Genome Sequence (MIGS), Minimum Information about a Metagenomic Sequence (MIMS), and Minimum Information about a Marker Gene Sequence (MIMARKS). Community-wide adoption of MISAG and MIMAG will facilitate more robust comparative genomic analyses of bacterial and archaeal diversity.
Insights into the phylogeny and coding potential of microbial dark matterGenome sequencing enhances our understanding of the biological world by providing blueprints for the evolutionary and functional diversity that shapes the biosphere. However, microbial genomes that are currently available are of limited phylogenetic breadth, owing to our historical inability to cultivate most microorganisms in the laboratory. We apply single-cell genomics to target and sequence 201 uncultivated archaeal and bacterial cells from nine diverse habitats belonging to 29 major mostly uncharted branches of the tree of life, so-called ‘microbial dark matter’. With this additional genomic information, we are able to resolve many intra- and inter-phylum-level relationships and to propose two new superphyla. We uncover unexpected metabolic features that extend our understanding of biology and challenge established boundaries between the three domains of life. These include a novel amino acid use for the opal stop codon, an archaeal-type purine synthesis in Bacteria and complete sigma factors in Archaea similar to those in Bacteria. The single-cell genomes also served to phylogenetically anchor up to 20% of metagenomic reads in some habitats, facilitating organism-level interpretation of ecosystem function. This study greatly expands the genomic representation of the tree of life and provides a systematic step towards a better understanding of biological evolution on our planet. Uncultivated archaeal and bacterial cells of major uncharted branches of the tree of life are targeted and sequenced using single-cell genomics; this enables resolution of many intra- and inter-phylum-level relationships, uncovers unexpected metabolic features that challenge established boundaries between the three domains of life, and leads to the proposal of two new superphyla. Currently available genome sequences give us a narrow view of the remarkable diversity of microorganisms because the vast majority of them have never been cultivated in pure culture. Here Tanja Woyke and colleagues use single-cell genomics to target and sequence 201 uncultivated archaeal and bacterial cells from nine diverse habitats. This information reveals numerous intra- and inter-phylum relationships and a number of unexpected metabolic features. On the basis of the new data the authors propose taxonomic revisions to the archaeal and bacterial domains, including a proposal to reorganizing the Archaea into three superphyla.
IMG: the integrated microbial genomes database and comparative analysis systemThe Integrated Microbial Genomes (IMG) system serves as a community resource for comparative analysis of publicly available genomes in a comprehensive integrated context. IMG integrates publicly available draft and complete genomes from all three domains of life with a large number of plasmids and viruses. IMG provides tools and viewers for analyzing and reviewing the annotations of genes and genomes in a comparative context. IMG's data content and analytical capabilities have been continuously extended through regular updates since its first release in March 2005. IMG is available at http://img.jgi.doe.gov. Companion IMG systems provide support for expert review of genome annotations (IMG/ER: http://img.jgi.doe.gov/er), teaching courses and training in microbial genome analysis (IMG/EDU: http://img.jgi.doe.gov/edu) and analysis of genomes related to the Human Microbiome Project (IMG/HMP: http://www.hmpdacc-resources.org/img_hmp).
Uncovering Earth’s viromeA phylogeny-driven genomic encyclopaedia of Bacteria and ArchaeaThe bacterial and archaeal genomes that have been sequenced to date were chosen for sequencing based mainly on their physiology, which is fine but has resulted in a distinct phylogenetic bias. An alternative approach has been taken in the Genomic Encyclopedia of Bacteria and Archaea (GEBA) project, which advocates choosing genomes based on the organism's phylogenetic position, with the aim filling in the gaps in sequencing along on bacterial and archaeal branches of the tree of life. The value of this approach has been demonstrated by a pilot study of the genome sequences of 56 culturable species selected to maximize phylogenetic coverage. Analysis of the sequences provides insights into phylogenetics, protein function and genome annotation. There are now nearly 1,000 completed bacterial and archaeal genomes available, but as most of them were chosen for sequencing on the basis of their physiology, the data are limited by a highly biased phylogenetic distribution. To explore the value added by choosing microbial genomes for sequencing on the basis of their evolutionary relationships, the genomes of 56 species of Bacteria and Archaea selected to maximize phylogenetic coverage are now sequenced and analysed. Sequencing of bacterial and archaeal genomes has revolutionized our understanding of the many roles played by microorganisms1. There are now nearly 1,000 completed bacterial and archaeal genomes available2, most of which were chosen for sequencing on the basis of their physiology. As a result, the perspective provided by the currently available genomes is limited by a highly biased phylogenetic distribution3,4,5. To explore the value added by choosing microbial genomes for sequencing on the basis of their evolutionary relationships, we have sequenced and analysed the genomes of 56 culturable species of Bacteria and Archaea selected to maximize phylogenetic coverage. Analysis of these genomes demonstrated pronounced benefits (compared to an equivalent set of genomes randomly selected from the existing database) in diverse areas including the reconstruction of phylogenetic history, the discovery of new protein families and biological properties, and the prediction of functions for known genes from other organisms. Our results strongly support the need for systematic ‘phylogenomic’ efforts to compile a phylogeny-driven ‘Genomic Encyclopedia of Bacteria and Archaea’ in order to derive maximum knowledge from existing microbial genome data as well as from genome sequences to come.