A highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes

Bas E. Dutilh(Radboud University Medical Center), Noriko A. Cassman(San Diego State University), Katelyn McNair(San Diego State University), Savannah E. Sanchez(San Diego State University), Amaro E. Trindade‐Silva(San Diego State University), Lance Boling(San Diego State University), Jeremy J. Barr(San Diego State University), Daan R. Speth(Radboud University Nijmegen), Victor Seguritan(San Diego State University), Ramy K. Aziz(San Diego State University), Ben Felts(San Diego State University), Elizabeth A. Dinsdale(San Diego State University), John L. Mokili(San Diego State University), Robert A. Edwards(Universidade Federal do Rio de Janeiro)
Nature Communications
July 24, 2014
Cited by 863Open Access
Full Text

Abstract

Metagenomics, or sequencing of the genetic material from a complete microbial community, is a promising tool to discover novel microbes and viruses. Viral metagenomes typically contain many unknown sequences. Here we describe the discovery of a previously unidentified bacteriophage present in the majority of published human faecal metagenomes, which we refer to as crAssphage. Its ~97 kbp genome is six times more abundant in publicly available metagenomes than all other known phages together; it comprises up to 90% and 22% of all reads in virus-like particle (VLP)-derived metagenomes and total community metagenomes, respectively; and it totals 1.68% of all human faecal metagenomic sequencing reads in the public databases. The majority of crAssphage-encoded proteins match no known sequences in the database, which is why it was not detected before. Using a new co-occurrence profiling approach, we predict a Bacteroides host for this phage, consistent with Bacteroides-related protein homologues and a unique carbohydrate-binding domain encoded in the phage genome. Metagenomic studies of microbial communities often report DNA sequences from unidentified viruses. Here, Dutilh et al. analyse metagenomic data to reveal the complete genome of an abundant, ubiquitous virus from human faeces, and predict that the virus infects bacteria of the Bacteroides group.


Related Papers

No related papers found

Powered by citation graph analysis