The Protist Ribosomal Reference database (PR2): a catalog of unicellular eukaryote Small Sub-Unit rRNA sequences with curated taxonomy

Laure Guillou(Centre National de la Recherche Scientifique), Dipankar Bachar(Centre National de la Recherche Scientifique), Stéphane Audic(Centre National de la Recherche Scientifique), David Bass(Natural History Museum), Cédric Berney(Natural History Museum), Lucie Bittner(Centre National de la Recherche Scientifique), Christophe Boutte(Centre National de la Recherche Scientifique), Gaëtan Burgaud(Laboratoire de Biodiversité et Biotechnologies Microbiennes), Colomban de Vargas(Centre National de la Recherche Scientifique), Johan Decelle(Centre National de la Recherche Scientifique), Javier del Campo(Institut Català de Ciències del Clima), John R. Dolan(Centre National de la Recherche Scientifique), Micah Dunthorn(University of Kaiserslautern), Bente Edvardsen(OsloMet – Oslo Metropolitan University), Maria Holzmann(University of Geneva), Wiebe H. C. F. Kooistra(Stazione Zoologica Anton Dohrn), Enrique Lara(University of Neuchâtel), Noan Le Bescot(Centre National de la Recherche Scientifique), Ramiro Logares(Institut Català de Ciències del Clima), Frédéric Mahé(Adaptation et Diversité en Milieu Marin), Ramón Massana(Institut Català de Ciències del Clima), Marina Montresor(Stazione Zoologica Anton Dohrn), Raphaël Morard(Université Claude Bernard Lyon 1), Fabrice Not(Centre National de la Recherche Scientifique), Jan Pawłowski(University of Geneva), Ian Probert(Centre National de la Recherche Scientifique), Anne-Laure Sauvadet(Centre National de la Recherche Scientifique), Raffaele Siano(Ifremer), Thorsten Stoeck(University of Kaiserslautern), Daniel Vaulot(Centre National de la Recherche Scientifique), Pascal Zimmermann(Département d'Informatique), Richard Christen(Centre National de la Recherche Scientifique)
Nucleic Acids Research
November 26, 2012
Cited by 2,368Open Access
Full Text

Abstract

The interrogation of genetic markers in environmental meta-barcoding studies is currently seriously hindered by the lack of taxonomically curated reference data sets for the targeted genes. The Protist Ribosomal Reference database (PR(2), http://ssu-rrna.org/) provides a unique access to eukaryotic small sub-unit (SSU) ribosomal RNA and DNA sequences, with curated taxonomy. The database mainly consists of nuclear-encoded protistan sequences. However, metazoans, land plants, macrosporic fungi and eukaryotic organelles (mitochondrion, plastid and others) are also included because they are useful for the analysis of high-troughput sequencing data sets. Introns and putative chimeric sequences have been also carefully checked. Taxonomic assignation of sequences consists of eight unique taxonomic fields. In total, 136 866 sequences are nuclear encoded, 45 708 (36 501 mitochondrial and 9657 chloroplastic) are from organelles, the remaining being putative chimeric sequences. The website allows the users to download sequences from the entire and partial databases (including representative sequences after clustering at a given level of similarity). Different web tools also allow searches by sequence similarity. The presence of both rRNA and rDNA sequences, taking into account introns (crucial for eukaryotic sequences), a normalized eight terms ranked-taxonomy and updates of new GenBank releases were made possible by a long-term collaboration between experts in taxonomy and computer scientists.


Related Papers

No related papers found

Powered by citation graph analysis