Standardized annotation of translated open reading frames

Jonathan M. Mudge; Jorge Ruiz‐Orera; John R. Prensner; Marie A. Brunet; Ferriol Calvet; Irwin Jungreis; José M. González; Michele Magrane; Thomas F. Martínez; Jana Felicitas Schulz; Yucheng Yang; M. Mar Albà; Julie L. Aspden; Pavel V. Baranov; Ariel Bazzini; Elspeth A. Bruford; María Martin; Lorenzo Calviello; Anne‐Ruxandra Carvunis; Jin Chen; Juan Pablo Couso; Eric W. Deutsch; Paul Flicek; Adam Frankish; Mark Gerstein; Norbert Hübner; Nicholas T. Ingolia; Manolis Kellis; Gerben Menschaert; Robert L. Moritz; Uwe Ohler; Xavier Roucou; Alan Saghatelian; Jonathan S. Weissman; Sebastiaan van Heesch

doi:10.1038/s41587-022-01369-0

Standardized annotation of translated open reading frames

Jonathan M. Mudge(European Bioinformatics Institute), Jorge Ruiz‐Orera(Max Delbrück Center), John R. Prensner(Broad Institute), Marie A. Brunet(Université de Sherbrooke), Ferriol Calvet(European Bioinformatics Institute), Irwin Jungreis(Broad Institute), José M. González(European Bioinformatics Institute), Michele Magrane(European Bioinformatics Institute), Thomas F. Martínez(Salk Institute for Biological Studies), Jana Felicitas Schulz(Max Delbrück Center), Yucheng Yang(Yale University), M. Mar Albà(Institució Catalana de Recerca i Estudis Avançats), Julie L. Aspden(University of Leeds), Pavel V. Baranov(University College Cork), Ariel Bazzini(Stowers Institute for Medical Research), Elspeth A. Bruford(European Bioinformatics Institute), María Martin(European Bioinformatics Institute), Lorenzo Calviello(Human Technopole), Anne‐Ruxandra Carvunis(University of Pittsburgh), Jin Chen(The University of Texas Southwestern Medical Center), Juan Pablo Couso(Centro Andaluz de Biología del Desarrollo), Eric W. Deutsch(Institute for Systems Biology), Paul Flicek(European Bioinformatics Institute), Adam Frankish(European Bioinformatics Institute), Mark Gerstein(Yale University), Norbert Hübner(Max Delbrück Center), Nicholas T. Ingolia(QB3), Manolis Kellis(Broad Institute), Gerben Menschaert(Ghent University), Robert L. Moritz(Institute for Systems Biology), Uwe Ohler(Max Delbrück Center), Xavier Roucou(Université de Sherbrooke), Alan Saghatelian(Salk Institute for Biological Studies), Jonathan S. Weissman(Howard Hughes Medical Institute), Sebastiaan van Heesch(Princess Máxima Center)

Nature Biotechnology

July 1, 2022

10.1038/s41587-022-01369-0

Cited by 246Open Access

Full Text

Abstract

Ribosome profiling (Ribo-seq) has extended our understanding of the translational ‘vocabulary’ of the human genome, uncovering thousands of open reading frames (ORFs) within long noncoding RNAs (lncRNAs) and presumed untranslated regions (UTRs) of protein-coding genes. However, reference gene annotation projects have been circumspect in their incorporation of these ORFs because of uncertainties about their experimental reproducibility and physiological roles. Yet, it is clear that certain ‘Ribo-seq ORFs’ make stable proteins, others mediate gene regulation, and many have medical implications. Ultimately, the absence of standardized ORF annotation has created a circular problem: while Ribo-seq ORFs remain unrecognized by reference annotation databases, this lack of recognition will thwart studies examining their roles. Here, we outline a community-led effort involving Ensembl/GENCODE, the HUGO Gene Nomenclature Committee (HGNC), UniProtKB, HUPO/HPP and PeptideAtlas to produce a standardized catalog of 7,264 human Ribo-seq ORFs; a path to bring protein-level evidence for Ribo-seq ORFs into reference annotation databases; and a roadmap to facilitate research in the global community.

Related Papers

No related papers found

Powered by citation graph analysis