Rfam 15: RNA families database in 2025

Nancy Ontiveros‐Palacios(European Bioinformatics Institute), Emma J. Cooke(Innovate UK), Eric P. Nawrocki(National Institutes of Health), Sandra Triebel(Friedrich Schiller University Jena), Manja Marz(Friedrich Schiller University Jena), Elena Rivas(Harvard University), Sam Griffiths‐Jones(University of Manchester), Anton I. Petrov, Alex Bateman(European Bioinformatics Institute), Blake Sweeney(European Bioinformatics Institute)
Nucleic Acids Research
November 11, 2024
Cited by 151Open Access
Full Text

Abstract

The Rfam database, a widely used repository of non-coding RNA families, has undergone significant updates in release 15.0. This paper introduces major improvements, including the expansion of Rfamseq to 26 106 genomes, a 76% increase, incorporating the latest UniProt reference proteomes and additional viral genomes. Sixty-five RNA families were enhanced using experimentally determined 3D structures, improving the accuracy of consensus secondary structures and annotations. R-scape covariation analysis was used to refine structural predictions in 26 families. Gene Ontology (GO) and Sequence Ontology annotations were comprehensively updated, increasing GO term coverage to 75% of families. The release adds 14 new Hepatitis C Virus RNA families and completes microRNA family synchronization with miRBase, resulting in 1603 microRNA families. New data types, including FULL alignments, have been implemented. Integration with APICURON for improved curator attribution and multiple website enhancements further improve user experience. These updates significantly expand Rfam's coverage and improve annotation quality, reinforcing its critical role in RNA research, genome annotation and the development of machine learning models. Rfam is freely available at https://rfam.org.


Related Papers

No related papers found

Powered by citation graph analysis