JASPAR 2024: 20th anniversary of the open-access database of transcription factor binding profiles

Ieva Rauluševičiūtė(University of Oslo), Rafael Riudavets Puig(University of Oslo), Romain Blanc‐Mathieu(Centre National de la Recherche Scientifique), Jaime A. Castro-Mondragón(University of Oslo), Katalin Ferenc(University of Oslo), Vipin Kumar(University of Oslo), Roza Berhanu Lemma(University of Oslo), Jérémy Lucas(Centre National de la Recherche Scientifique), Jeanne Chèneby(University of Oslo), Damir Baranas̆ić(Hammersmith Hospital), Aziz Khan(University of Oslo), Oriol Fornés(University of British Columbia), Sveinung Gundersen(University of Oslo), Morten Johansen(University of Oslo), Eivind Hovig(Oslo University Hospital), Boris Lenhard(Hammersmith Hospital), Albin Sandelin(University of Copenhagen), Wyeth W. Wasserman(University of British Columbia), François Parcy(Centre National de la Recherche Scientifique), Anthony Mathelier(Oslo University Hospital)
Nucleic Acids Research
November 14, 2023
Cited by 1,008Open Access
Full Text

Abstract

JASPAR (https://jaspar.elixir.no/) is a widely-used open-access database presenting manually curated high-quality and non-redundant DNA-binding profiles for transcription factors (TFs) across taxa. In this 10th release and 20th-anniversary update, the CORE collection has expanded with 329 new profiles. We updated three existing profiles and provided orthogonal support for 72 profiles from the previous release's UNVALIDATED collection. Altogether, the JASPAR 2024 update provides a 20% increase in CORE profiles from the previous release. A trimming algorithm enhanced profiles by removing low information content flanking base pairs, which were likely uninformative (within the capacity of the PFM models) for TFBS predictions and modelling TF-DNA interactions. This release includes enhanced metadata, featuring a refined classification for plant TFs' structural DNA-binding domains. The new JASPAR collections prompt updates to the genomic tracks of predicted TF binding sites (TFBSs) in 8 organisms, with human and mouse tracks available as native tracks in the UCSC Genome browser. All data are available through the JASPAR web interface and programmatically through its API and the updated Bioconductor and pyJASPAR packages. Finally, a new TFBS extraction tool enables users to retrieve predicted JASPAR TFBSs intersecting their genomic regions of interest.


Related Papers

No related papers found

Powered by citation graph analysis