An open approach to systematically prioritize causal variants and genes at all published human GWAS trait-associated loci

Edward Mountjoy(Wellcome Sanger Institute), Ellen M. Schmidt(Wellcome Sanger Institute), Miguel Carmona(European Bioinformatics Institute), Jeremy Schwartzentruber(European Bioinformatics Institute), Gareth Peat(European Bioinformatics Institute), Alfredo Miranda(European Bioinformatics Institute), Luca Fumis(European Bioinformatics Institute), James Hayhurst(European Bioinformatics Institute), Annalisa Buniello(European Bioinformatics Institute), Mohd Anisul Karim(Wellcome Sanger Institute), Daniel J. Wright(Wellcome Sanger Institute), Andrew Hercules(European Bioinformatics Institute), Eliseo Papa(Biogen (United States)), Eric B. Fauman(Pfizer (United States)), Jeffrey C. Barrett(Wellcome Sanger Institute), John A. Todd(Centre for Human Genetics), David Ochoa(European Bioinformatics Institute), Ian Dunham(European Bioinformatics Institute), Maya Ghoussaini(Wellcome Sanger Institute)
Nature Genetics
October 28, 2021
Cited by 629Open Access
Full Text

Abstract

Genome-wide association studies (GWASs) have identified many variants associated with complex traits, but identifying the causal gene(s) is a major challenge. In the present study, we present an open resource that provides systematic fine mapping and gene prioritization across 133,441 published human GWAS loci. We integrate genetics (GWAS Catalog and UK Biobank) with transcriptomic, proteomic and epigenomic data, including systematic disease–disease and disease–molecular trait colocalization results across 92 cell types and tissues. We identify 729 loci fine mapped to a single-coding causal variant and colocalized with a single gene. We trained a machine-learning model using the fine-mapped genetics and functional genomics data and 445 gold-standard curated GWAS loci to distinguish causal genes from neighboring genes, outperforming a naive distance-based model. Our prioritized genes were enriched for known approved drug targets (odds ratio = 8.1, 95% confidence interval = 5.7, 11.5). These results are publicly available through a web portal ( http://genetics.opentargets.org ), enabling users to easily prioritize genes at disease-associated loci and assess their potential as drug targets. Open Targets Genetics is a community resource that provides systematic fine mapping at human GWAS loci, enabling users to prioritize genes at disease-associated regions and assess their potential as drug targets.


Related Papers

No related papers found

Powered by citation graph analysis