Integration of variant annotations using deep set networks boosts rare variant association testing

Brian Clarke(German Cancer Research Center), Eva Holtkamp(Helmholtz Association of German Research Centres), Hakime Öztürk(German Cancer Research Center), Marcel Mück(German Cancer Research Center), Magnus Wahlberg(German Cancer Research Center), Kayla Meyer(German Cancer Research Center), Felix Munzlinger(German Cancer Research Center), Felix Brechtmann(Munich Center for Machine Learning), Florian R. Hölzlwimmer(Technical University of Munich), Jonas Lindner(Technical University of Munich), Zhifen Chen(German Centre for Cardiovascular Research), Julien Gagneur(Helmholtz Zentrum München), Oliver Stegle(European Bioinformatics Institute)
Nature Genetics
September 25, 2024
Cited by 19Open Access
Full Text

Abstract

Rare genetic variants can have strong effects on phenotypes, yet accounting for rare variants in genetic analyses is statistically challenging due to the limited number of allele carriers and the burden of multiple testing. While rich variant annotations promise to enable well-powered rare variant association tests, methods integrating variant annotations in a data-driven manner are lacking. Here we propose deep rare variant association testing (DeepRVAT), a model based on set neural networks that learns a trait-agnostic gene impairment score from rare variant annotations and phenotypes, enabling both gene discovery and trait prediction. On 34 quantitative and 63 binary traits, using whole-exome-sequencing data from UK Biobank, we find that DeepRVAT yields substantial gains in gene discoveries and improved detection of individuals at high genetic risk. Finally, we demonstrate how DeepRVAT enables calibrated and computationally efficient rare variant tests at biobank scale, aiding the discovery of genetic risk factors for human disease traits.


Related Papers

No related papers found

Powered by citation graph analysis