Personal transcriptome variation is poorly explained by current genomic deep learning models

Connie Huang(University of California, Berkeley), Richard W. Shuai(University of California, Berkeley), Parth Baokar(University of California, Berkeley), Ryan Chung(University of California, Berkeley), Ruchir Rastogi(University of California, Berkeley), Pooja Kathail(University of California, Berkeley), Nilah M. Ioannidis(Berkeley College)
Nature Genetics
November 30, 2023
Cited by 100Open Access
Full Text

Abstract

Genomic deep learning models can predict genome-wide epigenetic features and gene expression levels directly from DNA sequence. While current models perform well at predicting gene expression levels across genes in different cell types from the reference genome, their ability to explain expression variation between individuals due to cis-regulatory genetic variants remains largely unexplored. Here, we evaluate four state-of-the-art models on paired personal genome and transcriptome data and find limited performance when explaining variation in expression across individuals. In addition, models often fail to predict the correct direction of effect of cis-regulatory genetic variation on expression.


Related Papers

No related papers found

Powered by citation graph analysis