Comprehensive functional genomic resource and integrative model for the human brain

Daifeng Wang(Yale University), Shuang Liu(Yale University), Jonathan Warrell(Yale University), Hyejung Won(University of North Carolina at Chapel Hill), Xu Shi(Yale University), Fábio C. P. Navarro(Yale University), Declan Clarke(Yale University), Mengting Gu(Yale University), Prashant S. Emani(Yale University), Yucheng Yang(Yale University), Min Xu(Yale University), Michael J. Gandal(Charles R. Drew University of Medicine and Science), Shaoke Lou(Yale University), Jing Zhang(Yale University), Jonathan J. Park(Yale University), Chengfei Yan(Yale University), Suhn K. Rhie(University of Southern California), Kasidet Manakongtreecheep(Yale University), Holly Zhou(Yale University), Aparna Nathan(Yale University), Mette A. Peters(Sage Bionetworks), Eugenio Mattei(University of Massachusetts Chan Medical School), Dominic Fitzgerald(Reproductive Genetics Institute), Tonya M. Brunetti(Reproductive Genetics Institute), Jill E. Moore(University of Massachusetts Chan Medical School), Yan Jiang(Icahn School of Medicine at Mount Sinai), Kiran Girdhar(Icahn School of Medicine at Mount Sinai), Gabriel E. Hoffman(Icahn School of Medicine at Mount Sinai), Selim Kalaycı(Icahn School of Medicine at Mount Sinai), Zeynep H. Gümüş(Icahn School of Medicine at Mount Sinai), Gregory E. Crawford(Duke University), Panos Roussos(Icahn School of Medicine at Mount Sinai), Schahram Akbarian(Icahn School of Medicine at Mount Sinai), Andrew E. Jaffe(Johns Hopkins University), Kevin P. White(Tempus Labs (United States)), Zhiping Weng(University of Massachusetts Chan Medical School), Nenad Šestan(Yale University), Daniel H. Geschwind(University of California, Los Angeles), James A. Knowles(SUNY Downstate Health Sciences University), Mark Gerstein(Yale University), Allison E. Ashley‐Koch, Gregory E. Crawford(Duke University), Melanie E. Garrett, Lingyun Song, Alexias Safi, Graham D. Johnson, Gregory A. Wray(Duke University), Timothy E. Reddy, Fernando S. Goes, Peter P. Zandi(Sage Bionetworks), Julien Bryois, Andrew E. Jaffe(Johns Hopkins University), Amanda J. Price, Nikolay A. Ivanov, Leonardo Collado‐Torres, Thomas M. Hyde, Emily E. Burke, Joel E. Kleiman, Ran Tao(Icahn School of Medicine at Mount Sinai), Joo Heon Shin, Schahram Akbarian(Icahn School of Medicine at Mount Sinai), Kiran Girdhar(Icahn School of Medicine at Mount Sinai), Yan Jiang(Icahn School of Medicine at Mount Sinai), Marija Kundaković, Leanne Brown, Bibi Kassim, Royce Park(Yale University), Jennifer Wiseman, Elizabeth Zharovsky, Rivka Jacobov, Olivia Devillers, Elie Flatow, Gabriel E. Hoffman(Icahn School of Medicine at Mount Sinai), Barbara K. Lipska, David A. Lewis, Vahram Haroutunian, Chang-Gyu Hahn, Alexander W. Charney, Stella Dracheva, Alexey Kozlenkov, Judson Belmont, Diane M. Del Valle, Nancy Francoeur, Evi Hadjimichael(Tempus Labs (United States)), Dalila Pinto, Harm van Bakel, Panos Roussos(Icahn School of Medicine at Mount Sinai), John F. Fullard, Jaroslav Bendl, Mads E. Hauberg, Lara M. Mangravite, Mette A. Peters(Sage Bionetworks), Yooree Chae(Charles R. Drew University of Medicine and Science), Junmin Peng, Mingming Niu, Xusheng Wang(Yale University), Maree J. Webster, Thomas G. Beach, Chao Chen(Yale University), Yi Jiang(Icahn School of Medicine at Mount Sinai), Rujia Dai(Yale University), Annie W. Shieh, Chunyu Liu(Yale University), Kay Grennan, Yan Xia(Icahn School of Medicine at Mount Sinai), Ramu Vadukapuram, Yongjun Wang(Yale University), Dominic Fitzgerald(Reproductive Genetics Institute), Lijun Cheng(Yale University), Miguel Brown, Mimi Brown, Tonya M. Brunetti(Reproductive Genetics Institute), Thomas Goodman, Majd Alsayed, Michael J. Gandal(Charles R. Drew University of Medicine and Science), Daniel H. Geschwind(University of California, Los Angeles), Hyejung Won(University of North Carolina at Chapel Hill), Damon Polioudakis, Brie Wamsley(Icahn School of Medicine at Mount Sinai), Jiani Yin, Tarik Hadžić, Luis de la Torre-Ubieta, Vivek Swarup, Stephan Sanders, Matthew W. State, Donna M. Werling, Joon‐Yong An, Brooke Sheppard, A. Jeremy Willsey, Kevin P. White(Tempus Labs (United States)), Mohana Ray, Gina Giase, Amira Kefi, Eugenio Mattei(University of Massachusetts Chan Medical School), Michael Purcaro(Charles R. Drew University of Medicine and Science), Zhiping Weng(University of Massachusetts Chan Medical School), Jill E. Moore(University of Massachusetts Chan Medical School), Henry Pratt, Jack Huey, Tyler Borrman, Patrick F. Sullivan, Paola Giusti‐Rodríguez, Yunjung Kim, Patrick Sullivan, Jin Szatkiewicz(Yale University), Suhn K. Rhie(University of Southern California), Christoper Armoskus, Adrian Camarena, Peggy Farnham, Valeria N. Spitsyna, Heather Witt, Shannon Schreiner, Oleg V. Evgrafov, James A. Knowles(SUNY Downstate Health Sciences University), Mark Gerstein(Yale University), Shuang Liu(Yale University), Daifeng Wang(Yale University), Fábio C. P. Navarro(Yale University), Jonathan Warrell(Yale University), Declan Clarke(Yale University), Prashant S. Emani(Yale University), Mengting Gu(Yale University), Xu Shi(Yale University), Min Xu(Yale University), Yucheng Yang(Yale University), Robert R. Kitchen, Gamze Gürsoy, Jing Zhang(Yale University), Becky C. Carlyle, Angus C. Nairn, Mingfeng Li(Icahn School of Medicine at Mount Sinai), Sirisha Pochareddy, Nenad Šestan(Yale University), Mario Škarica, Zhen Li(Icahn School of Medicine at Mount Sinai), André M. M. Sousa, Gabriel Santpere(Icahn School of Medicine at Mount Sinai), Jinmyung Choi, Ying Zhu, Tianliuyun Gao, Daniel J. Miller, Adriana Cherskov, Mo Yang(Yale University), Anahita Amiri, Gianfilippo Coppola, Jessica Mariani, Soraya Scuderi, Anna Szekely, Flora M. Vaccarino, Feinan Wu, Sherman M. Weissman, Tanmoy Roychowdhury, Alexej Abyzov
Science
December 13, 2018
Cited by 1,083

Abstract

INTRODUCTION Strong genetic associations have been found for a number of psychiatric disorders. However, understanding the underlying molecular mechanisms remains challenging. RATIONALE To address this challenge, the PsychENCODE Consortium has developed a comprehensive online resource and integrative models for the functional genomics of the human brain. RESULTS The base of the pyramidal resource is the datasets generated by PsychENCODE, including bulk transcriptome, chromatin, genotype, and Hi-C datasets and single-cell transcriptomic data from ~32,000 cells for major brain regions. We have merged these with data from Genotype-Tissue Expression (GTEx), ENCODE, Roadmap Epigenomics, and single-cell analyses. Via uniform processing, we created a harmonized resource, allowing us to survey functional genomics data on the brain over a sample size of 1866 individuals. From this uniformly processed dataset, we created derived data products. These include lists of brain-expressed genes, coexpression modules, and single-cell expression profiles for many brain cell types; ~79,000 brain-active enhancers with associated Hi-C loops and topologically associating domains; and ~2.5 million expression quantitative-trait loci (QTLs) comprising ~238,000 linkage-disequilibrium–independent single-nucleotide polymorphisms and of other types of QTLs associated with splice isoforms, cell fractions, and chromatin activity. By using these, we found that >88% of the cross-population variation in brain gene expression can be accounted for by cell fraction changes. Furthermore, a number of disorders and aging are associated with changes in cell-type proportions. The derived data also enable comparison between the brain and other tissues. In particular, by using spectral analyses, we found that the brain has distinct expression and epigenetic patterns, including a greater extent of noncoding transcription than other tissues. The top level of the resource consists of integrative networks for regulation and machine-learning models for disease prediction. The networks include a full gene regulatory network (GRN) for the brain, linking transcription factors, enhancers, and target genes from merging of the QTLs, generalized element-activity correlations, and Hi-C data. By using this network, we link disease genes to genome-wide association study (GWAS) variants for psychiatric disorders. For schizophrenia, we linked 321 genes to the 142 reported GWAS loci. We then embedded the regulatory network into a deep-learning model to predict psychiatric phenotypes from genotype and expression. Our model gives a ~6-fold improvement in prediction over additive polygenic risk scores. Moreover, it achieves a ~3-fold improvement over additive models, even when the gene expression data are imputed, highlighting the value of having just a small amount of transcriptome data for disease prediction. Lastly, it highlights key genes and pathways associated with disorder prediction, including immunological, synaptic, and metabolic pathways, recapitulating de novo results from more targeted analyses. CONCLUSION Our resource and integrative analyses have uncovered genomic elements and networks in the brain, which in turn have provided insight into the molecular mechanisms underlying psychiatric disorders. Our deep-learning model improves disease risk prediction over traditional approaches and can be extended with additional data types (e.g., microRNA and neuroimaging). A comprehensive functional genomic resource for the adult human brain. The resource forms a three-layer pyramid. The bottom layer includes sequencing datasets for traits, such as schizophrenia. The middle layer represents derived datasets, including functional genomic elements and QTLs. The top layer contains integrated models, which link genotypes to phenotypes. DSPN, Deep Structured Phenotype Network; PC1 and PC2, principal components 1 and 2; ref, reference; alt, alternate; H3K27ac, histone H3 acetylation at lysine 27.


Related Papers