Transcript expression-aware annotation improves rare variant interpretation

Beryl B. Cummings(Broad Institute), Konrad J. Karczewski(Broad Institute), Jack A. Kosmicki(Broad Institute), Eleanor G. Seaby(Broad Institute), Nicholas A. Watts(Broad Institute), Moriel Singer‐Berk(Broad Institute), Jonathan M. Mudge(European Bioinformatics Institute), Juha Karjalainen(Broad Institute), F. Kyle Satterstrom(Broad Institute), Anne O’Donnell‐Luria(Broad Institute), Timothy Poterba(Broad Institute), Cotton Seed(Broad Institute), Matthew Solomonson(Broad Institute), Jessica Alföldi(Broad Institute), Genome Aggregation Database Production Team(Broad Institute), Jessica Alföldi(Broad Institute), Irina M. Armean(Broad Institute), Eric Banks(Broad Institute), Louis Bergelson(Broad Institute), Kristian Cibulskis(Broad Institute), Ryan L. Collins(Broad Institute), Kristen M. Connolly(Broad Institute), Miguel Covarrubias(Broad Institute), Beryl B. Cummings(Broad Institute), Mark J. Daly(Broad Institute), Stacey Donnelly(Broad Institute), Yossi Farjoun(Broad Institute), Steven Ferriera(Broad Institute), Laurent C. Francioli(Broad Institute), Stacey Gabriel(Broad Institute), Laura D. Gauthier(Broad Institute), Jeff Gentry(Broad Institute), Namrata Gupta(Broad Institute), Thibault Jeandet(Broad Institute), Diane Kaplan(Broad Institute), Konrad J. Karczewski(Broad Institute), Kristen M. Laricchia(Broad Institute), Christopher Llanwarne(Massachusetts General Hospital), Eric Vallabh Minikel(Broad Institute), Ruchi Munshi(Massachusetts General Hospital), Benjamin M. Neale(Broad Institute), Sam Novod(Broad Institute), Anne O’Donnell‐Luria(Broad Institute), Nikelle Petrillo(Massachusetts General Hospital), Timothy Poterba(Broad Institute), David Roazen(Broad Institute), Valentín Ruano-Rubio(Broad Institute), Andrea Saltzman(Broad Institute), Kaitlin E. Samocha(Broad Institute), Molly Schleicher(Broad Institute), Cotton Seed(Broad Institute), Matthew Solomonson(Broad Institute), José Soto(Massachusetts General Hospital), Grace Tiao(Broad Institute), Kathleen Tibbetts(Broad Institute), Charlotte Tolonen(Massachusetts General Hospital), Christopher Vittal(Broad Institute), Gordon Wade(Massachusetts General Hospital), Arcturus Wang(Massachusetts General Hospital), Qingbo S. Wang(Broad Institute), James S. Ware(Massachusetts General Hospital), Nicholas A. Watts(Broad Institute), Ben Weisburd(Broad Institute), Nicola Whiffin(Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán), Carlos A. Aguilar‐Salinas(Peninsula College of Medicine and Dentistry), Tariq Ahmad(Brigham and Women's Hospital), Christine M. Albert(University of Parma), Diego Ardissino(University of Haifa), Gil Atzmon(Cleveland Clinic Lerner College of Medicine), John Barnard(Sorbonne Université), Laurent Beaugerie(National Heart Lung and Blood Institute), Emelia J. Benjamin(University of Michigan), Michael Boehnke(National Institutes of Health), Lori L. Bonnycastle(Icahn School of Medicine at Mount Sinai), Erwin P. Böttinger(Wake Forest University), Donald W. Bowden(University of Leicester), Matthew J. Bown(Imperial College London), John C. Chambers(Chinese University of Hong Kong), Juliana C.N. Chan(Brigham and Women's Hospital), Daniel I. Chasman(Icahn School of Medicine at Mount Sinai), Judy H. Cho(Cleveland Clinic Lerner College of Medicine), Mina K. Chung(McLean Hospital), Bruce M. Cohen(Jackson Memorial Hospital), Adolfo Correa(Colorado School of Public Health), Dana Dabelea(Massachusetts General Hospital), Mark J. Daly(Broad Institute), Dawood Darbar(Texas Biomedical Research Institute), Ravindranath Duggirala(National Heart Lung and Blood Institute), Josée Dupuis(Massachusetts General Hospital), Patrick T. Ellinor(Broad Institute), Roberto Elosúa(University of Lübeck), Jeanette Erdmann(University of Tartu), Tõnu Esko(Broad Institute), Martti Färkkilâ(Broad Institute), José C. Florez(Broad Institute), André Franke(Broad Institute), Gad Getz(Hadassah Medical Center), Benjamin Gläser(SUNY Upstate Medical University), Stephen J. Glatt(Columbia University Irving Medical Center), David Goldstein(Instituto Nacional de Salud Pública), Clicerio González(Lund University), Leif Groop(Lund University), Christopher Haiman(The University of Texas Health Science Center at Houston), Craig L. Hanis(Columbia University), Matthew Harms(University of Eastern Finland), Mikko Hiltunen(Helsinki University Hospital), Matti Holi(Karolinska Institutet), Christina M. Hultman(Helsinki University Hospital), Mikko Kallela(University of Helsinki), Jaakko Kaprio(Massachusetts General Hospital), Sekar Kathiresan(Broad Institute), Bong-Jo Kim(Korea National Institute of Health), Young Jin Kim(Cardiff University), George Kirov(Ealing Hospital), Jaspal S. Kooner(Finnish Institute for Health and Welfare), Seppo Koskinen(Yale University), Harlan M. Krumholz(Emory University), Subra Kugathasan(Seoul National University Hospital), Soo Heon Kwak(University of Eastern Finland), Markku Laakso(Tampere University), Terho Lehtimäki(Icahn School of Medicine at Mount Sinai), Ruth J. F. Loos(Massachusetts General Hospital), Steven A. Lubitz(Broad Institute), Ronald C.W.(Massachusetts General Hospital), Daniel G. MacArthur(Broad Institute), Jaume Marrugat(Tampere University), Kari M. Mattila(Broad Institute), Steven A. McCarroll(Broad Institute), Mark I. McCarthy(Cedars-Sinai Medical Center), Dermot McGovern(University of Ottawa), Ruth McPherson(Harvard University), James B. Meigs(Broad Institute), Olle Melander(University of Tartu), Andres Metspalu(Massachusetts General Hospital), Benjamin M. Neale(Lund University), Peter M. Nilsson(Cardiff University), Michael O‘Donovan(McLean Hospital), Döst Öngür(National Institute of Genomic Medicine), Lorena Orozco(Cardiff University), Michael J. Owen(University of Dundee), Colin N. A. Palmer(University of Helsinki), Aarno Palotie(Broad Institute), Kyong Soo Park(University of Southern California), Carlos N. Pato(Johns Hopkins University), Ann E. Pulver(Institute of Cancer Research), Nazneen Rahman(University of Oulu), Anne M. Remes(Montreal Heart Institute), John D. Rioux(University of Helsinki), Samuli Ripatti(Broad Institute), Dan M. Roden(University of Pennsylvania), Danish Saleheen(Finnish Institute for Health and Welfare), Veikko Salomaa(Glenfield Hospital), Nilesh J. Samani(Massachusetts General Hospital), Jeremiah M. Scharf(Deutsches Herzzentrum München), Heribert Schunkert(Broad Institute), M. Benjamin Shoemaker(Broad Institute), Pamela Sklar(University of Eastern Finland), Hilkka Soininen(Sorbonne Université), Harry Sokol(King's College London), Tim D. Spector(Karolinska Institutet), Patrick F. Sullivan(Finnish Institute for Health and Welfare), Jaana Suvisaari(National University of Singapore), E Shyong Tai(National University of Singapore), Yik Ying Teo(University of Helsinki), Tuomi Tiinamaija(University of California San Diego), Ming T. Tsuang(Hebrew University of Jerusalem), Dan Turner, Teresa Tusié‐Luna(Finnish Institute for Health and Welfare), Erkki Vartiainen(University of California, Irvine), Marquis P. Vawter(Harefield Hospital), James S. Ware(University of Oxford), Hugh Watkins(University Medical Center Groningen), Rinse K. Weersma(University of Helsinki), Maija Wessman(Jackson Memorial Hospital), James G. Wilson(Broad Institute), Ramnik J. Xavier(Massachusetts General Hospital), Mark J. Daly(Broad Institute), Daniel G. MacArthur(Broad Institute)
Nature
May 27, 2020
Cited by 201Open Access
Full Text

Abstract

Abstract The acceleration of DNA sequencing in samples from patients and population studies has resulted in extensive catalogues of human genetic variation, but the interpretation of rare genetic variants remains problematic. A notable example of this challenge is the existence of disruptive variants in dosage-sensitive disease genes, even in apparently healthy individuals. Here, by manual curation of putative loss-of-function (pLoF) variants in haploinsufficient disease genes in the Genome Aggregation Database (gnomAD) 1 , we show that one explanation for this paradox involves alternative splicing of mRNA, which allows exons of a gene to be expressed at varying levels across different cell types. Currently, no existing annotation tool systematically incorporates information about exon expression into the interpretation of variants. We develop a transcript-level annotation metric known as the ‘proportion expressed across transcripts’, which quantifies isoform expression for variants. We calculate this metric using 11,706 tissue samples from the Genotype Tissue Expression (GTEx) project 2 and show that it can differentiate between weakly and highly evolutionarily conserved exons, a proxy for functional importance. We demonstrate that expression-based annotation selectively filters 22.8% of falsely annotated pLoF variants found in haploinsufficient disease genes in gnomAD, while removing less than 4% of high-confidence pathogenic variants in the same genes. Finally, we apply our expression filter to the analysis of de novo variants in patients with autism spectrum disorder and intellectual disability or developmental disorders to show that pLoF variants in weakly expressed regions have similar effect sizes to those of synonymous variants, whereas pLoF variants in highly expressed exons are most strongly enriched among cases. Our annotation is fast, flexible and generalizable, making it possible for any variant file to be annotated with any isoform expression dataset, and will be valuable for the genetic diagnosis of rare diseases, the analysis of rare variant burden in complex disorders, and the curation and prioritization of variants in recall-by-genotype studies.


Related Papers

No related papers found

Powered by citation graph analysis