The mutational constraint spectrum quantified from variation in 141,456 humans

Konrad J. Karczewski(Broad Institute), Laurent C. Francioli(Broad Institute), Grace Tiao(Broad Institute), Beryl B. Cummings(Broad Institute), Jessica Alföldi(Broad Institute), Qingbo S. Wang(Broad Institute), Ryan L. Collins(Broad Institute), Kristen M. Laricchia(Broad Institute), Andrea Ganna(Broad Institute), Daniel P. Birnbaum(Broad Institute), Laura D. Gauthier(Broad Institute), Harrison Brand(Broad Institute), Matthew Solomonson(Broad Institute), Nicholas A. Watts(Broad Institute), Daniel R. Rhodes(Queen Mary University of London), Moriel Singer‐Berk(Broad Institute), Eleina England(Broad Institute), Eleanor G. Seaby(Broad Institute), Jack A. Kosmicki(Broad Institute), Raymond K. Walters(Broad Institute), Katherine Tashman(Broad Institute), Yossi Farjoun(Broad Institute), Eric Banks(Broad Institute), Timothy Poterba(Broad Institute), Arcturus Wang(Broad Institute), Cotton Seed(Broad Institute), Nicola Whiffin(Broad Institute), Jessica X. Chong(University of Washington), Kaitlin E. Samocha(Wellcome Sanger Institute), Emma Pierce‐Hoffman(Broad Institute), Zachary Zappala(Broad Institute), Anne O’Donnell‐Luria(Broad Institute), Eric Vallabh Minikel(Broad Institute), Ben Weisburd(Broad Institute), Monkol Lek(Yale University), James S. Ware(Broad Institute), Christopher Vittal(Broad Institute), Irina M. Armean(Broad Institute), Louis Bergelson(Broad Institute), Kristian Cibulskis(Broad Institute), Kristen M. Connolly(Broad Institute), Miguel Covarrubias(Broad Institute), Stacey Donnelly(Broad Institute), Steven Ferriera(Broad Institute), Stacey Gabriel(Broad Institute), Jeff Gentry(Broad Institute), Namrata Gupta(Broad Institute), Thibault Jeandet(Broad Institute), Diane Kaplan(Broad Institute), Christopher Llanwarne(Broad Institute), Ruchi Munshi(Broad Institute), Sam Novod(Broad Institute), Nikelle Petrillo(Broad Institute), David Roazen(Broad Institute), Valentín Ruano-Rubio(Broad Institute), Andrea Saltzman(Broad Institute), Molly Schleicher(Broad Institute), José Soto(Broad Institute), Kathleen Tibbetts(Broad Institute), Charlotte Tolonen(Broad Institute), Gordon Wade(Broad Institute), Michael E. Talkowski(Broad Institute), Carlos A. Aguilar‐Salinas(Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán), Tariq Ahmad(Peninsula College of Medicine and Dentistry), Christine M. Albert(Brigham and Women's Hospital), Diego Ardissino(University of Parma), Gil Atzmon(Albert Einstein College of Medicine), John Barnard(Cleveland Clinic Lerner College of Medicine), Laurent Beaugerie(Sorbonne Université), Emelia J. Benjamin(Boston University), Michael Boehnke(University of Michigan), Lori L. Bonnycastle(National Institutes of Health), Erwin P. Böttinger(Icahn School of Medicine at Mount Sinai), Donald W. Bowden(Wake Forest University), Matthew J. Bown(University of Leicester), John C. Chambers(Imperial College Healthcare NHS Trust), Juliana C.N. Chan(Chinese University of Hong Kong), Daniel I. Chasman(Brigham and Women's Hospital), Judy H. Cho(Icahn School of Medicine at Mount Sinai), Mina K. Chung(Cleveland Clinic Lerner College of Medicine), Bruce M. Cohen(Harvard University), Adolfo Correa(Jackson Memorial Hospital), Dana Dabelea(Colorado School of Public Health), Mark J. Daly(Broad Institute), Dawood Darbar(University of Illinois Chicago), Ravindranath Duggirala(Texas Biomedical Research Institute), Josée Dupuis(Boston University), Patrick T. Ellinor(Broad Institute), Roberto Elosúa(Universitat de Vic - Universitat Central de Catalunya), Jeanette Erdmann(German Centre for Cardiovascular Research), Tõnu Esko(Broad Institute), Martti Färkkilâ(University of Helsinki), José C. Florez(Broad Institute), André Franke(Christian-Albrechts-Universität zu Kiel), Gad Getz(Broad Institute), Benjamin Gläser(Hadassah Medical Center), Stephen J. Glatt(SUNY Upstate Medical University), David Goldstein(Columbia University Irving Medical Center), Clicerio González(Instituto Nacional de Salud Pública), Leif Groop(Lund University), Christopher Haiman(Lund University), Craig L. Hanis(The University of Texas Health Science Center at Houston), Matthew Harms(Columbia University), Mikko Hiltunen(University of Eastern Finland), Matti Holi(Helsinki University Hospital), Christina M. Hultman(Karolinska Institutet), Mikko Kallela(Helsinki University Hospital), Jaakko Kaprio(University of Helsinki), Sekar Kathiresan(Broad Institute), Bong-Jo Kim(Korea National Institute of Health), Young Jin Kim(Korea National Institute of Health), George Kirov(Cardiff University), Jaspal S. Kooner(Imperial College Healthcare NHS Trust), Seppo Koskinen(Finnish Institute for Health and Welfare), Harlan M. Krumholz(Yale University), Subra Kugathasan(Emory University), Soo Heon Kwak(Seoul National University Hospital), Markku Laakso(University of Eastern Finland), Terho Lehtimäki(Tampere University), Ruth J. F. Loos(Child Health and Development Institute), Steven A. Lubitz(Broad Institute), Ronald C.W.(Chinese University of Hong Kong), Daniel G. MacArthur(Broad Institute), Jaume Marrugat(Centro de Investigación en Red en Enfermedades Cardiovasculares), Kari M. Mattila(Tampere University), Steven A. McCarroll(Broad Institute), Mark I. McCarthy(Centre for Human Genetics), Dermot McGovern(Cedars-Sinai Medical Center), Ruth McPherson(University of Ottawa), James B. Meigs(Broad Institute), Olle Melander(Lund University), Andres Metspalu(University of Tartu), Benjamin M. Neale(Broad Institute), Peter M. Nilsson(Lund University), Michael O‘Donovan(Cardiff University), Döst Öngür(Harvard University), Lorena Orozco(National Institute of Genomic Medicine), Michael J. Owen(Cardiff University), Colin N. A. Palmer(University of Dundee), Aarno Palotie(Broad Institute), Kyong Soo Park(Seoul National University), Carlos N. Pato(University of Southern California), Ann E. Pulver(Johns Hopkins University), Nazneen Rahman(Institute of Cancer Research), Anne M. Remes(Oulu University Hospital), John D. Rioux(Montreal Heart Institute), Samuli Ripatti(Broad Institute), Dan M. Roden(Vanderbilt University Medical Center), Danish Saleheen(Center for Non-Communicable Diseases), Veikko Salomaa(Finnish Institute for Health and Welfare), Nilesh J. Samani(University of Leicester), Jeremiah M. Scharf(Broad Institute), Heribert Schunkert(Deutsches Herzzentrum München), M. Benjamin Shoemaker(Vanderbilt University), Pamela Sklar(Massachusetts General Hospital), Hilkka Soininen(University of Eastern Finland), Harry Sokol(Sorbonne Université), Tim D. Spector(King's College London), Patrick F. Sullivan(University of North Carolina at Chapel Hill), Jaana Suvisaari(Finnish Institute for Health and Welfare), E Shyong Tai(National University of Singapore), Yik Ying Teo(National University of Singapore), Tuomi Tiinamaija(Helsinki University Hospital), Ming T. Tsuang(University of California San Diego), Dan Turner(Hebrew University of Jerusalem), Teresa Tusié‐Luna(Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán), Erkki Vartiainen(University of Helsinki), Marquis P. Vawter(University of California, Irvine), James S. Ware(Broad Institute), Hugh Watkins(University of Oxford), Rinse K. Weersma(University Medical Center Groningen), Maija Wessman(Institute for Molecular Medicine Finland), James G. Wilson(Jackson Memorial Hospital), Ramnik J. Xavier(Broad Institute), Benjamin M. Neale(Broad Institute), Mark J. Daly(Broad Institute), Daniel G. MacArthur(Broad Institute)
Nature
May 27, 2020
Cited by 10,052Open Access
Full Text

Abstract

Abstract Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences of gene disruption: genes that are crucial for the function of an organism will be depleted of such variants in natural populations, whereas non-essential genes will tolerate their accumulation. However, predicted loss-of-function variants are enriched for annotation errors, and tend to be found at extremely low frequencies, so their analysis requires careful variant annotation and very large sample sizes 1 . Here we describe the aggregation of 125,748 exomes and 15,708 genomes from human sequencing studies into the Genome Aggregation Database (gnomAD). We identify 443,769 high-confidence predicted loss-of-function variants in this cohort after filtering for artefacts caused by sequencing and annotation errors. Using an improved model of human mutation rates, we classify human protein-coding genes along a spectrum that represents tolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve the power of gene discovery for both common and rare diseases.


Related Papers

No related papers found

Powered by citation graph analysis