An integrated map of structural variation in 2,504 human genomes

Peter H. Sudmant(University of Washington), Tobias Rausch(European Molecular Biology Laboratory), Eugene J. Gardner(University of Maryland, Baltimore), Robert E. Handsaker(Broad Institute), Alexej Abyzov(Mayo Clinic), John Huddleston(Howard Hughes Medical Institute), Yan Zhang(Whitney Museum of American Art), Kai Ye(Washington University in St. Louis), Goo Jun(Statistical Service), Markus Hsi-Yang Fritz(European Molecular Biology Laboratory), Miriam K. Konkel(Louisiana State University), Ankit Malhotra(Jackson Laboratory), Adrian M. Stütz(European Molecular Biology Laboratory), Xinghua Shi(University of North Carolina at Charlotte), Francesco Paolo Casale(European Bioinformatics Institute), Jieming Chen(Whitney Museum of American Art), Fereydoun Hormozdiari(University of Washington), Gargi Dayama(State Street (United States)), Ken Chen(The University of Texas MD Anderson Cancer Center), Maika Malig(University of Washington), Mark Chaisson(University of Washington), Klaudia Walter(Wellcome Sanger Institute), Sascha Meiers(European Molecular Biology Laboratory), Seva Kashin(Broad Institute), Erik Garrison(Boston College), Adam Auton(Albert Einstein College of Medicine), Hugo Y. K. Lam(ZIN Technologies (United States)), Xinmeng Jasmine Mu(Broad Institute), Can Alkan(Bilkent University), Danny Antaki(University of California San Diego), Taejeong Bae(Mayo Clinic), Eliza Cerveira(Jackson Laboratory), Peter S. Chines(National Institutes of Health), Zechen Chong(The University of Texas MD Anderson Cancer Center), Laura Clarke(European Bioinformatics Institute), Elif Dal(Bilkent University), Li Ding(Washington University in St. Louis), Sarah B. Emery(University of Michigan), Xian Fan(The University of Texas MD Anderson Cancer Center), Madhusudan Gujral(University of California San Diego), Fatma Kahveci(Bilkent University), Jeffrey M. Kidd(University of Michigan), Yu Kong(Albert Einstein College of Medicine), Eric-Wubbo Lameijer(Leiden University Medical Center), Shane McCarthy(Wellcome Sanger Institute), Paul Flicek(European Bioinformatics Institute), Richard A. Gibbs(Baylor College of Medicine), Gábor Marth(Boston College), Christopher E. Mason(Prince Sattam Bin Abdulaziz University), Androniki Menelaou(University Medical Center Utrecht), Donna M. Muzny(Icahn School of Medicine at Mount Sinai), Bradley J. Nelson(University of Washington), Amina Noor(University of California San Diego), Nicholas F. Parrish(Kyoto University), Matthew Pendleton(Icahn School of Medicine at Mount Sinai), Andrew Quitadamo(University of North Carolina at Charlotte), Benjamin Raeder(European Molecular Biology Laboratory), Eric E. Schadt(Icahn School of Medicine at Mount Sinai), Mallory Romanovitch(Jackson Laboratory), Andreas Schlattl(European Molecular Biology Laboratory), Robert Sebra(Icahn School of Medicine at Mount Sinai), Andrey A. Shabalin(Virginia Commonwealth University), Andreas Untergasser(Heidelberg University), Jerilyn A. Walker(Louisiana State University), Min Wang(Baylor College of Medicine), Fuli Yu(Baylor College of Medicine), Chengsheng Zhang(Jackson Laboratory), Jing Zhang(Whitney Museum of American Art), Xiangqun Zheng-Bradley(European Bioinformatics Institute), Wanding Zhou(The University of Texas MD Anderson Cancer Center), Thomas Zichner(European Molecular Biology Laboratory), Jonathan Sebat(University of California San Diego), Mark A. Batzer(Louisiana State University), Steven A. McCarroll(Broad Institute), Ryan E. Mills(State Street (United States)), Mark Gerstein(Whitney Museum of American Art), Ali Bashir(Icahn School of Medicine at Mount Sinai), Oliver Stegle(European Bioinformatics Institute), Scott E. Devine(University of Maryland, Baltimore), Charles Lee(Ewha Womans University), Evan E. Eichler(Howard Hughes Medical Institute), Jan O. Korbel(European Bioinformatics Institute)
Nature
September 29, 2015
Cited by 2,649Open Access
Full Text

Abstract

Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association. The Structural Variation Analysis Group of The 1000 Genomes Project reports an integrated structural variation map based on discovery and genotyping of eight major structural variation classes in 2,504 unrelated individuals from across 26 populations; structural variation is compared within and between populations and its functional impact is quantified. The Structural Variation Analysis Group of The 1000 Genomes Project reports an integrated structural variation map based on discovery and genotyping of eight major structural variation classes in genomes for 2,504 unrelated individuals from across 26 populations. They characterize structural variation within and between populations and quantify its functional effect. The authors further create a phased reference panel that will be valuable for population genetic and disease association studies.


Related Papers

No related papers found

Powered by citation graph analysis