The sequences of 150,119 genomes in the UK Biobank

Bjarni V. Halldórsson(Reykjavík University), Hannes P. Eggertsson(deCODE Genetics (Iceland)), Kristjan H. S. Moore(deCODE Genetics (Iceland)), Hannes Hauswedell(deCODE Genetics (Iceland)), Ögmundur Eiríksson(deCODE Genetics (Iceland)), Magnús Ö. Úlfarsson(deCODE Genetics (Iceland)), Gunnar Pálsson(deCODE Genetics (Iceland)), Marteinn T. Hardarson(Reykjavík University), Ásmundur Oddsson(deCODE Genetics (Iceland)), Brynjar Ö. Jensson(deCODE Genetics (Iceland)), Snædís Kristmundsdóttir(Reykjavík University), Brynja D. Sigurpalsdottir(Reykjavík University), Ólafur Andri Stefánsson(deCODE Genetics (Iceland)), Doruk Beyter(deCODE Genetics (Iceland)), Guillaume Holley(deCODE Genetics (Iceland)), Vinicius Tragante(deCODE Genetics (Iceland)), Arnaldur Gylfason(deCODE Genetics (Iceland)), Pall I. Olason(deCODE Genetics (Iceland)), Florian Zink(deCODE Genetics (Iceland)), Margret Asgeirsdottir(deCODE Genetics (Iceland)), Sverrir T. Sverrisson(deCODE Genetics (Iceland)), Brynjar Sigurdsson(deCODE Genetics (Iceland)), Sigurjón A. Guðjónsson(deCODE Genetics (Iceland)), Gunnar Sigurðsson(deCODE Genetics (Iceland)), Gísli H. Halldórsson(deCODE Genetics (Iceland)), Garðar Sveinbjörnsson(deCODE Genetics (Iceland)), Kristján Norland(deCODE Genetics (Iceland)), Unnur Styrkársdóttir(deCODE Genetics (Iceland)), Droplaug N. Magnúsdóttir(deCODE Genetics (Iceland)), Steinunn Snorradóttir(deCODE Genetics (Iceland)), Kári Kristinsson(deCODE Genetics (Iceland)), Emilia Sobech(deCODE Genetics (Iceland)), Helgi Jónsson(Reykjavík University), Árni Jón Geirsson(Reykjavík University), Ísleifur Ólafsson(Reykjavík University), Pálmi V. Jónsson(Reykjavík University), Ole Birger Pedersen(Zealand University Hospital Køge), Christian Erikstrup(Aarhus University), Søren Brunak(University of Copenhagen), Sisse Rye Ostrowski(University of Copenhagen), Steffen Andersen(Copenhagen Business School), Karina Banasik(University of Copenhagen), Kristoffer Sølvsten Burgdorf(Copenhagen University Hospital), Maria Didriksen(Copenhagen University Hospital), Khoa Manh Dinh(Aarhus University Hospital), Christian Erikstrup(Aarhus University), Daníel F. Guðbjartsson(deCODE Genetics (Iceland)), Thomas Folkmann Hansen(Glostrup Hospital), Henrik Hjalgrim(Statens Serum Institut), Gregor B. E. Jemec(Zealand University Hospital), Poul Jennum(University of Copenhagen), Pär I. Johansson(Copenhagen University Hospital), Margit Anita Hørup Larsen(Copenhagen University Hospital), Susan Mikkelsen(Aarhus University Hospital), Kasper Nielsen(Aalborg University Hospital), Mette Nyegaard(Aarhus University), Sisse Rye Ostrowski(University of Copenhagen), Susanne Gjørup Sækmose(Zealand University Hospital Køge), Erik Sørensen(Copenhagen University Hospital), Unnur Þorsteinsdóttir(deCODE Genetics (Iceland)), Mie Topholm Brun(Odense University Hospital), Henrik Ullum(Statens Serum Institut), Thomas Werge(Copenhagen University Hospital), Guðmar Þorleifsson(deCODE Genetics (Iceland)), Frosti Jónsson(deCODE Genetics (Iceland)), Páll Melsted(deCODE Genetics (Iceland)), Ingileif Jónsdóttir(deCODE Genetics (Iceland)), Þórunn Rafnar(deCODE Genetics (Iceland)), Hilma Hólm(deCODE Genetics (Iceland)), Hreinn Stefánsson(deCODE Genetics (Iceland)), Jona Saemundsdottir(deCODE Genetics (Iceland)), Daníel F. Guðbjartsson(deCODE Genetics (Iceland)), Ólafur Þ. Magnússon(deCODE Genetics (Iceland)), Gísli Másson(deCODE Genetics (Iceland)), Unnur Þorsteinsdóttir(deCODE Genetics (Iceland)), Agnar Helgason(deCODE Genetics (Iceland)), Hákon Jónsson(Reykjavík University), Patrick Sulem(deCODE Genetics (Iceland)), Kāri Stefánsson(deCODE Genetics (Iceland))
Nature
July 20, 2022
Cited by 486Open Access
Full Text

Abstract

Abstract Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data 1,2 . Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank 3 . This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation.


Related Papers

No related papers found

Powered by citation graph analysis