Improved imputation of low-frequency and rare variants using the UK10K haplotype reference panel

Jie Huang(Wellcome Sanger Institute), Bryan Howie(Adaptive Biotechnologies (United States)), Shane McCarthy(Wellcome Sanger Institute), Yasin Memari(Wellcome Sanger Institute), Klaudia Walter(Wellcome Sanger Institute), Josine L. Min(University of Bristol), Petr Danecek(Wellcome Sanger Institute), Giovanni Malerba(University of Verona), Elisabetta Trabetti(University of Verona), Hou‐Feng Zheng(Jewish General Hospital), Saeed Al Turki(Wellcome Sanger Institute), Antoinette Amuzu(University of London), Carl A. Anderson(Wellcome Sanger Institute), Richard Anney(St. James's Hospital), Dinu Antony(University College London), María Soler Artigas(University of Leicester), Muhammad Ayub(Queen's University), Senduran Bala(Wellcome Sanger Institute), Jeffrey C. Barrett(Wellcome Sanger Institute), Inês Barroso(University of Cambridge), Phil Beales(University College London), Marianne Benn(University of Copenhagen), Jamie Bentham(Centre for Human Genetics), Shoumo Bhattacharya(Centre for Human Genetics), Ewan Birney(European Bioinformatics Institute), Douglas Blackwood(Royal Edinburgh Hospital), Martin Bobrow(University of Cambridge), Elena G. Bochukova(University of Cambridge), Patrick Bolton(King's College London), Rebecca Bounds(University of Cambridge), Chris Boustred(Great Ormond Street Hospital for Children NHS Foundation Trust), Gerome Breen(King's College London), Mattia Calissano(Great Ormond Street Hospital), Keren Carss(Wellcome Sanger Institute), Juan P. Casas(University of London), John C. Chambers(St Mary's Hospital), Ruth Charlton(St James's University Hospital), Krishna Chatterjee(University of Cambridge), Lu Chen(University of Cambridge), Antonio Ciampi(McGill University), Sebahattin Çırak(Great Ormond Street Hospital), Peter Clapham(Wellcome Sanger Institute), Gail Clement(King's College London), Guy Coates(Wellcome Sanger Institute), Massimiliano Cocca(University of Trieste), David Collier(Eli Lilly (United States)), Catherine Cosgrove(Centre for Human Genetics), Tony Cox(Wellcome Sanger Institute), Nick Craddock(Cardiff University), Lucy Crooks(Wellcome Sanger Institute), Sarah Curran(University of Sussex), David Curtis(University College London), Allan Daly(Wellcome Sanger Institute), Ian N.M. Day(University of Bristol), Aaron Day-Williams(Biogen (United States)), George Dedoussis(Harokopio University of Athens), Thomas A. Down(Guy's Hospital), Yuanping Du(BGI Group (China)), Cornelia M. van Duijn(Erasmus MC), Ian Dunham(European Bioinformatics Institute), Sarah Edkins(Wellcome Sanger Institute), Rosemary Ekong(University College London), Peter Ellis(Wellcome Sanger Institute), David M. Evans(Translational Research Institute), I. Sadaf Farooqi(University of Cambridge), David Fitzpatrick(Western General Hospital), Paul Flicek(European Bioinformatics Institute), James Floyd(Queen Mary University of London), A. Reghan Foley(Great Ormond Street Hospital), Christopher S. Franklin(Wellcome Sanger Institute), Marta Futema(University College London), Louise Gallagher(St. James's Hospital), Paolo Gasparini(Qatar Airways (Qatar)), Tom R. Gaunt(University of Bristol), Matthias Geihs(Wellcome Sanger Institute), Daniel H. Geschwind(University of California, Los Angeles), Celia M.T. Greenwood(Jewish General Hospital), Heather Griffin(University of Oxford), Detelina Grozeva(University of Cambridge), Xiaosen Guo(BGI Group (China)), Xueqin Guo(BGI Group (China)), Hugh Gurling(University College London), Deborah Hart(King's College London), Audrey E. Hendricks(Statistical and Applied Mathematical Sciences Institute), Peter Holmans(Cardiff University), Liren Huang(BGI Group (China)), Tim Hubbard(Guy's Hospital), Steve E. Humphries(University College London), Matthew E. Hurles(Wellcome Sanger Institute), Pirro G. Hysi(King's College London), Valentina Iotchkova(European Bioinformatics Institute), Aaron Isaacs(Erasmus MC), David K. Jackson(Wellcome Sanger Institute), Yalda Jamshidi(St George's, University of London), Jon Johnson(University College London), Christopher Joyce(Wellcome Sanger Institute), Konrad J. Karczewski(Broad Institute), Jane Kaye(University of Oxford), Thomas Keane(Wellcome Sanger Institute), John P. Kemp(Translational Research Institute), Karen Kennedy(Wellcome Sanger Institute), Alastair Kent(Genetic Alliance UK (United Kingdom)), Julia M. Keogh(University of Cambridge), Farrah Khawaja(St George's, University of London), Marcus E. Kleber(Heidelberg University), Margriet van Kogelenberg(Wellcome Sanger Institute), Anja Kolb-Kokocinski(Wellcome Sanger Institute), Jaspal S. Kooner(Imperial College London), Geneviève Lachance(King's College London), Claudia Langenberg(University of Cambridge), Cordelia Langford(Wellcome Sanger Institute), Daniel J. Lawson(University of Bristol), Irene Lee(University College London), Elisabeth M. van Leeuwen(Erasmus MC), Monkol Lek(Massachusetts General Hospital), Rui Li(Jewish General Hospital), Rui Li(BGI Group (China)), Jieqin Liang(BGI Group (China)), Hong Lin(BGI Group (China)), Ryan Liu, Jouko Lönnqvist(Finnish Institute for Health and Welfare), Luís R. Lopes(University of Lisbon), Margarida Lopes(Centre for Human Genetics), Jian’an Luan(University of Cambridge), Daniel G. MacArthur(Broad Institute), Massimo Mangino(King's College London), Gaëlle Marenne(Wellcome Sanger Institute), Winfried März(Medical University of Graz), John Maslen(Wellcome Sanger Institute), Angela Matchan(Wellcome Sanger Institute), Iain Mathieson(Harvard University), Peter McGuffin(King's College London), Andrew M. McIntosh(Royal Edinburgh Hospital), Andrew G. McKechanie(Royal Edinburgh Hospital), Andrew McQuillin(University College London), Sarah Metrustry(King's College London), Nicola Migone(University of Turin), Hannah M. Mitchison(University College London), Alireza Moayyeri(King's College London), James Morris(Wellcome Sanger Institute), Richard Morris, Dawn Muddyman(Wellcome Sanger Institute), Francesco Muntoni(Great Ormond Street Hospital), Børge G. Nordestgaard(University of Copenhagen), Kate Northstone(University of Bristol), Michael O‘Donovan(Cardiff University), Stephen O’Rahilly(University of Cambridge), Alexandros Onoufriadis(Guy's Hospital), Karim Oualkacha(Université du Québec à Montréal), Michael J. Owen(Cardiff University), Aarno Palotie(Broad Institute), Kalliope Panoutsopoulou(Wellcome Sanger Institute), Victoria Parker(University of Cambridge), Jeremy Parr(Newcastle University), Lavinia Paternoster(University of Bristol), Tiina Paunio(University of Helsinki), Felicity Payne(Wellcome Sanger Institute), Stewart J. Payne(Northwick Park Hospital), John R. B. Perry(King's College London), Olli Pietiläinen(University of Helsinki), Vincent Plagnol(University College London), Rebecca C. Pollitt(Sheffield Children's NHS Foundation Trust), Sue Povey(University College London), Michael A. Quail(Wellcome Sanger Institute), Lydia Quaye(King's College London), Lucy Raymond(University of Cambridge), Karola Rehnström(Wellcome Sanger Institute), Cheryl K. Ridout(Guy's Hospital), Susan M. Ring(University of Bristol), Graham R. S. Ritchie(European Bioinformatics Institute), Nicola Roberts(University of Cambridge), Rachel L. Robinson(St James's University Hospital), David B. Savage(University of Cambridge), Peter Scambler(University College London), Stephan Schiffels(Wellcome Sanger Institute), Miriam Schmidts(Radboud University Nijmegen), Nadia Schoenmakers(University of Cambridge), Richard H. Scott(University of Cambridge), Robert A. Scott(University of Cambridge), Robert K. Semple(University of Cambridge), Eva Serra(Wellcome Sanger Institute), Sally I. Sharp(University College London), Adam Shaw(Guy's and St Thomas' NHS Foundation Trust), Hashem A. Shihab(University of Bristol), So–Youn Shin(Wellcome Sanger Institute), David Skuse(University College London), Kerrin S. Small(King's College London), Carol Smee(Wellcome Sanger Institute), George Davey Smith(University of Bristol), Lorraine Southam(Centre for Human Genetics), Olivera Spasić-Bošković(University of Cambridge), Timothy D. Spector(King's College London), David St Clair(University of Aberdeen), Beaté St Pourcain(University Of Bristol Dental Hospital), Jim Stalker(Wellcome Sanger Institute), Elizabeth Stevens(Great Ormond Street Hospital), Jianping Sun(Jewish General Hospital), Gabriela Surdulescu(King's College London), Jaana Suvisaari(Finnish Institute for Health and Welfare), Petros Syrris(University College London), Ioanna Tachmazidou(Wellcome Sanger Institute), Rohan Taylor(St George's, University of London), Jing Tian(BGI Group (China)), Martin D. Tobin(University of Leicester), Daniela Toniolo(San Raffaele University of Rome), Michela Traglia(San Raffaele University of Rome), Anne Tybjærg‐Hansen(University of Copenhagen), Ana M. Valdes(King's College London), Anthony M. Vandersteen, Anette Varbo(University of Copenhagen), Parthiban Vijayarangakannan(Wellcome Sanger Institute), Peter M. Visscher(Translational Research Institute), Louise V. Wain(University of Leicester), James Walters(Cardiff University), Guangbiao Wang(BGI Group (China)), Jun Wang(BGI Group (China)), Yu Wang(BGI Group (China)), Kirsten Ward(King's College London), Eleanor Wheeler(Wellcome Sanger Institute), Peter H. Whincup(St George's, University of London), Tamieka Whyte(Great Ormond Street Hospital), Hywel Williams(University College London), Kathleen A. Williamson(Western General Hospital), Crispian Wilson(University of Cambridge), Scott G. Wilson(The University of Western Australia), Kim Wong(Wellcome Sanger Institute), Changjiang Xu(Jewish General Hospital), Jian Yang(Translational Research Institute), Gianluigi Zaza(University of Verona), Eleftheria Zeggini(Wellcome Sanger Institute), Feng Zhang(King's College London), Pingbo Zhang(BGI Group (China)), Weihua Zhang(St Mary's Hospital), Giovanni Gambaro(Agostino Gemelli University Polyclinic), J. Brent Richards(St Thomas' Hospital), Richard Durbin(Wellcome Sanger Institute), Nicholas J. Timpson(University of Bristol), Jonathan Marchini(Centre for Human Genetics), Nicole Soranzo(University of Cambridge)
Nature Communications
September 14, 2015
Cited by 384Open Access
Full Text

Abstract

Imputing genotypes from reference panels created by whole-genome sequencing (WGS) provides a cost-effective strategy for augmenting the single-nucleotide polymorphism (SNP) content of genome-wide arrays. The UK10K Cohorts project has generated a data set of 3,781 whole genomes sequenced at low depth (average 7x), aiming to exhaustively characterize genetic variation down to 0.1% minor allele frequency in the British population. Here we demonstrate the value of this resource for improving imputation accuracy at rare and low-frequency variants in both a UK and an Italian population. We show that large increases in imputation accuracy can be achieved by re-phasing WGS reference panels after initial genotype calling. We also present a method for combining WGS panels to improve variant coverage and downstream imputation accuracy, which we illustrate by integrating 7,562 WGS haplotypes from the UK10K project with 2,184 haplotypes from the 1000 Genomes Project. Finally, we introduce a novel approximation that maintains speed without sacrificing imputation accuracy for rare variants.


Related Papers

No related papers found

Powered by citation graph analysis