Telomere-to-telomere assembly of a complete human X chromosome

Karen H. Miga(University of California, Santa Cruz), Sergey Koren(National Institutes of Health), Arang Rhie(National Institutes of Health), Mitchell R. Vollger(University of Washington), Ariel Gershman(Johns Hopkins University), Andrey V. Bzikadze(University of California San Diego), Shelise Brooks(National Institutes of Health), Edmund Howe(Stowers Institute for Medical Research), David Porubskỳ(University of Washington), Glennis A. Logsdon(University of Washington), Valérie Schneider(National Institutes of Health), Tamara Potapova(Stowers Institute for Medical Research), Jonathan Wood(Wellcome Sanger Institute), William Chow(Wellcome Sanger Institute), Joel Armstrong(University of California, Santa Cruz), Jeanne Fredrickson(University of Washington), Evgenia Pak(National Institutes of Health), Kristof Tigyi(University of California, Santa Cruz), Milinn Kremitzki(James S. McDonnell Foundation), Christopher Markovic(James S. McDonnell Foundation), Valerie V. Maduro(National Institutes of Health), Amalia Dutra(National Institutes of Health), Gerard G. Bouffard(National Institutes of Health), Alexander M. Chang(National Institutes of Health), Nancy F. Hansen(National Institutes of Health), Amy B. Wilfert(University of Washington), Françoise Thibaud‐Nissen(National Institutes of Health), Anthony D. Schmitt(Arima Genomics (United States)), Jon-Matthew Belton(Arima Genomics (United States)), Siddarth Selvaraj(Arima Genomics (United States)), Megan Y. Dennis(University of California, Davis), Daniela C. Soto(University of California, Davis), Ruta Sahasrabudhe(University of California, Davis), Gulhan Kaya(University of California, Davis), Josh Quick(University of Birmingham), Nicholas J. Loman(University of Birmingham), Nadine Holmes(University of Nottingham), Matthew Loose(University of Nottingham), Urvashi Surti(University of Pittsburgh), Rosa Ana Risques(University of Washington), Tina Lindsay(James S. McDonnell Foundation), Robert S. Fulton(James S. McDonnell Foundation), Ira M. Hall(James S. McDonnell Foundation), Benedict Paten(University of California, Santa Cruz), Kerstin Howe(Wellcome Sanger Institute), Winston Timp(Johns Hopkins University), Alice Young(National Institutes of Health), James C. Mullikin(National Institutes of Health), Pavel A. Pevzner(University of California San Diego), Jennifer L. Gerton(Stowers Institute for Medical Research), Beth A. Sullivan(Duke Medical Center), Evan E. Eichler(Howard Hughes Medical Institute), Adam M. Phillippy(National Institutes of Health)
Nature
July 14, 2020
Cited by 829Open Access
Full Text

Abstract

Abstract After two decades of improvements, the current human reference genome (GRCh38) is the most accurate and complete vertebrate genome ever produced. However, no single chromosome has been finished end to end, and hundreds of unresolved gaps persist 1,2 . Here we present a human genome assembly that surpasses the continuity of GRCh38 2 , along with a gapless, telomere-to-telomere assembly of a human chromosome. This was enabled by high-coverage, ultra-long-read nanopore sequencing of the complete hydatidiform mole CHM13 genome, combined with complementary technologies for quality improvement and validation. Focusing our efforts on the human X chromosome 3 , we reconstructed the centromeric satellite DNA array (approximately 3.1 Mb) and closed the 29 remaining gaps in the current reference, including new sequences from the human pseudoautosomal regions and from cancer-testis ampliconic gene families (CT-X and GAGE). These sequences will be integrated into future human reference genome releases. In addition, the complete chromosome X, combined with the ultra-long nanopore data, allowed us to map methylation patterns across complex tandem repeats and satellite arrays. Our results demonstrate that finishing the entire human genome is now within reach, and the data presented here will facilitate ongoing efforts to complete the other human chromosomes.


Related Papers

No related papers found

Powered by citation graph analysis