The diploid genome sequence of an Asian individual

Jun Wang(BGI Group (China)), Wei Wang(Beijing Institute of Genomics), Ruiqiang Li(Beijing Institute of Genomics), Yingrui Li(BGI Group (China)), Geng Tian(BGI Group (China)), Laurie Goodman(BGI Group (China)), Wei Fan(BGI Group (China)), Junqing Zhang(BGI Group (China)), Jun Li(BGI Group (China)), Juanbin Zhang(BGI Group (China)), Yiran Guo(BGI Group (China)), Binxiao Feng(BGI Group (China)), Heng Li(Wellcome Sanger Institute), Yao Lu(BGI Group (China)), Xiaodong Fang(BGI Group (China)), Huiqing Liang(BGI Group (China)), Zhenglin Du(BGI Group (China)), Dong Li(BGI Group (China)), Yiqing Zhao(BGI Group (China)), Yujie Hu(BGI Group (China)), Zhenzhen Yang(BGI Group (China)), Hancheng Zheng(University of Southern Denmark), Ines Hellmann(University of California, Berkeley), Michael Inouye(Wellcome Sanger Institute), John E. Pool(University of California, Berkeley), Xin Yi(BGI Group (China)), Jing Zhao(BGI Group (China)), Jinjie Duan(BGI Group (China)), Yan Zhou(BGI Group (China)), Junjie Qin(University of Chinese Academy of Sciences), Lijia Ma(University of Chinese Academy of Sciences), Guoqing Li(BGI Group (China)), Zhentao Yang(BGI Group (China)), Guojie Zhang(University of Chinese Academy of Sciences), Bin Yang(BGI Group (China)), Chang Yu(BGI Group (China)), Liang Fang(University of Chinese Academy of Sciences), Wenjie Li(BGI Group (China)), Shaochuan Li(Shenzhen University), Dawei Li(BGI Group (China)), Peixiang Ni(BGI Group (China)), Jue Ruan(BGI Group (China)), Qibin Li(University of Chinese Academy of Sciences), Hongmei Zhu(BGI Group (China)), Dongyuan Liu(BGI Group (China)), Zhike Lu(BGI Group (China)), Ning Li(University of Chinese Academy of Sciences), Guangwu Guo(BGI Group (China)), Jianguo Zhang(BGI Group (China)), Jia Ye(BGI Group (China)), Lin Fang(BGI Group (China)), Qin Hao(BGI Group (China)), Quan Chen(Peking University), Yu Liang(University of Chinese Academy of Sciences), Yeyang Su(BGI Group (China)), Amor A. San Juan(University of Chinese Academy of Sciences), Cuo Ping(BGI Group (China)), Shuang Yang(BGI Group (China)), Fang Chen(BGI Group (China)), Li Li(BGI Group (China)), Ke Zhou(BGI Group (China)), Hongkun Zheng(University of Southern Denmark), Yuanyuan Ren(BGI Group (China)), Ling Yang(BGI Group (China)), Yang Gao(Beijing Institute of Genomics), Guohua Yang(Shenzhen University), Zhuo Li(BGI Group (China)), Xiaoli Feng(BGI Group (China)), Karsten Kristiansen(University of Southern Denmark), Gane Ka‐Shu Wong(BGI Group (China)), Rasmus Nielsen(University of California, Berkeley), Richard Durbin(Wellcome Sanger Institute), Lars Bolund(Aarhus University), Xiuqing Zhang(BGI Group (China)), Songgang Li(Shenzhen University), Huanming Yang(Beijing Institute of Genomics), Jian Wang(BGI Group (China))
Nature
November 1, 2008
Cited by 960Open Access
Full Text

Abstract

Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual’s genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next-generation sequencing technologies for personal genomics. The power of the latest massively parallel synthetic DNA sequencing technologies is demonstrated in two major collaborations that shed light on the nature of genomic variation with ethnicity. The first describes the genomic characterization of an individual from the Yoruba ethnic group of west Africa. The second reports a personal genome of a Han Chinese, the group comprising 30% of the world's population. These new resources can now be used in conjunction with the Venter, Watson and NIH reference sequences. A separate study looked at genetic ethnicity on the continental scale, based on data from 1,387 individuals from more than 30 European countries. Overall there was little genetic variation between countries, but the differences that do exist correspond closely to the geographic map. Statistical analysis of the genome data places 50% of the individuals within 310 km of their reported origin. As well as its relevance for testing genetic ancestry, this work has implications for evaluating genome-wide association studies that link genes with diseases.


Related Papers

No related papers found

Powered by citation graph analysis