Human genetic variation database, a reference database of genetic variations in the Japanese population

Koichiro Higasa(Kyoto University), Noriko Miyake(Yokohama City University), Jun Yoshimura(The University of Tokyo), K. Okamura(National Center For Child Health and Development), Tetsuya Niihori(Tohoku University), Hirotomo Saitsu(Yokohama City University), Koichiro Doi(The University of Tokyo), Masakazu Shimizu(Kyoto University), Kazuhiko Nakabayashi(National Center For Child Health and Development), Yoko Aoki(Tohoku University), Yoshinori Tsurusaki(Yokohama City University), Shinichi Morishita(The University of Tokyo), Takahisa Kawaguchi(Kyoto University), Osuke Migita(St. Marianna University School of Medicine), Keiko Nakayama(Tohoku University), Mitsuko Nakashima(Yokohama City University), Jun Mitsui(The University of Tokyo), Maiko Narahara(Kyoto University), Keiko Hayashi(National Center For Child Health and Development), Ryo Funayama(Tohoku University), Daisuke Yamaguchi, Hiroyuki Ishiura(The University of Tokyo), Wen‐Ya Ko(Kyoto University), Kenichiro Hata(National Center For Child Health and Development), Takeshi Nagashima(Tohoku University), Ryo Yamada(Kyoto University), Yoichi Matsubara(Tohoku University), Akihiro Umezawa(National Center For Child Health and Development), Shoji Tsuji(The University of Tokyo), Naomichi Matsumoto(Yokohama City University), Fumihiko Matsuda(Kyoto University)
Journal of Human Genetics
February 25, 2016
Cited by 303Open Access
Full Text

Abstract

Whole-genome and -exome resequencing using next-generation sequencers is a powerful approach for identifying genomic variations that are associated with diseases. However, systematic strategies for prioritizing causative variants from many candidates to explain the disease phenotype are still far from being established, because the population-specific frequency spectrum of genetic variation has not been characterized. Here, we have collected exomic genetic variation from 1208 Japanese individuals through a collaborative effort, and aggregated the data into a prevailing catalog. In total, we identified 156 622 previously unreported variants. The allele frequencies for the majority (88.8%) were lower than 0.5% in allele frequency and predicted to be functionally deleterious. In addition, we have constructed a Japanese-specific major allele reference genome by which the number of unique mapping of the short reads in our data has increased 0.045% on average. Our results illustrate the importance of constructing an ethnicity-specific reference genome for identifying rare variants. All the collected data were centralized to a newly developed database to serve as useful resources for exploring pathogenic variations. Public access to the database is available at http://www.genome.med.kyoto-u.ac.jp/SnpDB/.


Related Papers

No related papers found

Powered by citation graph analysis