Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data

Yoshihiro Kawahara(Institute of Agrobiological Sciences), M. de la Bastide(Cold Spring Harbor Laboratory), John P. Hamilton(Michigan State University), Hiroyuki Kanamori(Institute of Agrobiological Sciences), W. Richard McCombie(Cold Spring Harbor Laboratory), Shu Ouyang(PerkinElmer (United States)), David C. Schwartz(University of Wisconsin–Madison), Tsuyoshi Tanaka(Institute of Agrobiological Sciences), Jianzhong Wu(Institute of Agrobiological Sciences), Shiguo Zhou(University of Wisconsin–Madison), Kevin L. Childs(Michigan State University), Rebecca M. Davidson(National Jewish Health), Haining Lin(Pioneer Hi-Bred), L. M. Quesada-Ocampo(Michigan State University), Brieanne Vaillancourt(Michigan State University), Hiroaki Sakai(Institute of Agrobiological Sciences), Sung Shin Lee(Institute of Agrobiological Sciences), Jungsok Kim(Institute of Agrobiological Sciences), Hisataka Numa(Institute of Agrobiological Sciences), Takeshi Itoh(Institute of Agrobiological Sciences), C. Robin Buell(Michigan State University), Takashi Matsumoto(Institute of Agrobiological Sciences)
Rice
February 6, 2013
Cited by 2,475Open Access
Full Text

Abstract

BACKGROUND: Rice research has been enabled by access to the high quality reference genome sequence generated in 2005 by the International Rice Genome Sequencing Project (IRGSP). To further facilitate genomic-enabled research, we have updated and validated the genome assembly and sequence for the Nipponbare cultivar of Oryza sativa (japonica group). RESULTS: The Nipponbare genome assembly was updated by revising and validating the minimal tiling path of clones with the optical map for rice. Sequencing errors in the revised genome assembly were identified by re-sequencing the genome of two different Nipponbare individuals using the Illumina Genome Analyzer II/IIx platform. A total of 4,886 sequencing errors were identified in 321 Mb of the assembled genome indicating an error rate in the original IRGSP assembly of only 0.15 per 10,000 nucleotides. A small number (five) of insertions/deletions were identified using longer reads generated using the Roche 454 pyrosequencing platform. As the re-sequencing data were generated from two different individuals, we were able to identify a number of allelic differences between the original individual used in the IRGSP effort and the two individuals used in the re-sequencing effort. The revised assembly, termed Os-Nipponbare-Reference-IRGSP-1.0, is now being used in updated releases of the Rice Annotation Project and the Michigan State University Rice Genome Annotation Project, thereby providing a unified set of pseudomolecules for the rice community. CONCLUSIONS: A revised, error-corrected, and validated assembly of the Nipponbare cultivar of rice was generated using optical map data, re-sequencing data, and manual curation that will facilitate on-going and future research in rice. Detection of polymorphisms between three different Nipponbare individuals highlights that allelic differences between individuals should be considered in diversity studies.


Related Papers