The Genomes of Oryza sativa: A History of Duplications

Jun Yu(BGI Group (China)), Jun Wang(BGI Group (China)), Wei Lin(BGI Group (China)), Songgang Li(BGI Group (China)), Heng Li(BGI Group (China)), Jun Zhou(BGI Group (China)), Peixiang Ni(BGI Group (China)), Wei Dong(BGI Group (China)), Songnian Hu(Beijing Institute of Genomics), Changqing Zeng(BGI Group (China)), Jianguo Zhang(BGI Group (China)), Yong Zhang(BGI Group (China)), Ruiqiang Li(BGI Group (China)), Zuyuan Xu(BGI Group (China)), Shengting Li(BGI Group (China)), Xianran Li(BGI Group (China)), Hongkun Zheng(BGI Group (China)), Lijuan Cong(BGI Group (China)), Liang Lin(BGI Group (China)), Jianning Yin(BGI Group (China)), Jianing Geng(BGI Group (China)), Guangyuan Li(BGI Group (China)), Jianping Shi(BGI Group (China)), Juan Liu(BGI Group (China)), Hong Lv(BGI Group (China)), Jun Li(BGI Group (China)), Jing Wang(BGI Group (China)), Yajun Deng(BGI Group (China)), Longhua Ran(Beijing Computing Center), Xiaoli Shi(BGI Group (China)), Xiyin Wang(BGI Group (China)), Qingfa Wu(BGI Group (China)), Changfeng Li(BGI Group (China)), Xiaoyu Ren(BGI Group (China)), Jingqiang Wang(BGI Group (China)), Xiaoling Wang(BGI Group (China)), Dawei Li(BGI Group (China)), Dongyuan Liu(BGI Group (China)), Xiaowei Zhang(BGI Group (China)), Zhendong Ji(BGI Group (China)), Wenming Zhao(BGI Group (China)), Yongqiao Sun(BGI Group (China)), Zhenpeng Zhang(BGI Group (China)), Jingyue Bao(BGI Group (China)), Yujun Han(BGI Group (China)), Lingli Dong(BGI Group (China)), Jia Ji(BGI Group (China)), Peng Chen(BGI Group (China)), Shuming Wu(BGI Group (China)), Jinsong Liu(BGI Group (China)), Ying Xiao(BGI Group (China)), Dongbo Bu(Chinese Academy of Sciences), Jianlong Tan(Chinese Academy of Sciences), Yang Li(BGI Group (China)), Chen Ye(BGI Group (China)), Jingfen Zhang(Chinese Academy of Sciences), Jingyi Xu(Chinese Academy of Sciences), Yan Zhou(Beijing Institute of Genomics), Yingpu Yu(Beijing Institute of Genomics), Bing Zhang(Beijing Institute of Genomics), Shulin Zhuang(Beijing Institute of Genomics), Haibin Wei(Beijing Institute of Genomics), Bin Liu(BGI Group (China)), Lei Meng(BGI Group (China)), Hong Yu(National Institute of Genomic Medicine), Yuanzhe Li(BGI Group (China)), Hao Xu(Beijing Institute of Genomics), Shulin Wei(BGI Group (China)), Ximiao He(BGI Group (China)), Lijun Fang(Beijing Institute of Genomics), Zengjin Zhang(BGI Group (China)), Yunze Zhang(BGI Group (China)), Xiangang Huang(BGI Group (China)), Zhixi Su(Beijing Institute of Genomics), Wei Tong(BGI Group (China)), Jinhong Li(National Institute of Genomic Medicine), Zongzhong Tong(BGI Group (China)), Shuangli Li(BGI Group (China)), Jia Ye(Beijing Institute of Genomics), Li‐Shun Wang(BGI Group (China)), Lin Fang(BGI Group (China)), Tingting Lei(BGI Group (China)), Chen Chen(BGI Group (China)), Huan Chen(Beijing Institute of Genomics), Zhao Xu(BGI Group (China)), Haihong Li(BGI Group (China)), Haiyan Huang(BGI Group (China)), Feng Zhang(BGI Group (China)), Huayong Xu(Beijing Institute of Genomics), Na Li(BGI Group (China)), Caifeng Zhao(BGI Group (China)), Shuting Li(BGI Group (China)), Lijun Dong(BGI Group (China)), Yanqing Huang(BGI Group (China)), Long Li(BGI Group (China)), Yan Xi(BGI Group (China)), Qiuhui Qi(BGI Group (China)), Wenjie Li(BGI Group (China)), Bo Zhang(BGI Group (China)), Wei Hu(BGI Group (China)), Yanling Zhang(BGI Group (China)), Xiangjun Tian(Beijing Institute of Genomics), Yong-Zhi Jiao(BGI Group (China)), Xiaohu Liang(BGI Group (China)), Jiao Jin(BGI Group (China)), Lei Gao(BGI Group (China)), Wei‐Mou Zheng(BGI Group (China)), Bailin Hao(BGI Group (China)), Siqi Liu(BGI Group (China)), Wen Wang(Kunming Institute of Zoology), Longping Yuan(Beijing Institute of Genomics), Mengliang Cao(Beijing Institute of Genomics), Jason McDermott(University of Washington), Ram Samudrala(University of Washington), Jian Wang(BGI Group (China)), Gane Ka‐Shu Wong(BGI Group (China)), Huanming Yang(BGI Group (China))
PLoS Biology
January 21, 2005
Cited by 1,037Open Access
Full Text

Abstract

We report improved whole-genome shotgun sequences for the genomes of indica and japonica rice, both with multimegabase contiguity, or almost 1,000-fold improvement over the drafts of 2002. Tested against a nonredundant collection of 19,079 full-length cDNAs, 97.7% of the genes are aligned, without fragmentation, to the mapped super-scaffolds of one or the other genome. We introduce a gene identification procedure for plants that does not rely on similarity to known genes to remove erroneous predictions resulting from transposable elements. Using the available EST data to adjust for residual errors in the predictions, the estimated gene count is at least 38,000-40,000. Only 2%-3% of the genes are unique to any one subspecies, comparable to the amount of sequence that might still be missing. Despite this lack of variation in gene content, there is enormous variation in the intergenic regions. At least a quarter of the two sequences could not be aligned, and where they could be aligned, single nucleotide polymorphism (SNP) rates varied from as little as 3.0 SNP/kb in the coding regions to 27.6 SNP/kb in the transposable elements. A more inclusive new approach for analyzing duplication history is introduced here. It reveals an ancient whole-genome duplication, a recent segmental duplication on Chromosomes 11 and 12, and massive ongoing individual gene duplications. We find 18 distinct pairs of duplicated segments that cover 65.7% of the genome; 17 of these pairs date back to a common time before the divergence of the grasses. More important, ongoing individual gene duplications provide a never-ending source of raw material for gene genesis and are major contributors to the differences between members of the grass family.


Related Papers