Pan-genome analysis highlights the extent of genomic variation in cultivated and wild rice

Qiang Zhao(Chinese Academy of Sciences), Qi Feng(Chinese Academy of Sciences), Hengyun Lu(Chinese Academy of Sciences), Yan Li(Chinese Academy of Sciences), Ahong Wang(Chinese Academy of Sciences), Qilin Tian(Chinese Academy of Sciences), Qilin Zhan(Chinese Academy of Sciences), Yiqi Lu(Chinese Academy of Sciences), Lei Zhang(Chinese Academy of Sciences), Tao Huang(Chinese Academy of Sciences), Yongchun Wang(Chinese Academy of Sciences), Danlin Fan(Chinese Academy of Sciences), Yan Zhao(Chinese Academy of Sciences), Ziqun Wang(Chinese Academy of Sciences), Congcong Zhou(Chinese Academy of Sciences), Jiaying Chen(Chinese Academy of Sciences), Chuanrang Zhu(Chinese Academy of Sciences), Wenjun Li(Chinese Academy of Sciences), Qijun Weng(Chinese Academy of Sciences), Qun Xu(Chinese Academy of Agricultural Sciences), Zi-Xuan Wang(Chinese Academy of Sciences), Xinghua Wei(Chinese Academy of Agricultural Sciences), Bin Han(Chinese Academy of Sciences), Xuehui Huang(Shanghai Normal University)
Nature Genetics
January 9, 2018
Cited by 690Open Access
Full Text

Abstract

The rich genetic diversity in Oryza sativa and Oryza rufipogon serves as the main sources in rice breeding. Large-scale resequencing has been undertaken to discover allelic variants in rice, but much of the information for genetic variation is often lost by direct mapping of short sequence reads onto the O. sativa japonica Nipponbare reference genome. Here we constructed a pan-genome dataset of the O. sativa–O. rufipogon species complex through deep sequencing and de novo assembly of 66 divergent accessions. Intergenomic comparisons identified 23 million sequence variants in the rice genome. This catalog of sequence variations includes many known quantitative trait nucleotides and will be helpful in pinpointing new causal variants that underlie complex traits. In particular, we systemically investigated the whole set of coding genes using this pan-genome data, which revealed extensive presence and absence of variation among rice accessions. This pan-genome resource will further promote evolutionary and functional studies in rice. A pan-genome dataset of the Oryza sativa–Oryza rufipogon species complex generated through deep sequencing and de novo genome assembly of 66 divergent accessions will be helpful in pinpointing new causal variants underlying complex traits and in promoting evolutionary and functional studies in rice.


Related Papers

No related papers found

Powered by citation graph analysis