The Sorghum bicolor genome and the diversification of grasses
Abstract
Sorghum, an African grass related to sugar cane and maize, is grown for food, feed, fibre and fuel. We present an initial analysis of the ∼730-megabase Sorghum bicolor (L.) Moench genome, placing ∼98% of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information. Genetic recombination is largely confined to about one-third of the sorghum genome with gene order and density similar to those of rice. Retrotransposon accumulation in recombinationally recalcitrant heterochromatin explains the ∼75% larger genome size of sorghum compared with rice. Although gene and repetitive DNA distributions have been preserved since palaeopolyploidization ∼70 million years ago, most duplicated gene sets lost one member before the sorghum–rice divergence. Concerted evolution makes one duplicated chromosomal segment appear to be only a few million years old. About 24% of genes are grass-specific and 7% are sorghum-specific. Recent gene and microRNA duplications may contribute to sorghum’s drought tolerance. The Sorghum bicolor genome sequence is published this week. Sorghum is a cereal grown widely as food, animal feed, fibre and fuel. Tolerant to hot, dry conditions, it is a staple for large populations in the West African Sahel region. Comparisons of the genome with those of maize and rice shed light on the evolution of grasses and of C4 photosynthesis, which is particularly efficient at assimilating carbon at high temperatures. In addition, protein coding genes and miRNAs that could contribute to sorghum's drought tolerance may also be found. Sorghum yield improvement has lagged behind that of other crops and the availability of the genome sequence could provide a vital boost to work on its improvement. Sorghum is an African grass that is grown for food, animal feed and fuel. The current paper presents an initial analysis of the ∼730 megabase genome of Sorghum bicolor. Genome analysis and its comparison with maize and rice shed light on grass genome evolution and also provide insights into the evolution of C4 photosynthesis, as well as protein coding genes and miRNAs that might contribute to sorghum's drought tolerance.