Reference genome sequence of the model plant SetariaCompletion of genome sequences for the diploid Setaria italica reveals features of C4 photosynthesis that could enable improvement of the polyploid biofuel crop switchgrass (Panicum virgatum). The genetic basis of biotechnologically relevant traits, including drought tolerance, photosynthetic efficiency and flowering control, is also highlighted. We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ∼400-Mb assembly covers ∼80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species that demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).
The genome of Eucalyptus grandisEucalypts are the world’s most widely planted hardwood trees. Their outstanding diversity, adaptability and growth have made them a global renewable resource of fibre and energy. We sequenced and assembled >94% of the 640-megabase genome of Eucalyptus grandis. Of 36,376 predicted protein-coding genes, 34% occur in tandem duplications, the largest proportion thus far in plant genomes. Eucalyptus also shows the highest diversity of genes for specialized metabolites such as terpenes that act as chemical defence and provide unique pharmaceutical oils. Genome sequencing of the E. grandis sister species E. globulus and a set of inbred E. grandis tree genomes reveals dynamic genome evolution and hotspots of inbreeding depression. The E. grandis genome is the first reference for the eudicot order Myrtales and is placed here sister to the eurosids. This resource expands our understanding of the unique biology of large woody perennials and provides a powerful tool to accelerate comparative biology, breeding and biotechnology. The Eucalyptus grandis genome has been sequenced, revealing the greatest number of tandem duplications of any plant genome sequenced so far, and the highest diversity of genes for specialized metabolites that act as chemical defence and provide unique pharmaceutical oils; genome sequencing of the sister species E. globulus and a set of inbred E. grandis tree genomes reveals dynamic genome evolution and hotspots of inbreeding depression. Fast-growing Eucalyptus trees form the basis of an international pulp, paper and chemical cellulose industry and they are also seen as potential biomass feedstocks for bioenergy and biomaterials. The genome of Eucalyptus grandis has now been sequenced. It contains the greatest number of tandem duplications so far found in a plant genome, as well as the highest diversity of genes for specialized metabolites that act as chemical defence and provide unique pharmaceutical oils. Comparison with the sister species E. globulus and with other E. grandis lines reveals dynamic genome evolution and hotspots of inbreeding depression. The availability of comprehensive genomic data will be of use in work on accelerating breeding cycles for productivity and wood quality and developing eucalypt strains suited to a variety of habitats.
Widespread noncoding circular <scp>RNA</scp>s in plantsChuyu Ye, Li Chen, Chen Liu et al.|New Phytologist|2015 A large number of noncoding circular RNAs (circRNAs) with regulatory potency have been identified in animals, but little attention has been given to plant circRNAs. We performed genome-wide identification of circRNAs in Oryza sativa and Arabidopsis thaliana using publically available RNA-Seq data, analyzed and compared features of plant and animal circRNAs. circRNAs (12037 and 6012) were identified in Oryza sativa and Arabidopsis thaliana, respectively, with 56% (10/18) of the sampled rice exonic circRNAs validated experimentally. Parent genes of over 700 exonic circRNAs were orthologues between rice and Arabidopsis, suggesting conservation of circRNAs in plants. The introns flanking plant circRNAs were much longer than introns from linear genes, and possessed less repetitive elements and reverse complementary sequences than the flanking introns of animal circRNAs. Plant circRNAs showed diverse expression patterns, and 27 rice exonic circRNAs were found to be differentially expressed under phosphate-sufficient and -starvation conditions. A significantly positive correlation was observed for the expression profiles of some circRNAs and their parent genes. Our results demonstrated that circRNAs are widespread in plants, revealed the common and distinct features of circRNAs between plants and animals, and suggested that circRNAs could be a critical class of noncoding regulators in plants.
Genome-wide identification and functional prediction of novel and drought-responsive lincRNAs in Populus trichocarpaPeng Shuai, Dan Liang, Sha Tang et al.|Journal of Experimental Botany|2014 Protein-coding genes are considered to be a dominant component of the eukaryotic transcriptome; however, many studies have shown that intergenic, non-coding transcripts also play an important role. Long intergenic non-coding RNAs (lincRNAs) were found to play a vital role in human and Arabidopsis. However, lincRNAs and their regulatory roles remain poorly characterized in woody plants, especially Populus trichocarpa (P. trichocarpa). A large set of Populus RNA-Seq data were examined with high sequencing depth under control and drought conditions and a total of 2542 lincRNA candidates were identified. In total, 51 lincRNAs and 20 lincRNAs were identified as putative targets and target mimics of known Populus miRNAs, respectively. A total of 504 lincRNAs were found to be drought responsive, eight of which were confirmed by RT-qPCR. These findings provide a comprehensive view of Populus lincRNAs, which will enable in-depth functional analysis.
Echinochloa crus-galli genome analysis provides insight into its adaptation and invasiveness as a weedLongbiao Guo, Jie Qiu, Chuyu Ye et al.|Nature Communications|2017 Barnyardgrass (Echinochloa crus-galli) is a pernicious weed in agricultural fields worldwide. The molecular mechanisms underlying its success in the absence of human intervention are presently unknown. Here we report a draft genome sequence of the hexaploid species E. crus-galli, i.e., a 1.27 Gb assembly representing 90.7% of the predicted genome size. An extremely large repertoire of genes encoding cytochrome P450 monooxygenases and glutathione S-transferases associated with detoxification are found. Two gene clusters involved in the biosynthesis of an allelochemical 2,4-dihydroxy-7-methoxy-1,4-benzoxazin-3-one (DIMBOA) and a phytoalexin momilactone A are found in the E. crus-galli genome, respectively. The allelochemical DIMBOA gene cluster is activated in response to co-cultivation with rice, while the phytoalexin momilactone A gene cluster specifically to infection by pathogenic Pyricularia oryzae. Our results provide a new understanding of the molecular mechanisms underlying the extreme adaptation of the weed.