Rice Annotation Project Database (RAP-DB): An Integrative and Interactive Database for Rice GenomicsHiroaki Sakai, Sung Shin Lee, Tsuyoshi Tanaka et al.|Plant and Cell Physiology|2013 The Rice Annotation Project Database (RAP-DB, http://rapdb.dna.affrc.go.jp/) has been providing a comprehensive set of gene annotations for the genome sequence of rice, Oryza sativa (japonica group) cv. Nipponbare. Since the first release in 2005, RAP-DB has been updated several times along with the genome assembly updates. Here, we present our newest RAP-DB based on the latest genome assembly, Os-Nipponbare-Reference-IRGSP-1.0 (IRGSP-1.0), which was released in 2011. We detected 37,869 loci by mapping transcript and protein sequences of 150 monocot species. To provide plant researchers with highly reliable and up to date rice gene annotations, we have been incorporating literature-based manually curated data, and 1,626 loci currently incorporate literature-based annotation data, including commonly used gene names or gene symbols. Transcriptional activities are shown at the nucleotide level by mapping RNA-Seq reads derived from 27 samples. We also mapped the Illumina reads of a Japanese leading japonica cultivar, Koshihikari, and a Chinese indica cultivar, Guangluai-4, to the genome and show alignments together with the single nucleotide polymorphisms (SNPs) and gene functional annotations through a newly developed browser, Short-Read Assembly Browser (S-RAB). We have developed two satellite databases, Plant Gene Family Database (PGFD) and Integrative Database of Cereal Gene Phylogeny (IDCGP), which display gene family and homologous gene relationships among diverse plant species. RAP-DB and the satellite databases offer simple and user-friendly web interfaces, enabling plant and genome researchers to access the data easily and facilitating a broad range of plant research topics.
Analysis of the Set of GABAA Receptor Genes in the Human GenomeJoseph Simon, Hironobu Wakimoto, Norihisa Fujita et al.|Journal of Biological Chemistry|2004 The genes of the ionotropic gamma-aminobutyric acid receptor (GABR) subunits have shown an unusual chromosomal clustering, but only now can this be fully specified by analyses of the human genome. We have characterized the genes encoding the 18 known human GABR subunits, plus one now located here, for their precise locations, sizes, and exon/intron structures. Clusters of 17 of the 19, distributed between five chromosomes, are specified in detail, and their possible significance is considered. By applying search algorithms designed to recognize sequences of all known GABR-type subunits in species from man down to nematodes, we found no new GABR subunit is detectable in the human genome. However, the sequence of the human orthologue of the rat GABR rho3 receptor subunit was uncovered by these algorithms, and its gene could be analyzed. Consistent with those search results, orthologues of the beta4 and gamma4 subunits from the chicken, not cloned from mammals, were not detectable in the human genome by specific searches for them. The relationships are consistent with the mammalian subunit being derived from the beta line and epsilon from the gamma line, with mammalian loss of beta4 and gamma4. In their structures the human GABR genes show a basic pattern of nine coding exons, with six different genomic mechanisms for the alternative splicing found in various subunits. Additional noncoding exons occur for certain subunits, which can be regulatory. A dicysteine loop and its exon show remarkable constancy between all GABR subunits and species, of deduced functional significance.
TENOR: Database for Comprehensive mRNA-Seq Experiments in RiceHere we present TENOR (Transcriptome ENcyclopedia Of Rice, http://tenor.dna.affrc.go.jp), a database that encompasses large-scale mRNA sequencing (mRNA-Seq) data obtained from rice under a wide variety of conditions. Since the elucidation of the ability of plants to adapt to various growing conditions is a key issue in plant sciences, it is of great interest to understand the regulatory networks of genes responsible for environmental changes. We used mRNA-Seq and performed a time-course transcriptome analysis of rice, Oryza sativa L. (cv. Nipponbare), under 10 abiotic stress conditions (high salinity; high and low phosphate; high, low and extremely low cadmium; drought; osmotic; cold; and flood) and two plant hormone treatment conditions (ABA and jasmonic acid). A large number of genes that were responsive to abiotic stresses and plant hormones were detected by differential expression analysis. Furthermore, several responsive genes were found to encode transcription factors that could control the transcriptional network of stress responses, but the timing of the induction of these genes was not uniform across conditions. A significant number of cis-regulatory elements were enriched in the promoter regions of the responsive genes and were shared among conditions. These data suggest that some key components of gene regulation networks are shared between different stress signaling pathways. All the resources (novel genes identified from mRNA-Seq data, expression profiles, co-expressed genes and cis-regulatory elements) can be searched for and are available in TENOR.
Massive parallel sequencing of mRNA in identification of unannotated salinity stress-inducible transcripts in rice (Oryza sativa L.)BACKGROUND: Microarray technology is limited to monitoring the expression of previously annotated genes that have corresponding probes on the array. Computationally annotated genes have not fully been validated, because ESTs and full-length cDNAs cannot cover entire transcribed regions. Here, mRNA-Seq (an Illumina cDNA sequencing application) was used to monitor whole mRNAs of salinity stress-treated rice tissues. RESULTS: Thirty-six-base-pair reads from whole mRNAs were mapped to the rice genomic sequence: 72.0% to 75.2% were mapped uniquely to the genome, and 5.0% to 5.7% bridged exons. From the piling up of short reads mapped on the genome, a series of programs (Bowtie, TopHat, and Cufflinks) comprehensively predicted 51,301 (shoot) and 54,491 (root) transcripts, including 2,795 (shoot) and 3,082 (root) currently unannotated in the Rice Annotation Project database. Of these unannotated transcripts, 995 (shoot) and 1,052 (root) had ORFs similar to those encoding the amino acid sequences of functional proteins in a BLASTX search against UniProt and RefSeq databases. Among the unannotated genes, 213 (shoot) and 436 (root) were differentially expressed in response to salinity stress. Sequence-based and array-based measurements of the expression ratios of previously annotated genes were highly correlated. CONCLUSION: Unannotated transcripts were identified on the basis of the piling up of mapped reads derived from mRNAs in rice. Some of these unannotated transcripts encoding putative functional proteins were expressed differentially in response to salinity stress.
Retrogenes in Rice (Oryza sativa L. ssp. japonica) Exhibit Correlated Expression with Their Source GenesHiroaki Sakai, Hiroshi Mizuno, Yoshihiro Kawahara et al.|Genome Biology and Evolution|2011 Gene duplication occurs by either DNA- or RNA-based processes; the latter duplicates single genes via retroposition of messenger RNA. The expression of a retroposed gene copy (retrocopy) is expected to be uncorrelated with its source gene because upstream promoter regions are usually not part of the retroposition process. In contrast, DNA-based duplication often encompasses both the coding and the intergenic (promoter) regions; hence, expression is often correlated, at least initially, between DNA-based duplicates. In this study, we identified 150 retrocopies in rice (Oryza sativa L. ssp japonica), most of which represent ancient retroposition events. We measured their expression from high-throughput RNA sequencing (RNAseq) data generated from seven tissues. At least 66% of the retrocopies were expressed but at lower levels than their source genes. However, the tissue specificity of retrogenes was similar to their source genes, and expression between retrocopies and source genes was correlated across tissues. The level of correlation was similar between RNA- and DNA-based duplicates, and they decreased over time at statistically indistinguishable rates. We extended these observations to previously identified retrocopies in Arabidopsis thaliana, suggesting they may be general features of the process of retention of plant retrogenes.