Integrative Analysis of the <i>Caenorhabditis elegans</i> Genome by the modENCODE ProjectWe systematically generated large-scale data sets to improve genome annotation for the nematode Caenorhabditis elegans, a key model organism. These data sets include transcriptome profiling across a developmental time course, genome-wide identification of transcription factor-binding sites, and maps of chromatin organization. From this, we created more complete and accurate gene models, including alternative splice forms and candidate noncoding RNAs. We constructed hierarchical networks of transcription factor-binding and microRNA interactions and discovered chromosomal locations bound by an unusually large number of transcription factors. Different patterns of chromatin composition and histone modification were revealed between chromosome arms and centers, with similarly prominent differences between autosomes and the X chromosome. Integrating data types, we built statistical models relating chromatin, transcription factor binding, and gene expression. Overall, our analyses ascribed putative functions to most of the conserved genome.
The Landscape of <i>C. elegans</i> 3′UTRsThree-prime untranslated regions (3'UTRs) of metazoan messenger RNAs (mRNAs) contain numerous regulatory elements, yet remain largely uncharacterized. Using polyA capture, 3' rapid amplification of complementary DNA (cDNA) ends, full-length cDNAs, and RNA-seq, we defined approximately 26,000 distinct 3'UTRs in Caenorhabditis elegans for approximately 85% of the 18,328 experimentally supported protein-coding genes and revised approximately 40% of gene models. Alternative 3'UTR isoforms are frequent, often differentially expressed during development. Average 3'UTR length decreases with animal age. Surprisingly, no polyadenylation signal (PAS) was detected for 13% of polyadenylation sites, predominantly among shorter alternative isoforms. Trans-spliced (versus non-trans-spliced) mRNAs possess longer 3'UTRs and frequently contain no PAS or variant PAS. We identified conserved 3'UTR motifs, isoform-specific predicted microRNA target sites, and polyadenylation of most histone genes. Our data reveal a rich complexity of 3'UTRs, both genome-wide and throughout development.
Genome Architecture and Evolution of a Unichromosomal Asexual NematodeDevelopmental dynamics of gene expression and alternative polyadenylation in the Caenorhabditis elegans germlineBACKGROUND: The 3' untranslated regions (UTRs) of mRNAs play a major role in post-transcriptional regulation of gene expression. Selection of transcript cleavage and polyadenylation sites is a dynamic process that produces multiple transcript isoforms for the same gene within and across different cell types. Using LITE-Seq, a new quantitative method to capture transcript 3' ends expressed in vivo, we have characterized sex- and cell type-specific transcriptome-wide changes in gene expression and 3'UTR diversity in Caenorhabditis elegans germline cells undergoing proliferation and differentiation. RESULTS: We show that nearly half of germline transcripts are alternatively polyadenylated, that differential regulation of endogenous 3'UTR variants is common, and that alternative isoforms direct distinct spatiotemporal protein expression patterns in vivo. Dynamic expression profiling also reveals temporal regulation of X-linked gene expression, selective stabilization of transcripts, and strong evidence for a novel developmental program that promotes nucleolar dissolution in oocytes. We show that the RNA-binding protein NCL-1/Brat is a posttranscriptional regulator of numerous ribosome-related transcripts that acts through specific U-rich binding motifs to down-regulate mRNAs encoding ribosomal protein subunits, rRNA processing factors, and tRNA synthetases. CONCLUSIONS: These results highlight the pervasive nature and functional potential of patterned gene and isoform expression during early animal development.
Novel LOTUS-domain proteins are organizational hubs that recruit C. elegans Vasa to germ granulesgerm granule components that interact with the intrinsically disordered MEG-3 protein. These proteins promote P granule condensation, form granules independently of MEG-3 in the postembryonic germ line, and balance each other in regulating P granule growth and localization. MIP-1 and MIP-2 each contain two LOTUS domains and intrinsically disordered regions and form homo- and heterodimers. They bind and anchor the Vasa homolog GLH-1 within P granules and are jointly required for coalescence of MEG-3, GLH-1, and PGL proteins. Animals lacking MIP-1 and MIP-2 show temperature-sensitive embryonic lethality, sterility, and mortal germ lines. Germline phenotypes include defects in stem cell self-renewal, meiotic progression, and gamete differentiation. We propose that these proteins serve as scaffolds and organizing centers for ribonucleoprotein networks within P granules that help recruit and balance essential RNA processing machinery to regulate key developmental transitions in the germ line.