Integrative Analysis of the <i>Caenorhabditis elegans</i> Genome by the modENCODE ProjectWe systematically generated large-scale data sets to improve genome annotation for the nematode Caenorhabditis elegans, a key model organism. These data sets include transcriptome profiling across a developmental time course, genome-wide identification of transcription factor-binding sites, and maps of chromatin organization. From this, we created more complete and accurate gene models, including alternative splice forms and candidate noncoding RNAs. We constructed hierarchical networks of transcription factor-binding and microRNA interactions and discovered chromosomal locations bound by an unusually large number of transcription factors. Different patterns of chromatin composition and histone modification were revealed between chromosome arms and centers, with similarly prominent differences between autosomes and the X chromosome. Integrating data types, we built statistical models relating chromatin, transcription factor binding, and gene expression. Overall, our analyses ascribed putative functions to most of the conserved genome.
Broad chromosomal domains of histone modification patterns in <i>C. elegans</i>Chromatin immunoprecipitation identifies specific interactions between genomic DNA and proteins, advancing our understanding of gene-level and chromosome-level regulation. Based on chromatin immunoprecipitation experiments using validated antibodies, we define the genome-wide distributions of 19 histone modifications, one histone variant, and eight chromatin-associated proteins in Caenorhabditis elegans embryos and L3 larvae. Cluster analysis identified five groups of chromatin marks with shared features: Two groups correlate with gene repression, two with gene activation, and one with the X chromosome. The X chromosome displays numerous unique properties, including enrichment of monomethylated H4K20 and H3K27, which correlate with the different repressive mechanisms that operate in somatic tissues and germ cells, respectively. The data also revealed striking differences in chromatin composition between the autosomes and between chromosome arms and centers. Chromosomes I and III are globally enriched for marks of active genes, consistent with containing more highly expressed genes, compared to chromosomes II, IV, and especially V. Consistent with the absence of cytological heterochromatin and the holocentric nature of C. elegans chromosomes, markers of heterochromatin such as H3K9 methylation are not concentrated at a single region on each chromosome. Instead, H3K9 methylation is enriched on chromosome arms, coincident with zones of elevated meiotic recombination. Active genes in chromosome arms and centers have very similar histone mark distributions, suggesting that active domains in the arms are interspersed with heterochromatin-like structure. These data, which confirm and extend previous studies, allow for in-depth analysis of the organization and deployment of the C. elegans genome during development.