iRegulon: From a Gene List to a Gene Regulatory Network Using Large Motif and Track CollectionsIdentifying master regulators of biological processes and mapping their downstream gene networks are key challenges in systems biology. We developed a computational method, called iRegulon, to reverse-engineer the transcriptional regulatory network underlying a co-expressed gene set using cis-regulatory sequence analysis. iRegulon implements a genome-wide ranking-and-recovery approach to detect enriched transcription factor motifs and their optimal sets of direct targets. We increase the accuracy of network inference by using very large motif collections of up to ten thousand position weight matrices collected from various species, and linking these to candidate human TFs via a motif2TF procedure. We validate iRegulon on gene sets derived from ENCODE ChIP-seq data with increasing levels of noise, and we compare iRegulon with existing motif discovery methods. Next, we use iRegulon on more challenging types of gene lists, including microRNA target sets, protein-protein interaction networks, and genetic perturbation data. In particular, we over-activate p53 in breast cancer cells, followed by RNA-seq and ChIP-seq, and could identify an extensive up-regulated network controlled directly by p53. Similarly we map a repressive network with no indication of direct p53 regulation but rather an indirect effect via E2F and NFY. Finally, we generalize our computational framework to include regulatory tracks such as ChIP-seq data and show how motif and track discovery can be combined to map functional regulatory interactions among co-expressed genes. iRegulon is available as a Cytoscape plugin from http://iregulon.aertslab.org.
Alteration of the microRNA network during the progression of Alzheimer's diseasePierre Lau, Koen Bossers, Rekin’s Janky et al.|EMBO Molecular Medicine|2013 An overview of miRNAs altered in Alzheimer's disease (AD) was established by profiling the hippocampus of a cohort of 41 late-onset AD (LOAD) patients and 23 controls, showing deregulation of 35 miRNAs. Profiling of miRNAs in the prefrontal cortex of a second independent cohort of 49 patients grouped by Braak stages revealed 41 deregulated miRNAs. We focused on miR-132-3p which is strongly altered in both brain areas. Downregulation of this miRNA occurs already at Braak stages III and IV, before loss of neuron-specific miRNAs. Next-generation sequencing confirmed a strong decrease of miR-132-3p and of three family-related miRNAs encoded by the same miRNA cluster on chromosome 17. Deregulation of miR-132-3p in AD brain appears to occur mainly in neurons displaying Tau hyper-phosphorylation. We provide evidence that miR-132-3p may contribute to disease progression through aberrant regulation of mRNA targets in the Tau network. The transcription factor (TF) FOXO1a appears to be a key target of miR-132-3p in this pathway.
C. elegans ORFeome version 1.1: experimental verification of the genome annotation and resource for proteome-scale protein expressionIndependence of Repressive Histone Marks and Chromatin Compaction during Senescent Heterochromatic Layer FormationRSAT: regulatory sequence analysis toolsThe regulatory sequence analysis tools (RSAT, http://rsat.ulb.ac.be/rsat/) is a software suite that integrates a wide collection of modular tools for the detection of cis-regulatory elements in genome sequences. The suite includes programs for sequence retrieval, pattern discovery, phylogenetic footprint detection, pattern matching, genome scanning and feature map drawing. Random controls can be performed with random gene selections or by generating random sequences according to a variety of background models (Bernoulli, Markov). Beyond the original word-based pattern-discovery tools (oligo-analysis and dyad-analysis), we recently added a battery of tools for matrix-based detection of cis-acting elements, with some original features (adaptive background models, Markov-chain estimation of P-values) that do not exist in other matrix-based scanning tools. The web server offers an intuitive interface, where each program can be accessed either separately or connected to the other tools. In addition, the tools are now available as web services, enabling their integration in programmatic workflows. Genomes are regularly updated from various genome repositories (NCBI and EnsEMBL) and 682 organisms are currently supported. Since 1998, the tools have been used by several hundreds of researchers from all over the world. Several predictions made with RSAT were validated experimentally and published.