K

Kristy S. Mualim

Carnegie Institution for Science

ORCID: 0000-0002-6898-3205

Publishes on RNA Research and Splicing, RNA and protein synthesis mechanisms, Species Distribution and Climate Change. 12 papers and 990 citations.

12Publications
990Total Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

An encyclopedia of enhancer-gene regulatory interactions in the human genome
Andreas R. Gschwind, Kristy S. Mualim, Alireza Karbalayghareh et al.|bioRxiv (Cold Spring Harbor Laboratory)|2023
Cited by 115Open Access

Abstract Identifying transcriptional enhancers and their target genes is essential for understanding gene regulation and the impact of human genetic variation on disease 1–6 . Here we create and evaluate a resource of >13 million enhancer-gene regulatory interactions across 352 cell types and tissues, by integrating predictive models, measurements of chromatin state and 3D contacts, and large-scale genetic perturbations generated by the ENCODE Consortium 7 . We first create a systematic benchmarking pipeline to compare predictive models, assembling a dataset of 10,411 element-gene pairs measured in CRISPR perturbation experiments, >30,000 fine-mapped eQTLs, and 569 fine-mapped GWAS variants linked to a likely causal gene. Using this framework, we develop a new predictive model, ENCODE-rE2G, that achieves state-of-the-art performance across multiple prediction tasks, demonstrating a strategy involving iterative perturbations and supervised machine learning to build increasingly accurate predictive models of enhancer regulation. Using the ENCODE-rE2G model, we build an encyclopedia of enhancer-gene regulatory interactions in the human genome, which reveals global properties of enhancer networks, identifies differences in the functions of genes that have more or less complex regulatory landscapes, and improves analyses to link noncoding variants to target genes and cell types for common, complex diseases. By interpreting the model, we find evidence that, beyond enhancer activity and 3D enhancer-promoter contacts, additional features guide enhancer-promoter communication including promoter class and enhancer-enhancer synergy. Altogether, these genome-wide maps of enhancer-gene regulatory interactions, benchmarking software, predictive models, and insights about enhancer function provide a valuable resource for future studies of gene regulation and human genetics.

Cohesin-mediated 3D contacts tune enhancer-promoter regulation
Philine Guckelberger, Benjamin R. Doughty, Glen Munson et al.|bioRxiv (Cold Spring Harbor Laboratory)|2024
Cited by 29Open Access

Enhancers are key drivers of gene regulation thought to act via 3D physical interactions with the promoters of their target genes. However, genome-wide depletions of architectural proteins such as cohesin result in only limited changes in gene expression, despite a loss of contact domains and loops. Consequently, the role of cohesin and 3D contacts in enhancer function remains debated. Here, we developed CRISPRi of regulatory elements upon degron operation (CRUDO), a novel approach to measure how changes in contact frequency impact enhancer effects on target genes by perturbing enhancers with CRISPRi and measuring gene expression in the presence or absence of cohesin. We systematically perturbed all 1,039 candidate enhancers near five cohesin-dependent genes and identified 34 enhancer-gene regulatory interactions. Of 26 regulatory interactions with sufficient statistical power to evaluate cohesin dependence, 18 show cohesin-dependent effects. A decrease in enhancer-promoter contact frequency upon removal of cohesin is frequently accompanied by a decrease in the regulatory effect of the enhancer on gene expression, consistent with a contact-based model for enhancer function. However, changes in contact frequency and regulatory effects on gene expression vary as a function of distance, with distal enhancers (e.g., >50Kb) experiencing much larger changes than proximal ones (e.g., <50Kb). Because most enhancers are located close to their target genes, these observations can explain how only a small subset of genes - those with strong distal enhancers - are sensitive to cohesin. Together, our results illuminate how 3D contacts, influenced by both cohesin and genomic distance, tune enhancer effects on gene expression.

Genome-wide maps of enhancer regulation connect risk variants to disease genes
Joseph Nasser, Drew T. Bergman, Charles P. Fulco et al.|bioRxiv (Cold Spring Harbor Laboratory)|2020
Cited by 12Open Access

Abstract Genome-wide association studies have now identified tens of thousands of noncoding loci associated with human diseases and complex traits, each of which could reveal insights into biological mechanisms of disease. Many of the underlying causal variants are thought to affect enhancers, but we have lacked genome-wide maps of enhancer-gene regulation to interpret such variants. We previously developed the Activity-by-Contact (ABC) Model to predict enhancer-gene connections and demonstrated that it can accurately predict the results of CRISPR perturbations across several cell types. Here, we apply this ABC Model to create enhancer-gene maps in 131 cell types and tissues, and use these maps to interpret the functions of fine-mapped GWAS variants. For inflammatory bowel disease (IBD), causal variants are &gt;20-fold enriched in enhancers in particular cell types, and ABC outperforms other regulatory methods at connecting noncoding variants to target genes. Across 72 diseases and complex traits, ABC links 5,036 GWAS signals to 2,249 unique genes, including a class of 577 genes that appear to influence multiple phenotypes via variants in enhancers that act in different cell types. Guided by these variant-to-function maps, we show that an enhancer containing an IBD risk variant regulates the expression of PPIF to tune mitochondrial membrane potential. Together, our study reveals insights into principles of genome regulation, illuminates mechanisms that influence IBD, and demonstrates a generalizable strategy to connect common disease risk variants to their molecular and cellular functions.

A consensus variant-to-function score to functionally prioritize variants for disease
Tabassum Fabiha, Ivy Evergreen, Soumya Kundu et al.|bioRxiv (Cold Spring Harbor Laboratory)|2024
Cited by 9Open Access

Identifying and functionally characterizing causal disease variants in genome-wide association studies remains a pressing challenge. Here, we construct a consensus variant-to-function (cV2F) score that assigns a single value to each common single-nucleotide variant in the genome, and helps to predict and characterize causal disease variants. The cV2F score leverages features reflecting variant-level experimentally and computationally predicted function (e.g. allelic imbalance and sequence-based deep learning models) and element-level function (e.g. predicted enhancers), and learns optimal combinations of features by training a gradient boosting model on GWAS fine-mapping results. The cV2F-annotated variants attained an AUPRC of 0.822 at identifying held-out fine-mapped variants. Variants with high cV2F scores are highly enriched for heritability (14.2x, s.e. 0.5) across 66 diseases/traits, are uniquely informative for disease heritability, and are highly predictive of variants implicated by reporter assays; cV2F substantially outperforms previous variant-to-function scores using all of these metrics. GWAS fine-mapping of 110 diseases/traits informed by cV2F identified 14.3% more confidently fine-mapped (PIP &gt; 0.95) variants than non-functionally informed fine-mapping. We further constructed tissue/cell line-specific cV2F scores that prioritize variants based on regulatory potential in specific tissues/cell lines, attaining high heritability enrichment for tissue-related diseases/traits (15.6x, s.e. 2.3) while providing independent information (average correlation of 0.27 with the primary cV2F score). We highlight examples of GWAS loci for which cV2F pinpoints causal variants with high confidence and elucidates their functional role.