W

Wilfred Wong

Memorial Sloan Kettering Cancer Center

ORCID: 0000-0003-1363-5235

Publishes on Genomics and Chromatin Dynamics, Single-cell and spatial transcriptomics, Epigenetics and DNA Methylation. 15 papers and 191 citations.

15Publications
191Total Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

Single-cell multi-ome regression models identify functional and disease-associated enhancers and enable chromatin potential analysis
Sneha Mitra, Ro Malik, Wilfred Wong et al.|Nature Genetics|2024
Cited by 67Open Access

We present a gene-level regulatory model, single-cell ATAC + RNA linking (SCARlink), which predicts single-cell gene expression and links enhancers to target genes using multi-ome (scRNA-seq and scATAC-seq co-assay) sequencing data. The approach uses regularized Poisson regression on tile-level accessibility data to jointly model all regulatory effects at a gene locus, avoiding the limitations of pairwise gene-peak correlations and dependence on peak calling. SCARlink outperformed existing gene scoring methods for imputing gene expression from chromatin accessibility across high-coverage multi-ome datasets while giving comparable to improved performance on low-coverage datasets. Shapley value analysis on trained models identified cell-type-specific gene enhancers that are validated by promoter capture Hi-C and are 11× to 15× and 5× to 12× enriched in fine-mapped eQTLs and fine-mapped genome-wide association study (GWAS) variants, respectively. We further show that SCARlink-predicted and observed gene expression vectors provide a robust way to compute a chromatin potential vector field to enable developmental trajectory analysis.

HiC-DC+ enables systematic 3D interaction calls and differential analysis for Hi-C and HiChIP
Merve Şahin, Wilfred Wong, Yingqian A. Zhan et al.|Nature Communications|2021
Cited by 64Open Access

Recent genome-wide chromosome conformation capture assays such as Hi-C and HiChIP have vastly expanded the resolution and throughput with which we can study 3D genomic architecture and function. Here, we present HiC-DC+, a software tool for Hi-C/HiChIP interaction calling and differential analysis using an efficient implementation of the HiC-DC statistical framework. HiC-DC+ integrates with popular preprocessing and visualization tools and includes topologically associating domain (TAD) and A/B compartment callers. We found that HiC-DC+ can more accurately identify enhancer-promoter interactions in H3K27ac HiChIP, as validated by CRISPRi-FlowFISH experiments, compared to existing methods. Differential HiC-DC+ analyses of published HiChIP and Hi-C data sets in settings of cellular differentiation and cohesin perturbation systematically and quantitatively recovers biological findings, including enhancer hubs, TAD aggregation, and the relationship between promoter-enhancer loop dynamics and gene expression changes. HiC-DC+ therefore provides a principled statistical analysis tool to empower genome-wide studies of 3D chromatin architecture and function.

ChromaFold predicts the 3D contact map from single-cell chromatin accessibility
Vianne R. Gao, Rui Yang, Arnav Das et al.|Nature Communications|2024
Cited by 21Open Access

Identifying cell-type-specific 3D chromatin interactions between regulatory elements can help decipher gene regulation and interpret disease-associated non-coding variants. However, achieving this resolution with current 3D genomics technologies is often infeasible given limited input cell numbers. We therefore present ChromaFold, a deep learning model that predicts 3D contact maps, including regulatory interactions, from single-cell ATAC sequencing (scATAC-seq) data alone. ChromaFold uses pseudobulk chromatin accessibility, co-accessibility across metacells, and a CTCF motif track as inputs and employs a lightweight architecture to train on standard GPUs. Trained on paired scATAC-seq and Hi-C data in human samples, ChromaFold accurately predicts the 3D contact map and peak-level interactions across diverse human and mouse test cell types. Compared to leading contact map prediction models that use ATAC-seq and CTCF ChIP-seq, ChromaFold achieves state-of-the-art performance using only scATAC-seq. Finally, fine-tuning ChromaFold on paired scATAC-seq and Hi-C in a complex tissue enables deconvolution of chromatin interactions across cell subpopulations. Obtaining a high-resolution contact map using current 3D genomics technologies can be challenging with small input cell numbers. Here, the authors develop ChromaFold, a deep learning model that predicts cell-type-specific 3D contact maps from single-cell chromatin accessibility data alone.

CRISPR screening uncovers a long-range enhancer for ONECUT1 in pancreatic differentiation and links a diabetes risk variant
Samuel J. Kaplan, Wilfred Wong, Jielin Yan et al.|Cell Reports|2024
Cited by 11Open Access

Functional enhancer annotation is critical for understanding tissue-specific transcriptional regulation and prioritizing disease-associated non-coding variants. However, unbiased enhancer discovery in disease-relevant contexts remains challenging. To identify enhancers pertinent to diabetes, we conducted a CRISPR interference (CRISPRi) screen in the human pluripotent stem cell (hPSC) pancreatic differentiation system. Among the enhancers identified, we focused on an enhancer we named ONECUT1e-664kb, ∼664 kb from the ONECUT1 promoter. Previous studies have linked ONECUT1 coding mutations to pancreatic hypoplasia and neonatal diabetes. We found that homozygous deletion of ONECUT1e-664kb in hPSCs leads to a near-complete loss of ONECUT1 expression and impaired pancreatic differentiation. ONECUT1e-664kb contains a type 2 diabetes-associated variant (rs528350911) disrupting a GATA motif. Introducing the risk variant into hPSCs reduced binding of key pancreatic transcription factors (GATA4, GATA6, and FOXA2), supporting its causal role in diabetes. This work highlights the utility of unbiased enhancer discovery in disease-relevant settings for understanding monogenic and complex disease.

Discovery of Competent Chromatin Regions in Human Embryonic Stem Cells
Julián Pulecio, Zakieh Tayyebi, Dingyu Liu et al.|bioRxiv (Cold Spring Harbor Laboratory)|2023
Cited by 7Open Access

The mechanisms underlying the ability of embryonic stem cells (ESCs) to rapidly activate lineage-specific genes during differentiation remain largely unknown. Through multiple CRISPR-activation screens, we discovered human ESCs have pre-established transcriptionally competent chromatin regions (CCRs) that support lineage-specific gene expression at levels comparable to differentiated cells. CCRs reside in the same topological domains as their target genes. They lack typical enhancer-associated histone modifications but show enriched occupancy of pluripotent transcription factors, DNA demethylation factors, and histone deacetylases. TET1 and QSER1 protect CCRs from excessive DNA methylation, while HDAC1 family members prevent premature activation. This "push and pull" feature resembles bivalent domains at developmental gene promoters but involves distinct molecular mechanisms. Our study provides new insights into pluripotency regulation and cellular plasticity in development and disease. One sentence summary: We report a class of distal regulatory regions distinct from enhancers that confer human embryonic stem cells with the competence to rapidly activate the expression of lineage-specific genes.