ChromaFold predicts the 3D contact map from single-cell chromatin accessibility

Vianne R. Gao(Memorial Sloan Kettering Cancer Center), Rui Yang(Memorial Sloan Kettering Cancer Center), Arnav Das(University of Washington), Renhe Luo(Kettering University), Hanzhi Luo(Memorial Sloan Kettering Cancer Center), Dylan McNally(Cornell University), Ioannis Karagiannidis(Cornell University), Martín A. Rivas(Cornell University), Zhong-Min Wang(Memorial Sloan Kettering Cancer Center), Darko Barišić(Cornell University), Alireza Karbalayghareh(Memorial Sloan Kettering Cancer Center), Wilfred Wong(Memorial Sloan Kettering Cancer Center), Yingqian A. Zhan(Memorial Sloan Kettering Cancer Center), Christopher R. Chin(Cornell University), William Stafford Noble(University of Washington), Jeff Bilmes(University of Washington), Effie Apostolou(Cornell University), Michael G. Kharas(Memorial Sloan Kettering Cancer Center), Wendy Béguelin(Cornell University), Aaron D. Viny(Columbia University), Danwei Huangfu(Kettering University), Alexander Y. Rudensky(Memorial Sloan Kettering Cancer Center), Ari Melnick(Cornell University), Christina S. Leslie(Memorial Sloan Kettering Cancer Center)
Nature Communications
November 1, 2024
Cited by 21Open Access
Full Text

Abstract

Identifying cell-type-specific 3D chromatin interactions between regulatory elements can help decipher gene regulation and interpret disease-associated non-coding variants. However, achieving this resolution with current 3D genomics technologies is often infeasible given limited input cell numbers. We therefore present ChromaFold, a deep learning model that predicts 3D contact maps, including regulatory interactions, from single-cell ATAC sequencing (scATAC-seq) data alone. ChromaFold uses pseudobulk chromatin accessibility, co-accessibility across metacells, and a CTCF motif track as inputs and employs a lightweight architecture to train on standard GPUs. Trained on paired scATAC-seq and Hi-C data in human samples, ChromaFold accurately predicts the 3D contact map and peak-level interactions across diverse human and mouse test cell types. Compared to leading contact map prediction models that use ATAC-seq and CTCF ChIP-seq, ChromaFold achieves state-of-the-art performance using only scATAC-seq. Finally, fine-tuning ChromaFold on paired scATAC-seq and Hi-C in a complex tissue enables deconvolution of chromatin interactions across cell subpopulations. Obtaining a high-resolution contact map using current 3D genomics technologies can be challenging with small input cell numbers. Here, the authors develop ChromaFold, a deep learning model that predicts cell-type-specific 3D contact maps from single-cell chromatin accessibility data alone.


Related Papers

No related papers found

Powered by citation graph analysis