SNAP: Streamlined Nextflow Analysis Pipeline for Immunoprecipitation-Based Epigenomic Profiling of Circulating Chromatin

Ze Zhang(Harvard University), Paulo Da Silva Cordeiro(Harvard University), Surya B. Chhetri(Harvard University), Brad Fortunato(Dana-Farber Cancer Institute), Zhenjie Jin(Dana-Farber Cancer Institute), Razane El Hajj Chehade(Harvard University), Karl Semaan(Harvard University), Gunsagar S. Gulati(Harvard University), Garyoung Gary Lee(Harvard University), Christopher Hemauer(Harvard University), Weiwei Bian(Dana-Farber Cancer Institute), Shahabeddin Sotudian(Harvard University), Ziwei Zhang(Harvard University), David O. Osei-Hwedieh(Massachusetts General Hospital), Tanya Heim(UPMC Hillman Cancer Center), Corrie Painter, Rashad Nawfal(Harvard University), Marc Eid(Harvard University), Damien Vasseur(Harvard University), John Canniff(Dana-Farber Cancer Institute), Hunter Savignano(Dana-Farber Cancer Institute), Noa Phillips(Dana-Farber Cancer Institute), Ji-Heui Seo(Dana-Farber Cancer Institute), Kurt R. Weiss(UPMC Hillman Cancer Center), Matt Freedman(Harvard University), Sylvan C. Baca(Harvard University)
bioRxiv (Cold Spring Harbor Laboratory)
December 30, 2025
Cited by 3Open Access
Full Text

Abstract

Epigenomic profiling of circulating chromatin is a powerful and minimally invasive approach for detecting and monitoring disease, but there are no bioinformatics pipelines tailored to the unique characteristics of cell-free chromatin. We present SNAP (Streamlined Nextflow Analysis Pipeline), a reproducible, scalable, and modular workflow specifically designed for immunoprecipitation-based methods for profiling cell-free chromatin. SNAP incorporates quality control metrics optimized for circulating chromatin, including enrichment score and fragment count thresholds, as well as direct estimation of circulating tumor DNA (ctDNA) content from fragment length distributions. It also includes SNP fingerprinting to enable sample identity verification. When applied to cfChIP-seq and cfMeDIP-seq data across multiple cancer types, SNAP's quality filters significantly improved classification performance while maintaining high data retention. Independent validation using plasma from patients with osteosarcoma confirmed the detection of tumor-associated epigenomic signatures that correlated with ctDNA levels and reflected disease biology. SNAP's modular architecture enables straightforward extension to additional cell-free immunoprecipitation-based assays, providing a robust framework to support studies of circulating chromatin broadly. SNAP is compatible with cloud and high-performance computing environments and is publicly available at https://github.com/prc992/SNAP/.


Related Papers

No related papers found

Powered by citation graph analysis