ClusterMap for multi-scale clustering analysis of spatial gene expression

Yichun He(Broad Institute), Xin Tang(Broad Institute), Jiahao Huang(Broad Institute), Jingyi Ren(Broad Institute), Haowen Zhou(Broad Institute), Kevin Chen(Harvard University), Albert Liu(Broad Institute), Hailing Shi(Broad Institute), Zuwan Lin(Broad Institute), Qiang Li(Harvard University), Abhishek Aditham(Broad Institute), Johain R. Ounadjela(Broad Institute), Emanuelle I. Grody(Broad Institute), Jian Shu(Broad Institute), Jia Liu(Harvard University), Xiao Wang(Broad Institute)
Nature Communications
October 8, 2021
Cited by 92Open Access
Full Text

Abstract

Quantifying RNAs in their spatial context is crucial to understanding gene expression and regulation in complex tissues. In situ transcriptomic methods generate spatially resolved RNA profiles in intact tissues. However, there is a lack of a unified computational framework for integrative analysis of in situ transcriptomic data. Here, we introduce an unsupervised and annotation-free framework, termed ClusterMap, which incorporates the physical location and gene identity of RNAs, formulates the task as a point pattern analysis problem, and identifies biologically meaningful structures by density peak clustering (DPC). Specifically, ClusterMap precisely clusters RNAs into subcellular structures, cell bodies, and tissue regions in both two- and three-dimensional space, and performs consistently on diverse tissue types, including mouse brain, placenta, gut, and human cardiac organoids. We demonstrate ClusterMap to be broadly applicable to various in situ transcriptomic measurements to uncover gene expression patterns, cell niche, and tissue organization principles from images with high-dimensional transcriptomic profiles.


Related Papers

Matplotlib: A 2D Graphics Environment
John D. Hunter|Computing in Science & Engineering|2007|39.1k
Fast unfolding of communities in large networks
Vincent D. Blondel, Jean‐Loup Guillaume, Renaud Lambiotte et al.|Journal of Statistical Mechanics Theory and Experiment|2008|21.1k
Data Structures for Statistical Computing in Python
Wes McKinney|Proceedings of the Python in Science Conferences|2010|11.1k