BANKSY unifies cell typing and tissue domain segmentation for scalable spatial omics data analysis

Vipul Singhal(Agency for Science, Technology and Research), Nigel Chou(Agency for Science, Technology and Research), Joseph Lee(National University of Singapore), Yifei Yue(National University of Singapore), Jinyue Liu(Agency for Science, Technology and Research), Wan Kee Chock(Agency for Science, Technology and Research), Li Lin(Agency for Science, Technology and Research), Yun‐Ching Chang, Erica Mei Ling Teo, Jonathan Aow(Agency for Science, Technology and Research), Hwee Kuan Lee(Agency for Science, Technology and Research), Kok Hao Chen(Agency for Science, Technology and Research), Shyam Prabhakar(Agency for Science, Technology and Research)
Nature Genetics
February 27, 2024
Cited by 217Open Access
Full Text

Abstract

Spatial omics data are clustered to define both cell types and tissue domains. We present Building Aggregates with a Neighborhood Kernel and Spatial Yardstick (BANKSY), an algorithm that unifies these two spatial clustering problems by embedding cells in a product space of their own and the local neighborhood transcriptome, representing cell state and microenvironment, respectively. BANKSY's spatial feature augmentation strategy improved performance on both tasks when tested on diverse RNA (imaging, sequencing) and protein (imaging) datasets. BANKSY revealed unexpected niche-dependent cell states in the mouse brain and outperformed competing methods on domain segmentation and cell typing benchmarks. BANKSY can also be used for quality control of spatial transcriptomics data and for spatially aware batch effect correction. Importantly, it is substantially faster and more scalable than existing methods, enabling the processing of millions of cell datasets. In summary, BANKSY provides an accurate, biologically motivated, scalable and versatile framework for analyzing spatially resolved omics data.


Related Papers