SAW: an efficient and accurate data analysis workflow for Stereo-seq spatial transcriptomics

Gong Chun(BGI Group (China)), Shengkang Li(BGI Group (China)), Leying Wang(BGI Group (China)), Fuxiang Zhao(BGI Group (China)), Shuangsang Fang(BGI Group (China)), Dong Yuan(BGI Group (China)), Zijian Zhao(BGI Group (China)), Qiqi He(BGI Group (China)), Mei Li(BGI Group (China)), Weiqing Liu(BGI Group (China)), Zhaoxun Li(BGI Group (China)), Hongqing Xie(BGI Group (China)), Sha Liao(BGI Group (China)), Ao Chen(BGI Group (China)), Yong Zhang(BGI Group (China)), Yuxiang Li(BGI Group (China)), Xun Xu(Wuhan College)
Gigabyte
February 19, 2024
Cited by 43Open Access
Full Text

Abstract

The basic analysis steps of spatial transcriptomics require obtaining gene expression information from both space and cells. The existing tools for these analyses incur performance issues when dealing with large datasets. These issues involve computationally intensive spatial localization, RNA genome alignment, and excessive memory usage in large chip scenarios. These problems affect the applicability and efficiency of the analysis. Here, a high-performance and accurate spatial transcriptomics data analysis workflow, called Stereo-seq Analysis Workflow (SAW), was developed for the Stereo-seq technology developed at BGI. SAW includes mRNA spatial position reconstruction, genome alignment, gene expression matrix generation, and clustering. The workflow outputs files in a universal format for subsequent personalized analysis. The execution time for the entire analysis is ∼148 min with 1 GB reads 1 × 1 cm chip test data, 1.8 times faster than with an unoptimized workflow.


Related Papers