Spatially informed clustering, integration, and deconvolution of spatial transcriptomics with GraphSTYahui Long, Kok Siong Ang, Mengwei Li et al.|Nature Communications|2023 Spatial transcriptomics technologies generate gene expression profiles with spatial context, requiring spatially informed analysis tools for three key tasks, spatial clustering, multisample integration, and cell-type deconvolution. We present GraphST, a graph self-supervised contrastive learning method that fully exploits spatial transcriptomics data to outperform existing methods. It combines graph neural networks with self-supervised contrastive learning to learn informative and discriminative spot representations by minimizing the embedding distance between spatially adjacent spots and vice versa. We demonstrated GraphST on multiple tissue types and technology platforms. GraphST achieved 10% higher clustering accuracy and better delineated fine-grained tissue structures in brain and embryo tissues. GraphST is also the only method that can jointly analyze multiple tissue slices in vertical or horizontal integration while correcting batch effects. Lastly, GraphST demonstrated superior cell-type deconvolution to capture spatial niches like lymph node germinal centers and exhausted tumor infiltrating T cells in breast tumor tissue.
Unsupervised spatially embedded deep representation of spatial transcriptomicsHang Xu, Huazhu Fu, Yahui Long et al.|Genome Medicine|2024 Optimal integration of transcriptomics data and associated spatial information is essential towards fully exploiting spatial transcriptomics to dissect tissue heterogeneity and map out inter-cellular communications. We present SEDR, which uses a deep autoencoder coupled with a masked self-supervised learning mechanism to construct a low-dimensional latent representation of gene expression, which is then simultaneously embedded with the corresponding spatial information through a variational graph autoencoder. SEDR achieved higher clustering performance on manually annotated 10 × Visium datasets and better scalability on high-resolution spatial transcriptomics datasets than existing methods. Additionally, we show SEDR's ability to impute and denoise gene expression (URL: https://github.com/JinmiaoChenLab/SEDR/ ).
DISCO: a database of Deeply Integrated human Single-Cell Omics dataMengwei Li, Xiaomeng Zhang, Kok Siong Ang et al.|Nucleic Acids Research|2021 The ability to study cellular heterogeneity at single cell resolution is making single-cell sequencing increasingly popular. However, there is no publicly available resource that offers an integrated cell atlas with harmonized metadata that users can integrate new data with. Here, we present DISCO (https://www.immunesinglecell.org/), a database of Deeply Integrated Single-Cell Omics data. The current release of DISCO integrates more than 18 million cells from 4593 samples, covering 107 tissues/cell lines/organoids, 158 diseases, and 20 platforms. We standardized the associated metadata with a controlled vocabulary and ontology system. To allow large scale integration of single-cell data, we developed FastIntegration, a fast and high-capacity version of Seurat Integration. We also developed CELLiD, an atlas guided automatic cell type identification tool. Employing these two tools on the assembled data, we constructed one global atlas and 27 sub-atlases for different tissues, diseases, and cell types. DISCO provides three online tools, namely Online FastIntegration, Online CELLiD, and CellMapper, for users to integrate, annotate, and project uploaded single-cell RNA-seq data onto a selected atlas. Collectively, DISCO is a versatile platform for users to explore published single-cell data and efficiently perform integrated analysis with their own data.
Deciphering spatial domains from spatial multi-omics with SpatialGlueAdvances in spatial omics technologies now allow multiple types of data to be acquired from the same tissue slice. To realize the full potential of such data, we need spatially informed methods for data integration. Here, we introduce SpatialGlue, a graph neural network model with a dual-attention mechanism that deciphers spatial domains by intra-omics integration of spatial location and omics measurement followed by cross-omics integration. We demonstrated SpatialGlue on data acquired from different tissue types using different technologies, including spatial epigenome-transcriptome and transcriptome-proteome modalities. Compared to other methods, SpatialGlue captured more anatomical details and more accurately resolved spatial domains such as the cortex layers of the brain. Our method also identified cell types like spleen macrophage subsets located at three different zones that were not available in the original data annotations. SpatialGlue scales well with data size and can be used to integrate three modalities. Our spatial multi-omics analysis tool combines the information from complementary omics modalities to obtain a holistic view of cellular and tissue properties.
Efficacy of inactivated poliovirus vaccine in IndiaInactivated poliovirus vaccine (IPV) is efficacious against paralytic disease, but its effect on mucosal immunity is debated. We assessed the efficacy of IPV in boosting mucosal immunity. Participants received IPV, bivalent 1 and 3 oral poliovirus vaccine (bOPV), or no vaccine. A bOPV challenge was administered 4 weeks later, and excretion was assessed 3, 7, and 14 days later. Nine hundred and fifty-four participants completed the study. Any fecal shedding of poliovirus type 1 was 8.8, 9.1, and 13.5% in the IPV group and 14.4, 24.1, and 52.4% in the control group by 6- to 11-month, 5-year, and 10-year groups, respectively (IPV versus control: Fisher's exact test P < 0.001). IPV reduced excretion for poliovirus types 1 and 3 between 38.9 and 74.2% and 52.8 and 75.7%, respectively. Thus, IPV in OPV-vaccinated individuals boosts intestinal mucosal immunity.