DISCO: a database of Deeply Integrated human Single-Cell Omics data

Mengwei Li(Agency for Science, Technology and Research), Xiaomeng Zhang(Agency for Science, Technology and Research), Kok Siong Ang(Agency for Science, Technology and Research), Jingjing Ling(Agency for Science, Technology and Research), Raman Sethi(Agency for Science, Technology and Research), Nicole Yee Shin Lee(Agency for Science, Technology and Research), Florent Ginhoux(Agency for Science, Technology and Research), Jinmiao Chen(Agency for Science, Technology and Research)
Nucleic Acids Research
October 13, 2021
Cited by 204Open Access
Full Text

Abstract

The ability to study cellular heterogeneity at single cell resolution is making single-cell sequencing increasingly popular. However, there is no publicly available resource that offers an integrated cell atlas with harmonized metadata that users can integrate new data with. Here, we present DISCO (https://www.immunesinglecell.org/), a database of Deeply Integrated Single-Cell Omics data. The current release of DISCO integrates more than 18 million cells from 4593 samples, covering 107 tissues/cell lines/organoids, 158 diseases, and 20 platforms. We standardized the associated metadata with a controlled vocabulary and ontology system. To allow large scale integration of single-cell data, we developed FastIntegration, a fast and high-capacity version of Seurat Integration. We also developed CELLiD, an atlas guided automatic cell type identification tool. Employing these two tools on the assembled data, we constructed one global atlas and 27 sub-atlases for different tissues, diseases, and cell types. DISCO provides three online tools, namely Online FastIntegration, Online CELLiD, and CellMapper, for users to integrate, annotate, and project uploaded single-cell RNA-seq data onto a selected atlas. Collectively, DISCO is a versatile platform for users to explore published single-cell data and efficiently perform integrated analysis with their own data.


Related Papers

No related papers found

Powered by citation graph analysis