Searching large-scale scRNA-seq databases via unbiased cell embedding with Cell BLAST

Zhi‐Jie Cao(Peking University), Wei Lin(Peking University), Lu Shen(Peking University), De-Chang Yang(Peking University), Ge Gao(Peking University)
Nature Communications
July 10, 2020
Cited by 160Open Access
Full Text

Abstract

Single-cell RNA-seq (scRNA-seq) is being used widely to resolve cellular heterogeneity. With the rapid accumulation of public scRNA-seq data, an effective and efficient cell-querying method is critical for the utilization of the existing annotations to curate newly sequenced cells. Such a querying method should be based on an accurate cell-to-cell similarity measure, and capable of handling batch effects properly. Herein, we present Cell BLAST, an accurate and robust cell-querying method built on a neural network-based generative model and a customized cell-to-cell similarity metric. Through extensive benchmarks and case studies, we demonstrate the effectiveness of Cell BLAST in annotating discrete cell types and continuous cell differentiation potential, as well as identifying novel cell types. Powered by a well-curated reference database and a user-friendly Web server, Cell BLAST provides the one-stop solution for real-world scRNA-seq cell querying and annotation.


Related Papers

Basic local alignment search tool
Stephen F. Altschul, Warren Gish, Webb Miller et al.|Journal of Molecular Biology|1990|94.2k
Visualizing Data using t-SNE
Laurens van der Maaten, Geoffrey E. Hinton|Journal of Machine Learning Research|2008|35.7k
GAN(Generative Adversarial Nets)
柴田 淳司|Journal of Japan Society for Fuzzy Theory and Intelligent Informatics|2017|21.8k
Fast unfolding of communities in large networks
Vincent D. Blondel, Jean‐Loup Guillaume, Renaud Lambiotte et al.|Journal of Statistical Mechanics Theory and Experiment|2008|21.1k