Consensus prediction of cell type labels in single-cell data with popV

Can Ergen(University of California, Berkeley), Galen Xing(Gladstone Institutes), Chenling Xu(University of California, Berkeley), Martin Kim(University of California, Berkeley), Michael Jayasuriya(University of California, Berkeley), Erin McGeever(Chan Zuckerberg Initiative (United States)), Angela Oliveira Pisco(Chan Zuckerberg Initiative (United States)), Aaron Streets(Chan Zuckerberg Initiative (United States)), Nir Yosef(Ragon Institute of MGH, MIT and Harvard)
Nature Genetics
November 20, 2024
Cited by 38Open Access
Full Text

Abstract

Cell-type classification is a crucial step in single-cell sequencing analysis. Various methods have been proposed for transferring a cell-type label from an annotated reference atlas to unannotated query datasets. Existing methods for transferring cell-type labels lack proper uncertainty estimation for the resulting annotations, limiting interpretability and usefulness. To address this, we propose popular Vote (popV), an ensemble of prediction models with an ontology-based voting scheme. PopV achieves accurate cell-type labeling and provides uncertainty scores. In multiple case studies, popV confidently annotates the majority of cells while highlighting cell populations that are challenging to annotate by label transfer. This additional step helps to reduce the load of manual inspection, which is often a necessary component of the annotation process, and enables one to focus on the most problematic parts of the annotation, streamlining the overall annotation process. Popular Vote (popV) is a simple, ensemble popular vote approach for cell type annotation in single-cell omic data, flexibly incorporating various methods in an open-source Python framework. Across various challenging input datasets, popV offers consistent, accurate performance.


Related Papers

No related papers found

Powered by citation graph analysis