Whole-cell segmentation of tissue images with human-level performance using large-scale data annotation and deep learning

Noah F. Greenwald(Stanford University), Geneva Miller(California Institute of Technology), Erick Moen(California Institute of Technology), Alex Kong(Stanford University), Adam Kagel(Stanford University), Christine Camacho Fullaway(Stanford University), Brianna J. McIntosh(Stanford University), Ke Xuan Leow(Stanford University), Morgan Schwartz(California Institute of Technology), Thomas Dougherty(California Institute of Technology), Cole Pavelchek(California Institute of Technology), Sunny Cui(California Institute of Technology), Isabella Camplisson(California Institute of Technology), Omer Bar-Tal(Weizmann Institute of Science), Jaiveer Singh(Stanford University), Mara Fong(Stanford University), Gautam Chaudhry(Stanford University), Zion Abraham(Stanford University), Jackson Moseley(Stanford University), Shiri Warshawsky(Stanford University), Erin Soon(Stanford University), Shirley Greenbaum(Stanford University), Tyler Risom(Stanford University), Travis J. Hollmann(Memorial Sloan Kettering Cancer Center), Leeat Keren(Weizmann Institute of Science), Will Graf(California Institute of Technology), Michael Angelo(Stanford University), David Van Valen(California Institute of Technology)
bioRxiv (Cold Spring Harbor Laboratory)
March 2, 2021
Cited by 142Open Access
Full Text

Abstract

Abstract Understanding the spatial organization of tissues is of critical importance for both basic and translational research. While recent advances in tissue imaging are opening an exciting new window into the biology of human tissues, interpreting the data that they create is a significant computational challenge. Cell segmentation, the task of uniquely identifying each cell in an image, remains a substantial barrier for tissue imaging, as existing approaches are inaccurate or require a substantial amount of manual curation to yield useful results. Here, we addressed the problem of cell segmentation in tissue imaging data through large-scale data annotation and deep learning. We constructed TissueNet, an image dataset containing >1 million paired whole-cell and nuclear annotations for tissue images from nine organs and six imaging platforms. We created Mesmer, a deep learning-enabled segmentation algorithm trained on TissueNet that performs nuclear and whole-cell segmentation in tissue imaging data. We demonstrated that Mesmer has better speed and accuracy than previous methods, generalizes to the full diversity of tissue types and imaging platforms in TissueNet, and achieves human-level performance for whole-cell segmentation. Mesmer enabled the automated extraction of key cellular features, such as subcellular localization of protein signal, which was challenging with previous approaches. We further showed that Mesmer could be adapted to harness cell lineage information present in highly multiplexed datasets. We used this enhanced version to quantify cell morphology changes during human gestation. All underlying code and models are released with permissive licenses as a community resource.


Related Papers

No related papers found

Powered by citation graph analysis