Deep Learning-Based Multi-Class Classification of Breast Digital Pathology Images

Weiming Mi(Tsinghua University), Junjie Li(Chinese Academy of Medical Sciences & Peking Union Medical College), Yucheng Guo, Xinyu Ren(Chinese Academy of Medical Sciences & Peking Union Medical College), Zhiyong Liang(Chinese Academy of Medical Sciences & Peking Union Medical College), Tao Zhang(Tsinghua University), Hao Zou(Tsinghua University)
Cancer Management and Research
June 1, 2021
Cited by 64Open Access
Full Text

Abstract

INTRODUCTION: Breast cancer, one of the most common health threats to females worldwide, has always been a crucial topic in the medical field. With the rapid development of digital pathology, many scholars have used AI-based systems to classify breast cancer pathological images. However, most existing studies only stayed on the binary classification of breast lesions (normal vs tumor or benign vs malignant), far from meeting the clinical demand. Therefore, we established a multi-class classification system of breast digital pathology images based on AI, which is more clinically practical than the binary classification system. METHODS: In this paper, we adopted a two-stage architecture based on deep learning method and machine learning method for the multi-class classification (normal tissue, benign lesion, ductal carcinoma in situ, and invasive carcinoma) of breast digital pathological images. RESULTS: The proposed approach achieved an overall accuracy of 86.67% at patch-level. At WSI-level, the overall accuracies of our classification system were 88.16% on validation data and 90.43% on test data. Additionally, we used two public datasets, the BreakHis and BACH, for independent verification. The accuracies our model obtained on these two datasets were comparable to related publications. Furthermore, our model could achieve accuracies of 85.19% on multi-classification and 96.30% on binary classification (non-malignant vs malignant) using pathology images of frozen sections, which was proven to have good generalizability. Then, we used t-SNE for visualization of patch classification efficiency. Finally, we analyzed morphological characteristics of patches learned by the model. CONCLUSION: The proposed two-stage model could be effectively applied to the multi-class classification task of breast pathology images and could be a very useful tool for assisting pathologists in diagnosing breast cancer.


Related Papers