Machine learning-based analysis identifies and validates serum exosomal proteomic signatures for the diagnosis of colorectal cancer

Haofan Yin(Sun Yat-sen University), Jinye Xie(Zhongshan People's Hospital), Shan Xing(Sun Yat-sen University), Xiaofang Lu(Sun Yat-sen University), Yu Yu(Sun Yat-sen University), Yong Ren(Guangdong University Of Finances and Economics), Jian Tao(Southern University of Science and Technology), Guirong He(Southern University of Science and Technology), Lijun Zhang(Southern University of Science and Technology), Xiaopeng Yuan(Southern University of Science and Technology), Zheng Yang(Sun Yat-sen University), Zhijian Huang(Sun Yat-sen University)
Cell Reports Medicine
August 1, 2024
Cited by 59Open Access
Full Text

Abstract

The potential of serum extracellular vesicles (EVs) as non-invasive biomarkers for diagnosing colorectal cancer (CRC) remains elusive. We employed an in-depth 4D-DIA proteomics and machine learning (ML) pipeline to identify key proteins, PF4 and AACT, for CRC diagnosis in serum EV samples from a discovery cohort of 37 cases. PF4 and AACT outperform traditional biomarkers, CEA and CA19-9, detected by ELISA in 912 individuals. Furthermore, we developed an EV-related random forest (RF) model with the highest diagnostic efficiency, achieving AUC values of 0.960 and 0.963 in the train and test sets, respectively. Notably, this model demonstrated reliable diagnostic performance for early-stage CRC and distinguishing CRC from benign colorectal diseases. Additionally, multi-omics approaches were employed to predict the functions and potential sources of serum EV-derived proteins. Collectively, our study identified the crucial proteomic signatures in serum EVs and established a promising EV-related RF model for CRC diagnosis in the clinic. • 4D-DIA proteomic profiles of serum EVs in CRC patients and healthy controls • Identification of proteomic signatures in serum EVs for CRC diagnosis • Development of diagnostic model distinguishing CRC from healthy controls and BCD • Prediction of functions and potential cell sources of serum EV-derived proteins Yin et al. utilizes 4D-DIA proteomics and machine learning to identify key biomarkers PF4 and AACT in serum extracellular vesicles for colorectal cancer (CRC) diagnosis. Their random forest model demonstrates superior diagnostic performance for early-stage CRC and distinguishing CRC from benign colorectal diseases, offering a promising tool for clinical application.


Related Papers