Proteomic prediction of common and rare diseases

Julia Carrasco-Zanini(Queen Mary University of London), Maik Pietzner(Queen Mary University of London), Jonathan Davitte, Praveen Surendran(Age UK), Damien C. Croteau‐Chonka, Chloe Robins, Ana Torralbo(University College London), Christopher Tomlinson(National Institute for Health and Care Research), Natalie Fitzpatrick(University College London), Cai Ytsma(University College London), Tokuwa Kanno, Stephan Gade, Daniel F. Freitag(Age UK), Frederik Ziebell, Spiros Denaxas(National Institute for Health and Care Research), Joanna Betts(Age UK), Nicholas J. Wareham(University of Cambridge), Harry Hemingway(National Institute for Health and Care Research), Robert A. Scott(Age UK), Claudia Langenberg(Queen Mary University of London)
medRxiv
July 23, 2023
Cited by 14Open Access
Full Text

Abstract

Abstract Background For many diseases there are delays in diagnosis due to a lack of objective biomarkers for disease onset. Whether measuring thousands of proteins offers predictive information across a wide range of diseases is unknown. Methods In 41,931 individuals from the UK Biobank Pharma Proteomics Project (UKB-PPP), we integrated ∼3000 plasma proteins with clinical information to derive sparse prediction models for the 10-year incidence of 218 common and rare diseases (81 – 6038 cases). We compared prediction models based on proteins with a) basic clinical information alone, b) basic clinical information + 37 clinical biomarkers, and c) genome-wide polygenic risk scores. Results For 67 pathologically diverse diseases, a model including as few as 5 to 20 proteins was superior to clinical models (median delta C-index = 0.07; range = 0.02 – 0.31) and to clinical models with biomarkers for 52 diseases. In multiple myeloma, for example, a set of 5 proteins significantly improved prediction over basic clinical information (delta C-index = 0.25 (95% confidence interval 0.20 – 0.29)). At a 5% false positive rate (FPR), proteomic prediction (5 proteins) identified individuals at high risk of multiple myeloma (detection rate (DR) = 50%), non-Hodgkin lymphoma (DR = 55%) and motor neuron disease (DR = 29%). At a 20% FPR, proteomic prediction identified individuals at high-risk for pulmonary fibrosis (DR= 80%) and dilated cardiomyopathy (DR = 75%). Conclusions Sparse plasma protein signatures offer novel, clinically useful prediction of common and rare diseases, through disease-specific proteins and protein predictors shared across multiple diseases. (Funded by Medical Research Council, NIHR, Wellcome Trust.)


Related Papers

No related papers found

Powered by citation graph analysis