Predicting AT(N) pathologies in Alzheimer’s disease from blood-based proteomic data using neural networks

Yuting Zhang(University of Oxford), Upamanyu Ghose(University of Oxford), Noel J. Buckley(University of Oxford), Sebastiaan Engelborghs(Vrije Universiteit Brussel), Kristel Sleegers(University of Antwerp), Giovanni B. Frisoni(University of Geneva), Anders Wallin(University of Gothenburg), Alberto Lleó(Hospital de Sant Pau), Julius Popp(University of Zurich), Pablo Martínez‐Lage(Fundacion CITA Alzheimer), Cristina Legido‐Quigley(Steno Diabetes Centers), Frederik Barkhof(University College London), Henrik Zetterberg(Sahlgrenska University Hospital), Pieter Jelle Visser(Maastricht University), Lars Bertram(University of Oslo), Simon Lovestone(University of Oxford), Alejo Nevado‐Holgado(University of Oxford), Liu Shi(University of Oxford)
Frontiers in Aging Neuroscience
November 29, 2022
Cited by 13Open Access
Full Text

Abstract

Background and objective Blood-based biomarkers represent a promising approach to help identify early Alzheimer’s disease (AD). Previous research has applied traditional machine learning (ML) to analyze plasma omics data and search for potential biomarkers, but the most modern ML methods based on deep learning has however been scarcely explored. In the current study, we aim to harness the power of state-of-the-art deep learning neural networks (NNs) to identify plasma proteins that predict amyloid, tau, and neurodegeneration (AT[N]) pathologies in AD. Methods We measured 3,635 proteins using SOMAscan in 881 participants from the European Medical Information Framework for AD Multimodal Biomarker Discovery study (EMIF-AD MBD). Participants underwent measurements of brain amyloid β (Aβ) burden, phosphorylated tau (p-tau) burden, and total tau (t-tau) burden to determine their AT(N) statuses. We ranked proteins by their association with Aβ, p-tau, t-tau, and AT(N), and fed the top 100 proteins along with age and apolipoprotein E ( APOE ) status into NN classifiers as input features to predict these four outcomes relevant to AD. We compared NN performance of using proteins, age, and APOE genotype with performance of using age and APOE status alone to identify protein panels that optimally improved the prediction over these main risk factors. Proteins that improved the prediction for each outcome were aggregated and nominated for pathway enrichment and protein–protein interaction enrichment analysis. Results Age and APOE alone predicted Aβ, p-tau, t-tau, and AT(N) burden with area under the curve (AUC) scores of 0.748, 0.662, 0.710, and 0.795. The addition of proteins significantly improved AUCs to 0.782, 0.674, 0.734, and 0.831, respectively. The identified proteins were enriched in five clusters of AD-associated pathways including human immunodeficiency virus 1 infection, p53 signaling pathway, and phosphoinositide-3-kinase–protein kinase B/Akt signaling pathway. Conclusion Combined with age and APOE genotype, the proteins identified have the potential to serve as blood-based biomarkers for AD and await validation in future studies. While the NNs did not achieve better scores than the support vector machine model used in our previous study, their performances were likely limited by small sample size.


Related Papers

No related papers found

Powered by citation graph analysis