Scalable Prediction of Acute Myeloid Leukemia Using High-Dimensional Machine Learning and Blood Transcriptomics

Stefanie Warnat‐Herresthal(University of Bonn), Konstantinos Perrakis(German Center for Neurodegenerative Diseases), Bernd Taschler(German Center for Neurodegenerative Diseases), Matthias Becker(University of Bonn), Kevin Baßler(University of Bonn), Marc Beyer(University of Bonn), Patrick Günther(University of Bonn), Jonas Schulte-Schrepping(University of Bonn), Lea Seep(University of Bonn), Kathrin Klee(University of Bonn), Thomas Ulas(University of Bonn), Torsten Haferlach(Munich Leukemia Laboratory (Germany)), Sach Mukherjee(German Center for Neurodegenerative Diseases), Joachim L. Schultze(University of Bonn)
iScience
December 18, 2019
Cited by 114Open Access
Full Text

Abstract

Acute myeloid leukemia (AML) is a severe, mostly fatal hematopoietic malignancy. We were interested in whether transcriptomic-based machine learning could predict AML status without requiring expert input. Using 12,029 samples from 105 different studies, we present a large-scale study of machine learning-based prediction of AML in which we address key questions relating to the combination of machine learning and transcriptomics and their practical use. We find data-driven, high-dimensional approaches-in which multivariate signatures are learned directly from genome-wide data with no prior knowledge-to be accurate and robust. Importantly, these approaches are highly scalable with low marginal cost, essentially matching human expert annotation in a near-automated workflow. Our results support the notion that transcriptomics combined with machine learning could be used as part of an integrated -omics approach wherein risk prediction, differential diagnosis, and subclassification of AML are achieved by genomics while diagnosis could be assisted by transcriptomic-based machine learning.


Related Papers

No related papers found

Powered by citation graph analysis