Prediction of Allogeneic Hematopoietic Stem-Cell Transplantation Mortality 100 Days After Transplantation Using a Machine Learning Algorithm: A European Group for Blood and Marrow Transplantation Acute Leukemia Working Party Retrospective Data Mining Study

Roni Shouval(Bar-Ilan University), Myriam Labopin(Sheba Medical Center), Ori Bondi(Sheba Medical Center), Hila Mishan‐Shamay(Sheba Medical Center), Avichai Shimoni(Sheba Medical Center), Fabio Ciceri(Sheba Medical Center), Jordi Esteve(Sheba Medical Center), Sebastian Giebel(Sheba Medical Center), Norbert Claude Gorin(Sheba Medical Center), Christoph Schmid(Sheba Medical Center), Emmanuelle Polge(Sheba Medical Center), Mahmoud Aljurf(Sheba Medical Center), Nicolaus Kröger(Sheba Medical Center), Charles Craddock(Sheba Medical Center), Andrea Bacigalupo(Sheba Medical Center), Jan J. Cornelissen(Sheba Medical Center), Frédéric Baron(Sheba Medical Center), Ron Unger(Sheba Medical Center), Arnon Nagler(Sheba Medical Center), Mohamad Mohty(Sheba Medical Center)
Journal of Clinical Oncology
August 4, 2015
Cited by 163Open Access
Full Text

Abstract

PURPOSE: Allogeneic hematopoietic stem-cell transplantation (HSCT) is potentially curative for acute leukemia (AL), but carries considerable risk. Machine learning algorithms, which are part of the data mining (DM) approach, may serve for transplantation-related mortality risk prediction. PATIENTS AND METHODS: This work is a retrospective DM study on a cohort of 28,236 adult HSCT recipients from the AL registry of the European Group for Blood and Marrow Transplantation. The primary objective was prediction of overall mortality (OM) at 100 days after HSCT. Secondary objectives were estimation of nonrelapse mortality, leukemia-free survival, and overall survival at 2 years. Donor, recipient, and procedural characteristics were analyzed. The alternating decision tree machine learning algorithm was applied for model development on 70% of the data set and validated on the remaining data. RESULTS: OM prevalence at day 100 was 13.9% (n=3,936). Of the 20 variables considered, 10 were selected by the model for OM prediction, and several interactions were discovered. By using a logistic transformation function, the crude score was transformed into individual probabilities for 100-day OM (range, 3% to 68%). The model's discrimination for the primary objective performed better than the European Group for Blood and Marrow Transplantation score (area under the receiver operating characteristics curve, 0.701 v 0.646; P<.001). Calibration was excellent. Scores assigned were also predictive of secondary objectives. CONCLUSION: The alternating decision tree model provides a robust tool for risk evaluation of patients with AL before HSCT, and is available online (http://bioinfo.lnx.biu.ac.il/∼bondi/web1.html). It is presented as a continuous probabilistic score for the prediction of day 100 OM, extending prediction to 2 years. The DM method has proved useful for clinical prediction in HSCT.


Related Papers

No related papers found

Powered by citation graph analysis