Multimodal data integration using machine learning improves risk stratification of high-grade serous ovarian cancer

Kevin Boehm(Memorial Sloan Kettering Cancer Center), Emily A. Aherne(Memorial Sloan Kettering Cancer Center), Lora H. Ellenson(Memorial Sloan Kettering Cancer Center), Ines Nikolovski(Memorial Sloan Kettering Cancer Center), Mohammed Alghamdi(Memorial Sloan Kettering Cancer Center), Ignacio Vázquez-Garćıa(Memorial Sloan Kettering Cancer Center), Dmitriy Zamarin(Memorial Sloan Kettering Cancer Center), Kara Long Roche(Memorial Sloan Kettering Cancer Center), Ying L. Liu(Memorial Sloan Kettering Cancer Center), Druv Patel(Memorial Sloan Kettering Cancer Center), Andrew Aukerman(Memorial Sloan Kettering Cancer Center), Arfath Pasha(Memorial Sloan Kettering Cancer Center), Doori Rose(Memorial Sloan Kettering Cancer Center), Pier Selenica(Memorial Sloan Kettering Cancer Center), Pamela Causa Andrieu(Memorial Sloan Kettering Cancer Center), Chris Fong(Memorial Sloan Kettering Cancer Center), Marinela Capanu(Memorial Sloan Kettering Cancer Center), Jorge S. Reis‐Filho(Memorial Sloan Kettering Cancer Center), R. Vanguri(Memorial Sloan Kettering Cancer Center), Harini Veeraraghavan(Memorial Sloan Kettering Cancer Center), Natalie Gangai(Memorial Sloan Kettering Cancer Center), Ramon E. Sosa(Memorial Sloan Kettering Cancer Center), Samantha Leung(Memorial Sloan Kettering Cancer Center), Andrew McPherson(Memorial Sloan Kettering Cancer Center), Jianjiong Gao(Memorial Sloan Kettering Cancer Center), Yulia Lakhman(Memorial Sloan Kettering Cancer Center), Sohrab P. Shah(Memorial Sloan Kettering Cancer Center)
Nature Cancer
June 28, 2022
Cited by 335Open Access
Full Text

Abstract

Patients with high-grade serous ovarian cancer suffer poor prognosis and variable response to treatment. Known prognostic factors for this disease include homologous recombination deficiency status, age, pathological stage and residual disease status after debulking surgery. Recent work has highlighted important prognostic information captured in computed tomography and histopathological specimens, which can be exploited through machine learning. However, little is known about the capacity of combining features from these disparate sources to improve prediction of treatment response. Here, we assembled a multimodal dataset of 444 patients with primarily late-stage high-grade serous ovarian cancer and discovered quantitative features, such as tumor nuclear size on staining with hematoxylin and eosin and omental texture on contrast-enhanced computed tomography, associated with prognosis. We found that these features contributed complementary prognostic information relative to one another and clinicogenomic features. By fusing histopathological, radiologic and clinicogenomic machine-learning models, we demonstrate a promising path toward improved risk stratification of patients with cancer through multimodal data integration.


Related Papers