Swarm learning for decentralized artificial intelligence in cancer histopathology

Oliver Lester Saldanha(RWTH Aachen University), Philip Quirke(University of Leeds), Nicholas P. West(University of Leeds), Jacqueline A. James(Queen's University Belfast), Maurice B. Loughrey(Queen's University Belfast), Heike I. Grabsch(University of Leeds), Manuel Salto‐Tellez(Queen's University Belfast), Elizabeth Alwers(German Cancer Research Center), Didem Çifçi(RWTH Aachen University), Narmin Ghaffari Laleh(RWTH Aachen University), Tobias Seibel(RWTH Aachen University), Richard Gray(University of Oxford), Gordon Hutchins(University of Leeds), Hermann Brenner(German Cancer Research Center), Marko van Treeck(RWTH Aachen University), Tanwei Yuan(German Cancer Research Center), Titus J. Brinker(German Cancer Research Center), Jenny Chang‐Claude(Universität Hamburg), Firas Khader(RWTH Aachen University), Andreas Schuppert(Universitätsklinikum Aachen), Tom Luedde(Düsseldorf University Hospital), Christian Trautwein(RWTH Aachen University), Hannah Sophie Muti(RWTH Aachen University), Sebastian Foersch(Johannes Gutenberg University Mainz), Michael Hoffmeister(German Cancer Research Center), Daniel Truhn(RWTH Aachen University), Jakob Nikolas Kather(Heidelberg University)
Nature Medicine
April 25, 2022
Cited by 205Open Access
Full Text

Abstract

Artificial intelligence (AI) can predict the presence of molecular alterations directly from routine histopathology slides. However, training robust AI systems requires large datasets for which data collection faces practical, ethical and legal obstacles. These obstacles could be overcome with swarm learning (SL), in which partners jointly train AI models while avoiding data transfer and monopolistic data governance. Here, we demonstrate the successful use of SL in large, multicentric datasets of gigapixel histopathology images from over 5,000 patients. We show that AI models trained using SL can predict BRAF mutational status and microsatellite instability directly from hematoxylin and eosin (H&E)-stained pathology slides of colorectal cancer. We trained AI models on three patient cohorts from Northern Ireland, Germany and the United States, and validated the prediction performance in two independent datasets from the United Kingdom. Our data show that SL-trained AI models outperform most locally trained models, and perform on par with models that are trained on the merged datasets. In addition, we show that SL-based AI models are data efficient. In the future, SL can be used to train distributed AI models for any histopathology image analysis task, eliminating the need for data transfer.


Related Papers

No related papers found

Powered by citation graph analysis