Swarm Learning as a privacy-preserving machine learning approach for disease classification

Stefanie Warnat‐Herresthal(University of Bonn), Hartmut Schultze(Hewlett Packard Enterprise (United States)), Krishnaprasad Lingadahalli Shastry(Hewlett Packard Enterprise (United States)), Sathyanarayanan Manamohan(Hewlett Packard Enterprise (United States)), Saikat Mukherjee(Hewlett Packard Enterprise (United States)), Vishesh Garg(Hewlett Packard Enterprise (United States)), Ravi Sarveswara(Hewlett Packard Enterprise (United States)), Kristian Händler(University of Bonn), Peter Pickkers(Radboud University Nijmegen), N. Ahmad Aziz(University of Bonn), Sofia Ira Ktena(National and Kapodistrian University of Athens), Christian Siever(Hewlett Packard Enterprise (United States)), Michael Kraut(University of Bonn), Milind Y. Desai(Hewlett Packard Enterprise (United States)), Bruno Monnet(Hewlett Packard Enterprise (United States)), Maria Saridaki(National and Kapodistrian University of Athens), Charles Siegel(Hewlett Packard Enterprise (United States)), Anna Drews(University of Bonn), Melanie Nuesch-Germano(University of Bonn), Heidi Theis(University of Bonn), Mihai G. Netea(University of Bonn), Fabian J. Theis(Helmholtz Zentrum München), Anna C. Aschenbrenner(University of Bonn), Thomas Ulas(University of Bonn), Monique M.B. Breteler(University of Bonn), Evangelos J. Giamarellos‐Bourboulis(National and Kapodistrian University of Athens), Matthijs Kox(Radboud University Nijmegen), Matthias Becker(University of Bonn), S.C. Cheran(Hewlett Packard Enterprise (United States)), Michael S. Woodacre(Hewlett Packard Enterprise (United States)), Eng Lim Goh(Hewlett Packard Enterprise (United States)), Joachim L. Schultze(University of Bonn), German COVID-19 OMICS Initiative (DeCOI)
bioRxiv (Cold Spring Harbor Laboratory)
June 26, 2020
Cited by 24Open Access
Full Text

Abstract

Abstract Identification of patients with life-threatening diseases including leukemias or infections such as tuberculosis and COVID-19 is an important goal of precision medicine. We recently illustrated that leukemia patients are identified by machine learning (ML) based on their blood transcriptomes. However, there is an increasing divide between what is technically possible and what is allowed because of privacy legislation. To facilitate integration of any omics data from any data owner world-wide without violating privacy laws, we here introduce Swarm Learning (SL), a decentralized machine learning approach uniting edge computing, blockchain-based peer-to-peer networking and coordination as well as privacy protection without the need for a central coordinator thereby going beyond federated learning. Using more than 14,000 blood transcriptomes derived from over 100 individual studies with non-uniform distribution of cases and controls and significant study biases, we illustrate the feasibility of SL to develop disease classifiers based on distributed data for COVID-19, tuberculosis or leukemias that outperform those developed at individual sites. Still, SL completely protects local privacy regulations by design. We propose this approach to noticeably accelerate the introduction of precision medicine.


Related Papers

No related papers found

Powered by citation graph analysis