TY - JOUR
T1 - Swarm learning for decentralized artificial intelligence in cancer histopathology
AU - Saldanha, Oliver Lester
AU - Quirke, Philip
AU - West, Nicholas P
AU - James, Jacqueline A
AU - Loughrey, Maurice B
AU - Grabsch, Heike I
AU - Salto-Tellez, Manuel
AU - Alwers, Elizabeth
AU - Cifci, Didem
AU - Ghaffari Laleh, Narmin
AU - Seibel, Tobias
AU - Gray, Richard
AU - Hutchins, Gordon G A
AU - Brenner, Hermann
AU - van Treeck, Marko
AU - Yuan, Tanwei
AU - Brinker, Titus J
AU - Chang-Claude, Jenny
AU - Khader, Firas
AU - Schuppert, Andreas
AU - Luedde, Tom
AU - Trautwein, Christian
AU - Muti, Hannah Sophie
AU - Foersch, Sebastian
AU - Hoffmeister, Michael
AU - Truhn, Daniel
AU - Kather, Jakob Nikolas
N1 - © 2022. The Author(s).
PY - 2022/4/25
Y1 - 2022/4/25
N2 - Artificial intelligence (AI) can predict the presence of molecular alterations directly from routine histopathology slides. However, training robust AI systems requires large datasets for which data collection faces practical, ethical and legal obstacles. These obstacles could be overcome with swarm learning (SL), in which partners jointly train AI models while avoiding data transfer and monopolistic data governance. Here, we demonstrate the successful use of SL in large, multicentric datasets of gigapixel histopathology images from over 5,000 patients. We show that AI models trained using SL can predict BRAF mutational status and microsatellite instability directly from hematoxylin and eosin (H&E)-stained pathology slides of colorectal cancer. We trained AI models on three patient cohorts from Northern Ireland, Germany and the United States, and validated the prediction performance in two independent datasets from the United Kingdom. Our data show that SL-trained AI models outperform most locally trained models, and perform on par with models that are trained on the merged datasets. In addition, we show that SL-based AI models are data efficient. In the future, SL can be used to train distributed AI models for any histopathology image analysis task, eliminating the need for data transfer.
AB - Artificial intelligence (AI) can predict the presence of molecular alterations directly from routine histopathology slides. However, training robust AI systems requires large datasets for which data collection faces practical, ethical and legal obstacles. These obstacles could be overcome with swarm learning (SL), in which partners jointly train AI models while avoiding data transfer and monopolistic data governance. Here, we demonstrate the successful use of SL in large, multicentric datasets of gigapixel histopathology images from over 5,000 patients. We show that AI models trained using SL can predict BRAF mutational status and microsatellite instability directly from hematoxylin and eosin (H&E)-stained pathology slides of colorectal cancer. We trained AI models on three patient cohorts from Northern Ireland, Germany and the United States, and validated the prediction performance in two independent datasets from the United Kingdom. Our data show that SL-trained AI models outperform most locally trained models, and perform on par with models that are trained on the merged datasets. In addition, we show that SL-based AI models are data efficient. In the future, SL can be used to train distributed AI models for any histopathology image analysis task, eliminating the need for data transfer.
U2 - 10.1038/s41591-022-01768-5
DO - 10.1038/s41591-022-01768-5
M3 - Article
C2 - 35469069
SN - 1078-8956
JO - Nature Medicine
JF - Nature Medicine
ER -