使用行进模型范例的帕金森病分类多中心分布式学习方法

Frontiers in Artificial Intelligence Pub Date : 2024-02-07 DOI:10.3389/frai.2024.1301997

Raissa Souza, Emma A. M. Stanley, Milton Camacho, Richard Camicioli, O. Monchi, Zahinoor Ismail, M. Wilms, Nils D. Forkert

{"title":"使用行进模型范例的帕金森病分类多中心分布式学习方法","authors":"Raissa Souza, Emma A. M. Stanley, Milton Camacho, Richard Camicioli, O. Monchi, Zahinoor Ismail, M. Wilms, Nils D. Forkert","doi":"10.3389/frai.2024.1301997","DOIUrl":null,"url":null,"abstract":"Distributed learning is a promising alternative to central learning for machine learning (ML) model training, overcoming data-sharing problems in healthcare. Previous studies exploring federated learning (FL) or the traveling model (TM) setup for medical image-based disease classification often relied on large databases with a limited number of centers or simulated artificial centers, raising doubts about real-world applicability. This study develops and evaluates a convolution neural network (CNN) for Parkinson's disease classification using data acquired by 83 diverse real centers around the world, mostly contributing small training samples. Our approach specifically makes use of the TM setup, which has proven effective in scenarios with limited data availability but has never been used for image-based disease classification. Our findings reveal that TM is effective for training CNN models, even in complex real-world scenarios with variable data distributions. After sufficient training cycles, the TM-trained CNN matches or slightly surpasses the performance of the centrally trained counterpart (AUROC of 83% vs. 80%). Our study highlights, for the first time, the effectiveness of TM in 3D medical image classification, especially in scenarios with limited training samples and heterogeneous distributed data. These insights are relevant for situations where ML models are supposed to be trained using data from small or remote medical centers, and rare diseases with sparse cases. The simplicity of this approach enables a broad application to many deep learning tasks, enhancing its clinical utility across various contexts and medical facilities.","PeriodicalId":508738,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"358 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A multi-center distributed learning approach for Parkinson's disease classification using the traveling model paradigm\",\"authors\":\"Raissa Souza, Emma A. M. Stanley, Milton Camacho, Richard Camicioli, O. Monchi, Zahinoor Ismail, M. Wilms, Nils D. Forkert\",\"doi\":\"10.3389/frai.2024.1301997\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Distributed learning is a promising alternative to central learning for machine learning (ML) model training, overcoming data-sharing problems in healthcare. Previous studies exploring federated learning (FL) or the traveling model (TM) setup for medical image-based disease classification often relied on large databases with a limited number of centers or simulated artificial centers, raising doubts about real-world applicability. This study develops and evaluates a convolution neural network (CNN) for Parkinson's disease classification using data acquired by 83 diverse real centers around the world, mostly contributing small training samples. Our approach specifically makes use of the TM setup, which has proven effective in scenarios with limited data availability but has never been used for image-based disease classification. Our findings reveal that TM is effective for training CNN models, even in complex real-world scenarios with variable data distributions. After sufficient training cycles, the TM-trained CNN matches or slightly surpasses the performance of the centrally trained counterpart (AUROC of 83% vs. 80%). Our study highlights, for the first time, the effectiveness of TM in 3D medical image classification, especially in scenarios with limited training samples and heterogeneous distributed data. These insights are relevant for situations where ML models are supposed to be trained using data from small or remote medical centers, and rare diseases with sparse cases. The simplicity of this approach enables a broad application to many deep learning tasks, enhancing its clinical utility across various contexts and medical facilities.\",\"PeriodicalId\":508738,\"journal\":{\"name\":\"Frontiers in Artificial Intelligence\",\"volume\":\"358 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/frai.2024.1301997\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frai.2024.1301997","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在机器学习（ML）模型训练中，分布式学习是中心学习的一种有前途的替代方法，可以克服医疗保健领域的数据共享问题。以往探索基于医学影像的疾病分类的联合学习（FL）或巡回模型（TM）设置的研究往往依赖于中心数量有限的大型数据库或模拟人工中心，这让人对其在现实世界中的适用性产生怀疑。本研究利用全球 83 个不同真实中心获得的数据，开发并评估了用于帕金森病分类的卷积神经网络（CNN），这些中心大多提供了少量训练样本。我们的方法特别使用了 TM 设置，该设置已被证明在数据可用性有限的情况下有效，但从未用于基于图像的疾病分类。我们的研究结果表明，即使在数据分布多变的复杂现实世界场景中，TM 也能有效地训练 CNN 模型。经过足够的训练周期后，TM 训练的 CNN 的性能可与集中训练的 CNN 相媲美，甚至略胜一筹（AUROC 为 83% 对 80%）。我们的研究首次强调了 TM 在三维医学图像分类中的有效性，尤其是在训练样本有限和异构分布式数据的情况下。这些见解对于使用来自小型或偏远医疗中心的数据训练 ML 模型以及病例稀少的罕见疾病具有重要意义。这种方法非常简单，可广泛应用于许多深度学习任务，从而提高其在各种环境和医疗设施中的临床实用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A multi-center distributed learning approach for Parkinson's disease classification using the traveling model paradigm

Distributed learning is a promising alternative to central learning for machine learning (ML) model training, overcoming data-sharing problems in healthcare. Previous studies exploring federated learning (FL) or the traveling model (TM) setup for medical image-based disease classification often relied on large databases with a limited number of centers or simulated artificial centers, raising doubts about real-world applicability. This study develops and evaluates a convolution neural network (CNN) for Parkinson's disease classification using data acquired by 83 diverse real centers around the world, mostly contributing small training samples. Our approach specifically makes use of the TM setup, which has proven effective in scenarios with limited data availability but has never been used for image-based disease classification. Our findings reveal that TM is effective for training CNN models, even in complex real-world scenarios with variable data distributions. After sufficient training cycles, the TM-trained CNN matches or slightly surpasses the performance of the centrally trained counterpart (AUROC of 83% vs. 80%). Our study highlights, for the first time, the effectiveness of TM in 3D medical image classification, especially in scenarios with limited training samples and heterogeneous distributed data. These insights are relevant for situations where ML models are supposed to be trained using data from small or remote medical centers, and rare diseases with sparse cases. The simplicity of this approach enables a broad application to many deep learning tasks, enhancing its clinical utility across various contexts and medical facilities.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Frontiers in Artificial Intelligence

自引率

0.00%

发文量