Machine Learning Models for Parkinson Disease: Systematic Review

IF 3.1 3区医学 Q2 MEDICAL INFORMATICS JMIR Medical Informatics Pub Date : 2024-05-17 DOI:10.2196/50117

Thasina Tabashum, Robert Cooper Snyder, Megan K O'Brien, Mark V Albert

{"title":"Machine Learning Models for Parkinson Disease: Systematic Review","authors":"Thasina Tabashum, Robert Cooper Snyder, Megan K O'Brien, Mark V Albert","doi":"10.2196/50117","DOIUrl":null,"url":null,"abstract":"Background: With the increasing availability of data, computing resources, and easier-to-use software libraries, machine learning (ML) is increasingly utilized in disease detection and prediction, including Parkinson’s disease (PD). Despite the large number of studies published every year, very few ML systems have been adopted for real-world subject use. Objective: In particular, a lack of external validity may result in poor performance of these systems in clinical practice. Additional methodological issues in ML design and reporting can also hinder clinical adoption, even for applications that would benefit from such data-driven systems. To sample the current ML practices in PD applications, we conducted a systematic review of studies in 2020 and 2021 that use ML models to diagnose PD or to track PD progression. Methods: We conducted a systematic literature review in accordance with PRISMA in PubMed between January 2020 - April 2021, using the exact string “Parkinson’s” AND (“ML” OR “prediction” OR “classification” OR “detection” or “artificial intelligence” OR “AI”), resulting in 1085 publications from the search results. After a search query and review, we found 113 publications that used ML for the classification or regression-based prediction of PD or PD-related symptoms. Results: Only 25.7% of studies used a hold-out test set to avoid potentially inflated accuracies, and approximately half of the studies without a hold-out test set did not state this as a potential concern. Surprisingly, 38.9% of studies did not report on how or if models were tuned, and an additional 27.4% used ad hoc model tuning, which is generally frowned upon in ML model optimization. Only 15% of studies performed direct comparisons of results with other models, severely limiting the interpretation of results. Conclusions: This review highlights the notable limitations of current ML systems and techniques that may contribute to a gap between reported performance in research and the real-life applicability of ML models aiming to detect and predict diseases such as PD.","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.1000,"publicationDate":"2024-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2196/50117","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: With the increasing availability of data, computing resources, and easier-to-use software libraries, machine learning (ML) is increasingly utilized in disease detection and prediction, including Parkinson’s disease (PD). Despite the large number of studies published every year, very few ML systems have been adopted for real-world subject use. Objective: In particular, a lack of external validity may result in poor performance of these systems in clinical practice. Additional methodological issues in ML design and reporting can also hinder clinical adoption, even for applications that would benefit from such data-driven systems. To sample the current ML practices in PD applications, we conducted a systematic review of studies in 2020 and 2021 that use ML models to diagnose PD or to track PD progression. Methods: We conducted a systematic literature review in accordance with PRISMA in PubMed between January 2020 - April 2021, using the exact string “Parkinson’s” AND (“ML” OR “prediction” OR “classification” OR “detection” or “artificial intelligence” OR “AI”), resulting in 1085 publications from the search results. After a search query and review, we found 113 publications that used ML for the classification or regression-based prediction of PD or PD-related symptoms. Results: Only 25.7% of studies used a hold-out test set to avoid potentially inflated accuracies, and approximately half of the studies without a hold-out test set did not state this as a potential concern. Surprisingly, 38.9% of studies did not report on how or if models were tuned, and an additional 27.4% used ad hoc model tuning, which is generally frowned upon in ML model optimization. Only 15% of studies performed direct comparisons of results with other models, severely limiting the interpretation of results. Conclusions: This review highlights the notable limitations of current ML systems and techniques that may contribute to a gap between reported performance in research and the real-life applicability of ML models aiming to detect and predict diseases such as PD.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

帕金森病的机器学习模型：系统回顾

背景：随着数据、计算资源和易于使用的软件库的可用性不断提高，机器学习（ML）越来越多地被用于疾病检测和预测，包括帕金森病（PD）。尽管每年都有大量研究成果发表，但很少有 ML 系统被实际应用。目标：特别是，缺乏外部有效性可能会导致这些系统在临床实践中表现不佳。此外，ML 设计和报告中的其他方法问题也会阻碍临床应用，即使是那些能从此类数据驱动型系统中获益的应用。为了对目前帕金森病应用中的 ML 实践进行抽样调查，我们对 2020 年和 2021 年使用 ML 模型诊断帕金森病或跟踪帕金森病进展的研究进行了系统回顾。方法：2020 年 1 月至 2021 年 4 月期间，我们按照 PRISMA 在 PubMed 上进行了系统性文献综述，使用精确字符串 "帕金森病 "和（"ML "或 "预测 "或 "分类 "或 "检测 "或 "人工智能 "或 "AI"），从搜索结果中获得了 1085 篇出版物。经过搜索查询和审查，我们找到了 113 篇使用 ML 对帕金森病或帕金森病相关症状进行分类或回归预测的文献。结果发现只有 25.7% 的研究使用了暂缓测试集以避免可能出现的误差，而在没有使用暂缓测试集的研究中，约有一半的研究没有将此作为潜在的关注点。令人惊讶的是，有 38.9% 的研究没有报告如何或是否对模型进行了调整，另有 27.4% 的研究使用了特别的模型调整，而这在 ML 模型优化中通常是不受欢迎的。只有 15% 的研究将结果与其他模型进行了直接比较，这严重限制了对结果的解释。结论：本综述强调了当前 ML 系统和技术的显著局限性，这些局限性可能导致研究报告的性能与旨在检测和预测 PD 等疾病的 ML 模型的实际应用性之间存在差距。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

JMIR Medical Informatics Medicine-Health Informatics

CiteScore

7.90

自引率

3.10%

发文量

173

审稿时长

12 weeks

期刊介绍： JMIR Medical Informatics (JMI, ISSN 2291-9694) is a top-rated, tier A journal which focuses on clinical informatics, big data in health and health care, decision support for health professionals, electronic health records, ehealth infrastructures and implementation. It has a focus on applied, translational research, with a broad readership including clinicians, CIOs, engineers, industry and health informatics professionals. Published by JMIR Publications, publisher of the Journal of Medical Internet Research (JMIR), the leading eHealth/mHealth journal (Impact Factor 2016: 5.175), JMIR Med Inform has a slightly different scope (emphasizing more on applications for clinicians and health professionals rather than consumers/citizens, which is the focus of JMIR), publishes even faster, and also allows papers which are more technical or more formative than what would be published in the Journal of Medical Internet Research.