Thasina Tabashum, Robert Cooper Snyder, Megan K O'Brien, Mark V Albert
{"title":"Machine Learning Models for Parkinson Disease: Systematic Review","authors":"Thasina Tabashum, Robert Cooper Snyder, Megan K O'Brien, Mark V Albert","doi":"10.2196/50117","DOIUrl":null,"url":null,"abstract":"Background: With the increasing availability of data, computing resources, and easier-to-use software libraries, machine learning (ML) is increasingly utilized in disease detection and prediction, including Parkinson’s disease (PD). Despite the large number of studies published every year, very few ML systems have been adopted for real-world subject use. Objective: In particular, a lack of external validity may result in poor performance of these systems in clinical practice. Additional methodological issues in ML design and reporting can also hinder clinical adoption, even for applications that would benefit from such data-driven systems. To sample the current ML practices in PD applications, we conducted a systematic review of studies in 2020 and 2021 that use ML models to diagnose PD or to track PD progression. Methods: We conducted a systematic literature review in accordance with PRISMA in PubMed between January 2020 - April 2021, using the exact string “Parkinson’s” AND (“ML” OR “prediction” OR “classification” OR “detection” or “artificial intelligence” OR “AI”), resulting in 1085 publications from the search results. After a search query and review, we found 113 publications that used ML for the classification or regression-based prediction of PD or PD-related symptoms. Results: Only 25.7% of studies used a hold-out test set to avoid potentially inflated accuracies, and approximately half of the studies without a hold-out test set did not state this as a potential concern. Surprisingly, 38.9% of studies did not report on how or if models were tuned, and an additional 27.4% used ad hoc model tuning, which is generally frowned upon in ML model optimization. Only 15% of studies performed direct comparisons of results with other models, severely limiting the interpretation of results. Conclusions: This review highlights the notable limitations of current ML systems and techniques that may contribute to a gap between reported performance in research and the real-life applicability of ML models aiming to detect and predict diseases such as PD.","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.1000,"publicationDate":"2024-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2196/50117","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: With the increasing availability of data, computing resources, and easier-to-use software libraries, machine learning (ML) is increasingly utilized in disease detection and prediction, including Parkinson’s disease (PD). Despite the large number of studies published every year, very few ML systems have been adopted for real-world subject use. Objective: In particular, a lack of external validity may result in poor performance of these systems in clinical practice. Additional methodological issues in ML design and reporting can also hinder clinical adoption, even for applications that would benefit from such data-driven systems. To sample the current ML practices in PD applications, we conducted a systematic review of studies in 2020 and 2021 that use ML models to diagnose PD or to track PD progression. Methods: We conducted a systematic literature review in accordance with PRISMA in PubMed between January 2020 - April 2021, using the exact string “Parkinson’s” AND (“ML” OR “prediction” OR “classification” OR “detection” or “artificial intelligence” OR “AI”), resulting in 1085 publications from the search results. After a search query and review, we found 113 publications that used ML for the classification or regression-based prediction of PD or PD-related symptoms. Results: Only 25.7% of studies used a hold-out test set to avoid potentially inflated accuracies, and approximately half of the studies without a hold-out test set did not state this as a potential concern. Surprisingly, 38.9% of studies did not report on how or if models were tuned, and an additional 27.4% used ad hoc model tuning, which is generally frowned upon in ML model optimization. Only 15% of studies performed direct comparisons of results with other models, severely limiting the interpretation of results. Conclusions: This review highlights the notable limitations of current ML systems and techniques that may contribute to a gap between reported performance in research and the real-life applicability of ML models aiming to detect and predict diseases such as PD.
期刊介绍:
JMIR Medical Informatics (JMI, ISSN 2291-9694) is a top-rated, tier A journal which focuses on clinical informatics, big data in health and health care, decision support for health professionals, electronic health records, ehealth infrastructures and implementation. It has a focus on applied, translational research, with a broad readership including clinicians, CIOs, engineers, industry and health informatics professionals.
Published by JMIR Publications, publisher of the Journal of Medical Internet Research (JMIR), the leading eHealth/mHealth journal (Impact Factor 2016: 5.175), JMIR Med Inform has a slightly different scope (emphasizing more on applications for clinicians and health professionals rather than consumers/citizens, which is the focus of JMIR), publishes even faster, and also allows papers which are more technical or more formative than what would be published in the Journal of Medical Internet Research.