Fine-Tuned Machine Learning Classifiers for Diagnosing Parkinson's Disease Using Vocal Characteristics: A Comparative Analysis.

IF 3.3 3区 医学 Q1 MEDICINE, GENERAL & INTERNAL Diagnostics Pub Date : 2025-03-06 DOI:10.3390/diagnostics15050645
Mehmet Meral, Ferdi Ozbilgin, Fatih Durmus
{"title":"Fine-Tuned Machine Learning Classifiers for Diagnosing Parkinson's Disease Using Vocal Characteristics: A Comparative Analysis.","authors":"Mehmet Meral, Ferdi Ozbilgin, Fatih Durmus","doi":"10.3390/diagnostics15050645","DOIUrl":null,"url":null,"abstract":"<p><p><b>Background/Objectives</b>: This paper is significant in highlighting the importance of early and precise diagnosis of Parkinson's Disease (PD) that affects both motor and non-motor functions to achieve better disease control and patient outcomes. This study seeks to assess the effectiveness of machine learning algorithms optimized to classify PD based on vocal characteristics to serve as a non-invasive and easily accessible diagnostic tool. <b>Methods</b>: This study used a publicly available dataset of vocal samples from 188 people with PD and 64 controls. Acoustic features like baseline characteristics, time-frequency components, Mel Frequency Cepstral Coefficients (MFCCs), and wavelet transform-based metrics were extracted and analyzed. The Chi-Square test was used for feature selection to determine the most important attributes that enhanced the accuracy of the classification. Six different machine learning classifiers, namely SVM, k-NN, DT, NN, Ensemble and Stacking models, were developed and optimized via Bayesian Optimization (BO), Grid Search (GS) and Random Search (RS). Accuracy, precision, recall, F1-score and AUC-ROC were used for evaluation. <b>Results</b>: It has been found that Stacking models, especially those fine-tuned via Grid Search, yielded the best performance with 92.07% accuracy and an F1-score of 0.95. In addition to that, the choice of relevant vocal features, in conjunction with the Chi-Square feature selection method, greatly enhanced the computational efficiency and classification performance. <b>Conclusions</b>: This study highlights the potential of combining advanced feature selection techniques with hyperparameter optimization strategies to enhance machine learning-based PD diagnosis using vocal characteristics. Ensemble models proved particularly effective in handling complex datasets, demonstrating robust diagnostic performance. Future research may focus on deep learning approaches and temporal feature integration to further improve diagnostic accuracy and scalability for clinical applications.</p>","PeriodicalId":11225,"journal":{"name":"Diagnostics","volume":"15 5","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11898924/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diagnostics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3390/diagnostics15050645","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

Abstract

Background/Objectives: This paper is significant in highlighting the importance of early and precise diagnosis of Parkinson's Disease (PD) that affects both motor and non-motor functions to achieve better disease control and patient outcomes. This study seeks to assess the effectiveness of machine learning algorithms optimized to classify PD based on vocal characteristics to serve as a non-invasive and easily accessible diagnostic tool. Methods: This study used a publicly available dataset of vocal samples from 188 people with PD and 64 controls. Acoustic features like baseline characteristics, time-frequency components, Mel Frequency Cepstral Coefficients (MFCCs), and wavelet transform-based metrics were extracted and analyzed. The Chi-Square test was used for feature selection to determine the most important attributes that enhanced the accuracy of the classification. Six different machine learning classifiers, namely SVM, k-NN, DT, NN, Ensemble and Stacking models, were developed and optimized via Bayesian Optimization (BO), Grid Search (GS) and Random Search (RS). Accuracy, precision, recall, F1-score and AUC-ROC were used for evaluation. Results: It has been found that Stacking models, especially those fine-tuned via Grid Search, yielded the best performance with 92.07% accuracy and an F1-score of 0.95. In addition to that, the choice of relevant vocal features, in conjunction with the Chi-Square feature selection method, greatly enhanced the computational efficiency and classification performance. Conclusions: This study highlights the potential of combining advanced feature selection techniques with hyperparameter optimization strategies to enhance machine learning-based PD diagnosis using vocal characteristics. Ensemble models proved particularly effective in handling complex datasets, demonstrating robust diagnostic performance. Future research may focus on deep learning approaches and temporal feature integration to further improve diagnostic accuracy and scalability for clinical applications.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用声音特征诊断帕金森病的微调机器学习分类器:比较分析。
背景/目的:本文强调了早期和精确诊断影响运动和非运动功能的帕金森病(PD)的重要性,以实现更好的疾病控制和患者预后。本研究旨在评估优化的机器学习算法的有效性,该算法基于声音特征对PD进行分类,以作为一种无创且易于获取的诊断工具。方法:本研究使用了188名PD患者和64名对照组的公开可用的声音样本数据集。提取并分析了声学特征,如基线特征、时频分量、Mel频率倒谱系数(MFCCs)和基于小波变换的指标。使用卡方检验进行特征选择,以确定提高分类准确性的最重要属性。通过贝叶斯优化(BO)、网格搜索(GS)和随机搜索(RS),开发并优化了SVM、k-NN、DT、NN、Ensemble和Stacking 6种不同的机器学习分类器。采用正确率、精密度、召回率、f1评分和AUC-ROC进行评价。结果:通过网格搜索优化后的叠加模型,准确率达到92.07%,f1得分为0.95。除此之外,选择相关的声音特征,结合卡方特征选择方法,大大提高了计算效率和分类性能。结论:本研究强调了将先进的特征选择技术与超参数优化策略相结合的潜力,以增强基于机器学习的PD诊断。集成模型被证明在处理复杂数据集方面特别有效,展示了强大的诊断性能。未来的研究可能会集中在深度学习方法和时间特征集成上,以进一步提高诊断的准确性和临床应用的可扩展性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Diagnostics
Diagnostics Biochemistry, Genetics and Molecular Biology-Clinical Biochemistry
CiteScore
4.70
自引率
8.30%
发文量
2699
审稿时长
19.64 days
期刊介绍: Diagnostics (ISSN 2075-4418) is an international scholarly open access journal on medical diagnostics. It publishes original research articles, reviews, communications and short notes on the research and development of medical diagnostics. There is no restriction on the length of the papers. Our aim is to encourage scientists to publish their experimental and theoretical research in as much detail as possible. Full experimental and/or methodological details must be provided for research articles.
期刊最新文献
Cephalometric Assessment of Airway-Related Hyoid Position and Velar Morphology Across Skeletal Malocclusions: A Cross-Sectional Study. Comparative Diagnostic Performance of TST and IGRAs in the Diagnosis of Latent Tuberculosis Infection: A Systematic Review and Diagnostic Meta-Analysis. Uncertainty-Aware Framework for CT Radiation Dose Optimization in the Active Surveillance of Small Renal Masses: Clinical and Radiological Considerations. Digital Specimen Tracking- and ISO 15189-Oriented Risk Management in Anatomic Pathology: A Qualitative Study of Expert Perspectives in Western Austria. Intrapartum Ultrasound in Vacuum Operative Delivery: A Comprehensive Review and Proposal of the Novel Ultrasound Flexion Point Method.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1