Sai Bharadwaj Appakaya, Ruchira Pratihar, Ravi Sankar
{"title":"Parkinson’s Disease Classification Framework Using Vocal Dynamics in Connected Speech","authors":"Sai Bharadwaj Appakaya, Ruchira Pratihar, Ravi Sankar","doi":"10.3390/a16110509","DOIUrl":null,"url":null,"abstract":"Parkinson’s disease (PD) classification through speech has been an advancing field of research because of its ease of acquisition and processing. The minimal infrastructure requirements of the system have also made it suitable for telemonitoring applications. Researchers have studied the effects of PD on speech from various perspectives using different speech tasks. Typical speech deficits due to PD include voice monotony (e.g., monopitch), breathy or rough quality, and articulatory errors. In connected speech, these symptoms are more emphatic, which is also the basis for speech assessment in popular rating scales used for PD, like the Unified Parkinson’s Disease Rating Scale (UPDRS) and Hoehn and Yahr (HY). The current study introduces an innovative framework that integrates pitch-synchronous segmentation and an optimized set of features to investigate and analyze continuous speech from both PD patients and healthy controls (HC). Comparison of the proposed framework against existing methods has shown its superiority in classification performance and mitigation of overfitting in machine learning models. A set of optimal classifiers with unbiased decision-making was identified after comparing several machine learning models. The outcomes yielded by the classifiers demonstrate that the framework effectively learns the intrinsic characteristics of PD from connected speech, which can potentially offer valuable assistance in clinical diagnosis.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"30 16","pages":"0"},"PeriodicalIF":1.8000,"publicationDate":"2023-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Algorithms","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/a16110509","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Parkinson’s disease (PD) classification through speech has been an advancing field of research because of its ease of acquisition and processing. The minimal infrastructure requirements of the system have also made it suitable for telemonitoring applications. Researchers have studied the effects of PD on speech from various perspectives using different speech tasks. Typical speech deficits due to PD include voice monotony (e.g., monopitch), breathy or rough quality, and articulatory errors. In connected speech, these symptoms are more emphatic, which is also the basis for speech assessment in popular rating scales used for PD, like the Unified Parkinson’s Disease Rating Scale (UPDRS) and Hoehn and Yahr (HY). The current study introduces an innovative framework that integrates pitch-synchronous segmentation and an optimized set of features to investigate and analyze continuous speech from both PD patients and healthy controls (HC). Comparison of the proposed framework against existing methods has shown its superiority in classification performance and mitigation of overfitting in machine learning models. A set of optimal classifiers with unbiased decision-making was identified after comparing several machine learning models. The outcomes yielded by the classifiers demonstrate that the framework effectively learns the intrinsic characteristics of PD from connected speech, which can potentially offer valuable assistance in clinical diagnosis.
通过言语分类帕金森病(PD)因其易于获取和处理而成为一个前沿研究领域。该系统对基础设施的最低要求也使其适合远程监控应用。研究者利用不同的言语任务从不同的角度研究了PD对言语的影响。由PD引起的典型言语缺陷包括声音单调(例如,单音),呼吸或粗糙的质量,以及发音错误。在关联言语中,这些症状更加突出,这也是常用的PD评定量表(如统一帕金森病评定量表(UPDRS)和Hoehn and Yahr (HY))的言语评估基础。目前的研究引入了一个创新的框架,该框架集成了音高同步分割和一组优化的功能,用于调查和分析PD患者和健康对照(HC)的连续语音。将提出的框架与现有方法进行比较,表明其在分类性能和缓解机器学习模型的过拟合方面具有优势。通过比较几种机器学习模型,确定了一组具有无偏决策的最优分类器。分类器产生的结果表明,该框架可以有效地从连接语音中学习PD的内在特征,这可能为临床诊断提供有价值的帮助。