喉切除术后声带参数提取的康复监测

A. Carullo, A. Atzori, L. Midolo, A. Vallan, M. Fantini, G. Succo
{"title":"喉切除术后声带参数提取的康复监测","authors":"A. Carullo, A. Atzori, L. Midolo, A. Vallan, M. Fantini, G. Succo","doi":"10.1109/MeMeA54994.2022.9856487","DOIUrl":null,"url":null,"abstract":"This paper deals with the analysis of substitution voices of post-laryngectomy patients with the aim of identifying a methodology to track the effectiveness of rehabilitation therapies. The available data-set includes 22 patients that had undergone Open Partial Horizontal Laryngectomy (OPHL) of type II and 10 patients that had undergone OPHL of type III. A pre-processing algorithm that relies on the spectral kurtosis has been designed to remove non-harmonic frames from the available recordings of patients, thus minimizing the number of outliers among the extracted parameters. Such an algorithm has been tuned starting form the results of a control group of 10 healthy subjects. After this preliminary step, from the harmonic frames a series of parameters have been extracted that belong to spectral domain (tilt, kurtosis, entropy and Soft Phonation Index) and cepstral domain (Mel-Frequency Cepstral Coefficients, MFCCs, and Cepstral Peak Prominence Smoothed, CPPS). Then, the patients have been subdivided into two classes according to the index I (Intelligibility) of the auditory perceptual scale INFVo and a Kolmogorov-Smirnov two-samples test has been run, which has highlighted that low-band MFCCs, spectral entropy and spectral kurtosis show the best discrimination capability of substitution voices. This outcome has been confirmed by an alternative method that is based on the performance of classification algorithms. A classification accuracy of about 81% has been obtained using a logistic regression model that was trained with median of MFCC3, range of MFCC4 and 95° percentiles of MFCC6 and MFCC9. The same accuracy has been provided by a coarse decision tree algorithm trained with skewness of MFCC1, median of MFCC3 and 95° percentile of spectral entropy.","PeriodicalId":106228,"journal":{"name":"2022 IEEE International Symposium on Medical Measurements and Applications (MeMeA)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Rehabilitation Monitoring of Post-Laryngectomy Patients through the Extraction of Vocal Parameters\",\"authors\":\"A. Carullo, A. Atzori, L. Midolo, A. Vallan, M. Fantini, G. Succo\",\"doi\":\"10.1109/MeMeA54994.2022.9856487\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper deals with the analysis of substitution voices of post-laryngectomy patients with the aim of identifying a methodology to track the effectiveness of rehabilitation therapies. The available data-set includes 22 patients that had undergone Open Partial Horizontal Laryngectomy (OPHL) of type II and 10 patients that had undergone OPHL of type III. A pre-processing algorithm that relies on the spectral kurtosis has been designed to remove non-harmonic frames from the available recordings of patients, thus minimizing the number of outliers among the extracted parameters. Such an algorithm has been tuned starting form the results of a control group of 10 healthy subjects. After this preliminary step, from the harmonic frames a series of parameters have been extracted that belong to spectral domain (tilt, kurtosis, entropy and Soft Phonation Index) and cepstral domain (Mel-Frequency Cepstral Coefficients, MFCCs, and Cepstral Peak Prominence Smoothed, CPPS). Then, the patients have been subdivided into two classes according to the index I (Intelligibility) of the auditory perceptual scale INFVo and a Kolmogorov-Smirnov two-samples test has been run, which has highlighted that low-band MFCCs, spectral entropy and spectral kurtosis show the best discrimination capability of substitution voices. This outcome has been confirmed by an alternative method that is based on the performance of classification algorithms. A classification accuracy of about 81% has been obtained using a logistic regression model that was trained with median of MFCC3, range of MFCC4 and 95° percentiles of MFCC6 and MFCC9. The same accuracy has been provided by a coarse decision tree algorithm trained with skewness of MFCC1, median of MFCC3 and 95° percentile of spectral entropy.\",\"PeriodicalId\":106228,\"journal\":{\"name\":\"2022 IEEE International Symposium on Medical Measurements and Applications (MeMeA)\",\"volume\":\"55 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Symposium on Medical Measurements and Applications (MeMeA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MeMeA54994.2022.9856487\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Symposium on Medical Measurements and Applications (MeMeA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MeMeA54994.2022.9856487","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

本文分析了喉切除术后患者的替代声音,目的是确定一种方法来跟踪康复治疗的有效性。现有数据集包括22例II型开放式部分水平喉切除术(OPHL)患者和10例III型OPHL患者。一种基于谱峰度的预处理算法被设计用于从患者的可用记录中去除非谐波帧,从而最大限度地减少提取参数中的异常值的数量。这种算法是从10名健康受试者的对照组的结果开始调整的。在这一初步步骤之后,从谐波帧中提取了一系列属于谱域(倾斜、峰度、熵和软音指数)和倒谱域(Mel-Frequency cepstral Coefficients, MFCCs,和cepstral Peak珥Smoothed, CPPS)的参数。然后,根据听觉感知量表(INFVo)的可理解性指数I将患者细分为两类,并进行Kolmogorov-Smirnov双样本检验,结果表明低频mfcc、谱熵和谱峰度表现出最佳的替代语音识别能力。这一结果已被一种基于分类算法性能的替代方法所证实。使用MFCC3的中位数、MFCC4的极差以及MFCC6和MFCC9的95°百分位数训练的逻辑回归模型,获得了约81%的分类准确率。用MFCC1的偏度、MFCC3的中位数和谱熵的95°百分位数训练的粗糙决策树算法也能提供相同的精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Rehabilitation Monitoring of Post-Laryngectomy Patients through the Extraction of Vocal Parameters
This paper deals with the analysis of substitution voices of post-laryngectomy patients with the aim of identifying a methodology to track the effectiveness of rehabilitation therapies. The available data-set includes 22 patients that had undergone Open Partial Horizontal Laryngectomy (OPHL) of type II and 10 patients that had undergone OPHL of type III. A pre-processing algorithm that relies on the spectral kurtosis has been designed to remove non-harmonic frames from the available recordings of patients, thus minimizing the number of outliers among the extracted parameters. Such an algorithm has been tuned starting form the results of a control group of 10 healthy subjects. After this preliminary step, from the harmonic frames a series of parameters have been extracted that belong to spectral domain (tilt, kurtosis, entropy and Soft Phonation Index) and cepstral domain (Mel-Frequency Cepstral Coefficients, MFCCs, and Cepstral Peak Prominence Smoothed, CPPS). Then, the patients have been subdivided into two classes according to the index I (Intelligibility) of the auditory perceptual scale INFVo and a Kolmogorov-Smirnov two-samples test has been run, which has highlighted that low-band MFCCs, spectral entropy and spectral kurtosis show the best discrimination capability of substitution voices. This outcome has been confirmed by an alternative method that is based on the performance of classification algorithms. A classification accuracy of about 81% has been obtained using a logistic regression model that was trained with median of MFCC3, range of MFCC4 and 95° percentiles of MFCC6 and MFCC9. The same accuracy has been provided by a coarse decision tree algorithm trained with skewness of MFCC1, median of MFCC3 and 95° percentile of spectral entropy.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Time and Frequency domain analysis of APB muscles Abduction in adult dominant hand using surface electromyography signals Comparison of Noise Reduction Techniques for Dysarthric Speech Recognition Effects of ROI positioning on the measurement of engineered muscle tissue contractility with an optical tracking method Atrial Fibrillation Detection by Means of Edge Computing on Wearable Device: A Feasibility Assessment Unraveling the biological meaning of radiomic features
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1