Non-intrusive technique for pathological voice classification using jitter and shimmer

N. Sripriya, S. Poornima, R. Shivaranjani, P. Thangaraju
{"title":"Non-intrusive technique for pathological voice classification using jitter and shimmer","authors":"N. Sripriya, S. Poornima, R. Shivaranjani, P. Thangaraju","doi":"10.1109/ICCCSP.2017.7944104","DOIUrl":null,"url":null,"abstract":"Speech signal contains two characteristics, system and source. When there is disturbance in vocal cord function, there is notable change in source characteristic. Despite the technological advances in the medical field, the voice pathologists use endoscopic methods to view the vocal cord flap movements for patients with infections and disturbances in vocal cords which are painful. This work is an alternative for classifying pathological voice from normal voice by evaluating the jitter and shimmer variations in the speech signal of an affected person. When there is any distortion in voice, it is reflected in the source characteristics. Pitch being the fundamental source characteristic, analyzing pitch helps us classify pathological voice from normal voice. Jitter and shimmer are derived characteristics of pitch. The glottal closure instants are better representatives of source compared to pitch. In this work, we have explored using the glottal closure instants to calculate the jitter, shimmer and other speech parameters instead of the pitch period. Analyzing these jitter and shimmer parameters for various pathological voices and normal voices help us to classify them. Experiments were carried out using a database containing normal and pathological voices. An accuracy of 85% was achieved for normal-pathological voice classification.","PeriodicalId":269595,"journal":{"name":"2017 International Conference on Computer, Communication and Signal Processing (ICCCSP)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Computer, Communication and Signal Processing (ICCCSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCSP.2017.7944104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

Speech signal contains two characteristics, system and source. When there is disturbance in vocal cord function, there is notable change in source characteristic. Despite the technological advances in the medical field, the voice pathologists use endoscopic methods to view the vocal cord flap movements for patients with infections and disturbances in vocal cords which are painful. This work is an alternative for classifying pathological voice from normal voice by evaluating the jitter and shimmer variations in the speech signal of an affected person. When there is any distortion in voice, it is reflected in the source characteristics. Pitch being the fundamental source characteristic, analyzing pitch helps us classify pathological voice from normal voice. Jitter and shimmer are derived characteristics of pitch. The glottal closure instants are better representatives of source compared to pitch. In this work, we have explored using the glottal closure instants to calculate the jitter, shimmer and other speech parameters instead of the pitch period. Analyzing these jitter and shimmer parameters for various pathological voices and normal voices help us to classify them. Experiments were carried out using a database containing normal and pathological voices. An accuracy of 85% was achieved for normal-pathological voice classification.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于抖动和闪烁的非侵入性病理语音分类技术
语音信号包含系统和源两个特征。当声带功能受到干扰时,声源特征发生显著变化。尽管医学领域的技术进步,但声音病理学家使用内窥镜方法来观察声带感染和声带紊乱患者的声带皮瓣运动,这是痛苦的。这项工作是通过评估受影响的人的语音信号中的抖动和闪烁变化来对病理语音和正常语音进行分类的一种替代方法。当声音有失真时,就会反映在音源的特性上。音高是音源的基本特征,分析音高有助于我们区分病理声音和正常声音。抖动和闪烁是基音的派生特性。与音高相比,声门关闭的瞬间更能代表音源。在这项工作中,我们探索了使用声门关闭瞬间来计算抖动,闪烁和其他语音参数,而不是音高周期。分析各种病理性声音和正常声音的抖动和闪烁参数有助于我们对它们进行分类。实验使用包含正常和病理声音的数据库进行。正常-病理语音分类准确率达到85%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Copyright form Copyright page Development of efficient VLSI architecture for speech processing in mobile communication Detection of sleep apnea from multiparameter monitor signals using empirical mode decomposition Utilization based prediction model for resource provisioning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1