Enhancement of electrolaryngeal speech using Frequency Auditory Masking and GMM based voice conversion

P. Malathi, G. R. Sureshw, M. Moorthi
{"title":"Enhancement of electrolaryngeal speech using Frequency Auditory Masking and GMM based voice conversion","authors":"P. Malathi, G. R. Sureshw, M. Moorthi","doi":"10.1109/AEEICB.2018.8480968","DOIUrl":null,"url":null,"abstract":"Laryngectomees lose their voice box after surgery and adapt various methods to restore their voice, one of them being Electrolaryngeal speech. The Electrolarynx suffers from producing natural speech by generating mechanical form of speech with suppressed unvoiced features, device and environment noise. This paper tends to remove the echo noise, device noise and environmental noise thereby enhancing the Electrolaryngeal speech to be more intelligible by spectral mapping using Gaussian Mixture Model (GMM) and auditory masking. The low frequency noise is masked by the pre-emphasised speech signal by determining the absolute threshold of masking. The spectral mapping technique using GMM based voice conversion in association with the source-filter model improves the voice quality and prosody. The objective and subjective evaluation measures, depict the significant enhancement of electrolaryngeal speech compared to previous enhancement methods which removed only low frequency noise and failed to include voice quality.","PeriodicalId":423671,"journal":{"name":"2018 Fourth International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Fourth International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AEEICB.2018.8480968","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

Laryngectomees lose their voice box after surgery and adapt various methods to restore their voice, one of them being Electrolaryngeal speech. The Electrolarynx suffers from producing natural speech by generating mechanical form of speech with suppressed unvoiced features, device and environment noise. This paper tends to remove the echo noise, device noise and environmental noise thereby enhancing the Electrolaryngeal speech to be more intelligible by spectral mapping using Gaussian Mixture Model (GMM) and auditory masking. The low frequency noise is masked by the pre-emphasised speech signal by determining the absolute threshold of masking. The spectral mapping technique using GMM based voice conversion in association with the source-filter model improves the voice quality and prosody. The objective and subjective evaluation measures, depict the significant enhancement of electrolaryngeal speech compared to previous enhancement methods which removed only low frequency noise and failed to include voice quality.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用频率听觉掩蔽和基于GMM的语音转换增强喉电语音
喉切除术患者术后失去了声带,采用各种方法来恢复声音,其中一种是电喉言语。电喉通过产生机械形式的语音来产生自然语音,这些语音带有被抑制的未发声特征、设备和环境噪声。本文采用高斯混合模型(Gaussian Mixture Model, GMM)和听觉掩蔽相结合的频谱映射方法,去除回声噪声、设备噪声和环境噪声,提高喉电语音的可理解性。通过确定屏蔽的绝对阈值,低频噪声被预先强化的语音信号所掩盖。基于GMM的语音转换频谱映射技术与源-滤波器模型相结合,提高了语音质量和音韵。客观和主观评价指标,描述了与之前的增强方法相比,电喉语音的显著增强,这些增强方法仅去除低频噪声,未能包括语音质量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Microbial Isolates for Enhancement of Seed Germination A Precise on Wearable ECG Electrodes for Detection Of Heart Rate and Arrthymia Classification Web based Biometric Validation Using Biological Identities: An Elaborate Survey Induction Motor Parameter Monitoring System using Zig bee Protocol & MATLAB GUI : Automated Monitoring System Compressive Sensing and Hyper-Chaos Based Image Compression-Encryption
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1