利用频率听觉掩蔽和基于GMM的语音转换增强喉电语音

2018 Fourth International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB) Pub Date : 2018-02-01 DOI:10.1109/AEEICB.2018.8480968

P. Malathi, G. R. Sureshw, M. Moorthi

{"title":"利用频率听觉掩蔽和基于GMM的语音转换增强喉电语音","authors":"P. Malathi, G. R. Sureshw, M. Moorthi","doi":"10.1109/AEEICB.2018.8480968","DOIUrl":null,"url":null,"abstract":"Laryngectomees lose their voice box after surgery and adapt various methods to restore their voice, one of them being Electrolaryngeal speech. The Electrolarynx suffers from producing natural speech by generating mechanical form of speech with suppressed unvoiced features, device and environment noise. This paper tends to remove the echo noise, device noise and environmental noise thereby enhancing the Electrolaryngeal speech to be more intelligible by spectral mapping using Gaussian Mixture Model (GMM) and auditory masking. The low frequency noise is masked by the pre-emphasised speech signal by determining the absolute threshold of masking. The spectral mapping technique using GMM based voice conversion in association with the source-filter model improves the voice quality and prosody. The objective and subjective evaluation measures, depict the significant enhancement of electrolaryngeal speech compared to previous enhancement methods which removed only low frequency noise and failed to include voice quality.","PeriodicalId":423671,"journal":{"name":"2018 Fourth International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Enhancement of electrolaryngeal speech using Frequency Auditory Masking and GMM based voice conversion\",\"authors\":\"P. Malathi, G. R. Sureshw, M. Moorthi\",\"doi\":\"10.1109/AEEICB.2018.8480968\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Laryngectomees lose their voice box after surgery and adapt various methods to restore their voice, one of them being Electrolaryngeal speech. The Electrolarynx suffers from producing natural speech by generating mechanical form of speech with suppressed unvoiced features, device and environment noise. This paper tends to remove the echo noise, device noise and environmental noise thereby enhancing the Electrolaryngeal speech to be more intelligible by spectral mapping using Gaussian Mixture Model (GMM) and auditory masking. The low frequency noise is masked by the pre-emphasised speech signal by determining the absolute threshold of masking. The spectral mapping technique using GMM based voice conversion in association with the source-filter model improves the voice quality and prosody. The objective and subjective evaluation measures, depict the significant enhancement of electrolaryngeal speech compared to previous enhancement methods which removed only low frequency noise and failed to include voice quality.\",\"PeriodicalId\":423671,\"journal\":{\"name\":\"2018 Fourth International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB)\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Fourth International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AEEICB.2018.8480968\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Fourth International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AEEICB.2018.8480968","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

喉切除术患者术后失去了声带，采用各种方法来恢复声音，其中一种是电喉言语。电喉通过产生机械形式的语音来产生自然语音，这些语音带有被抑制的未发声特征、设备和环境噪声。本文采用高斯混合模型(Gaussian Mixture Model, GMM)和听觉掩蔽相结合的频谱映射方法，去除回声噪声、设备噪声和环境噪声，提高喉电语音的可理解性。通过确定屏蔽的绝对阈值，低频噪声被预先强化的语音信号所掩盖。基于GMM的语音转换频谱映射技术与源-滤波器模型相结合，提高了语音质量和音韵。客观和主观评价指标，描述了与之前的增强方法相比，电喉语音的显著增强，这些增强方法仅去除低频噪声，未能包括语音质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Enhancement of electrolaryngeal speech using Frequency Auditory Masking and GMM based voice conversion

Laryngectomees lose their voice box after surgery and adapt various methods to restore their voice, one of them being Electrolaryngeal speech. The Electrolarynx suffers from producing natural speech by generating mechanical form of speech with suppressed unvoiced features, device and environment noise. This paper tends to remove the echo noise, device noise and environmental noise thereby enhancing the Electrolaryngeal speech to be more intelligible by spectral mapping using Gaussian Mixture Model (GMM) and auditory masking. The low frequency noise is masked by the pre-emphasised speech signal by determining the absolute threshold of masking. The spectral mapping technique using GMM based voice conversion in association with the source-filter model improves the voice quality and prosody. The objective and subjective evaluation measures, depict the significant enhancement of electrolaryngeal speech compared to previous enhancement methods which removed only low frequency noise and failed to include voice quality.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 Fourth International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB)

自引率

0.00%

发文量