Real-time enhancement of electrolaryngeal speech by spectral subtraction

S. K. Basha, P. C. Pandey
{"title":"Real-time enhancement of electrolaryngeal speech by spectral subtraction","authors":"S. K. Basha, P. C. Pandey","doi":"10.1109/NCC.2012.6176807","DOIUrl":null,"url":null,"abstract":"An electrolarynx, a vibrator held against the neck tissue, is used by laryngectomy patients to provide excitation to the vocal tract as a substitute to that provided by the glottis. The quality and intelligibility of electrolaryngeal speech is generally poor because of the presence of background noise caused by leakage of acoustic energy from the vibrator and vibrator-tissue interface. This noise can be suppressed by pitch-synchronous application of spectral subtraction. The paper presents a real-time implementation of the spectral subtraction for enhancement of electrolaryngeal speech, using a 16-bit fixed-point DSP board. Electrolaryngeal speech is continuously acquired at 12 kHz using codec and DMA into the input buffers. It is processed using 256-point FFT, 3-frame 4-stage cascaded median-based dynamic estimation of noise, spectral subtraction, and IFFT, using two-pitch period window with 50 % overlap. The resynthesized speech is output using DMA and codec.","PeriodicalId":178278,"journal":{"name":"2012 National Conference on Communications (NCC)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC.2012.6176807","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13

Abstract

An electrolarynx, a vibrator held against the neck tissue, is used by laryngectomy patients to provide excitation to the vocal tract as a substitute to that provided by the glottis. The quality and intelligibility of electrolaryngeal speech is generally poor because of the presence of background noise caused by leakage of acoustic energy from the vibrator and vibrator-tissue interface. This noise can be suppressed by pitch-synchronous application of spectral subtraction. The paper presents a real-time implementation of the spectral subtraction for enhancement of electrolaryngeal speech, using a 16-bit fixed-point DSP board. Electrolaryngeal speech is continuously acquired at 12 kHz using codec and DMA into the input buffers. It is processed using 256-point FFT, 3-frame 4-stage cascaded median-based dynamic estimation of noise, spectral subtraction, and IFFT, using two-pitch period window with 50 % overlap. The resynthesized speech is output using DMA and codec.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
频谱减法实时增强喉电语音
电喉器是一种放在颈部组织上的振动器,喉切除术患者使用它来为声道提供兴奋,以替代声门提供的兴奋。由于振动器和振动器-组织界面的声能泄漏所引起的背景噪声的存在,电喉语音的质量和可理解性通常较差。这种噪声可以通过同步应用谱减法来抑制。本文介绍了一种利用16位定点DSP板实时实现的频谱减法增强喉电语音的方法。使用编解码器和DMA进入输入缓冲器,以12khz连续获取喉电语音。它使用256点FFT, 3帧4级联基于中值的噪声动态估计,光谱减法和IFFT进行处理,使用重叠50%的双间距周期窗口。重新合成的语音通过DMA和编解码器输出。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Quantized modulation diversity for 64-QAM IITKGP-MLILSC speech database for language identification Strip lined - Truncated ground plane for flat response of miniaturized UWB patch antenna On the underwater wireless network clustering Faster BIC segmentation using local speaker modeling
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1