A modified spectral subtraction method for speech enhancement based on masking property of human auditory system

Bingyin Xia, Yan Liang, C. Bao
{"title":"A modified spectral subtraction method for speech enhancement based on masking property of human auditory system","authors":"Bingyin Xia, Yan Liang, C. Bao","doi":"10.1109/WCSP.2009.5371466","DOIUrl":null,"url":null,"abstract":"This paper addresses the problem of musical noise introduced by conventional spectral subtraction method for speech enhancement. A modified spectral subtraction algorithm based on the masking properties of human auditory system is proposed. In comparison with Virag's algorithm, the modification of proposed method is made from four aspects. Firstly, VAD(Voice Activity Detection) is substituted by MCRA(Minima-Controlled Recursive Averaging) algorithm to estimate the background noise; Secondly, the masking threshold is calculated based on enhanced speech by multi-band spectral subtraction method; Thirdly, the adaptive parameters of spectral subtraction method is adjusted; Finally, a modified form of parametric spectral subtraction is employed. The performance of the proposed method is evaluated under ITU-T G.160 standard. The results shows that, comparing with the reference algorithms, the proposed method provides acceptable amount of signal-to-noise ratio(SNR) improvement and noise reduction with a little impact on the level of speech. The objective speech quality is improved evidently at the same time.","PeriodicalId":244652,"journal":{"name":"2009 International Conference on Wireless Communications & Signal Processing","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Wireless Communications & Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCSP.2009.5371466","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

Abstract

This paper addresses the problem of musical noise introduced by conventional spectral subtraction method for speech enhancement. A modified spectral subtraction algorithm based on the masking properties of human auditory system is proposed. In comparison with Virag's algorithm, the modification of proposed method is made from four aspects. Firstly, VAD(Voice Activity Detection) is substituted by MCRA(Minima-Controlled Recursive Averaging) algorithm to estimate the background noise; Secondly, the masking threshold is calculated based on enhanced speech by multi-band spectral subtraction method; Thirdly, the adaptive parameters of spectral subtraction method is adjusted; Finally, a modified form of parametric spectral subtraction is employed. The performance of the proposed method is evaluated under ITU-T G.160 standard. The results shows that, comparing with the reference algorithms, the proposed method provides acceptable amount of signal-to-noise ratio(SNR) improvement and noise reduction with a little impact on the level of speech. The objective speech quality is improved evidently at the same time.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种基于人听觉掩蔽特性的改进频谱减法语音增强方法
本文解决了传统频谱减法在语音增强中引入音乐噪声的问题。提出了一种基于人类听觉系统掩蔽特性的改进谱减法算法。通过与Virag算法的比较,从四个方面对本文方法进行了改进。首先,用最小控制递归平均(MCRA)算法代替VAD(Voice Activity Detection)算法来估计背景噪声;其次,采用多频带谱减法计算基于增强语音的掩蔽阈值;第三,调整谱减法的自适应参数;最后,采用了一种改进的参数谱减法。根据ITU-T G.160标准对该方法的性能进行了评估。结果表明,与参考算法相比,本文提出的方法在对语音水平影响不大的情况下,提供了可接受的信噪比提升和降噪效果。同时,客观语音质量明显提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
U-filter's gaussianization function for interference background Performance analysis on Call Admission Control in C3G-A system Performance analysis of fixed gain relaying systems in Nakagami-m fading channels A greedy algorithm for cognitive network resource allocation based on minimum remaining constraint space A new optimum jitter protection for conversational VoIP
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1