Speech enhancement of non-stationary noise based on controlled forward moving average

D. Farrokhi, R. Togneri, A. Zaknich
{"title":"Speech enhancement of non-stationary noise based on controlled forward moving average","authors":"D. Farrokhi, R. Togneri, A. Zaknich","doi":"10.1109/ISCIT.2007.4392263","DOIUrl":null,"url":null,"abstract":"A pre and post processing technique is proposed to enhance the speech signal of highly non-stationary noisy speech. The purpose of this research has been to build on current speech enhancement algorithms to produce an improved algorithm for enhancement of speech contaminated with non-stationary babble type noise. The pre processing involves two stages. In stage one, the variance of the noisy speech spectrum is reduced by utilizing the Discrete or Prolate Spheroidal Sequence (DPSS) multi-taper algorithm plus a Controlled Forward Moving Average (CFMA) technique. We introduced the CFMA algorithm to smooth and reduce variance of the estimated non-stationary noise spectrum. In the second stage the noisy speech power spectrum is de-noised by applying Stein's Unbiased Risk Estimator (SURE) wavelet thresholding technique. In the third layer, use is made of a noise estimation algorithm with rapid adaptation for a highly non-stationary noise environment. The noise estimate is updated in three frequency sub-bands, by averaging the noisy speech power spectrum using a frequency dependent smoothing factor, which is adjusted, based on a signal presence probability factor. In the fourth layer a spectral subtraction algorithm is used to enhance the speech signal, by subtracting each estimated noise from the original noisy speech. The new proposed post processing is then applied to the complete signal when the speech enhancement is processed using segmental speech enhancement. The enhanced signal is further improved by applying a soft wavelet thresholding technique to the un-segmented enhanced speech at the final processing stage. The results show improvements both quantitatively and qualitatively compared to the speech enhancement that does not apply the CFMA algorithm.","PeriodicalId":331439,"journal":{"name":"2007 International Symposium on Communications and Information Technologies","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 International Symposium on Communications and Information Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCIT.2007.4392263","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

A pre and post processing technique is proposed to enhance the speech signal of highly non-stationary noisy speech. The purpose of this research has been to build on current speech enhancement algorithms to produce an improved algorithm for enhancement of speech contaminated with non-stationary babble type noise. The pre processing involves two stages. In stage one, the variance of the noisy speech spectrum is reduced by utilizing the Discrete or Prolate Spheroidal Sequence (DPSS) multi-taper algorithm plus a Controlled Forward Moving Average (CFMA) technique. We introduced the CFMA algorithm to smooth and reduce variance of the estimated non-stationary noise spectrum. In the second stage the noisy speech power spectrum is de-noised by applying Stein's Unbiased Risk Estimator (SURE) wavelet thresholding technique. In the third layer, use is made of a noise estimation algorithm with rapid adaptation for a highly non-stationary noise environment. The noise estimate is updated in three frequency sub-bands, by averaging the noisy speech power spectrum using a frequency dependent smoothing factor, which is adjusted, based on a signal presence probability factor. In the fourth layer a spectral subtraction algorithm is used to enhance the speech signal, by subtracting each estimated noise from the original noisy speech. The new proposed post processing is then applied to the complete signal when the speech enhancement is processed using segmental speech enhancement. The enhanced signal is further improved by applying a soft wavelet thresholding technique to the un-segmented enhanced speech at the final processing stage. The results show improvements both quantitatively and qualitatively compared to the speech enhancement that does not apply the CFMA algorithm.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于可控前向移动平均的非平稳噪声语音增强
针对高非平稳噪声语音,提出了一种预处理和后处理技术来增强语音信号。本研究的目的是在现有的语音增强算法的基础上,提出一种改进的算法,用于增强受非平稳胡言乱语类型噪声污染的语音。预处理包括两个阶段。在第一阶段,通过使用离散或延长球体序列(DPSS)多锥度算法加上可控前向移动平均(CFMA)技术来减小噪声语音频谱的方差。引入CFMA算法对估计的非平稳噪声谱进行平滑处理并减小方差。第二阶段采用Stein无偏风险估计(SURE)小波阈值技术对噪声语音功率谱进行降噪。在第三层,使用了一种快速适应高度非平稳噪声环境的噪声估计算法。通过使用频率相关平滑因子对噪声语音功率谱进行平均,并根据信号存在概率因子对其进行调整,从而在三个频率子带中更新噪声估计。在第四层,通过从原始带噪声的语音中减去每个估计的噪声,使用频谱减法算法来增强语音信号。当使用分段语音增强处理语音增强时,将新提出的后处理应用于完整信号。在最后的处理阶段,对未分割的增强语音进行软小波阈值处理,进一步改善了增强信号。结果表明,与不使用CFMA算法的语音增强相比,语音增强在数量和质量上都有提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
An improved quantization scheme for lattice-reduction aided MIMO detection PAPR reduction in the OFDM system employing Tone Reservation based on FFT/IFFT Implementation and analysis of configurable Real Time Address Trace Compressor for embedded microprocessors Measured improvement of indoor coverage for fixed wireless loops with multiple antenna receivers Real-time wideband MIMO demonstrator
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1