Kernel fitting for speech detection and enhancement

Benyong Liu, Jing Zhang, Xiang Liao
{"title":"Kernel fitting for speech detection and enhancement","authors":"Benyong Liu, Jing Zhang, Xiang Liao","doi":"10.1109/ICOSP.2010.5656090","DOIUrl":null,"url":null,"abstract":"A kernel fitting algorithm is proposed for speech denoising to improve the precision of voice activity detection (VAD) and the performance of speech enhancement, of some popular algorithms. In the algorithm, a noisy speech frame is filtered by kernel fitting, and then its power spectral density is estimated and weighted by a gain factor constructed from frame energy and zero-crossing rate, so that a speech signal is obviously discriminated from a nonspeech one. By incorporation of the VAD outputs and the noise effect into the kernel fitting process, a speech frame is enhanced with better performance than the spectra subtraction algorithm. Experiments are taken on a real life speech signal plus simulated noises, and the results show the potentiality of the proposed algorithms in speech detection and enhancement.","PeriodicalId":281876,"journal":{"name":"IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSP.2010.5656090","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

A kernel fitting algorithm is proposed for speech denoising to improve the precision of voice activity detection (VAD) and the performance of speech enhancement, of some popular algorithms. In the algorithm, a noisy speech frame is filtered by kernel fitting, and then its power spectral density is estimated and weighted by a gain factor constructed from frame energy and zero-crossing rate, so that a speech signal is obviously discriminated from a nonspeech one. By incorporation of the VAD outputs and the noise effect into the kernel fitting process, a speech frame is enhanced with better performance than the spectra subtraction algorithm. Experiments are taken on a real life speech signal plus simulated noises, and the results show the potentiality of the proposed algorithms in speech detection and enhancement.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
语音检测与增强的核拟合
为了提高语音活动检测(VAD)的精度和一些常用算法的语音增强性能,提出了一种用于语音去噪的核拟合算法。该算法首先对带有噪声的语音帧进行核拟合滤波,然后利用帧能量和过零率构造的增益因子对其功率谱密度进行估计和加权,从而明显区分语音信号和非语音信号。通过在核拟合过程中加入VAD输出和噪声效应,语音帧的增强性能优于谱减法算法。实验结果表明,本文提出的算法在语音检测和增强方面具有一定的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Nonlinearity analysis of RF power amplified TD-SCDMA signals An optimized estimation of AR model parameters with inhibiting spectrum deviation Non-parametic model for robust road recognition Singularity detection of TWR echoes based on correlation A novel approach of countering centroid jamming by using INS information in terminal guidance
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1