Binaural Wind-Noise Tracking with Steering Preset

Stefan Thaleiser, G. Enzner
{"title":"Binaural Wind-Noise Tracking with Steering Preset","authors":"Stefan Thaleiser, G. Enzner","doi":"10.23919/eusipco55093.2022.9909804","DOIUrl":null,"url":null,"abstract":"Optimal performance of many speech enhancement methods is bound to an accurate noise power-spectral density (PSD) estimation. While for stationary noises, such as the white Gaussian or car noise, several approaches have proven themselves to perform sufficiently good, non-stationary noise types like the wind noise are more challenging. In the binaural setting and in multichannel systems, the speech-blocking method is essential to recent developments for non-stationary noise estimation. It critically requires information of the acoustic channel transfer function from source to listener. In this paper, we propose such noise-subspace approach for wind-noise PSD estimation, which relies on data-driven blind channel identification in speech presence and on a-priori acoustic channel information (i.e., the steering preset) in speech pause, where the smooth transition of both is controlled by a-priori SNR. The algorithm is designed for entire online operation based on the current noisy frame input. It improves on straightforward recursive subspace analysis and on established single-channel estimation in the wind-noise scenario, while dealing well with speech presence or babble noise too.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 30th European Signal Processing Conference (EUSIPCO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/eusipco55093.2022.9909804","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Optimal performance of many speech enhancement methods is bound to an accurate noise power-spectral density (PSD) estimation. While for stationary noises, such as the white Gaussian or car noise, several approaches have proven themselves to perform sufficiently good, non-stationary noise types like the wind noise are more challenging. In the binaural setting and in multichannel systems, the speech-blocking method is essential to recent developments for non-stationary noise estimation. It critically requires information of the acoustic channel transfer function from source to listener. In this paper, we propose such noise-subspace approach for wind-noise PSD estimation, which relies on data-driven blind channel identification in speech presence and on a-priori acoustic channel information (i.e., the steering preset) in speech pause, where the smooth transition of both is controlled by a-priori SNR. The algorithm is designed for entire online operation based on the current noisy frame input. It improves on straightforward recursive subspace analysis and on established single-channel estimation in the wind-noise scenario, while dealing well with speech presence or babble noise too.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
双耳风噪声跟踪与转向预设
许多语音增强方法的最佳性能取决于准确的噪声功率谱密度(PSD)估计。虽然对于平稳噪声,如白高斯噪声或汽车噪声,有几种方法已经证明自己表现得足够好,但像风噪声这样的非平稳噪声类型更具挑战性。在双耳环境和多声道系统中,语音阻塞方法是非平稳噪声估计的重要发展方向。它迫切需要声道从声源到听者传递函数的信息。在本文中,我们提出了这种用于风噪声PSD估计的噪声子空间方法,该方法在语音存在时依赖于数据驱动的盲信道识别,在语音暂停时依赖于先验声学信道信息(即转向预设),其中两者的平滑过渡由先验信噪比控制。该算法是基于当前有噪声帧输入的全在线运行算法。它改进了直接递归子空间分析和在风噪声场景下建立的单通道估计,同时也能很好地处理语音存在或呀呀学噪声。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Assessing Bias in Face Image Quality Assessment Electrically evoked auditory steady state response detection in cochlear implant recipients using a system identification approach Uncovering cortical layers with multi-exponential analysis: a region of interest study Phaseless Passive Synthetic Aperture Imaging with Regularized Wirtinger Flow The faster proximal algorithm, the better unfolded deep learning architecture ? The study case of image denoising
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1