Perceptual Effects Of Noise Disturbances On Phase Spectrum In Stft Analysis/synthesis Procedures. Application To Restoration Processes

O. Cappé, A. Chaigne
{"title":"Perceptual Effects Of Noise Disturbances On Phase Spectrum In Stft Analysis/synthesis Procedures. Application To Restoration Processes","authors":"O. Cappé, A. Chaigne","doi":"10.1109/ASPAA.1991.634141","DOIUrl":null,"url":null,"abstract":"Restoration of audio recordings degraded by surface noise can be viewed as an analysWsynthesis procedure where the modulus of the Fouriex transform is replaced by an estimator before resynthesis. The goal is usually to find the best possible estimator in terms of noise power reduction. In current restoration procedures there are no modifications of the phase specuum, because most authors consider that the phase is perceptually irrelevant [l]. Therefore the prime objective of the work presented here was to check the validity of these assumptions, and put some emphasis on the degradation of the phase. For that purpose, an analysis/synthesis procedure simulating a restoration p m s has been carried out on artificially degraded signals. An overview of this procedure can be Seen in Fig. 1. addibve noise phase Original signal-EK\"\"'\"' phase Fig. 1 Simulation of a restoration process based on Short-Time-Fourier-Transform, with perfect modulus recovering and degraded phase. In these later experiments, the restored signal is obtained from the modulus of the original signal and from the phase of the degraded signal. Thus the estimator of the modulus is equivalent to the one which would be obtained through \"perfect\" cancellation of the noise. As a consequence, the remaining degradation of the restored signal is only due to the influence of the additive noise in the phase spectrum. The first goal of the work is to calculate an estimator for the phase deviation in the restored signal, which depends on both the noise characteristics and the parameters of the analysis/synthesis plocedure. The results are then compared with psychoacoustical data related to the perception of modulations. This comparison is aimed at providing an appropriate selection for the STFT parameters. Following Vary [2]. the noise component is assumed to be gaussian. For a sine wave of frequency fo = p Fe / N, where Fe is the sampling frequency and N the size of the window, it can be shown that the expectation for the maximum phase deviation at fo is given by","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASPAA.1991.634141","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Restoration of audio recordings degraded by surface noise can be viewed as an analysWsynthesis procedure where the modulus of the Fouriex transform is replaced by an estimator before resynthesis. The goal is usually to find the best possible estimator in terms of noise power reduction. In current restoration procedures there are no modifications of the phase specuum, because most authors consider that the phase is perceptually irrelevant [l]. Therefore the prime objective of the work presented here was to check the validity of these assumptions, and put some emphasis on the degradation of the phase. For that purpose, an analysis/synthesis procedure simulating a restoration p m s has been carried out on artificially degraded signals. An overview of this procedure can be Seen in Fig. 1. addibve noise phase Original signal-EK""'"' phase Fig. 1 Simulation of a restoration process based on Short-Time-Fourier-Transform, with perfect modulus recovering and degraded phase. In these later experiments, the restored signal is obtained from the modulus of the original signal and from the phase of the degraded signal. Thus the estimator of the modulus is equivalent to the one which would be obtained through "perfect" cancellation of the noise. As a consequence, the remaining degradation of the restored signal is only due to the influence of the additive noise in the phase spectrum. The first goal of the work is to calculate an estimator for the phase deviation in the restored signal, which depends on both the noise characteristics and the parameters of the analysis/synthesis plocedure. The results are then compared with psychoacoustical data related to the perception of modulations. This comparison is aimed at providing an appropriate selection for the STFT parameters. Following Vary [2]. the noise component is assumed to be gaussian. For a sine wave of frequency fo = p Fe / N, where Fe is the sampling frequency and N the size of the window, it can be shown that the expectation for the maximum phase deviation at fo is given by
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Stft分析/合成过程中噪声干扰对相位谱的感知影响。应用于恢复程序
被表面噪声退化的录音的恢复可以看作是一个分析合成过程,在重新合成之前,傅里叶变换的模量被一个估计量取代。目标通常是在降低噪声功率方面找到最好的估计器。在目前的恢复过程中,相位谱没有变化,因为大多数作者认为相位在感知上是不相关的[1]。因此,这里提出的工作的主要目标是检查这些假设的有效性,并把一些重点放在阶段的退化上。为此,对人为退化的信号进行了模拟恢复过程的分析/合成过程。这个过程的概述可以在图1中看到。图1基于短时傅立叶变换的模量完全恢复、相位退化的恢复过程仿真。在这些后期的实验中,从原始信号的模量和退化信号的相位中获得恢复信号。因此,模量的估计量等于通过“完全”消除噪声而得到的估计量。因此,恢复信号的剩余退化仅是由于相位谱中加性噪声的影响。这项工作的第一个目标是计算恢复信号中相位偏差的估计量,这取决于噪声特性和分析/合成过程的参数。然后将结果与与调制感知相关的心理声学数据进行比较。这种比较的目的是为STFT参数提供一个适当的选择。Following Vary[2]。假设噪声分量是高斯的。对于频率为fo = p Fe / N的正弦波,其中Fe为采样频率,N为窗口大小,可以表明,在fo处最大相位偏差的期望为
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
An Overview of the MPEG/audio Compression Algorithm Acoustic Echo Cancellation for Stereophonic Teleconferencing A Single-Input Hearing Aid Based on the Auditory Perceptual Features to Improve Speech Intelligibility in Noise Models of Pitch Perception Perceptual Linear Predictive (PLP) Analysis-Resynthesis Technique
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1