首页 > 最新文献

Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)最新文献

英文 中文
Compatible scrambling of compressed audio 压缩音频的兼容置乱
J. Herre, E. Allamanche
Stimulated by the technological revolution in both networking technology (the Internet) and highly efficient perceptual audio coding algorithms (e.g. MPEG audio), a tremendous amount of music piracy has emerged recently. In contrast to this, a controlled distribution of music or multimedia content commonly employs so-called secure envelope techniques which "package" the audio bitstream into a secure container by means of ciphering all or part of the payload bitstream. In this way, access to the payload (i.e. decoding of the bitstream) is possible only for authorized persons who are in the possession of the proper key for decryption. While decoding of such a secure envelope bitstream requires a two-stage process (deciphering and source decoding), this paper presents a novel technique integrating both deciphering and source decoding into one combined process. This is achieved by "scrambling" the bitstream of the coded signal in a syntax-compatible way such that playback of the scrambled bitstream without access to the proper key will result in a stable playback at a degraded quality level ("soft-envelope" technique). The approach allows the content authors to select the amount of degradation, does not impose a bitrate or quality burden and can be applied to a wide range of coders. Examples of the scrambling technique are given for an MPEG-2 advanced audio coding (AAC) system.
在网络技术(Internet)和高效感知音频编码算法(如MPEG音频)的技术革命的刺激下,最近出现了大量的音乐盗版。与此相反,音乐或多媒体内容的受控分发通常采用所谓的安全信封技术,通过对全部或部分有效负载比特流进行加密,将音频比特流“打包”到安全容器中。这样,只有拥有适当解密密钥的授权人员才能访问有效载荷(即解码比特流)。虽然这种安全包络码流的解码需要两个阶段的过程(解密和源解码),但本文提出了一种将解密和源解码集成为一个组合过程的新技术。这是通过以一种语法兼容的方式对编码信号的比特流进行“置乱”来实现的,这样,在不访问适当密钥的情况下,置乱后的比特流将以降级的质量级别(“软信封”技术)进行稳定的重放。该方法允许内容作者选择降级的数量,不施加比特率或质量负担,并且可以应用于广泛的编码器。给出了MPEG-2高级音频编码(AAC)系统中置乱技术的实例。
{"title":"Compatible scrambling of compressed audio","authors":"J. Herre, E. Allamanche","doi":"10.1109/ASPAA.1999.810841","DOIUrl":"https://doi.org/10.1109/ASPAA.1999.810841","url":null,"abstract":"Stimulated by the technological revolution in both networking technology (the Internet) and highly efficient perceptual audio coding algorithms (e.g. MPEG audio), a tremendous amount of music piracy has emerged recently. In contrast to this, a controlled distribution of music or multimedia content commonly employs so-called secure envelope techniques which \"package\" the audio bitstream into a secure container by means of ciphering all or part of the payload bitstream. In this way, access to the payload (i.e. decoding of the bitstream) is possible only for authorized persons who are in the possession of the proper key for decryption. While decoding of such a secure envelope bitstream requires a two-stage process (deciphering and source decoding), this paper presents a novel technique integrating both deciphering and source decoding into one combined process. This is achieved by \"scrambling\" the bitstream of the coded signal in a syntax-compatible way such that playback of the scrambled bitstream without access to the proper key will result in a stable playback at a degraded quality level (\"soft-envelope\" technique). The approach allows the content authors to select the amount of degradation, does not impose a bitrate or quality burden and can be applied to a wide range of coders. Examples of the scrambling technique are given for an MPEG-2 advanced audio coding (AAC) system.","PeriodicalId":229733,"journal":{"name":"Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129686674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Wave field synthesis and analysis using array technology 基于阵列技术的波场合成与分析
D. de Vries, M. M. Boone
The concept of wave field synthesis (WFS) was introduced by Betkhout in 1988. It enables the generation of sound fields with natural temporal and spatial properties within a volume or area bounded by arrays of loudspeakers. Applications are found in real time performances as well as in reproduction of multitrack recordings. A logic next step was the formulation of a new wave field analysis (WFA) concept by Berkhout in 1997, where sound fields in enclosures are recorded with arrays of microphones and analyzed with postprocessing techniques commonly used in acoustical imaging. This way, both the temporal and spatial properties of the sound field can be investigated and understood. WFS and WFA meet in auralization applications: sound fields measured (or modeled) along arrays of microphone positions can be generated by arrays of loudspeakers for perceptual evaluation.
波场合成(WFS)的概念是由Betkhout于1988年提出的。它能够在扬声器阵列限定的音量或区域内产生具有自然时间和空间特性的声场。应用被发现在实时表演,以及在多轨录音的再现。一个合乎逻辑的下一步是Berkhout在1997年提出了一个新的波场分析(WFA)概念,即用麦克风阵列记录外壳中的声场,并用声学成像中常用的后处理技术进行分析。这样,声场的时间和空间特性都可以被研究和理解。WFS和WFA在听觉化应用中相结合:沿着麦克风位置阵列测量(或建模)的声场可以由扬声器阵列产生,用于感知评估。
{"title":"Wave field synthesis and analysis using array technology","authors":"D. de Vries, M. M. Boone","doi":"10.1109/ASPAA.1999.810838","DOIUrl":"https://doi.org/10.1109/ASPAA.1999.810838","url":null,"abstract":"The concept of wave field synthesis (WFS) was introduced by Betkhout in 1988. It enables the generation of sound fields with natural temporal and spatial properties within a volume or area bounded by arrays of loudspeakers. Applications are found in real time performances as well as in reproduction of multitrack recordings. A logic next step was the formulation of a new wave field analysis (WFA) concept by Berkhout in 1997, where sound fields in enclosures are recorded with arrays of microphones and analyzed with postprocessing techniques commonly used in acoustical imaging. This way, both the temporal and spatial properties of the sound field can be investigated and understood. WFS and WFA meet in auralization applications: sound fields measured (or modeled) along arrays of microphone positions can be generated by arrays of loudspeakers for perceptual evaluation.","PeriodicalId":229733,"journal":{"name":"Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117214465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 56
The effect of a Poisson "internal noise" process on theoretical acoustic signal detectability 泊松“内噪声”过程对理论声信号可探测性的影响
L. Gresham, L. Collins
Historically, theoretical predictions of human auditory perception have not agreed with experimental measurements. We have previously demonstrated that using signal detection theory to analyze the outputs of deterministic computational auditory models yields more accurate predictions of experimental performance than traditional approaches (Gresham and Collins 1998). However, discrepancies remained between predicted and actual performance. In this paper, the effects of stimulus uncertainty and neural variability on the detectability of a tone in noise are studied. The results suggest that remarkably accurate predictions of detection performance can be generated when such uncertainty is incorporated into the problem.
从历史上看,人类听觉感知的理论预测与实验测量并不一致。我们之前已经证明,使用信号检测理论来分析确定性计算听觉模型的输出,可以比传统方法更准确地预测实验性能(Gresham and Collins 1998)。然而,预测和实际表现之间仍然存在差异。本文研究了刺激不确定性和神经变异性对噪声中音调可检测性的影响。结果表明,当将这种不确定性纳入问题中时,可以产生非常准确的检测性能预测。
{"title":"The effect of a Poisson \"internal noise\" process on theoretical acoustic signal detectability","authors":"L. Gresham, L. Collins","doi":"10.1109/ASPAA.1999.810889","DOIUrl":"https://doi.org/10.1109/ASPAA.1999.810889","url":null,"abstract":"Historically, theoretical predictions of human auditory perception have not agreed with experimental measurements. We have previously demonstrated that using signal detection theory to analyze the outputs of deterministic computational auditory models yields more accurate predictions of experimental performance than traditional approaches (Gresham and Collins 1998). However, discrepancies remained between predicted and actual performance. In this paper, the effects of stimulus uncertainty and neural variability on the detectability of a tone in noise are studied. The results suggest that remarkably accurate predictions of detection performance can be generated when such uncertainty is incorporated into the problem.","PeriodicalId":229733,"journal":{"name":"Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130680130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Shunting networks for multi-band AM-FM-decomposition 多频带am - fm分解的分流网络
R. Baxter, T. Quatieri
We describe a transduction-based, neurodynamic approach to estimating the amplitude-modulated (AM) and frequency-modulated (FM) components of a signal. We show that the transduction approach can be realized as a bank of constant-Q bandpass filters followed by envelope detectors and shunting neural networks, and the resulting dynamical system is capable of robust AM-FM estimation. Our model is consistent with previous psychophysical experiments that indicate AM and FM components of acoustic signals may be transformed into a common neural code in the brain stem via FM-to-AM transduction (Saberi and Hafter 1995). The shunting network for AM-FM decomposition is followed by a contrast enhancement shunting network that provides a mechanism for robustly selecting auditory filter channels as the FM of an input stimulus sweeps across the multiple filters. The AM-FM output of the shunting networks may provide a robust feature representation and is being considered for applications in signal recognition and multi-component decomposition problems.
我们描述了一种基于转导的神经动力学方法来估计信号的调幅(AM)和调频(FM)成分。我们证明了这种转导方法可以通过一组恒q带通滤波器,然后是包络检测器和分路神经网络来实现,并且所得到的动态系统能够进行稳健的AM-FM估计。我们的模型与先前的心理物理实验一致,这些实验表明声信号的调幅和调频成分可能通过FM- AM转导在脑干中转化为共同的神经编码(Saberi和Hafter 1995)。在调幅调频分解的分流网络之后是对比度增强分流网络,该分流网络提供了一种机制,当输入刺激的调频扫过多个滤波器时,可以鲁棒地选择听觉滤波器通道。分路网络的AM-FM输出可以提供鲁棒的特征表示,并被考虑用于信号识别和多分量分解问题。
{"title":"Shunting networks for multi-band AM-FM-decomposition","authors":"R. Baxter, T. Quatieri","doi":"10.1109/ASPAA.1999.810891","DOIUrl":"https://doi.org/10.1109/ASPAA.1999.810891","url":null,"abstract":"We describe a transduction-based, neurodynamic approach to estimating the amplitude-modulated (AM) and frequency-modulated (FM) components of a signal. We show that the transduction approach can be realized as a bank of constant-Q bandpass filters followed by envelope detectors and shunting neural networks, and the resulting dynamical system is capable of robust AM-FM estimation. Our model is consistent with previous psychophysical experiments that indicate AM and FM components of acoustic signals may be transformed into a common neural code in the brain stem via FM-to-AM transduction (Saberi and Hafter 1995). The shunting network for AM-FM decomposition is followed by a contrast enhancement shunting network that provides a mechanism for robustly selecting auditory filter channels as the FM of an input stimulus sweeps across the multiple filters. The AM-FM output of the shunting networks may provide a robust feature representation and is being considered for applications in signal recognition and multi-component decomposition problems.","PeriodicalId":229733,"journal":{"name":"Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127005752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A head-and-torso model for low-frequency binaural elevation effects 低频双耳抬高效应的头部-躯干模型
C. Avendaño, V. Algazi, R. Duda
Low-frequency elevation-dependent features appear in HRTF (head related transfer function) measurements because of torso and shoulder reflections and head diffraction effects. A simple structural model that accounts for these features is presented. Listening tests show that the model produces significant elevation cues for virtual sound sources whose spectra are limited to frequencies below 3 kHz. The low-frequency binaural elevation cues are perceptually significant away from the median plane, and complement high-frequency monaural pinna cues.
由于躯干和肩部反射以及头部衍射效应,在HRTF(头部相关传递函数)测量中出现低频仰角相关特征。提出了一个简单的结构模型来解释这些特征。听力测试表明,该模型对频谱限制在3khz以下的虚拟声源产生显著的高程提示。低频双耳抬高信号在远离中位面处具有显著的知觉意义,并与高频单耳耳廓信号形成互补。
{"title":"A head-and-torso model for low-frequency binaural elevation effects","authors":"C. Avendaño, V. Algazi, R. Duda","doi":"10.1109/ASPAA.1999.810879","DOIUrl":"https://doi.org/10.1109/ASPAA.1999.810879","url":null,"abstract":"Low-frequency elevation-dependent features appear in HRTF (head related transfer function) measurements because of torso and shoulder reflections and head diffraction effects. A simple structural model that accounts for these features is presented. Listening tests show that the model produces significant elevation cues for virtual sound sources whose spectra are limited to frequencies below 3 kHz. The low-frequency binaural elevation cues are perceptually significant away from the median plane, and complement high-frequency monaural pinna cues.","PeriodicalId":229733,"journal":{"name":"Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)","volume":"712 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122922024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Filter bank design using nilpotent matrices 利用幂零矩阵设计滤波器组
G. Schuller, W. Sweldens
We present a design method for filter banks with unequal length of the impulse responses for the analysis and synthesis part. This is useful e.g. for audio coding applications. A further advantage of the design method is the possibility to explicitly control the overall system delay of the filter bank, when causal filters are desired. The design method is based on a factorization of the polyphase matrices into factors with nilpotent matrices. These factors guarantee mathematical perfect reconstruction of the filter bank, and lead to FIR filters for analysis and synthesis. Using matrices with nilpotency of higher order than 2 leads to FIR filter banks with unequal filter length for analysis and synthesis.
在分析和合成部分,我们提出了一种脉冲响应长度不等的滤波器组设计方法。这是有用的,例如音频编码应用程序。该设计方法的另一个优点是,当需要因果滤波器时,可以显式控制滤波器组的整体系统延迟。设计方法是将多相矩阵分解为幂零矩阵的因子。这些因素保证了滤波器组在数学上的完美重构,并导致了FIR滤波器的分析和合成。使用零幂次大于2的矩阵会导致分析和合成的FIR滤波器组的滤波器长度不等。
{"title":"Filter bank design using nilpotent matrices","authors":"G. Schuller, W. Sweldens","doi":"10.1109/ASPAA.1999.810847","DOIUrl":"https://doi.org/10.1109/ASPAA.1999.810847","url":null,"abstract":"We present a design method for filter banks with unequal length of the impulse responses for the analysis and synthesis part. This is useful e.g. for audio coding applications. A further advantage of the design method is the possibility to explicitly control the overall system delay of the filter bank, when causal filters are desired. The design method is based on a factorization of the polyphase matrices into factors with nilpotent matrices. These factors guarantee mathematical perfect reconstruction of the filter bank, and lead to FIR filters for analysis and synthesis. Using matrices with nilpotency of higher order than 2 leads to FIR filter banks with unequal filter length for analysis and synthesis.","PeriodicalId":229733,"journal":{"name":"Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128501573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Maximization of the subjective loudness of speech with constrained amplitude 在受限制的振幅下,使言语的主观响度最大化
J. Seppanen, S. Kananoja, Jari-Yli-Hietanen, K. Koppinen, J. Sjoberg
We introduce an adaptive algorithm for constraining the amplitude of speech signals while at the same time trying to maintain the subjective loudness and trying not to produce disturbing artifacts. The algorithm can be applied to compensate for the clipping distortion of amplifiers in speech reproduction devices. The algorithm analyzes the speech signal on multiple frequency bands and applies an internal audibility law in order to make inaudible changes to the signal. An example of the audibility law, presented in the form of a matrix, is described, associated with a specific speech reproduction device. Multiple band-pass signals are processed with a waveshaper to accomplish soft-clipping and to constrain the amplitude of the processed signal. When processed with the proposed algorithm, the computational loudness value of speech signals was found to diminish only slightly (approximately 6 sones) during processing, while at the same time the signal amplitude could be reduced by even 15 dB.
我们引入了一种自适应算法来限制语音信号的幅度,同时尽量保持主观响度,尽量不产生干扰的伪影。该算法可用于语音再现设备中放大器的剪切失真补偿。该算法对多个频带上的语音信号进行分析,并应用内部可听性规律对信号进行可听性改变。可听性定律的一个例子,以矩阵的形式呈现,被描述,与一个特定的语音复制设备相关联。用整形器处理多个带通信号以实现软剪裁并约束处理后信号的幅度。在处理过程中,语音信号的计算响度值仅略有下降(约6声),而信号幅度可降低15 dB。
{"title":"Maximization of the subjective loudness of speech with constrained amplitude","authors":"J. Seppanen, S. Kananoja, Jari-Yli-Hietanen, K. Koppinen, J. Sjoberg","doi":"10.1109/ASPAA.1999.810869","DOIUrl":"https://doi.org/10.1109/ASPAA.1999.810869","url":null,"abstract":"We introduce an adaptive algorithm for constraining the amplitude of speech signals while at the same time trying to maintain the subjective loudness and trying not to produce disturbing artifacts. The algorithm can be applied to compensate for the clipping distortion of amplifiers in speech reproduction devices. The algorithm analyzes the speech signal on multiple frequency bands and applies an internal audibility law in order to make inaudible changes to the signal. An example of the audibility law, presented in the form of a matrix, is described, associated with a specific speech reproduction device. Multiple band-pass signals are processed with a waveshaper to accomplish soft-clipping and to constrain the amplitude of the processed signal. When processed with the proposed algorithm, the computational loudness value of speech signals was found to diminish only slightly (approximately 6 sones) during processing, while at the same time the signal amplitude could be reduced by even 15 dB.","PeriodicalId":229733,"journal":{"name":"Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115044966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feedback cancellation in hearing aids using constrained adaptation 基于约束适应的助听器反馈消除
J. Kates
In feedback cancellation in hearing aids, the output of an adaptive filter is subtracted from the microphone signal to cancel the acoustic and mechanical feedback signals picked up by the microphone. The feedback cancellation filter typically adapts the hearing-aid input signal, and signal cancellation and coloration artifacts can occur for a narrowband input. In this paper, two procedures for LMS adaptation with a constraint on the magnitude of the adaptive weight vector are derived. The constraints greatly reduce the probability that the adaptive filter will cancel a narrowband input. Simulation results are used to demonstrate the efficacy of the constrained adaptation.
在助听器的反馈抵消中,从麦克风信号中减去自适应滤波器的输出,以抵消麦克风拾取的声学和机械反馈信号。反馈抵消滤波器通常适应助听器输入信号,信号抵消和彩色伪影可能出现在窄带输入。本文推导了两个具有自适应权向量大小约束的LMS自适应过程。这些约束大大降低了自适应滤波器取消窄带输入的概率。仿真结果验证了约束自适应的有效性。
{"title":"Feedback cancellation in hearing aids using constrained adaptation","authors":"J. Kates","doi":"10.1109/ASPAA.1999.810892","DOIUrl":"https://doi.org/10.1109/ASPAA.1999.810892","url":null,"abstract":"In feedback cancellation in hearing aids, the output of an adaptive filter is subtracted from the microphone signal to cancel the acoustic and mechanical feedback signals picked up by the microphone. The feedback cancellation filter typically adapts the hearing-aid input signal, and signal cancellation and coloration artifacts can occur for a narrowband input. In this paper, two procedures for LMS adaptation with a constraint on the magnitude of the adaptive weight vector are derived. The constraints greatly reduce the probability that the adaptive filter will cancel a narrowband input. Simulation results are used to demonstrate the efficacy of the constrained adaptation.","PeriodicalId":229733,"journal":{"name":"Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)","volume":"579 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123409760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Linear transforms and filterbanks based on vector ARMA models 基于向量ARMA模型的线性变换和滤波器组
U. Laine
Linear transformations, like wavelet transforms, and filterbanks of IIR-type and of arbitrary time-frequency plane tilings can be efficiently realized by vector ARMA models. The quality of the realization depends on how well the basis functions or impulse responses of the filterbank can be approximated by the actual VARMA based pole-zero model. The vector AR part gives an MSE-optimal block-recursive model for the target basis functions. The vector MA part is formed of the vector AR residual and further optimized by an iterative algorithm.
线性变换,如小波变换,以及iir型和任意时频平面平铺的滤波器组都可以通过矢量ARMA模型有效地实现。实现的质量取决于滤波器组的基函数或脉冲响应如何被实际的基于VARMA的极点-零模型所近似。向量AR部分给出了目标基函数的最小均方差最优块递归模型。向量MA部分由向量AR残差组成,并通过迭代算法进一步优化。
{"title":"Linear transforms and filterbanks based on vector ARMA models","authors":"U. Laine","doi":"10.1109/ASPAA.1999.810851","DOIUrl":"https://doi.org/10.1109/ASPAA.1999.810851","url":null,"abstract":"Linear transformations, like wavelet transforms, and filterbanks of IIR-type and of arbitrary time-frequency plane tilings can be efficiently realized by vector ARMA models. The quality of the realization depends on how well the basis functions or impulse responses of the filterbank can be approximated by the actual VARMA based pole-zero model. The vector AR part gives an MSE-optimal block-recursive model for the target basis functions. The vector MA part is formed of the vector AR residual and further optimized by an iterative algorithm.","PeriodicalId":229733,"journal":{"name":"Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120989896","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Narrow-band interference cancellation for enhanced speaker identification 窄带干扰消除,增强扬声器识别
S.J. Wenndt, A. Noga
While the cepstrum feature has been widely used for speaker identification (SID), studies have shown that it can be sensitive to changes in environmental conditions. Many experiments have examined the effects of additive white Gaussian noise on the cepstral feature, but few, if any, have been conducted using additive narrow-band interference. Since such interference appears in an unpredictable fashion due to adverse signal environments or equipment anomalies in communication systems, it is important to understand its impact along with the affect of interference removal algorithms on SID performance. This paper examines two interference removal algorithms for enhancing SID performance. One is a simple notch filter suitable for tone removal. The other is a newly introduced method suitable for mitigating more general forms of interference, including interfering signals that can be modeled as being angle-modulated.
虽然倒频谱特征已广泛用于说话人识别(SID),但研究表明它对环境条件的变化很敏感。许多实验已经研究了加性高斯白噪声对倒谱特征的影响,但很少(如果有的话)使用加性窄带干扰进行。由于通信系统中不利的信号环境或设备异常,此类干扰以不可预测的方式出现,因此了解其影响以及干扰消除算法对SID性能的影响非常重要。本文研究了两种增强SID性能的干扰去除算法。一种是简单的陷波滤波器,适用于音调去除。另一种是一种新引入的方法,适用于减轻更一般形式的干扰,包括可以建模为角度调制的干扰信号。
{"title":"Narrow-band interference cancellation for enhanced speaker identification","authors":"S.J. Wenndt, A. Noga","doi":"10.1109/ASPAA.1999.810865","DOIUrl":"https://doi.org/10.1109/ASPAA.1999.810865","url":null,"abstract":"While the cepstrum feature has been widely used for speaker identification (SID), studies have shown that it can be sensitive to changes in environmental conditions. Many experiments have examined the effects of additive white Gaussian noise on the cepstral feature, but few, if any, have been conducted using additive narrow-band interference. Since such interference appears in an unpredictable fashion due to adverse signal environments or equipment anomalies in communication systems, it is important to understand its impact along with the affect of interference removal algorithms on SID performance. This paper examines two interference removal algorithms for enhancing SID performance. One is a simple notch filter suitable for tone removal. The other is a newly introduced method suitable for mitigating more general forms of interference, including interfering signals that can be modeled as being angle-modulated.","PeriodicalId":229733,"journal":{"name":"Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124968188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1