首页 > 最新文献

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics最新文献

英文 中文
A method for spectral transposition of speech signal applicable in profound hearing loss 一种适用于重度听力损失的语音信号频谱变换方法
A. Czyzewski, R. Królikowski, B. Kostek, H. Skarżyńśki, A. Lorens
Experiments that have been carried out by the authors so far point out that locating the essential components of the speech signal in a low frequency band can improve the speech intelligibility of some hearing impaired people. It is caused by the fact that such persons can maintain their hearing ability in this range of frequencies. The experiments were performed with the use of an electronic device operating in real-time with an algorithm for lowering the audio signals frequency. The satisfactory results of these tests encouraged them to run a research project that involves designing a new digital speech processor and elaborating an advanced algorithm for the spectral transposition of speech. The paper presents a new concept of a hearing aid employing the spectral transposition method. It presents the general scheme of the vocoder-based transposition of speech signals and gives a brief description of the new speech processor implementing this method.
到目前为止,作者进行的实验表明,将语音信号的基本成分定位在低频段可以提高一些听障人士的语音清晰度。这是由于这些人可以在这个频率范围内保持他们的听力能力。实验是使用一个实时操作的电子设备进行的,该设备带有降低音频信号频率的算法。这些测试令人满意的结果鼓励他们开展一项研究项目,包括设计一种新的数字语音处理器,并为语音的频谱变换制定一种先进的算法。本文提出了一种采用频谱变换方法的助听器的新概念。给出了基于声码器的语音信号转置的一般方案,并简要介绍了实现该方法的新型语音处理器。
{"title":"A method for spectral transposition of speech signal applicable in profound hearing loss","authors":"A. Czyzewski, R. Królikowski, B. Kostek, H. Skarżyńśki, A. Lorens","doi":"10.1109/ASPAA.1997.625581","DOIUrl":"https://doi.org/10.1109/ASPAA.1997.625581","url":null,"abstract":"Experiments that have been carried out by the authors so far point out that locating the essential components of the speech signal in a low frequency band can improve the speech intelligibility of some hearing impaired people. It is caused by the fact that such persons can maintain their hearing ability in this range of frequencies. The experiments were performed with the use of an electronic device operating in real-time with an algorithm for lowering the audio signals frequency. The satisfactory results of these tests encouraged them to run a research project that involves designing a new digital speech processor and elaborating an advanced algorithm for the spectral transposition of speech. The paper presents a new concept of a hearing aid employing the spectral transposition method. It presents the general scheme of the vocoder-based transposition of speech signals and gives a brief description of the new speech processor implementing this method.","PeriodicalId":347087,"journal":{"name":"Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"141 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133927393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Interpolation and extrapolation of room transfer functions based on common acoustical poles and their residues 基于共同声学极点及其残差的室内传递函数插值与外推
Y. Haneda, Y. Kaneda
We propose a new method of modeling a room transfer function (RTF) that uses common acoustical poles and their residues. The common acoustical poles correspond to the resonance frequencies (eigenvalues) of the room, and their residues are composed of the eigenfunctions of the source and receiver positions in the room. Because the common acoustical poles do not depend on the source and receiver positions, this model expresses the RTF variations due to changes in the source and receiver positions by using residue variations. We also propose methods of interpolating and extrapolating RTFs based on the proposed common-acoustical-pole and residue model. Computer simulation demonstrated that unknown RTFs can be well estimated from known (measured) RTFs by using these methods.
我们提出了一种新的方法来模拟一个房间传递函数(RTF),使用共同的声学极点和它们的残馀。公共声极对应于房间的共振频率(特征值),其残差由房间中源和接收器位置的特征函数组成。由于共同声极不依赖于声源和接收机位置,因此该模型利用残差变化来表达由于声源和接收机位置变化而引起的RTF变化。我们还提出了基于所提出的共声极点和剩余模型的插值和外推rtf的方法。计算机模拟表明,使用这些方法可以很好地从已知(测量)rtf中估计未知rtf。
{"title":"Interpolation and extrapolation of room transfer functions based on common acoustical poles and their residues","authors":"Y. Haneda, Y. Kaneda","doi":"10.1109/ASPAA.1997.625593","DOIUrl":"https://doi.org/10.1109/ASPAA.1997.625593","url":null,"abstract":"We propose a new method of modeling a room transfer function (RTF) that uses common acoustical poles and their residues. The common acoustical poles correspond to the resonance frequencies (eigenvalues) of the room, and their residues are composed of the eigenfunctions of the source and receiver positions in the room. Because the common acoustical poles do not depend on the source and receiver positions, this model expresses the RTF variations due to changes in the source and receiver positions by using residue variations. We also propose methods of interpolating and extrapolating RTFs based on the proposed common-acoustical-pole and residue model. Computer simulation demonstrated that unknown RTFs can be well estimated from known (measured) RTFs by using these methods.","PeriodicalId":347087,"journal":{"name":"Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133275482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Superdirective microphone array for a set-top videoconferencing system 机顶式视频会议系统的超指令麦克风阵列
P. Chu
In set-top videoconferencing, the complete videoconferencing system fits unobtrusively on top of the television. The microphone sound pickup system is one of the most important functional blocks with constraints of small size, high performance, and low cost. Persons speaking several feet away from the system must be picked up satisfactorily while noise generated internally in the system by the cooling fan and hard drive, and noise generated externally from air conditioning and nearby computers must be attenuated. In this paper, a three microphone superdirective array is described which meets these constraints. An analog highpass and lowpass filter are used to merge two of the microphone signals to form a single channel, so that a single stereo A/D converter is required to process the three microphone signals. The microphone signals are then linearly combined so as to maximize the signal-to-noise ratio, resulting in nulls steered toward nearby objectionable noise sources.
在机顶盒视频会议中,完整的视频会议系统可以不显眼地安装在电视的顶部。传声器拾音系统是最重要的功能模块之一,具有体积小、性能好、成本低等特点。距离系统几英尺远的人说话的声音必须被令人满意地拾取,而系统内部由冷却风扇和硬盘驱动器产生的噪音,以及外部由空调和附近计算机产生的噪音必须被衰减。本文描述了一种满足这些约束条件的三传声器超指令阵列。使用模拟高通和低通滤波器将两个麦克风信号合并成一个通道,这样就需要一个立体声a /D转换器来处理三个麦克风信号。然后将传声器信号线性组合,以最大限度地提高信噪比,从而使零点转向附近令人反感的噪声源。
{"title":"Superdirective microphone array for a set-top videoconferencing system","authors":"P. Chu","doi":"10.1109/ASPAA.1997.625632","DOIUrl":"https://doi.org/10.1109/ASPAA.1997.625632","url":null,"abstract":"In set-top videoconferencing, the complete videoconferencing system fits unobtrusively on top of the television. The microphone sound pickup system is one of the most important functional blocks with constraints of small size, high performance, and low cost. Persons speaking several feet away from the system must be picked up satisfactorily while noise generated internally in the system by the cooling fan and hard drive, and noise generated externally from air conditioning and nearby computers must be attenuated. In this paper, a three microphone superdirective array is described which meets these constraints. An analog highpass and lowpass filter are used to merge two of the microphone signals to form a single channel, so that a single stereo A/D converter is required to process the three microphone signals. The microphone signals are then linearly combined so as to maximize the signal-to-noise ratio, resulting in nulls steered toward nearby objectionable noise sources.","PeriodicalId":347087,"journal":{"name":"Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128400644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The one-filter Keefe clarinet tonehole 单滤基夫单簧管音孔
Julius O. Smith, G. Scavone
Two "one-filter" scattering junctions are derived which provide very accurate models of woodwind toneholes in the context of a digital waveguide model. Because toneholes in the clarinet possess only one resonance and/or anti-resonance within the audio band, a second-order digital filter suffices.
导出了两个“单滤波器”散射结,它们在数字波导模型的背景下提供了非常精确的木管乐器音孔模型。因为单簧管的音孔在音频频带内只有一个共振和/或反共振,所以一个二阶数字滤波器就足够了。
{"title":"The one-filter Keefe clarinet tonehole","authors":"Julius O. Smith, G. Scavone","doi":"10.1109/ASPAA.1997.625618","DOIUrl":"https://doi.org/10.1109/ASPAA.1997.625618","url":null,"abstract":"Two \"one-filter\" scattering junctions are derived which provide very accurate models of woodwind toneholes in the context of a digital waveguide model. Because toneholes in the clarinet possess only one resonance and/or anti-resonance within the audio band, a second-order digital filter suffices.","PeriodicalId":347087,"journal":{"name":"Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134510602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Phase-vocoder: about this phasiness business 相位声码器:关于相位业务
Jean Laroche, M. Dolson
The phase-vocoder is a well-known tool for the frequency domain processing of speech or audio signals, with applications such as time compression or expansion, pitch-scale modification, noise reduction, etc. In the context of time-scale or pitch-scale modification, the phase-vocoder is usually considered to yield high quality results, especially when large modification factors are used on polyphonic or non-pitched signals. However, the phase-vocoder is also known for an artifact that plagues its output, and has been described in the literature as either "phasiness", "reverberation", or "loss of presence". Research has been devoted to understanding and reducing this artifact, and solutions have been proposed which either significantly improve the quality of the output at the cost of a very high additional computation time, or are inexpensive but only marginally effective. This paper examines the problem of phasiness in the context of time-scale modification of signals, and presents two new phase synchronization schemes which are shown to both significantly improve the sound quality, and reduce the computational cost of such modifications.
相位声码器是一种众所周知的用于语音或音频信号频域处理的工具,应用于时间压缩或扩展,音高尺度修改,降噪等。在时间尺度或音高尺度修改的情况下,相位声码器通常被认为可以产生高质量的结果,特别是当在复音或非音高信号上使用大的修改因子时。然而,相位声码器也因其输出受到干扰而闻名,并且在文献中被描述为“相位”,“混响”或“存在损失”。研究一直致力于理解和减少这个工件,并且已经提出了解决方案,这些解决方案要么以非常高的额外计算时间为代价显着提高输出质量,要么价格低廉但仅略微有效。本文研究了信号时间尺度修改中的相位问题,并提出了两种新的相位同步方案,这两种方案既能显著提高声音质量,又能减少这种修改的计算成本。
{"title":"Phase-vocoder: about this phasiness business","authors":"Jean Laroche, M. Dolson","doi":"10.1109/ASPAA.1997.625603","DOIUrl":"https://doi.org/10.1109/ASPAA.1997.625603","url":null,"abstract":"The phase-vocoder is a well-known tool for the frequency domain processing of speech or audio signals, with applications such as time compression or expansion, pitch-scale modification, noise reduction, etc. In the context of time-scale or pitch-scale modification, the phase-vocoder is usually considered to yield high quality results, especially when large modification factors are used on polyphonic or non-pitched signals. However, the phase-vocoder is also known for an artifact that plagues its output, and has been described in the literature as either \"phasiness\", \"reverberation\", or \"loss of presence\". Research has been devoted to understanding and reducing this artifact, and solutions have been proposed which either significantly improve the quality of the output at the cost of a very high additional computation time, or are inexpensive but only marginally effective. This paper examines the problem of phasiness in the context of time-scale modification of signals, and presents two new phase synchronization schemes which are shown to both significantly improve the sound quality, and reduce the computational cost of such modifications.","PeriodicalId":347087,"journal":{"name":"Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"134 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133864962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 72
Nonlinearity-tolerated active noise control using an artificial neural network 基于人工神经网络的非线性容忍有源噪声控制
C. X. Tan, H. Tachibana
A nonlinearity-tolerated neural active noise control scheme is presented. Time-space patterns are adaptively integrated within its architecture. A learning algorithm with time-delayed memory corresponding to the secondary acoustic paths is adopted. Simulation experiments with a hybrid structure of vibrating radiation and sound in an enclosure are conducted. It is demonstrated that the proposed approach can achieve effective noise attenuation over the whole spectrum of interest, even with a strong nonlinear environment, while the conventional filtered-x LMS active noise controller falls in chaos.
提出了一种非线性容忍神经主动噪声控制方案。时空模式自适应地集成在其架构中。采用了一种与次声路径相对应的延时记忆学习算法。对振动辐射声混合结构进行了仿真实验。结果表明,即使在强非线性环境下,该方法也能在整个频谱范围内实现有效的噪声衰减,而传统的滤波-x LMS有源噪声控制器则会陷入混沌。
{"title":"Nonlinearity-tolerated active noise control using an artificial neural network","authors":"C. X. Tan, H. Tachibana","doi":"10.1109/ASPAA.1997.625610","DOIUrl":"https://doi.org/10.1109/ASPAA.1997.625610","url":null,"abstract":"A nonlinearity-tolerated neural active noise control scheme is presented. Time-space patterns are adaptively integrated within its architecture. A learning algorithm with time-delayed memory corresponding to the secondary acoustic paths is adopted. Simulation experiments with a hybrid structure of vibrating radiation and sound in an enclosure are conducted. It is demonstrated that the proposed approach can achieve effective noise attenuation over the whole spectrum of interest, even with a strong nonlinear environment, while the conventional filtered-x LMS active noise controller falls in chaos.","PeriodicalId":347087,"journal":{"name":"Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129977054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A DSP implementation of a digital hearing aid with recruitment of loudness compensation and acoustic echo cancellation 一种具有响度补偿和回声消除功能的数字助听器的DSP实现
S. Wyrsch, A. Kaelin
This paper describes a DSP implementation of a digital hearing aid realized in the frequency domain that compensates for recruitment of loudness and cancels acoustic echos. In contrast to conventional systems which are based on a noise-probe signal, our echo canceler is adapted using only the available (e.g. speech) input signal. The main problems caused by a nonlinear feedforward filter are discussed using analytical results of the steady state behavior of the closed-loop hearing-aid system. The implemented DSP system is tested with a dummy behind-the-ear (bte) hearing-aid device on a KEMAR-head and results are presented.
本文介绍了一种在频域实现的数字助听器的DSP实现,该助听器可以补偿响度的增加和消除回声。与基于噪声探测信号的传统系统相比,我们的回声消除器仅使用可用的(例如语音)输入信号。利用闭环助听器系统稳态特性的分析结果,讨论了非线性前馈滤波器引起的主要问题。采用kemar头戴式耳后助听器对所实现的DSP系统进行了测试,并给出了测试结果。
{"title":"A DSP implementation of a digital hearing aid with recruitment of loudness compensation and acoustic echo cancellation","authors":"S. Wyrsch, A. Kaelin","doi":"10.1109/ASPAA.1997.625579","DOIUrl":"https://doi.org/10.1109/ASPAA.1997.625579","url":null,"abstract":"This paper describes a DSP implementation of a digital hearing aid realized in the frequency domain that compensates for recruitment of loudness and cancels acoustic echos. In contrast to conventional systems which are based on a noise-probe signal, our echo canceler is adapted using only the available (e.g. speech) input signal. The main problems caused by a nonlinear feedforward filter are discussed using analytical results of the steady state behavior of the closed-loop hearing-aid system. The implemented DSP system is tested with a dummy behind-the-ear (bte) hearing-aid device on a KEMAR-head and results are presented.","PeriodicalId":347087,"journal":{"name":"Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127624855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Spectral noise-shaping in integrate-and-fire neural networks 集成-火神经网络的频谱噪声整形
R. Adams
A theory of coordinated neuronal firing events is proposed that allows the low-noise transmission of analog signals through a network of coupled neurons. The inherent quantization noise of a group of neurons can be shaped in the frequency domain in such a way as to provide a high signal-to-noise ratio over some specified signal bandwidth. This shaping is accomplished by using neural interconnections that resemble lateral inhibition.
提出了一种协调神经元放电事件的理论,该理论允许模拟信号通过耦合神经元网络进行低噪声传输。一组神经元的固有量化噪声可以在频域以这样一种方式形成,以便在某些指定的信号带宽上提供高信噪比。这种形成是通过使用类似侧抑制的神经互连来完成的。
{"title":"Spectral noise-shaping in integrate-and-fire neural networks","authors":"R. Adams","doi":"10.1109/ASPAA.1997.625619","DOIUrl":"https://doi.org/10.1109/ASPAA.1997.625619","url":null,"abstract":"A theory of coordinated neuronal firing events is proposed that allows the low-noise transmission of analog signals through a network of coupled neurons. The inherent quantization noise of a group of neurons can be shaped in the frequency domain in such a way as to provide a high signal-to-noise ratio over some specified signal bandwidth. This shaping is accomplished by using neural interconnections that resemble lateral inhibition.","PeriodicalId":347087,"journal":{"name":"Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122949596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Range-dependence of the HRTF for a spherical head 球头HRTF的距离依赖性
R. Duda, W. Martens
This paper examines the range dependence of the head-related transfer function (HRTF) for a simple spherical model of the head in both the time-domain and the frequency domain. The variation of low-frequency interaural level difference (ILD) with range is shown to be significant for ranges smaller than five times the sphere radius. The impulse response explains the source of the ripples in the frequency response, and provides direct evidence that the interaural time delay (ITD) is not a strong function of range. Time-delay measurements confirm the Woodworth/Schlosberg formula. Numerical analysis indicates that the HRTF is minimum phase. Thus, except for the time delay, the impulse response can be reconstructed from a simple principle components analysis of the magnitude response.
本文研究了头部简单球面模型在时域和频域的头部相关传递函数(HRTF)的范围依赖性。低频耳间电平差(ILD)随距离的变化在小于球半径5倍的范围内是显著的。脉冲响应解释了频率响应中波纹的来源,并直接证明了声间时间延迟(ITD)不是距离的强函数。延时测量证实了Woodworth/Schlosberg公式。数值分析表明,HRTF为最小相位。因此,除了时间延迟外,脉冲响应可以通过对幅度响应的简单主分量分析来重建。
{"title":"Range-dependence of the HRTF for a spherical head","authors":"R. Duda, W. Martens","doi":"10.1109/ASPAA.1997.625597","DOIUrl":"https://doi.org/10.1109/ASPAA.1997.625597","url":null,"abstract":"This paper examines the range dependence of the head-related transfer function (HRTF) for a simple spherical model of the head in both the time-domain and the frequency domain. The variation of low-frequency interaural level difference (ILD) with range is shown to be significant for ranges smaller than five times the sphere radius. The impulse response explains the source of the ripples in the frequency response, and provides direct evidence that the interaural time delay (ITD) is not a strong function of range. Time-delay measurements confirm the Woodworth/Schlosberg formula. Numerical analysis indicates that the HRTF is minimum phase. Thus, except for the time delay, the impulse response can be reconstructed from a simple principle components analysis of the magnitude response.","PeriodicalId":347087,"journal":{"name":"Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130642589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Continuously signal-adaptive filterbank for high-quality perceptual audio coding 用于高质量感知音频编码的连续信号自适应滤波器组
J. Herre, J. Johnston
Historically, the choice of the optimum filterbank has been the subject of much research and discussion in the development of perceptual audio coders. Desirable properties of a good filterbank include both a good extraction of the signal's redundancy and effective utilization of that redundancy while maintaining control over perceptual demands. Often, there is a conflict between the use of perceptual constraints and the redundancy extraction, in that a filterbank with good resolution in both time and frequency is needed. Recently, a method for performing temporal noise shaping (TNS) of the error signal of a perceptual audio coder has been proposed, providing control over both the time and frequency structure of the coding noise. This paper focuses on the core part of the scheme, forming a continuously adaptive filterbank, and discusses its theoretical background, properties and limitations.
从历史上看,在感知音频编码器的发展中,最佳滤波器组的选择一直是许多研究和讨论的主题。良好滤波器组的理想特性包括良好的信号冗余提取和有效利用冗余,同时保持对感知需求的控制。通常,在使用感知约束和冗余提取之间存在冲突,因为需要在时间和频率上都具有良好分辨率的滤波器组。最近,提出了一种对感知音频编码器的误差信号进行时间噪声整形(TNS)的方法,提供了对编码噪声的时间和频率结构的控制。本文重点研究了该方案的核心部分,即组成连续自适应滤波器组,并讨论了其理论背景、性质和局限性。
{"title":"Continuously signal-adaptive filterbank for high-quality perceptual audio coding","authors":"J. Herre, J. Johnston","doi":"10.1109/ASPAA.1997.625588","DOIUrl":"https://doi.org/10.1109/ASPAA.1997.625588","url":null,"abstract":"Historically, the choice of the optimum filterbank has been the subject of much research and discussion in the development of perceptual audio coders. Desirable properties of a good filterbank include both a good extraction of the signal's redundancy and effective utilization of that redundancy while maintaining control over perceptual demands. Often, there is a conflict between the use of perceptual constraints and the redundancy extraction, in that a filterbank with good resolution in both time and frequency is needed. Recently, a method for performing temporal noise shaping (TNS) of the error signal of a perceptual audio coder has been proposed, providing control over both the time and frequency structure of the coding noise. This paper focuses on the core part of the scheme, forming a continuously adaptive filterbank, and discusses its theoretical background, properties and limitations.","PeriodicalId":347087,"journal":{"name":"Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125573966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
期刊
Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1