Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics最新文献

英文中文

Modeling of reflections and air absorption in acoustical spaces a digital filter design approach 声学空间中反射和空气吸收的建模——数字滤波器设计方法

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625594

J. Huopaniemi, L. Savioja, M. Karjalainen

A method is presented for modeling sound propagation in rooms using a signal processing approach. Low order digital filters are designed to match to sound propagation transfer functions calculated from boundary material and air absorption data. The technique is applied to low frequency, finite difference time domain (FDTD) simulation of room acoustics and to real-time image-source based virtual acoustics.

提出了一种利用信号处理方法模拟室内声音传播的方法。设计了低阶数字滤波器，以匹配由边界材料和空气吸收数据计算的声音传播传递函数。该技术应用于室内声学的低频时域有限差分(FDTD)仿真和基于实时图像源的虚拟声学。

引用次数: 72

Computational auditory scene analysis exploiting speech-recognition knowledge 利用语音识别知识的计算听觉场景分析

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625625

Daniel P. W. Ellis

The field of computational auditory scene analysis (CASA) strives to build computer models of the human ability to interpret sound mixtures as the combination of distinct sources. A major obstacle to this enterprise is defining and incorporating the kind of high level knowledge of real-world signal structure exploited by listeners. Speech recognition, while typically ignoring the problem of nonspeech inclusions, has been very successful at deriving powerful statistical models of speech structure from training data. In this paper, we describe a scene analysis system that includes both speech and nonspeech components, addressing the problem of working backwards from speech recognizer output to estimate the speech component of a mixture. Ultimately, such hybrid approaches will require more radical adaptation of current speech recognition approaches.

计算听觉场景分析(CASA)领域致力于建立人类能力的计算机模型，将声音混合解释为不同来源的组合。这项事业的一个主要障碍是定义和整合听众所利用的真实世界信号结构的高级知识。语音识别通常忽略非语音包含问题，但在从训练数据中获得强大的语音结构统计模型方面非常成功。在本文中，我们描述了一个包含语音和非语音成分的场景分析系统，解决了从语音识别器输出向后工作以估计混合语音成分的问题。最终，这种混合方法将需要对当前的语音识别方法进行更彻底的调整。

引用次数: 9

A pitch-based approach to time-delay estimation of reverberant speech 基于音高的混响语音时延估计方法

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625637

M. Brandstein

Generalized cross-correlation (GCC) has been the traditional method for estimating the relative time-delay associated with speech signals received by a pair of microphones in a reverberant, noisy environment. The filtering criterion employed is either focussed on the signal degradations due to additive noise or those due exclusively to multipath channel effects. There has been relatively little success at applying GCC weighting schemes which are robust to both of these conditions. This paper details an alternative approach which attempts to employ a signal dependent criterion, namely the estimated periodicity of harmonic spectral intervals, to design a GCC filter appropriate for the combination of noise and multipath signal distortions. Simulations are performed across a range of room conditions to illustrate the utility of the proposed time-delay estimation method relative to conventional GCC filtering approaches.

广义互相关(GCC)是估计一对传声器在混响、噪声环境中接收到的语音信号的相对时延的传统方法。所采用的滤波准则要么集中于由加性噪声引起的信号退化，要么集中于仅由多径信道效应引起的信号退化。在应用对这两种条件都具有鲁棒性的GCC加权方案方面取得的成功相对较少。本文详细介绍了一种替代方法，该方法试图采用信号相关准则，即谐波谱间隔的估计周期性，来设计适合噪声和多径信号失真组合的GCC滤波器。在一系列室内条件下进行了仿真，以说明所提出的延迟估计方法相对于传统GCC滤波方法的实用性。

引用次数: 57

Some notes on feedback suppression with adaptive filters in hearing aids 助听器中自适应滤波器反馈抑制的一些注意事项

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625583

W. Knecht

This paper discusses some important issues concerning feedback cancellation in hearing aids using adaptive filters. We summarize the main benefits of an anti-feedback system and comment on the pros and cons of a frequency-domain implementation. The length of the adaptive filter is dealt with and important delay elements in the algorithm are described. Finally, we comment on the problem of adaptation in the presence of uncorrelated additive noise which, in this case, is the desired input signal to the hearing aid.

本文讨论了自适应滤波器在助听器反馈消除中的一些重要问题。我们总结了反反馈系统的主要优点，并对频域实现的优缺点进行了评论。讨论了自适应滤波器的长度，并对算法中重要的延迟元素进行了描述。最后，我们评论了在不相关的加性噪声存在下的自适应问题，在这种情况下，这些噪声是助听器所需的输入信号。

引用次数: 9

Auditory segregation of vowel-like sounds with static and dynamic spectral properties 具有静态和动态频谱特性的类元音声音的听觉分离

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625624

P. Divenyi, R. Carré, A. P. Algazi

Experiments were conducted to determine the extent to which a fundamental frequency or formant frequency transition influenced segregation of a simultaneous pair of single-formant harmonic complexes. Results showed that even a minute transition facilitated segregation. The effect was larger for formant frequency than fundamental frequency transitions. It is concluded that dynamic aspects of speech must be taken into account when explaining auditory scene analysis by humans and when designing computational scene analysis methods.

进行了实验，以确定在何种程度上，一个基本频率或形成峰频率转变影响的分离，同时对单形成峰谐波复合体。结果表明，即使是一分钟的转变也会促进分离。形成峰频率的影响大于基频跃迁。结论是，在解释人类的听觉场景分析和设计计算场景分析方法时，必须考虑语音的动态方面。

引用次数: 8

Compression circuit of a multiband analog system for hearing aid

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625578

J. Fernandez Ramos, J. C. Tejero, J. A. Hidalgo, M. J. Martin, A. Gago

This paper describes the design and evaluation of a circuit which performs the compression of narrow-band signals within a multiband analog system for a hearing aid. The system has twelve narrow-band modules. Each module is formed by four stages. The first stage is a band-pass filter which selects the bandwidth of the module. The second stage, the object of this paper, is a compression circuit which performs a nonlinear operation for removing the attack and release times typical of automatic gain control systems. The third stage is another band-pass filter like the first, the function of which is to reduce the distortion produced by the compression stage. The fourth stage is a controlled linear gain stage. The simulation and experimental results obtained show that the compression circuit has good accuracy within the dynamic range of speech signals. The output narrow-band filter reduces to a great extent the harmonic and intermodulation distortion inherent in all compression systems when the input signal has some important formants very close.

本文介绍了一种用于助听器多波段模拟系统的窄带信号压缩电路的设计和评价。该系统有12个窄带模块。每个模块由四个阶段组成。第一级是带通滤波器，用于选择模块的带宽。第二阶段，本文的研究对象，是一个执行非线性运算的压缩电路，以消除自动增益控制系统中典型的攻击和释放时间。第三级是与第一级类似的另一个带通滤波器，其功能是减少压缩级产生的失真。第四级是可控线性增益级。仿真和实验结果表明，该压缩电路在语音信号的动态范围内具有良好的精度。当输入信号的一些重要共振峰非常接近时，输出窄带滤波器在很大程度上降低了所有压缩系统固有的谐波和互调失真。

引用次数: 0

Voice source localization for automatic camera pointing system in videoconferencing 视频会议中自动摄像机指向系统的语音源定位

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625639

H. Wang, P. Chu

This paper describes the voice source localization algorithm used in the PictureTel automatic camera pointing system (LimeLight/sup TM/, dynamic speech locating technology). The system uses an array of 46 cm wide and 30 cm high, which contains 4 microphones, and is mounted on top of the monitor. The three dimensional position of a sound source is calculated from the time delays of 4 pairs of microphones. In time delay estimation, the averaging of signal onsets of each frequency band is combined with phase correlation to reduce the influence of noise and reverberation. With this approach, it is possible to provide reliable three dimensional voice source localization by a small microphone array. Post processing based on a priori knowledge is also introduced to eliminate the influences of reflections from furniture such as tables. Results of speech source localization under real conference room conditions are given. Some system related issues are also discussed.

本文介绍了PictureTel自动摄像机指向系统中使用的语音源定位算法(LimeLight/sup TM/，动态语音定位技术)。该系统使用宽46厘米、高30厘米的阵列，其中包含4个麦克风，安装在监视器的顶部。声源的三维位置由4对传声器的延时计算得到。在时延估计中，将各频段信号初始值的平均与相位相关相结合，降低了噪声和混响的影响。利用这种方法，可以通过一个小型麦克风阵列提供可靠的三维声源定位。还引入了基于先验知识的后处理，以消除家具(如桌子)反射的影响。给出了真实会议室条件下的语音源定位结果。并对系统相关问题进行了讨论。

引用次数: 74

M-band wavelet packets and filter bank trees as flexible tools in audio signal processing m波段小波包和滤波器组树作为音频信号处理的灵活工具

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625604

F. Kurth, M. Clausen

This article discusses M-band wavelet packets which combine the well-known construction of 2-band wavelet packets with concepts of M-band wavelet theory. To make the resulting tilings of the time-frequency plane even more flexible, the concept of a filter bank tree (FBT) is presented. Within this framework the design of decimated filter bank cascades, realizing some arbitrary time-frequency tiling, is possible. Extensive tests on the denoising of high fidelity audio signals and comparison with several standard wavelet packet denoising techniques show the advantages of the new methods. The generality of the concept suggests its application to other problems, not necessarily restricted to the field of audio signal processing.

本文讨论了m波段小波包，它结合了众所周知的2波段小波包构造和m波段小波理论的概念。为了使得到的时频平面分层更加灵活，提出了滤波器组树(FBT)的概念。在这个框架内，可以设计抽取滤波器组级联，实现任意时频平铺。通过对高保真音频信号去噪的大量实验和与几种标准小波包去噪技术的比较，表明了新方法的优越性。这个概念的概括性表明它可以应用于其他问题，而不一定局限于音频信号处理领域。

引用次数: 1

Extraction of spectral peak parameters using a short-time Fourier transform modeling and no sidelobe windows 利用短时傅里叶变换建模和无旁瓣窗提取谱峰参数

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625600

P. Depalle, Thomas Hélie

A new method which improves the estimation of frequency, amplitude and phase of the partials of a sound is presented. It allows the reduction of the analysis-window size from four periods to two periods. It therefore gives better accuracy in parameter determination, and has proved to remain efficient at low signal-to-noise ratios. The basic idea consists of using a parametric modeling of the short-time Fourier transform. The method alternately estimates the complex amplitudes and the frequencies starting from the result of the classical analysis method. It uses the least-square procedure and a first-order limited expansion of the model around previous estimations. This method leads us to design new windows which do not have any sidelobes in order to help the convergence. Finally an analysis algorithm which has been built according to the observed behavior of the method for various kinds of sound is presented.

提出了一种改进声音偏频、幅值和相位估计的新方法。它允许将分析窗口大小从四个周期减少到两个周期。因此，它在参数确定方面具有更好的准确性，并且已被证明在低信噪比下仍然有效。其基本思想是使用短时傅里叶变换的参数化建模。该方法从经典分析方法的结果出发，交替估计复振幅和频率。它使用最小二乘过程和围绕先前估计的模型的一阶有限展开。这种方法导致我们设计没有任何副瓣的新窗口，以帮助收敛。最后，根据该方法对各种声音的观测行为，建立了一种分析算法。

引用次数: 66

Spectral transformation for musical tones via time domain filtering 通过时域滤波的谱变换的音乐音调

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625613

S. Yim, Y. Ding, A. McCree

We present a spectral transformation technique for musical tones. It can be used to modify the brightness or timbre of a musical tone. We perform spectral transformation by modifying the line spectral frequencies or LPC roots of the original spectral envelope and filtering the tone with a spectral transformation filter in the time domain. One of the application is pitch modification where frequency scaling is used to modify the fundamental frequency of a tone and spectral transformation is used to capture the original brightness and timbre. We have applied this spectral transformation technique to restore the original brightness of frequency scaled musical tones produced from a variety of musical instruments, such as piano, trumpet, violin, flute and bassoon by one octave up or down. In all experiments conducted, the pitch modified tones resemble the original brightness and timbre. This technique allows us to control the timbre of musical tones such as brightness, friction or other modulation by manipulating the spectral parameters.

我们提出了一种用于音乐音调的频谱变换技术。它可以用来修改音乐音调的亮度或音色。我们通过修改原始频谱包络的线谱频率或LPC根，并在时域内用频谱变换滤波器滤波音调来进行频谱变换。其中一个应用是音高修改，其中使用频率缩放来修改音调的基本频率，使用频谱变换来捕获原始亮度和音色。我们应用这种频谱变换技术，将钢琴、小号、小提琴、长笛和巴松管等多种乐器产生的频率缩放音乐音调的原始亮度提高或降低一个八度。在所有进行的实验中，音调修改后的音调与原始亮度和音色相似。这种技术允许我们通过操纵频谱参数来控制音乐音调的音色，如亮度，摩擦或其他调制。

引用次数: 2

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀