Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics最新文献

英文中文

Design of fractional delay filters using convex optimization 基于凸优化的分数阶延迟滤波器设计

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625616

W. Putnam, J. Smith

Fractional sample delay (FD) filters are useful and necessary in many applications, such as the accurate steering of acoustic arrays, delay lines for physical models of musical instruments, and time delay estimation. This paper addresses the design of finite impulse response (FIR) FD filters. The problem is posed as a convex optimization problem in which the maximum modulus of the complex error is minimized. Several design examples are presented, along with an empirical formula for the filter order required to meet a given worst case group delay error specification.

分数采样延迟(FD)滤波器在许多应用中都是有用和必要的，例如声学阵列的精确转向，乐器物理模型的延迟线和时间延迟估计。本文讨论了有限脉冲响应(FIR) FD滤波器的设计。该问题是一个求最大复误差模量最小的凸优化问题。给出了几个设计实例，以及满足给定最坏情况组延迟误差规范所需的滤波器阶数的经验公式。

引用次数: 25

Elimination of delay-free loops in discrete-time models of nonlinear acoustic systems 非线性声学系统离散时间模型中无延迟回路的消除

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625617

G. Borin, Giovanni De Poli, D. Rocchesso

Nonlinear acoustic systems are often described by means of nonlinear maps which act as instantaneous constraints on the solutions of a system of linear differential equations. This description leads to discrete-time models exhibiting non-computable loops. This paper presents a solution to this computability problem by means of geometrical transformation of the nonlinearities and algebraic transformation of the time-dependent equations. The proposed leads to stable and accurate simulations even at relatively low sampling rates.

非线性声学系统通常用非线性映射来描述，非线性映射作为线性微分方程组解的瞬时约束。这种描述导致离散时间模型表现出不可计算的循环。本文利用非线性的几何变换和时变方程的代数变换，给出了这一可计算性问题的解法。即使在相对较低的采样率下，所提出的方法也能实现稳定和准确的模拟。

引用次数: 109

A hybrid mono/stereo acoustic echo canceler 混合单声道/立体声回声消除器

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625634

J. Benesty, D. Morgan, M. Sondhi

In many applications, such as teleconferencing, multimedia workstations, televideo games, etc., stereo sound is already, or will soon be, implemented to give spatial realism that mono systems cannot offer. In such hands-free systems, stereophonic acoustic echo cancelers are absolutely necessary for full-duplex communication. We propose a new acoustic echo canceler (AEC) based on a fundamental experimental observation that the stereo effect is due mostly to sound energy below about 1000 Hz. The principle of the hybrid mono/stereo AEC is to use stereophonic sound with a stereo AEC at low frequencies (e.g., below 1000 Hz) and monophonic sound with a conventional mono AEC at higher frequencies (e.g., above 1000 Hz). This solution is a good compromise between the complexity of a full-band stereo AEC and spatial realism. For the stereo case we borrow from a previous innovation and add a small nonlinearity into each channel in order to accurately identify the two receiving room impulse responses.

在许多应用中，如电话会议、多媒体工作站、电视游戏等，立体声已经或即将实现，以提供单声道系统无法提供的空间真实感。在这种免提系统中，立体声回声消除器对于全双工通信是绝对必要的。我们提出了一种新的声学回声消除器(AEC)，基于一个基本的实验观察，立体声效应主要是由于声能低于1000赫兹。单声道/立体声混合AEC的原理是在低频(例如低于1000赫兹)使用立体声AEC的立体声声音，在高频(例如高于1000赫兹)使用传统单声道AEC的单声道声音。这种解决方案是在全波段立体声AEC的复杂性和空间真实感之间的一个很好的折衷。对于立体声的情况，我们借鉴了以前的创新，并在每个通道中添加了一个小的非线性，以便准确地识别两个接收室的脉冲响应。

引用次数: 34

Warped filters and their audio applications 扭曲过滤器和他们的音频应用

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625615

M. Karjalainen, A. Harma, U. Laine, J. Huopaniemi

An inherent property of many DSP algorithms is that they tend to exhibit uniform frequency resolution from zero to the Nyquist frequency. This is a direct consequence of using unit delays as building blocks; a frequency independent delay implies uniform frequency resolution. In audio applications, however, this is often an undesirable feature since the response properties are typically specified and measured on a logarithmic scale, following the behavior of the human auditory system. We present an overview of warped filters and DSP techniques which can be designed to better match the audio and auditory criteria. Audio applications, including modeling of auditory and musical phenomena, equalization techniques, auralization, and audio coding, are presented.

许多DSP算法的一个固有特性是，它们倾向于表现出从零到奈奎斯特频率的均匀频率分辨率。这是使用单位延迟作为构建模块的直接结果;频率无关的延迟意味着均匀的频率分辨率。然而，在音频应用中，这通常是一个不希望出现的特性，因为响应属性通常是按照人类听觉系统的行为以对数尺度指定和测量的。我们介绍了扭曲滤波器和DSP技术的概述，可以设计成更好地匹配音频和听觉标准。音频应用，包括听觉和音乐现象的建模，均衡技术，听觉化，和音频编码，提出。

引用次数: 36

Establishing the tonal context for musical pattern recognition 建立音乐模式识别的调性背景

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625608

I. Shmulevich, E. Coyle

We develop a method for establishing tonal contexts of musical patterns in a musical composition. This is subsequently incorporated into a system for recognition of musical patterns. Krumhansl's (1990) key-finding algorithm is used as a basis. The sequence of maximum correlations that it outputs is smoothed with a cubic spline and is used to determine weights for perceptual and absolute pitch errors. Statistically significant maximum correlations are used to create the assigned key sequence, which is then median filtered to improve the structure of the output of the key finding algorithm.

我们发展了一种在音乐作品中建立音乐模式的调性背景的方法。这随后被整合到音乐模式识别系统中。使用Krumhansl(1990)的寻键算法作为基础。它输出的最大相关序列用三次样条平滑，并用于确定感知和绝对音高误差的权重。使用统计上显著的最大相关性来创建分配的键序列，然后对其进行中值过滤，以改进键查找算法的输出结构。

引用次数: 14

Modeling the Haas effect: a first step for solving the CASA problem 模拟哈斯效应:解决CASA问题的第一步

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625627

J. K. Bates

Auditory scene analysis and its older cousin, the Haas/precedence effect both involve the same acoustic and auditory phenomena. In each case it is necessary to explain the ear's ability both to hear and pay attention to sources within a background of reverberations. Thus, a successful model of the Haas effect should be capable of being extended to CASA applications. We present a model based on a vector space of elementary meaning that is somewhat similar to Divenyi's (1995) "three cardinal dimensions". Test results replicating the Haas effect demonstrate ability to select and track, without foreknowledge, the azimuth direction of arrival of acoustic sources in a background of reverberations and environment noise.

听觉场景分析和它的前辈，哈斯/优先效应都涉及相同的声学和听觉现象。在每种情况下，都有必要解释耳朵在混响背景中听到和注意音源的能力。因此，一个成功的哈斯效应模型应该能够推广到CASA应用中。我们提出了一个基于基本意义的向量空间的模型，有点类似于Divenyi(1995)的模型。“三维”。模拟哈斯效应的测试结果表明，在没有预知的情况下，可以在混响和环境噪声背景下选择和跟踪声源到达的方位方向。

引用次数: 0

Robust time delay estimation for sound source localization in noisy environments 噪声环境下声源定位的鲁棒时延估计

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625638

P.G. Georgiou, C. Kyriakakis, P. Tsakalides

This paper addresses the problem of robust localization of a sound source in a wide range of operating environments. We use fractional lower order statistics in the frequency domain of two-sensor measurements to accurately locate the source in impulsive noise. We demonstrate a significant improvement in detection via simulation experiments of a sound source in /spl alpha/-stable noise. Applications of this technique include the efficient steering of a microphone array in teleconference applications.

本文讨论了在各种操作环境下声源的鲁棒定位问题。我们在双传感器测量的频域中使用分数阶低阶统计量来精确定位脉冲噪声中的源。我们通过声源在/spl α /-稳定噪声的模拟实验证明了检测的显着改进。该技术的应用包括电话会议应用中麦克风阵列的有效转向。

引用次数: 21

An efficient HRTF model for 3-D sound 三维声音的高效HRTF模型

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625596

Phillip Brown, R. Duda

A simple model is presented for synthesizing binaural sound from a monaural source. The model produces vertical as well as horizontal and externalization effects. The simplicity of the model permits efficient implementation, allowing for real-time multisource operation. Additionally, the parameters in the model can be adjusted to fit a particular individual's characteristics.

提出了一种从单声源合成双声源的简单模型。该模型既产生纵向效应，也产生横向效应和外部化效应。模型的简单性允许有效的实现，允许实时多源操作。此外，模型中的参数可以调整以适应特定个体的特征。

引用次数: 52

The continuous frequency dynamic range compressor 连续频率动态范围压缩机

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625580

E. Lindemann

The typical multiband audio compressor (TMC), such as that used in many modern hearing aids, consists of a bandpass filter bank coupled to a compression circuit which applies gain to each frequency band as a function of power in that band. Generally the filter bank is designed so that the sum of magnitude responses of the filters is unity with the band edges as steep as the implementation will allow, to minimize overlap between bands. There are a number of problems with this approach. Difficult decisions must be made regarding placement of band edges. While the composite response for broad band signals may be flat, the narrow band-e.g. swept sine-response exhibits bumps near the band edges. In other words, the system is non-shift invariant with respect to frequency. We show that these problems can be eliminated by increasing the number of bands, and by extending the overlap region between bands. The problem is examined in terms of frequency domain sampling of the power spectrum. If the sampling rate is sufficiently high then artifacts disappear, and the system can be viewed as continuous in frequency with no band edges.

典型的多频带音频压缩器(TMC)，如许多现代助听器中使用的，由一个带通滤波器组耦合到一个压缩电路，该电路将每个频带的增益作为该频带功率的函数。通常，滤波器组的设计使滤波器的幅度响应之和与实现允许的频带边缘尽可能陡一致，以尽量减少频带之间的重叠。这种方法存在许多问题。关于带边的位置必须做出困难的决定。虽然宽带信号的复合响应可能是平坦的，但窄带信号(例如:扫描正弦响应在带边缘附近呈现凸起。换句话说，系统对频率是非移不变的。我们表明，这些问题可以通过增加频带数量和扩大频带之间的重叠区域来消除。从功率谱的频域采样的角度来研究这个问题。如果采样率足够高，则伪影消失，并且系统可以被视为连续的频率，没有带边。

引用次数: 11

Towards a model of loudness recalibration 朝向一个响度重新校准的模型

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Pub Date : 1997-10-19 DOI: 10.1109/ASPAA.1997.625621

D. Mapes-Riordan, W. Yost

The Zwicker (1977, 1990) loudness model is a standard for predicting the loudness of a sound. This model, along with Moore and Glasberg's (see Acustica, vol.82, p.335-45, 1996) revision of it, is fairly accurate at predicting the loudness of steady-state sounds, but falls short for many types of temporally varying sounds. One temporal effect not accounted for in the Zwicker model is loudness recalibration. Loudness recalibration is a fatigue-like effect that makes a quiet tone at one frequency even quieter when it is preceded by a louder tone at the same frequency. The evidence suggests that loudness recalibration occurs in the central nervous system. Two means of modeling loudness recalibration are proposed. The first is an algorithmic description of the recalibration effect that could be added to the later stages of the Zwicker model. The other method uses a neural network and is based on a spike-train timing theory of hearing rather than a rate-place theory as assumed by the Zwicker model. This spike-train timing approach is unique in that spike-train averaging is postponed until a final loudness estimate is made. A more complete and accurate model of loudness recalibration will have to wait until more experimental data is collected.

Zwicker(1977,1990)响度模型是预测声音响度的标准。这个模型，以及摩尔和格拉斯伯格对它的修正(见《声学》，第82卷，第335-45页，1996年)，在预测稳态声音的响度方面相当准确，但对许多类型的时间变化声音的预测就不够了。Zwicker模型中没有考虑到的一个时间效应是响度重新校准。响度重新校准是一种类似疲劳的效果，当一个频率的安静音调之前有一个相同频率的更响亮的音调时，它会变得更安静。有证据表明，响度重新校准发生在中枢神经系统。提出了两种模拟响度再标定的方法。第一个是对重新校准效应的算法描述，可以添加到Zwicker模型的后期阶段。另一种方法使用神经网络，并基于听觉的尖峰列车定时理论，而不是Zwicker模型假设的率位理论。这种尖峰列车定时方法的独特之处在于，尖峰列车的平均被推迟到最后的响度估计完成。一个更完整和准确的响度重新校准模型必须等到收集到更多的实验数据。

{"title":"Towards a model of loudness recalibration","authors":"D. Mapes-Riordan, W. Yost","doi":"10.1109/ASPAA.1997.625621","DOIUrl":"https://doi.org/10.1109/ASPAA.1997.625621","url":null,"abstract":"The Zwicker (1977, 1990) loudness model is a standard for predicting the loudness of a sound. This model, along with Moore and Glasberg's (see Acustica, vol.82, p.335-45, 1996) revision of it, is fairly accurate at predicting the loudness of steady-state sounds, but falls short for many types of temporally varying sounds. One temporal effect not accounted for in the Zwicker model is loudness recalibration. Loudness recalibration is a fatigue-like effect that makes a quiet tone at one frequency even quieter when it is preceded by a louder tone at the same frequency. The evidence suggests that loudness recalibration occurs in the central nervous system. Two means of modeling loudness recalibration are proposed. The first is an algorithmic description of the recalibration effect that could be added to the later stages of the Zwicker model. The other method uses a neural network and is based on a spike-train timing theory of hearing rather than a rate-place theory as assumed by the Zwicker model. This spike-train timing approach is unique in that spike-train averaging is postponed until a final loudness estimate is made. A more complete and accurate model of loudness recalibration will have to wait until more experimental data is collected.","PeriodicalId":347087,"journal":{"name":"Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115406000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀