首页 > 最新文献

Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics最新文献

英文 中文
Efficient Methods for Simulating a Moving Talker in a Rectangular Room 矩形房间中移动说话者的有效模拟方法
B. Champagne, A. Lobo, P. Kabal
In this paper, we describe two methods for efficiently simulating the response of a microphone to a moving talker in a rectangular room. Both methods are based on an extension of the image method to moving sources. In the first method, the microphone output signal is obtained by performing a time-domain filtering operation on the original speech signal, while in the second method, a timefrequency representation of this filtering operation is used. In each case, computational load and memory requirements are considerably reduced by taking advantage of the fact that the talker velocity is much smaller than the speed of sound.
在本文中,我们描述了两种有效地模拟麦克风对矩形房间中移动的说话者的响应的方法。这两种方法都是基于对移动源的图像方法的扩展。在第一种方法中,通过对原始语音信号进行时域滤波操作获得麦克风输出信号,而在第二种方法中,使用该滤波操作的时频表示。在每种情况下,通过利用通话速度远小于声速的事实,计算负载和内存需求都大大减少。
{"title":"Efficient Methods for Simulating a Moving Talker in a Rectangular Room","authors":"B. Champagne, A. Lobo, P. Kabal","doi":"10.1109/ASPAA.1991.634134","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634134","url":null,"abstract":"In this paper, we describe two methods for efficiently simulating the response of a microphone to a moving talker in a rectangular room. Both methods are based on an extension of the image method to moving sources. In the first method, the microphone output signal is obtained by performing a time-domain filtering operation on the original speech signal, while in the second method, a timefrequency representation of this filtering operation is used. In each case, computational load and memory requirements are considerably reduced by taking advantage of the fact that the talker velocity is much smaller than the speed of sound.","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129312967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Data sonification: issues and challenges 数据声音化:问题和挑战
S. Smith
Data "sonification." the representation of da t a in sound. is the auditory counterpart of da ta visualization. '171itle data isualizatioii is a matuiiiig-if not actuall? mature-field, sonification is quite young. It criii 1~~~ h a i d i o l i ( ~ e beguii i i t l i B1h ' 5 pioneeiing 1982 stud). Because tlie field 1s $0 nen , sonification li,is not >et eitahli5lietl i t . slur as tool for exploring and understanding data; inoleover. sonification f,lcc> t i i f f i ( l i l t ol)5tCic lei to i t> toritiniied eiJolution. These obstacles are the focus of this presentation.
数据“声音化”:数据在声音中的表现。是数据可视化的听觉对应物。“小数据可视化是一种成熟——如果不是实际上的话?”成熟领域的超声技术相当年轻。criii 1 h ~ ~ ~我d l o (t l ~ e beguii 我我B1h 1982 pioneeiing螺栓)。因为这个领域的15美元的新,声波li,并不是> >在它上面的10美元。slur作为探索和理解数据的工具;inoleover。f, lcc > t我我f f (l i l t ol) 5 tcic lei我t > toritiniied eiJolution。这些障碍是本次演讲的重点。
{"title":"Data sonification: issues and challenges","authors":"S. Smith","doi":"10.1109/ASPAA.1991.634106","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634106","url":null,"abstract":"Data \"sonification.\" the representation of da t a in sound. is the auditory counterpart of da ta visualization. '171itle data isualizatioii is a matuiiiig-if not actuall? mature-field, sonification is quite young. It criii 1~~~ h a i d i o l i ( ~ e beguii i i t l i B1h ' 5 pioneeiing 1982 stud). Because tlie field 1s $0 nen , sonification li,is not >et eitahli5lietl i t . slur as tool for exploring and understanding data; inoleover. sonification f,lcc> t i i f f i ( l i l t ol)5tCic lei to i t> toritiniied eiJolution. These obstacles are the focus of this presentation.","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114262117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Active Attenuation With Overall System Modeling 主动衰减与整体系统建模
L. Eriksson, M. Allie
Adaptive filters are an attractive approach for control of an active attenuation system due to their ability t o adapt t o changes in the acoustical system or noise source. One approach, based on the filtered-X algorithm, uses a finite impulse response (FIR) filter structure with coefficients that are adapted using the least mean squares (LMS) algorithm [ll. The filtered-U algorithm features an infinite impulse response (IIR) filter structure and uses the recursive least mean squares (RLMS) adaptive algorithm [21. Both algorithms require knowledge of auxiliary path transfer functions following the adaptive filter to ensure proper convergence of the algorithm. One approach to obtaining these transfer functions has been previously described by the authors and uses an independent random noise source, as shown in Fig. 1, for the filtered-U algorithm [31. This presentation will explore the use of an alternative approach to auxiliary path modeling that does not require an additional noise source. This approach utilizes an overall system model, Q, and auxiliary path model, T, and is known as the Q-T modeling algorithm [2,4]. As shown in Fig. 2, two error signals are combined in this approach to form an overall error signal, ET(z), that is used t o adapt Q(z) and T(z): where the residual acoustic noise, and the difference of the outputs of models Q and T, EJz) = E(i<)-E(z) (1) The model, M(z), adapts to minimize E(z) while Q(z) and T(z) adapt to minimize E,Cz). The model, M(z), may use either a finite impulse response (FIR) filter structure or an infinite impulse response (IIR) filter structure. The supplementary models, Q(z) and T(z), could also use either an FIR or IIR model structure. Adaptation can be done using the LMS o r RLMS algorithms for the FIR or IIR structures, respectively. The error signal, E(z), goes t o zero for an IIW model formed from A(z) and B(z) when: M(z) = P(z)/[H(z)(l-P(~)F(z))l = A(z)/[l-B(z)] (4) where P(z> is the direct plant, F(z) is the feedback plant, and H(z) is the auxiliary path transfer function. In general, there are many possible solutions for A(z) and B(z) for various physical parameters. The overall error signal, ET(z), goes to zero and the residual noise is minimized when: E(z) = E'(z) = 0 (5) which requires $!(z)~(z) = M(z) = P(z)/[H(z)(l-P(~)F(z))l (6) and there are again many solutions for Q(z) and T(z) for various physical parameters. However, T(z) is also used …
自适应滤波器由于能够适应声学系统或噪声源的变化而成为控制有源衰减系统的一种有吸引力的方法。一种基于滤波- x算法的方法,使用有限脉冲响应(FIR)滤波器结构,其系数采用最小均方(LMS)算法[11]。滤波后的u算法具有无限脉冲响应(IIR)滤波器结构,并使用递归最小均方(RLMS)自适应算法[21]。这两种算法都需要了解自适应滤波器后的辅助路径传递函数,以确保算法的适当收敛。获得这些传递函数的一种方法之前已经被作者描述过,该方法使用一个独立的随机噪声源,如图1所示,用于滤波- u算法[31]。本演讲将探讨一种不需要额外噪声源的辅助路径建模的替代方法的使用。该方法利用整体系统模型Q和辅助路径模型T,称为Q-T建模算法[2,4]。如图2所示,该方法将两个误差信号组合成一个整体误差信号ET(z),用于自适应Q(z)和t (z):其中残余噪声与模型Q和t的输出之差EJz) = E(i为直接植物,F(z)为反馈植物,H(z)为辅助路径传递函数。一般来说,对于不同的物理参数,A(z)和B(z)有许多可能的解。当E(z) = E'(z) = 0(5)时,总体误差信号ET(z)趋于零,残余噪声最小,这需要$!(z)~(z) = M(z) = P(z)/[H(z)(l-P(~)F(z)))l(6),并且对于各种物理参数,Q(z)和T(z)也有许多解。然而,T(z)也被用来…
{"title":"Active Attenuation With Overall System Modeling","authors":"L. Eriksson, M. Allie","doi":"10.1109/ASPAA.1991.634144","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634144","url":null,"abstract":"Adaptive filters are an attractive approach for control of an active attenuation system due to their ability t o adapt t o changes in the acoustical system or noise source. One approach, based on the filtered-X algorithm, uses a finite impulse response (FIR) filter structure with coefficients that are adapted using the least mean squares (LMS) algorithm [ll. The filtered-U algorithm features an infinite impulse response (IIR) filter structure and uses the recursive least mean squares (RLMS) adaptive algorithm [21. Both algorithms require knowledge of auxiliary path transfer functions following the adaptive filter to ensure proper convergence of the algorithm. One approach to obtaining these transfer functions has been previously described by the authors and uses an independent random noise source, as shown in Fig. 1, for the filtered-U algorithm [31. This presentation will explore the use of an alternative approach to auxiliary path modeling that does not require an additional noise source. This approach utilizes an overall system model, Q, and auxiliary path model, T, and is known as the Q-T modeling algorithm [2,4]. As shown in Fig. 2, two error signals are combined in this approach to form an overall error signal, ET(z), that is used t o adapt Q(z) and T(z): where the residual acoustic noise, and the difference of the outputs of models Q and T, EJz) = E(i<)-E(z) (1) The model, M(z), adapts to minimize E(z) while Q(z) and T(z) adapt to minimize E,Cz). The model, M(z), may use either a finite impulse response (FIR) filter structure or an infinite impulse response (IIR) filter structure. The supplementary models, Q(z) and T(z), could also use either an FIR or IIR model structure. Adaptation can be done using the LMS o r RLMS algorithms for the FIR or IIR structures, respectively. The error signal, E(z), goes t o zero for an IIW model formed from A(z) and B(z) when: M(z) = P(z)/[H(z)(l-P(~)F(z))l = A(z)/[l-B(z)] (4) where P(z> is the direct plant, F(z) is the feedback plant, and H(z) is the auxiliary path transfer function. In general, there are many possible solutions for A(z) and B(z) for various physical parameters. The overall error signal, ET(z), goes to zero and the residual noise is minimized when: E(z) = E'(z) = 0 (5) which requires $!(z)~(z) = M(z) = P(z)/[H(z)(l-P(~)F(z))l (6) and there are again many solutions for Q(z) and T(z) for various physical parameters. However, T(z) is also used …","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129437537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Demonstration of The Ircam Signal Processing Workstation Ircam信号处理工作站的演示
M. Puckette, E. Lindemann, C. Lippe
This is a project begun in August 19SS involving eight to ten engineers. The intention is to provide a system which is n-ell adapted to both real-time signal processing and event processing. The system uses the NeXT machine as host computer. We have developed a high-speed general purpose multiprocessor configured as plugin boards for the NeXT cube. The board, designed at IRCAM, uses two Intel i860 processors for number crunching and a 560001 for I/O. Three boards can be plugged into a NeXT cube for a total of 6 i860's.
这个项目开始于1919年8月,涉及8到10名工程师。其目的是提供一种同时适应实时信号处理和事件处理的系统。该系统使用NeXT机作为主机。我们开发了一个高速通用多处理器,配置为NeXT多维数据集的插件板。该板由IRCAM设计,使用两个英特尔i860处理器进行数字处理,一个560001处理器用于I/O。三个板可以插入到NeXT立方体中,总共有6个i860。
{"title":"A Demonstration of The Ircam Signal Processing Workstation","authors":"M. Puckette, E. Lindemann, C. Lippe","doi":"10.1109/ASPAA.1991.634153","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634153","url":null,"abstract":"This is a project begun in August 19SS involving eight to ten engineers. The intention is to provide a system which is n-ell adapted to both real-time signal processing and event processing. The system uses the NeXT machine as host computer. We have developed a high-speed general purpose multiprocessor configured as plugin boards for the NeXT cube. The board, designed at IRCAM, uses two Intel i860 processors for number crunching and a 560001 for I/O. Three boards can be plugged into a NeXT cube for a total of 6 i860's.","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133581246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Time versus Frequency Resolution in a Low-Rate, High Quality Audio Transform Coder 低速率、高质量音频变换编码器的时间与频率分辨率
M. Bosi, G. Davidson, L. Fielder
An adaptive block size transform coder for high quality music has been developed. The adaptability of the input size of the transform combined with the properties of the transform as developed in the Dolby AC-2 technology allows one to exploit both maximum time and frequency resolution while keeping the bit rate as low as 128 kb/s per channel. The low complexity of the system permits a real-time implementation of encoder or decoder using one general purpose, programmable DSP chip per channel pair.
开发了一种适用于高质量音乐的自适应块大小变换编码器。在杜比AC-2技术中开发的转换的输入大小的适应性与转换的特性相结合,使人们能够利用最大的时间和频率分辨率,同时保持比特率低至128 kb/s每通道。系统的低复杂性允许实时实现编码器或解码器,每个通道对使用一个通用的,可编程的DSP芯片。
{"title":"Time versus Frequency Resolution in a Low-Rate, High Quality Audio Transform Coder","authors":"M. Bosi, G. Davidson, L. Fielder","doi":"10.1109/ASPAA.1991.634112","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634112","url":null,"abstract":"An adaptive block size transform coder for high quality music has been developed. The adaptability of the input size of the transform combined with the properties of the transform as developed in the Dolby AC-2 technology allows one to exploit both maximum time and frequency resolution while keeping the bit rate as low as 128 kb/s per channel. The low complexity of the system permits a real-time implementation of encoder or decoder using one general purpose, programmable DSP chip per channel pair.","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132355655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Adaptive High Pass Filtering with Expansion 自适应高通滤波扩展
B.D. Wwdruff, D. Preves, T. Fortune
Traditionally, the goal of a hearing aid fitting is to bring the acoustic level of speech above the hearing threshold of the impaired ear. Linear amplification is usually sufficient to bring the overall level of speech above threshold. Unfortunately, the information-bearing;, but low level, high frequency spectral components of speech often do not receive sufficient amplification from linear hearing aids to ensure their audibility.
传统上,助听器安装的目标是使语音的声学水平高于受损耳朵的听力阈值。线性放大通常足以使语音的整体水平高于阈值。不幸的是,语音的信息承载,但低水平,高频频谱成分往往没有得到线性助听器足够的放大,以确保其可听性。
{"title":"Adaptive High Pass Filtering with Expansion","authors":"B.D. Wwdruff, D. Preves, T. Fortune","doi":"10.1109/ASPAA.1991.634118","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634118","url":null,"abstract":"Traditionally, the goal of a hearing aid fitting is to bring the acoustic level of speech above the hearing threshold of the impaired ear. Linear amplification is usually sufficient to bring the overall level of speech above threshold. Unfortunately, the information-bearing;, but low level, high frequency spectral components of speech often do not receive sufficient amplification from linear hearing aids to ensure their audibility.","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128423787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Adaptive Q Cochlear Filter in Phoneme Recognition 一种用于音素识别的自适应Q耳蜗滤波器
T. Hirahara
Introduction It has been expected that speech recognition performance can be improved by replacing a traditional front-end with a model of the auditory periphery. The underlying assumption is that if a model could be designed properly, it should generate a more efficient representation compared to traditional physical spectrum representations. From this viewpoint, several works have been reported [11-[41. However, these studies do not always show an auditory front-end to be superior to a traditional front-end. Some auditory front-ends are superior only for noisy speech, but many show little, if any, superiority in processing clean speech. We also have been developing an auditory model characterized by an adaptive Q cochlear filter not only for the front-end of a speech recognition system but also for a general purpose spectral1 analyzer in speech research [SI. In this paper, several auditory front-ends based on the adaptive Q cochlear filter and its relatives are tested in speaker dependent phoneme recognition using different stochastic pattern classifier!;, a shift invariant multi template matching system using LVQ2-trained codebook, and a VQ-HMM system. Further, we will discuss problems of using an auditory model as a frontend of an automatic speech recognition system. 2. Adaptive Q Cochlear Filter An adaptive Q cochlear filter (AQF) is a computational filter that functionally simulates the nonlinear filtering characteristics of the basilar membrane vibrating system. The AQF consists of three parts: (1) cascaded second-order notch filters (NOTCH), (2) second-order band pass filters (BPF) connected to each NOTCH output, and (3) adaptive Q circuits connected to each BPF output. The adaptive Q circuit consists of a second-order low-pass filter (LPF) in
人们一直期望通过用听觉外围模型取代传统的前端来提高语音识别性能。潜在的假设是,如果一个模型可以被适当地设计,它应该比传统的物理频谱表示产生更有效的表示。从这一观点出发,已经报道了一些作品[11-[41]。然而,这些研究并不总是表明听觉前端优于传统前端。一些听觉前端只对嘈杂的语音有优势,但许多在处理干净的语音方面几乎没有优势。我们还开发了一种以自适应Q耳蜗滤波器为特征的听觉模型,不仅用于语音识别系统的前端,也用于语音研究中的通用频谱分析仪[SI]。本文采用不同的随机模式分类器,对基于自适应Q耳蜗滤波器及其相关听觉前端进行了基于说话人的音素识别测试;采用lvq2训练码本的移位不变多模板匹配系统,以及VQ-HMM系统。此外,我们将讨论使用听觉模型作为自动语音识别系统前端的问题。2. 自适应Q耳蜗滤波器(Adaptive Q Filter, AQF)是一种模拟基底膜振动系统非线性滤波特性的计算滤波器。AQF由三部分组成:(1)级联二阶陷波滤波器(notch),(2)连接到每个notch输出端的二阶带通滤波器(BPF),以及(3)连接到每个BPF输出端的自适应Q电路。自适应Q电路由一个二阶低通滤波器(LPF)组成
{"title":"An Adaptive Q Cochlear Filter in Phoneme Recognition","authors":"T. Hirahara","doi":"10.1109/ASPAA.1991.634091","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634091","url":null,"abstract":"Introduction It has been expected that speech recognition performance can be improved by replacing a traditional front-end with a model of the auditory periphery. The underlying assumption is that if a model could be designed properly, it should generate a more efficient representation compared to traditional physical spectrum representations. From this viewpoint, several works have been reported [11-[41. However, these studies do not always show an auditory front-end to be superior to a traditional front-end. Some auditory front-ends are superior only for noisy speech, but many show little, if any, superiority in processing clean speech. We also have been developing an auditory model characterized by an adaptive Q cochlear filter not only for the front-end of a speech recognition system but also for a general purpose spectral1 analyzer in speech research [SI. In this paper, several auditory front-ends based on the adaptive Q cochlear filter and its relatives are tested in speaker dependent phoneme recognition using different stochastic pattern classifier!;, a shift invariant multi template matching system using LVQ2-trained codebook, and a VQ-HMM system. Further, we will discuss problems of using an auditory model as a frontend of an automatic speech recognition system. 2. Adaptive Q Cochlear Filter An adaptive Q cochlear filter (AQF) is a computational filter that functionally simulates the nonlinear filtering characteristics of the basilar membrane vibrating system. The AQF consists of three parts: (1) cascaded second-order notch filters (NOTCH), (2) second-order band pass filters (BPF) connected to each NOTCH output, and (3) adaptive Q circuits connected to each BPF output. The adaptive Q circuit consists of a second-order low-pass filter (LPF) in","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130300520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modelling of Binaural Listening Situation and Its Realization by Using Tenth Scaled-Down Modelling-Technique 双耳听力情境建模及十倍缩小建模技术的实现
N. Xiang, K. Genuit
A normal listening situation in a room involes, at least, a listener who has a pair of functioning ears. The emphasis does lie on the word ears because our auditory system can be referred to as a final receiver in this listening situation, while the human external-ear can be observed as an antenna of acoustic signals in the sense of telecomunication. Generally speaking, a listening situation in a room usually contains one or several sound source(s). In the perspective of system theory, such a situation may be modelled by a transmission system between the inputs of sound sources and the outputs of receivers. This acoustic environment can often be described, particularly for one listener, by a linear time-invariant system with two outputs and probably several inputs (FigJ), if the source(s) and the receiver are not moving. Define an impulse response hi, between each input and each output, the relation between the output signals E, and the input signals Ij of the system can therefore be presented by the following equation:
在一个房间里,一个正常的听力情况至少需要一个有一双正常听觉的听众。重点确实在于“耳朵”这个词,因为我们的听觉系统可以被称为这种听力情况下的最终接收器,而人类的外耳可以被视为通信意义上的声学信号天线。一般来说,一个房间里的聆听环境通常包含一个或几个声源。从系统理论的角度来看,这种情况可以用声源输入和接收器输出之间的传输系统来模拟。如果源(s)和接收器不移动,这种声学环境通常可以通过具有两个输出和可能几个输入的线性时不变系统来描述,特别是对于一个听众(FigJ)。定义一个脉冲响应hi,在每个输入和每个输出之间,系统的输出信号E与输入信号Ij之间的关系可以用下式表示:
{"title":"Modelling of Binaural Listening Situation and Its Realization by Using Tenth Scaled-Down Modelling-Technique","authors":"N. Xiang, K. Genuit","doi":"10.1109/ASPAA.1991.634099","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634099","url":null,"abstract":"A normal listening situation in a room involes, at least, a listener who has a pair of functioning ears. The emphasis does lie on the word ears because our auditory system can be referred to as a final receiver in this listening situation, while the human external-ear can be observed as an antenna of acoustic signals in the sense of telecomunication. Generally speaking, a listening situation in a room usually contains one or several sound source(s). In the perspective of system theory, such a situation may be modelled by a transmission system between the inputs of sound sources and the outputs of receivers. This acoustic environment can often be described, particularly for one listener, by a linear time-invariant system with two outputs and probably several inputs (FigJ), if the source(s) and the receiver are not moving. Define an impulse response hi, between each input and each output, the relation between the output signals E, and the input signals Ij of the system can therefore be presented by the following equation:","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130024685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Study of the perceptive variations of room effect for different systems of sound recording and replaying 不同录音和回放系统对房间效果的感知变化研究
J. Jullien, O. Warusfel
Background For telecommunication services such as teleconferencing or visioconferencing, the true restitution of sound environment, through a complete transmission system, has become a new goal. The aim now encompasses the presentation of the localization of distant speakers as well as other aspects of the distant sound ambiance such as the room effect which contains auditive infoxmations about its size and the positions of the speakers within the room. This extended version of sound transmission fidelity will become a necessity when will be available sophisticated syste:ms for image transmission, now under project, offering high definition, large screen and stereo vision. At present, the applications of what is often called "virtual acousiics" are found primarily in the musical domain. One goal is to allow an auditor tio listen just as if he were in the hall during the performance although he listens through a complete system including performance recording and replaying or transmission. An other application for contemporary music is to achieve sound diffusion of digitally transformed signals of natural sources, previously recorded or synthesized with loudspeakers.
对于电话会议或视频会议等电信业务而言,通过完整的传输系统真正还原声音环境已成为新的目标。现在的目标包括远距离扬声器的定位,以及远距离声音环境的其他方面,如房间效果,其中包含有关其大小和扬声器在房间内位置的听觉信息。这种声音传输保真度的扩展版本将成为一种必要,因为将有先进的图像传输系统:ms,目前正在项目中,提供高清晰度,大屏幕和立体视觉。目前,通常被称为“虚拟声学”的应用主要是在音乐领域。其中一个目标是,通过录音、重放、传输等完整的系统,让听众在演奏过程中像在大厅里一样聆听。当代音乐的另一个应用是实现自然来源的数字转换信号的声音扩散,这些信号以前是用扬声器录制或合成的。
{"title":"Study of the perceptive variations of room effect for different systems of sound recording and replaying","authors":"J. Jullien, O. Warusfel","doi":"10.1109/ASPAA.1991.634103","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634103","url":null,"abstract":"Background For telecommunication services such as teleconferencing or visioconferencing, the true restitution of sound environment, through a complete transmission system, has become a new goal. The aim now encompasses the presentation of the localization of distant speakers as well as other aspects of the distant sound ambiance such as the room effect which contains auditive infoxmations about its size and the positions of the speakers within the room. This extended version of sound transmission fidelity will become a necessity when will be available sophisticated syste:ms for image transmission, now under project, offering high definition, large screen and stereo vision. At present, the applications of what is often called \"virtual acousiics\" are found primarily in the musical domain. One goal is to allow an auditor tio listen just as if he were in the hall during the performance although he listens through a complete system including performance recording and replaying or transmission. An other application for contemporary music is to achieve sound diffusion of digitally transformed signals of natural sources, previously recorded or synthesized with loudspeakers.","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122833614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Perceptual Effects Of Noise Disturbances On Phase Spectrum In Stft Analysis/synthesis Procedures. Application To Restoration Processes Stft分析/合成过程中噪声干扰对相位谱的感知影响。应用于恢复程序
O. Cappé, A. Chaigne
Restoration of audio recordings degraded by surface noise can be viewed as an analysWsynthesis procedure where the modulus of the Fouriex transform is replaced by an estimator before resynthesis. The goal is usually to find the best possible estimator in terms of noise power reduction. In current restoration procedures there are no modifications of the phase specuum, because most authors consider that the phase is perceptually irrelevant [l]. Therefore the prime objective of the work presented here was to check the validity of these assumptions, and put some emphasis on the degradation of the phase. For that purpose, an analysis/synthesis procedure simulating a restoration p m s has been carried out on artificially degraded signals. An overview of this procedure can be Seen in Fig. 1. addibve noise phase Original signal-EK""'"' phase Fig. 1 Simulation of a restoration process based on Short-Time-Fourier-Transform, with perfect modulus recovering and degraded phase. In these later experiments, the restored signal is obtained from the modulus of the original signal and from the phase of the degraded signal. Thus the estimator of the modulus is equivalent to the one which would be obtained through "perfect" cancellation of the noise. As a consequence, the remaining degradation of the restored signal is only due to the influence of the additive noise in the phase spectrum. The first goal of the work is to calculate an estimator for the phase deviation in the restored signal, which depends on both the noise characteristics and the parameters of the analysis/synthesis plocedure. The results are then compared with psychoacoustical data related to the perception of modulations. This comparison is aimed at providing an appropriate selection for the STFT parameters. Following Vary [2]. the noise component is assumed to be gaussian. For a sine wave of frequency fo = p Fe / N, where Fe is the sampling frequency and N the size of the window, it can be shown that the expectation for the maximum phase deviation at fo is given by
被表面噪声退化的录音的恢复可以看作是一个分析合成过程,在重新合成之前,傅里叶变换的模量被一个估计量取代。目标通常是在降低噪声功率方面找到最好的估计器。在目前的恢复过程中,相位谱没有变化,因为大多数作者认为相位在感知上是不相关的[1]。因此,这里提出的工作的主要目标是检查这些假设的有效性,并把一些重点放在阶段的退化上。为此,对人为退化的信号进行了模拟恢复过程的分析/合成过程。这个过程的概述可以在图1中看到。图1基于短时傅立叶变换的模量完全恢复、相位退化的恢复过程仿真。在这些后期的实验中,从原始信号的模量和退化信号的相位中获得恢复信号。因此,模量的估计量等于通过“完全”消除噪声而得到的估计量。因此,恢复信号的剩余退化仅是由于相位谱中加性噪声的影响。这项工作的第一个目标是计算恢复信号中相位偏差的估计量,这取决于噪声特性和分析/合成过程的参数。然后将结果与与调制感知相关的心理声学数据进行比较。这种比较的目的是为STFT参数提供一个适当的选择。Following Vary[2]。假设噪声分量是高斯的。对于频率为fo = p Fe / N的正弦波,其中Fe为采样频率,N为窗口大小,可以表明,在fo处最大相位偏差的期望为
{"title":"Perceptual Effects Of Noise Disturbances On Phase Spectrum In Stft Analysis/synthesis Procedures. Application To Restoration Processes","authors":"O. Cappé, A. Chaigne","doi":"10.1109/ASPAA.1991.634141","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634141","url":null,"abstract":"Restoration of audio recordings degraded by surface noise can be viewed as an analysWsynthesis procedure where the modulus of the Fouriex transform is replaced by an estimator before resynthesis. The goal is usually to find the best possible estimator in terms of noise power reduction. In current restoration procedures there are no modifications of the phase specuum, because most authors consider that the phase is perceptually irrelevant [l]. Therefore the prime objective of the work presented here was to check the validity of these assumptions, and put some emphasis on the degradation of the phase. For that purpose, an analysis/synthesis procedure simulating a restoration p m s has been carried out on artificially degraded signals. An overview of this procedure can be Seen in Fig. 1. addibve noise phase Original signal-EK\"\"'\"' phase Fig. 1 Simulation of a restoration process based on Short-Time-Fourier-Transform, with perfect modulus recovering and degraded phase. In these later experiments, the restored signal is obtained from the modulus of the original signal and from the phase of the degraded signal. Thus the estimator of the modulus is equivalent to the one which would be obtained through \"perfect\" cancellation of the noise. As a consequence, the remaining degradation of the restored signal is only due to the influence of the additive noise in the phase spectrum. The first goal of the work is to calculate an estimator for the phase deviation in the restored signal, which depends on both the noise characteristics and the parameters of the analysis/synthesis plocedure. The results are then compared with psychoacoustical data related to the perception of modulations. This comparison is aimed at providing an appropriate selection for the STFT parameters. Following Vary [2]. the noise component is assumed to be gaussian. For a sine wave of frequency fo = p Fe / N, where Fe is the sampling frequency and N the size of the window, it can be shown that the expectation for the maximum phase deviation at fo is given by","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122543648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1