首页 > 最新文献

Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics最新文献

英文 中文
An Overview of the MPEG/audio Compression Algorithm MPEG/音频压缩算法概述
D. Pan
This paper gives a summary of the MPEG/audio compression algorithm. This algorithm was developed by the Motion Picture Experts Group (MPEG), as an International Organization for Standardization standard for the high fidelity compression of digital audio. The MPEG/audio compression standard is one part of a multiple part standard that addresses the compression of video (11172-2), the compression of audio (11172-3), and the synchronization of the audio, video, and related data streams (11172-1) to an aggregate bit rate of about 1.5 Mbit/sec. The MPEG/audio standard also can be used for audio-only applications to compress high fidelity audio data at much lower bit rates. While the MPEG/audio compression algorithm is lossy, often it can provide `transparent', perceptually lossless, compression even with compression factors of 6-to-1 or more. The algorithm works by exploiting the perceptual weaknesses of the human ear. This paper also will cover the basics of psychoacoustic modeling and the methods used by the MPEG/audio algorithm to compress audio data with least perceptible degradation.
本文对MPEG/音频压缩算法进行了综述。该算法是由电影专家组(MPEG)开发的,作为国际标准化组织对数字音频进行高保真压缩的标准。MPEG/音频压缩标准是多部分标准的一部分,该标准解决了视频(11172-2)、音频(11172-3)的压缩,以及音频、视频和相关数据流(11172-1)的同步,总比特率约为1.5 Mbit/sec。MPEG/audio标准还可以用于纯音频应用程序,以更低的比特率压缩高保真音频数据。虽然MPEG/音频压缩算法是有损的,但通常它可以提供“透明”的、感知上无损的压缩,即使压缩系数为6比1或更高。该算法通过利用人耳的感知弱点来工作。本文还将涵盖心理声学建模的基础知识以及MPEG/音频算法用于压缩音频数据的方法,使其具有最小的可感知退化。
{"title":"An Overview of the MPEG/audio Compression Algorithm","authors":"D. Pan","doi":"10.1117/12.174960","DOIUrl":"https://doi.org/10.1117/12.174960","url":null,"abstract":"This paper gives a summary of the MPEG/audio compression algorithm. This algorithm was developed by the Motion Picture Experts Group (MPEG), as an International Organization for Standardization standard for the high fidelity compression of digital audio. The MPEG/audio compression standard is one part of a multiple part standard that addresses the compression of video (11172-2), the compression of audio (11172-3), and the synchronization of the audio, video, and related data streams (11172-1) to an aggregate bit rate of about 1.5 Mbit/sec. The MPEG/audio standard also can be used for audio-only applications to compress high fidelity audio data at much lower bit rates. While the MPEG/audio compression algorithm is lossy, often it can provide `transparent', perceptually lossless, compression even with compression factors of 6-to-1 or more. The algorithm works by exploiting the perceptual weaknesses of the human ear. This paper also will cover the basics of psychoacoustic modeling and the methods used by the MPEG/audio algorithm to compress audio data with least perceptible degradation.","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1994-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126514097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Acoustic Echo Cancellation for Stereophonic Teleconferencing 立体声电话会议的声学回声消除
M. Mohan Sondhi, D. Morgan
In long‐distance telephony, echoes arise due to impedance mismatches at various points in the telephone circuit. Adaptive line echo cancelers have been used successfully for over a decade to combat this problem. Echoes also arise in teleconferencing, due to acoustic coupling between microphone and loudspeaker in each conference room. This problem is similar to the line echo problem; however, the echo paths are much longer and much more variable in this case. In this paper a further complication that arises if stereophonic transmission is used for teleconferencing is discussed: There is an inherent nonuniqueness in estimating the echo paths. It appears that the only way to resolve this nonuniqueness is by somehow decorrelating the signals in the two stereo channels. Several methods of decorrelation are discussed and how they affect adaptive echo canceller performance as well as stereophonic perception is shown.
在长途电话中,由于电话电路中各点的阻抗不匹配而产生回声。自适应线回波消除器已经成功地使用了十多年来解决这个问题。在电话会议中,由于每个会议室的麦克风和扬声器之间的声学耦合,也会产生回声。这个问题类似于线回波问题;然而,在这种情况下,回声路径要长得多,变化也大得多。在本文中,进一步的复杂性出现,如果使用立体声传输的电话会议进行了讨论:有一个固有的非唯一性在估计回波路径。似乎解决这种非唯一性的唯一方法是通过某种方式解除两个立体声通道中的信号的相关性。讨论了几种去相关的方法,并分析了它们对自适应回声消除性能和立体声感知的影响。
{"title":"Acoustic Echo Cancellation for Stereophonic Teleconferencing","authors":"M. Mohan Sondhi, D. Morgan","doi":"10.1109/ASPAA.1991.634135","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634135","url":null,"abstract":"In long‐distance telephony, echoes arise due to impedance mismatches at various points in the telephone circuit. Adaptive line echo cancelers have been used successfully for over a decade to combat this problem. Echoes also arise in teleconferencing, due to acoustic coupling between microphone and loudspeaker in each conference room. This problem is similar to the line echo problem; however, the echo paths are much longer and much more variable in this case. In this paper a further complication that arises if stereophonic transmission is used for teleconferencing is discussed: There is an inherent nonuniqueness in estimating the echo paths. It appears that the only way to resolve this nonuniqueness is by somehow decorrelating the signals in the two stereo channels. Several methods of decorrelation are discussed and how they affect adaptive echo canceller performance as well as stereophonic perception is shown.","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1993-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132387416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 59
A Single-Input Hearing Aid Based on the Auditory Perceptual Features to Improve Speech Intelligibility in Noise 基于听觉感知特征的单输入助听器在噪声环境下提高语音清晰度
C. N. Canagarajah, P. Rayner
One of the main problems of the sensorineural hearing impaired listeners is the partial or complete loss of frequency selectivity. It is now well established that most of the auditory perceptual features are very well represented in the ear by the spectrum of the incoming sound signal. Thus the loss of frequency selectivity means that it is difficult for the impaired listener to discriminate between two sounds or to understand speech in a noisy environment. This handicap is referred to as the cocktail party eflect. It is widely accepted, and proven by the experiments carried out on impaired listeners, that one of the main causes for this impairment is the broad and tilted auditory filter shapes in the damaged cochlea compared to an undamaged normal ear. As a result, in noisy surroundings these broad filters allow more noise than a normal ear making detection of signal in noise difficult. Therefore to improve intelligibility a hearing aid must, not only suppress the noise in speech but also alleviate the problems of reduced frequency selectivity. There are a few hearing aids proposed in the literature to enhance speech in noise. Most of them are based on Adaptive noise cancellation or Adaptive beamforming principles. They have proved to be very useful in situations where there are few noise sources or when there is a reference noise available. Very often the environment contains many uncorrelated noise sources effectively creating a diffusive noise source. Hence obtaining a reference noise signal that is correlated with the noise in the other inputs is impossible. In these situations the above methods produce very little speech enhancement. There are many conventional single-input systems to suppress noise but like the multi-microphone methods mentioned above, they have proved to be of very little use in increasing the intelligibility of the speech for the hearing impaired. In this paper we illustrate how a single-input system incorporating the auditory perceptual features could be employed to increase intelligibility in hearing aids. Spectral Subtraction (SS) is an efficient way of reducing noise in single-input systems. In this method an estimate of the magnitude spectrum of the noise, #(U), is obtained during nonspeech activity and is subtracted from the magnitude spectrum of the noisy speech, X(w), to obtain the enhanced speech, S(u). This performs satisfactorily when the noise source is stationary. The main drawback of this system is it does not consider the problems of the hearing impaired and as a result is of very little benefit to them. Furthermore it introduces a residual or mwacal nobe in the processed speech. It is shown in this paper that by incorporating the perceptual features like masking and excitation patterns the above problems can be eliminated. The technique proposed here, firstly transforms the power (not magnitude) spectrum of the noisy speech (X(w)) into auditory excitation patterns, E(w). The auditory system consists of a
感音神经性听障听众的主要问题之一是部分或完全丧失频率选择性。现在已经确定,大多数听觉感知特征在耳朵中通过输入声音信号的频谱很好地表示出来。因此,频率选择性的丧失意味着受损的听者很难区分两种声音或在嘈杂的环境中理解讲话。这种缺陷被称为鸡尾酒会反射。人们普遍接受并通过对受损听者进行的实验证明,造成这种损害的主要原因之一是受损耳蜗与未受损的正常耳朵相比,听觉过滤器形状较宽且倾斜。因此,在嘈杂的环境中,这些宽滤波器比普通耳朵允许更多的噪声,使得在噪声中检测信号变得困难。因此,为了提高可听性,助听器不仅要抑制语音中的噪声,还要缓解频率选择性降低的问题。文献中提出了几种助听器来增强噪声环境下的语音。它们大多基于自适应噪声消除或自适应波束形成原理。事实证明,在噪声源很少或有参考噪声可用的情况下,它们非常有用。通常情况下,环境中包含许多不相关的噪声源,从而有效地形成扩散噪声源。因此,获得与其他输入噪声相关的参考噪声信号是不可能的。在这些情况下,上述方法产生很少的语音增强。有许多传统的单输入系统来抑制噪声,但就像上面提到的多麦克风方法一样,它们已被证明在提高听力受损者的语音清晰度方面用处不大。在本文中,我们说明了如何一个单输入系统结合听觉感知特征可以用来提高可理解性的助听器。谱减法(SS)是单输入系统中一种有效的降噪方法。在该方法中,在非言语活动期间获得噪声的幅度谱估计#(U),并从噪声语音的幅度谱X(w)中减去,得到增强的语音S(U)。当噪声源静止时,这种方法的效果令人满意。这个系统的主要缺点是它没有考虑到听障人士的问题,因此对他们几乎没有好处。在处理后的语音中引入残差或残差信号。本文表明,通过结合掩蔽和激励模式等感知特征,可以消除上述问题。本文提出的技术首先将噪声语音(X(w))的功率谱(而不是幅度)转换为听觉激发模式(E(w))。听觉系统由一组以对数尺度(巴克尺度)的恒定带宽带通滤波器组成。通过将信号频谱与这些滤波器进行卷积得到激励模式。E(w)现在表示信号的功率谱,就像它被正常的耳朵处理过一样。这些激发模式使听力受损的人能够相当成功地对频率成分进行分组,从而增加了他们的频率选择性,尽管他们的听觉系统中有广泛的过滤器。这种转换还消除了受损耳朵可能处理过的不需要的噪音和语音。
{"title":"A Single-Input Hearing Aid Based on the Auditory Perceptual Features to Improve Speech Intelligibility in Noise","authors":"C. N. Canagarajah, P. Rayner","doi":"10.1109/ASPAA.1991.634123","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634123","url":null,"abstract":"One of the main problems of the sensorineural hearing impaired listeners is the partial or complete loss of frequency selectivity. It is now well established that most of the auditory perceptual features are very well represented in the ear by the spectrum of the incoming sound signal. Thus the loss of frequency selectivity means that it is difficult for the impaired listener to discriminate between two sounds or to understand speech in a noisy environment. This handicap is referred to as the cocktail party eflect. It is widely accepted, and proven by the experiments carried out on impaired listeners, that one of the main causes for this impairment is the broad and tilted auditory filter shapes in the damaged cochlea compared to an undamaged normal ear. As a result, in noisy surroundings these broad filters allow more noise than a normal ear making detection of signal in noise difficult. Therefore to improve intelligibility a hearing aid must, not only suppress the noise in speech but also alleviate the problems of reduced frequency selectivity. There are a few hearing aids proposed in the literature to enhance speech in noise. Most of them are based on Adaptive noise cancellation or Adaptive beamforming principles. They have proved to be very useful in situations where there are few noise sources or when there is a reference noise available. Very often the environment contains many uncorrelated noise sources effectively creating a diffusive noise source. Hence obtaining a reference noise signal that is correlated with the noise in the other inputs is impossible. In these situations the above methods produce very little speech enhancement. There are many conventional single-input systems to suppress noise but like the multi-microphone methods mentioned above, they have proved to be of very little use in increasing the intelligibility of the speech for the hearing impaired. In this paper we illustrate how a single-input system incorporating the auditory perceptual features could be employed to increase intelligibility in hearing aids. Spectral Subtraction (SS) is an efficient way of reducing noise in single-input systems. In this method an estimate of the magnitude spectrum of the noise, #(U), is obtained during nonspeech activity and is subtracted from the magnitude spectrum of the noisy speech, X(w), to obtain the enhanced speech, S(u). This performs satisfactorily when the noise source is stationary. The main drawback of this system is it does not consider the problems of the hearing impaired and as a result is of very little benefit to them. Furthermore it introduces a residual or mwacal nobe in the processed speech. It is shown in this paper that by incorporating the perceptual features like masking and excitation patterns the above problems can be eliminated. The technique proposed here, firstly transforms the power (not magnitude) spectrum of the noisy speech (X(w)) into auditory excitation patterns, E(w). The auditory system consists of a ","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1991-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121661588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Models of Pitch Perception 音高感知模型
Tim Aarset, B. Gold
Abstract : Two pitch perception modeling algorithms are described. The first algorithm models periodicity pitch perception, and the second algorithm models place pitch perception. The two models are now applied to various psychoacoustic stimuli. Both periodicity and place models yield results that are in general agreement with psychoacoustic measurements for the missing fundamental and for inharmonic stimuli. The place algorithm proved to be a better approximation than periodicity for processing comb-filtered noise. Periodicity was more successful for periodic pulse train stimuli.
摘要:介绍了两种基音感知建模算法。第一种算法建立周期性基音感知模型,第二种算法建立位置基音感知模型。这两个模型现在应用于各种心理声刺激。周期性和位置模型产生的结果与缺失基波和非谐波刺激的心理声学测量结果基本一致。在处理梳滤波噪声时,位置算法比周期算法具有更好的逼近性。对于周期性脉冲序列刺激,周期性更成功。
{"title":"Models of Pitch Perception","authors":"Tim Aarset, B. Gold","doi":"10.21437/Eurospeech.1991-300","DOIUrl":"https://doi.org/10.21437/Eurospeech.1991-300","url":null,"abstract":"Abstract : Two pitch perception modeling algorithms are described. The first algorithm models periodicity pitch perception, and the second algorithm models place pitch perception. The two models are now applied to various psychoacoustic stimuli. Both periodicity and place models yield results that are in general agreement with psychoacoustic measurements for the missing fundamental and for inharmonic stimuli. The place algorithm proved to be a better approximation than periodicity for processing comb-filtered noise. Periodicity was more successful for periodic pulse train stimuli.","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1991-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127099508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Perceptual Linear Predictive (PLP) Analysis-Resynthesis Technique 感知线性预测(PLP)分析-再合成技术
H. Hermansky, L. Cox
A common wisdom in speech re-synthesis is that while the vocal tract excitation can be modified to represent the message prosody, the accurate preservation of the formants is needed in order to ensure that both the linguistic message and the speaker-dependent information is well represented in the synthesized speech. Formants are speaker-dependent. A further decomposition of the formant-based speech representation into its message-bearing and the speaker-dependent parts and the inverse problem of combining those two sources of speech information is of interest. The current paper addresses this issues.
在语音重新合成中,一个普遍的观点是,虽然可以修改声道激发来表示信息韵律,但需要准确地保留共振峰,以确保在合成的语音中既能很好地表示语言信息,也能很好地表示与说话人相关的信息。共振峰依赖于说话者。将基于共振峰的语音表示进一步分解为其消息承载部分和依赖于说话人的部分,以及将这两个语音信息源结合起来的逆问题是我们感兴趣的。本文解决了这个问题。
{"title":"Perceptual Linear Predictive (PLP) Analysis-Resynthesis Technique","authors":"H. Hermansky, L. Cox","doi":"10.21437/Eurospeech.1991-88","DOIUrl":"https://doi.org/10.21437/Eurospeech.1991-88","url":null,"abstract":"A common wisdom in speech re-synthesis is that while the vocal tract excitation can be modified to represent the message prosody, the accurate preservation of the formants is needed in order to ensure that both the linguistic message and the speaker-dependent information is well represented in the synthesized speech. Formants are speaker-dependent. A further decomposition of the formant-based speech representation into its message-bearing and the speaker-dependent parts and the inverse problem of combining those two sources of speech information is of interest. The current paper addresses this issues.","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1991-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130814242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 37
Subjective Assessments Of Low Bit-rate Audio Codecs 低比特率音频编解码器的主观评价
C. Grewin, T. Rydén
Subjective assessments or listening tests have always been an important part in the evaluation of audio equipment, maybe even more so today. For certain types of digital audio equipment there are no adequate methods of objective measurements available. This is certainly true for advanced bitrate reduction systems. Subjective assessments therefore play a very important role in the choice of a source coding algorithm for DAB, Digital Audio Broadcasting.
主观评估或听力测试一直是音频设备评估的重要组成部分,今天可能更加重要。对于某些类型的数字音频设备,没有适当的客观测量方法。对于高级比特率降低系统来说,这当然是正确的。因此,主观评价在选择数字音频广播(DAB)的源编码算法时起着非常重要的作用。
{"title":"Subjective Assessments Of Low Bit-rate Audio Codecs","authors":"C. Grewin, T. Rydén","doi":"10.1109/ASPAA.1991.634117","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634117","url":null,"abstract":"Subjective assessments or listening tests have always been an important part in the evaluation of audio equipment, maybe even more so today. For certain types of digital audio equipment there are no adequate methods of objective measurements available. This is certainly true for advanced bitrate reduction systems. Subjective assessments therefore play a very important role in the choice of a source coding algorithm for DAB, Digital Audio Broadcasting.","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1991-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126342060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Error surfaces and adaption behaviour of active noise control systems 主动噪声控制系统的误差曲面和自适应行为
Stuart J Fxockton
Input mise characteristics Knowledge of the geometry of the error surface is essential to the understanding of any adaptive system, and especially for one using any form of gradient descent algorithm. Most adaptive active noise control systems in the published literature use one form or another of the LMS adaptive algorithm (either in its standard non-recursive form [ 13 or in Feintuch's extension to the recursive form [2]). Because this algorithm is an approximation to a steepest descent algorithm its performance is very strongly affwted by the {gradient of the error surface and if the eigenvalues of the performance surface have substantially differing magnitudes the cmnvergence rate that can be achieved is poor. The comparative simplicity of implementation of the algorithm, however, has so far been sufficient to make it the preferred candidate i n real systems. Number of Linearity of Feedback Acoustic control transmission from reverberation channels paths secondary present sowce(s) to &tector(s) It is a problem with many real systems that the dimensionality of the error surface is so great as to make it rather difficult to perceive their character. However in many cases the essence of the system can be captured using a grossly simplified model with only a few coefficients in the adaptive system (and hence an error surface whose dimension is reasonably small). Input mise characteristics sinusoidal quasi-stationary periodic non-stationary random The following parameters may be used to separate active noise control systems into classes having different complexities. The variety of these classes is indicated in the following (rather arbitrary) table; in each case the complexity will generally increase from top to bottom of a column. Each column is of course independent of all the others so the table indicates that there are perhaps 324 significantly different complexities of active noise control system. Number of Linearity of Feedback Acoustic control transmission from reverberation channels paths secondary present sowce(s) to &tector(s)
对于任何自适应系统的理解,特别是对于使用任何形式的梯度下降算法的系统,误差曲面的几何知识都是必不可少的。在已发表的文献中,大多数自适应有源噪声控制系统使用LMS自适应算法的一种或另一种形式(要么是其标准的非递归形式[13],要么是Feintuch对递归形式的扩展[2])。由于该算法近似于最陡下降算法,其性能受到误差曲面梯度的强烈影响,如果性能曲面的特征值具有显著不同的大小,则可以实现的收敛率很差。然而,到目前为止,该算法的实现相对简单,足以使其成为实际系统中的首选候选算法。反馈声控制从混响通道、二次声源到检波器的传播线性数在实际系统中,误差曲面的维数非常大,以致难于察觉其特性。然而,在许多情况下,系统的本质可以使用一个粗略简化的模型来捕获,在自适应系统中只有几个系数(因此误差曲面的尺寸相当小)。输入模态特征正弦准平稳周期非平稳随机下列参数可用于将有源噪声控制系统划分为具有不同复杂程度的类别。这些类别的多样性显示在下面的表格中(相当随意);在每种情况下,复杂性通常会从列的顶部到底部增加。当然,每一列都是独立于其他列的,因此该表表明,主动噪声控制系统的复杂性可能有324个显著不同。反馈声控制从混响通道路径到二次声源到检波器的线性数
{"title":"Error surfaces and adaption behaviour of active noise control systems","authors":"Stuart J Fxockton","doi":"10.1109/ASPAA.1991.634142","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634142","url":null,"abstract":"Input mise characteristics Knowledge of the geometry of the error surface is essential to the understanding of any adaptive system, and especially for one using any form of gradient descent algorithm. Most adaptive active noise control systems in the published literature use one form or another of the LMS adaptive algorithm (either in its standard non-recursive form [ 13 or in Feintuch's extension to the recursive form [2]). Because this algorithm is an approximation to a steepest descent algorithm its performance is very strongly affwted by the {gradient of the error surface and if the eigenvalues of the performance surface have substantially differing magnitudes the cmnvergence rate that can be achieved is poor. The comparative simplicity of implementation of the algorithm, however, has so far been sufficient to make it the preferred candidate i n real systems. Number of Linearity of Feedback Acoustic control transmission from reverberation channels paths secondary present sowce(s) to &tector(s) It is a problem with many real systems that the dimensionality of the error surface is so great as to make it rather difficult to perceive their character. However in many cases the essence of the system can be captured using a grossly simplified model with only a few coefficients in the adaptive system (and hence an error surface whose dimension is reasonably small). Input mise characteristics sinusoidal quasi-stationary periodic non-stationary random The following parameters may be used to separate active noise control systems into classes having different complexities. The variety of these classes is indicated in the following (rather arbitrary) table; in each case the complexity will generally increase from top to bottom of a column. Each column is of course independent of all the others so the table indicates that there are perhaps 324 significantly different complexities of active noise control system. Number of Linearity of Feedback Acoustic control transmission from reverberation channels paths secondary present sowce(s) to &tector(s)","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116378892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IRIS Indigo Audio and Graphics Programming Environment A Case Study using Csouncl and Scrub IRIS Indigo音频和图形编程环境—使用Csouncl和Scrub的案例研究
P. Lacombe
Multimedia is coming of age, and 16 bit, 44.1kHz audio will soon be as common as the mouse. The IRIS Indigo represents the first Advanced Computing Environment (ACE) compatible workstation (ACE consortium members at last count was greater than forty). There are still many issues to be resolved, like standard file format:;, libraries, and synchronization, etc.. However the tools exist, and the open systems environment permits the industry to leverage technology through combined efforts. This paper presents one such environment. Outline IRIS Indigo architecture, followed by brief descriptions of the Audio and Graphics libraries, concluding with two audio applications presented as a case study in software development Csoundm and Scrub. Hardware The IRIS Indigo is a MIPS R3000/R3010 based, 56001 coprocessor, graphics/audio workstation (biendiari). The audio subsystem consists of a 56001, 32K x 24 bits SRAM, 16-bit stereo 64x oversampling delta-sigma ADC, ‘18-bit stereo 8x oversampled DAC, third-order filtering, MDAC attenuator software controlled, and IEC958/AES3 digital I/O. Supported sampling rates are 29.4, 32, 44.1, 48kHz, and any of these divided by integers 2 through 8. Audio Library 1.0 The basic construct implemented in release 1.0 of the Audio Library are audio ports. Programmers open ports to listen or generate sounds. These ports have intermediate buffers to relax real-time O/S and program requirements. Audio ports may be configured to different buffer sizes, sample widths, number of channels (1,2) and two configuration management calls. Hardware state parameters control the ports sampling rate, gain and input source. The audio library is currently designed around goals similar to our early implementations of the Graphics Library. Simplicity So the programmer can quickly learn how to use it. Completeness If the hardware can do it,. the library should let you. Efficiency Close enough to the metal to be optimum, yet not stifle hardware evolution. Graphics Library 4.0 The Graphics Library has evolved over the past ten years, at least eight graphics architectures (known to the author), and a CISC to RISC migration, with major efforts placed on minimizing obsolete Csound is a trademark of MIT functions. There are currently over 300 functions ranging from drawing primitives, text, modeling transformations, and performing raster operations, to the more esoteric functions like alpha-blending, fog and haze, lighting, NURBS, stencil planes, overlays, underlays, accumulation buffer, and zbuffer support.
多媒体即将成熟,16位、44.1kHz的音频将很快像鼠标一样普及。IRIS Indigo代表了第一个与高级计算环境(ACE)兼容的工作站(ACE联盟成员在最近的统计中超过了40个)。还有许多问题需要解决,比如标准文件格式、库和同步等。然而,工具是存在的,并且开放的系统环境允许业界通过联合努力来利用技术。本文提出了一个这样的环境。概述IRIS Indigo架构,然后简要描述音频和图形库,最后介绍两个音频应用程序,作为软件开发中的案例研究:Csoundm和Scrub。IRIS Indigo是基于MIPS R3000/R3010的56001协同处理器,图形/音频工作站(biendiari)。音频子系统由56001、32K × 24位SRAM、16位立体声64x过采样delta-sigma ADC、18位立体声8x过采样DAC、三阶滤波、MDAC衰减器软件控制和IEC958/AES3数字I/O组成。支持的采样率是29.4、32、44.1、48kHz,并将其中任何一个除以整数2到8。Audio Library 1.0版本中实现的基本结构是音频端口。程序员打开端口来监听或生成声音。这些端口具有中间缓冲区,以放松实时O/S和程序要求。音频端口可以配置为不同的缓冲区大小、采样宽度、通道数量(1、2)和两个配置管理调用。硬件状态参数控制端口的采样率、增益和输入源。音频库目前的设计目标类似于我们早期实现的图形库。所以程序员可以很快学会如何使用它。完整性如果硬件可以做到,。图书馆应该会允许的。效率足够接近金属,达到最佳状态,但又不会扼杀硬件的进化。图形库在过去的十年中已经发展,至少有八种图形架构(作者已知),以及从CISC到RISC的迁移,主要的努力是最小化过时的Csound是MIT功能的标志。目前有超过300个功能,从绘制原语、文本、建模转换和执行光栅操作,到更深奥的功能,如alpha混合、雾和雾霾、照明、NURBS、模板平面、覆盖、底层、累积缓冲区和zbuffer支持。
{"title":"IRIS Indigo Audio and Graphics Programming Environment A Case Study using Csouncl and Scrub","authors":"P. Lacombe","doi":"10.1109/ASPAA.1991.634133","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634133","url":null,"abstract":"Multimedia is coming of age, and 16 bit, 44.1kHz audio will soon be as common as the mouse. The IRIS Indigo represents the first Advanced Computing Environment (ACE) compatible workstation (ACE consortium members at last count was greater than forty). There are still many issues to be resolved, like standard file format:;, libraries, and synchronization, etc.. However the tools exist, and the open systems environment permits the industry to leverage technology through combined efforts. This paper presents one such environment. Outline IRIS Indigo architecture, followed by brief descriptions of the Audio and Graphics libraries, concluding with two audio applications presented as a case study in software development Csoundm and Scrub. Hardware The IRIS Indigo is a MIPS R3000/R3010 based, 56001 coprocessor, graphics/audio workstation (biendiari). The audio subsystem consists of a 56001, 32K x 24 bits SRAM, 16-bit stereo 64x oversampling delta-sigma ADC, ‘18-bit stereo 8x oversampled DAC, third-order filtering, MDAC attenuator software controlled, and IEC958/AES3 digital I/O. Supported sampling rates are 29.4, 32, 44.1, 48kHz, and any of these divided by integers 2 through 8. Audio Library 1.0 The basic construct implemented in release 1.0 of the Audio Library are audio ports. Programmers open ports to listen or generate sounds. These ports have intermediate buffers to relax real-time O/S and program requirements. Audio ports may be configured to different buffer sizes, sample widths, number of channels (1,2) and two configuration management calls. Hardware state parameters control the ports sampling rate, gain and input source. The audio library is currently designed around goals similar to our early implementations of the Graphics Library. Simplicity So the programmer can quickly learn how to use it. Completeness If the hardware can do it,. the library should let you. Efficiency Close enough to the metal to be optimum, yet not stifle hardware evolution. Graphics Library 4.0 The Graphics Library has evolved over the past ten years, at least eight graphics architectures (known to the author), and a CISC to RISC migration, with major efforts placed on minimizing obsolete Csound is a trademark of MIT functions. There are currently over 300 functions ranging from drawing primitives, text, modeling transformations, and performing raster operations, to the more esoteric functions like alpha-blending, fog and haze, lighting, NURBS, stencil planes, overlays, underlays, accumulation buffer, and zbuffer support.","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129341398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reconstruction of Overlapping Echos with Unknown Shape in Time and Frequency Domain 未知形状重叠回波的时频域重构
K. Gork, D. Guicking
In many situations one h i s to resolve an unknown number of overlapping and noisy echos of a signal. For this problem a numerical procedure 111 was developed which works as a modified version of the MUSIC algorithm [2,3]. The shape of the signal s’(t) must be known to estimate the number D and the t:ime delays fd of the echos. If K independent records are collected, all differing in the amplitudes mk,d of the echos qt fd) and the noise realization < k ( t ) , then the received and sampled waveforms can be written as
在许多情况下,人们需要解决未知数量的重叠和噪声回波的信号。针对这一问题,开发了一个数值过程111,作为MUSIC算法的修改版本[2,3]。必须知道信号s ' (t)的形状,才能估计回波的数量D和时间延迟fd。如果收集到K条独立的记录,回声的振幅mk,d qt fd)不同,噪声实现< K (t),则接收到的和采样到的波形可以写成
{"title":"Reconstruction of Overlapping Echos with Unknown Shape in Time and Frequency Domain","authors":"K. Gork, D. Guicking","doi":"10.1109/ASPAA.1991.634137","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634137","url":null,"abstract":"In many situations one h i s to resolve an unknown number of overlapping and noisy echos of a signal. For this problem a numerical procedure 111 was developed which works as a modified version of the MUSIC algorithm [2,3]. The shape of the signal s’(t) must be known to estimate the number D and the t:ime delays fd of the echos. If K independent records are collected, all differing in the amplitudes mk,d of the echos qt fd) and the noise realization < k ( t ) , then the received and sampled waveforms can be written as","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127709117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiple Model Adaptive Systems For Active Noise Attenuation 主动噪声衰减的多模型自适应系统
H. Nam, S. Elliott
The characteristics of :most active control systems change with time. In particular, the characteristics of the transfer functions between the secondary loudspeakers and error slensors (the "secondary path") can be time-varying. In many situations, an adaptive scheme to estimate these transfer functions is needed. This is in addition to the adaptive filter implementing the controller. Most adaptive control filters have used FIR structures based on filtered-x LMS algorithms. :Recently, Eriksson er al [ 11 showed that IIR structures are more desirable for the active control of duct noise in order to remove the poles introduced by the acoustic feedback and presented an algorithm to adjust the coefficients of an IIR filter using the recursive least mean square: (RLMS) algorithm of Feintuch [2]. Since both of these approaches require knowledge of the secondary path transfer function, some adaptive algorithms which simultaneoiisly estimate the transfer function of a secondary path have been presented [1,3]. Such adaptive techniques have a tendency to diverge when the parameters vary rapidly and it is difficullt to apply them to the multiple sensor multiple speaker cases [4] because there are too many parameters to be estimated in each step. We present a new algorithm using multiple models to reduce the tendency to diverge compared with previous adaptive algorithms under time-varying conditions. Since this approach requires only a small amount of computation, it may also be used in the multiple channel case. The block diagxim of the multiple model adaptive control (MMAC) technique for noise attenuation is shown in Figure 1.
大多数主动控制系统的特性随时间而变化。特别是,次级扬声器和误差传感器(“次级路径”)之间的传递函数的特性可以是时变的。在许多情况下,需要一种自适应方案来估计这些传递函数。这是对实现控制器的自适应滤波器的补充。大多数自适应控制滤波器都使用基于filtered-x LMS算法的FIR结构。最近,Eriksson等[11]表明,IIR结构更适合于主动控制管道噪声,以消除声反馈引入的极点,并提出了一种使用Feintuch[2]的递推最小均方(RLMS)算法来调整IIR滤波器系数的算法。由于这两种方法都需要了解辅助路径传递函数,因此已经提出了一些同时估计辅助路径传递函数的自适应算法[1,3]。这种自适应技术在参数快速变化时容易出现发散,并且由于每一步需要估计的参数太多,难以应用于多传感器多扬声器的情况。在时变条件下,与已有的自适应算法相比,本文提出了一种新的多模型自适应算法。由于这种方法只需要少量的计算,因此它也可以用于多通道的情况。多模型自适应控制(MMAC)降噪技术的框图如图1所示。
{"title":"Multiple Model Adaptive Systems For Active Noise Attenuation","authors":"H. Nam, S. Elliott","doi":"10.1109/ASPAA.1991.634146","DOIUrl":"https://doi.org/10.1109/ASPAA.1991.634146","url":null,"abstract":"The characteristics of :most active control systems change with time. In particular, the characteristics of the transfer functions between the secondary loudspeakers and error slensors (the \"secondary path\") can be time-varying. In many situations, an adaptive scheme to estimate these transfer functions is needed. This is in addition to the adaptive filter implementing the controller. Most adaptive control filters have used FIR structures based on filtered-x LMS algorithms. :Recently, Eriksson er al [ 11 showed that IIR structures are more desirable for the active control of duct noise in order to remove the poles introduced by the acoustic feedback and presented an algorithm to adjust the coefficients of an IIR filter using the recursive least mean square: (RLMS) algorithm of Feintuch [2]. Since both of these approaches require knowledge of the secondary path transfer function, some adaptive algorithms which simultaneoiisly estimate the transfer function of a secondary path have been presented [1,3]. Such adaptive techniques have a tendency to diverge when the parameters vary rapidly and it is difficullt to apply them to the multiple sensor multiple speaker cases [4] because there are too many parameters to be estimated in each step. We present a new algorithm using multiple models to reduce the tendency to diverge compared with previous adaptive algorithms under time-varying conditions. Since this approach requires only a small amount of computation, it may also be used in the multiple channel case. The block diagxim of the multiple model adaptive control (MMAC) technique for noise attenuation is shown in Figure 1.","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130024314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1