Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)最新文献

英文中文

Structured encoding of the singing voice using prior knowledge of the musical score 利用乐谱的先验知识对歌声进行结构化编码

Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)

Pub Date : 1999-10-17 DOI: 10.1109/ASPAA.1999.810846

Y.E. Kim

The human voice is the most difficult musical instrument to simulate convincingly. Yet a great deal of progress has been made in voice coding, the parameterization and re-synthesis of a source signal according to an assumed voice model. Source-filter models of the human voice, particularly linear predictive coding (LPC), are the basis of most low bit rate (speech) coding techniques in use today. This paper introduces a technique for coding the singing voice using LPC and prior knowledge of the musical score to aid in the process of encoding, reducing the amount of data required to represent the voice. This approach advances the singing voice closer towards a structured audio model in which musical parameters such as pitch, duration, and phonemes are represented orthogonally to the synthesis technique and can thus be modified prior to re-synthesis.

人声是最难令人信服地模拟的乐器。然而，在语音编码、参数化和根据假设的语音模型重新合成源信号方面已经取得了很大的进展。人类声音的源滤波器模型，特别是线性预测编码(LPC)，是目前使用的大多数低比特率(语音)编码技术的基础。本文介绍了一种利用LPC和乐谱的先验知识来编码歌唱声音的技术，以帮助编码过程，减少了表示声音所需的数据量。这种方法使歌唱声音更接近于结构化的音频模型，在这种模型中，音高、持续时间和音素等音乐参数与合成技术正交，因此可以在重新合成之前进行修改。

引用次数: 12

Psychoacoustical excitation of the (N)LMS algorithm for acoustical system identification 声学系统识别中(N)LMS算法的心理声激励

Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)

Pub Date : 1999-10-17 DOI: 10.1109/ASPAA.1999.810887

M. Peters

This paper presents an algorithm and an implementation of orthogonal perfect correlation sequences for acoustical system identification using psychoacoustical masking effects. Therefore, the common NLMS-algorithm has been modified to incorporate hidden orthogonal Ipatov- and Huffman sequences for fast system identification. Using this method, the speed and accuracy of the identification of the loudspeaker-room-microphone (LRM)-system is increased and the overall-performance of echo and noise cancellation has been improved.

本文提出了一种利用心理声掩蔽效应进行声学系统识别的正交完全相关序列算法及其实现。因此，对常用的nlms算法进行了改进，加入了隐藏的正交Ipatov-和Huffman序列，以实现系统的快速识别。该方法提高了扬声器-房间-麦克风(LRM)系统识别的速度和精度，提高了回波和噪声消除的综合性能。

引用次数: 0

Immersion and content-a framework for audio research 沉浸和内容——音频研究的框架

Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)

Pub Date : 1999-10-17 DOI: 10.1109/ASPAA.1999.810852

M. Karjalainen

Audio technology is rapidly expanding to various directions. In addition to recording, transmission, storage, and reproduction of sounds it supports practically unlimited modification and generation of sounds and their properties. It is possible to create ever more immersive virtual soundscapes. Audio-related information is also considered not only as signals or data anymore: sound can be analyzed more and more deeply, approaching its content. The focus of this article is to discuss a framework in which modern audio signal and information processing could be placed to see more clearly the explored and unexplored realms of research. This is the author's personal framework that has helped in shaping research in the Laboratory of Acoustics and Audio Signal Processing at the Helsinki University of Technology, and hopefully some of the ideas and visions could be useful also to others working in the field.

音频技术正迅速向各个方向发展。除了录制、传输、存储和复制声音之外，它还支持几乎无限的修改和声音及其属性的生成。创造更加身临其境的虚拟音景是可能的。与声音相关的信息也不再仅仅被视为信号或数据，声音可以越来越深入地分析，接近其内容。本文的重点是讨论一个框架，其中可以放置现代音频信号和信息处理，以更清楚地看到探索和未探索的研究领域。这是作者的个人框架，有助于形成赫尔辛基工业大学声学和音频信号处理实验室的研究，希望其中的一些想法和愿景也能对该领域的其他人有用。

引用次数: 8

Bayesian single channel blind deconvolution using parametric signal and channel models 使用参数信号和信道模型的贝叶斯单通道盲反卷积

Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)

Pub Date : 1999-10-01 DOI: 10.1109/ASPAA.1999.810872

J. Hopgood, P. Rayner

This paper considers single channel blind deconvolution, in which a degraded observed signal is modelled as the convolution of a non-stationary source signal with a stationary distortion operator. Recovery of the source signal from the observed signal is achieved by modelling the source signal as a time-varying autoregressive process, the distortion operator by a IIR filter, and then using a Bayesian framework to estimate the parameters of the distorting filter, which can be used to deconvolve the observed signal. The paper also discusses how the non-stationary properties of the source signal allow the identification of the distortion operator to be uniquely determined.

本文考虑单通道盲反卷积，将退化的观测信号建模为非平稳源信号与平稳失真算子的卷积。通过将源信号建模为时变自回归过程，通过IIR滤波器对失真算子进行建模，然后使用贝叶斯框架估计失真滤波器的参数，该参数可用于对观测信号进行反卷积，从而实现源信号从观测信号中恢复。本文还讨论了源信号的非平稳特性如何允许唯一地确定畸变算子的识别。

引用次数: 12

Considering non-stationarity for blind signal separation 考虑非平稳性的盲信号分离

Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)

Pub Date : 1999-09-16 DOI: 10.1109/ASPAA.1999.810862

A. Ahmed, P. Rayner, S. Godsill

We investigate the exploitation of non-stationarity for signal separation. A second-order decorrelation method is used to separate synthetic independent autoregressive signals that are made up of stationary blocks that have been convolutively mixed. We compare results obtained by not taking into account the non-stationarity with those that do. Under certain conditions, exploiting non-stationarity results in more robust separation. We present simulation results that vindicate this fact. In addition, we apply the decorrelation method to real microphone signals, to see how exploiting non-stationarity affects separation quality.

我们研究了非平稳性在信号分离中的应用。采用二阶解相关方法分离由卷积混合的平稳块组成的合成独立自回归信号。我们将不考虑非平稳性的结果与考虑非平稳性的结果进行比较。在某些条件下，利用非平稳性可以获得更稳健的分离。我们给出的仿真结果证明了这一事实。此外，我们将去相关方法应用于真实麦克风信号，以了解利用非平稳性如何影响分离质量。

引用次数: 5

Polyphonic pitch tracking using joint Bayesian estimation of multiple frame parameters 多帧参数联合贝叶斯估计的复调音高跟踪

Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)

Pub Date : 1999-09-16 DOI: 10.1109/ASPAA.1999.810864

Paul J. Walmsley, S. Godsill, P. Rayner

We present a novel approach to pitch estimation and note detection in polyphonic audio signals. We pose the problem in a Bayesian probabilistic framework, which allows us to incorporate prior knowledge about the nature of musical data into the model. We exploit the high correlation between model parameters in adjacent frames of data by explicitly modelling the frequency variation over time using latent variables. Parameters are estimated jointly across a number of adjacent frames to increase the robustness of the estimation against transient events. Individual frames of data are modelled as the sum of harmonic sinusoids. Parameter estimation is performed using Markov chain Monte Carlo (MCMC) methods.

提出了一种新的多声道音频信号的音高估计和音符检测方法。我们在贝叶斯概率框架中提出问题，这允许我们将关于音乐数据性质的先验知识纳入模型。我们利用模型参数在相邻的数据帧之间的高度相关性，通过使用潜在变量明确建模频率随时间的变化。在多个相邻帧中联合估计参数，以增加对瞬态事件估计的鲁棒性。数据的单个帧被建模为谐波正弦波的和。参数估计使用马尔可夫链蒙特卡罗(MCMC)方法执行。

引用次数: 81

Vocal interfaces to musical material 声音与音乐材料的接口

Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)

Pub Date : 1999-09-01 DOI: 10.1109/ASPAA.1999.810861

M. Kahrs

As the World Wide Web and the Internet becomes the dominant form of information distribution, consideration must be given to the indexing of musical material including themes, melodies, rhythm tracks and so forth. This paper describes the implementation of an algorithm for locating song titles from the vocal input of amateur singers. The prototype algorithm exceeds 90% accuracy for 9 different singers.

随着万维网和因特网成为信息传播的主要形式，必须考虑对音乐材料进行索引，包括主题、旋律、节奏音轨等等。本文描述了一种从业余歌手的声音输入中定位歌曲标题的算法的实现。原型算法对9位不同歌手的准确率超过90%。

引用次数: 0

SVD-based optimal filtering with applications to noise reduction in speech signals 基于奇异值分解的最优滤波及其在语音信号降噪中的应用

Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)

Pub Date : 1900-01-01 DOI: 10.1109/ASPAA.1999.810870

S. Doclo, M. Moonen

A class of SVD-based signal enhancement procedures is described, which amount to a specific optimal filtering technique for the case where the so-called 'desired response' signal cannot be observed. It is shown that this optimal filter can be written as a function of the generalized singular vectors and singular values of a so-called speech and noise data matrix. A number of simple symmetry properties of the optimal filter are derived, which are valid for the white noise case as well as for the coloured noise case. Also the averaging step of the standard one-microphone SVD-based noise reduction techniques is investigated, leading to serious doubts about the necessity of this averaging step. When applying this technique for multi-microphone noise reduction, it is shown that for simple scenarios, where we consider localised sources and no multipath propagation, this technique exhibits some kind of beamforming behaviour. We further compare the performance of this technique with standard beamforming techniques, showing that for all reverberation times the performance of the SVD-based optimal filter is better than beamforming.

描述了一类基于奇异值分解的信号增强程序，这相当于无法观察到所谓的“期望响应”信号的情况下的特定最佳滤波技术。结果表明，这种最优滤波器可以写成广义奇异向量和奇异值的函数，即所谓的语音和噪声数据矩阵。导出了最优滤波器的一些简单的对称性质，这些性质对白噪声和有色噪声情况都有效。此外，对标准的单麦克风基于奇异值分解的降噪技术的平均步骤进行了研究，导致对该平均步骤的必要性的严重质疑。当将该技术应用于多麦克风降噪时，表明对于简单的场景，我们考虑局域源和无多径传播，该技术表现出某种波束形成行为。我们进一步将该技术与标准波束形成技术的性能进行了比较，结果表明，对于所有混响时间，基于奇异值分解的最优滤波器的性能都优于波束形成。

{"title":"SVD-based optimal filtering with applications to noise reduction in speech signals","authors":"S. Doclo, M. Moonen","doi":"10.1109/ASPAA.1999.810870","DOIUrl":"https://doi.org/10.1109/ASPAA.1999.810870","url":null,"abstract":"A class of SVD-based signal enhancement procedures is described, which amount to a specific optimal filtering technique for the case where the so-called 'desired response' signal cannot be observed. It is shown that this optimal filter can be written as a function of the generalized singular vectors and singular values of a so-called speech and noise data matrix. A number of simple symmetry properties of the optimal filter are derived, which are valid for the white noise case as well as for the coloured noise case. Also the averaging step of the standard one-microphone SVD-based noise reduction techniques is investigated, leading to serious doubts about the necessity of this averaging step. When applying this technique for multi-microphone noise reduction, it is shown that for simple scenarios, where we consider localised sources and no multipath propagation, this technique exhibits some kind of beamforming behaviour. We further compare the performance of this technique with standard beamforming techniques, showing that for all reverberation times the performance of the SVD-based optimal filter is better than beamforming.","PeriodicalId":229733,"journal":{"name":"Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133078396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 18

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀