1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)最新文献

英文中文

A perturbation-based pre-processing algorithm for CELP-coders 基于微扰的celp编码器预处理算法

1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)

Pub Date : 1999-06-20 DOI: 10.1109/SCFT.1999.781515

J. Jensen, S. H. Jensen, E. Hansen

A novel pre-processing algorithm for CELP-coders is proposed. The algorithm aims at perturbing the original signal slightly, such that the perturbed signal is subjectively indistinguishable from the original but can be coded more effectively. A key feature of the algorithm is the possibility of controlling the frequency domain properties of the perturbations. Preliminary simulations with the proposed algorithm in combination with a CELP-like coder indicate improvements in terms of segmental SNR and subjective speech quality.

提出了一种新的celp编码器预处理算法。该算法旨在对原始信号进行轻微扰动，使扰动后的信号在主观上与原始信号无法区分，但可以更有效地进行编码。该算法的一个关键特点是可以控制扰动的频域特性。将该算法与类似celp的编码器结合进行的初步模拟表明，在分段信噪比和主观语音质量方面有所改善。

引用次数: 4

A new technique for wideband enhancement of coded narrowband speech 一种编码窄带语音的宽带增强新技术

1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)

Pub Date : 1999-06-20 DOI: 10.1109/SCFT.1999.781522

J. Epps, W. Holmes

Telephone speech is typically bandlimited to 4 kHz, resulting in a 'muffled' quality. Coding speech with a bandwidth greater than 4 kHz reduces this distortion, but requires a higher bit rate to avoid other types of distortion. An alternative to coding wider bandwidth speech is to exploit correlations between the 0-4 kHz and 4-8 kHz speech bands to re-synthesize wideband speech from decoded narrowband speech. This paper proposes a new technique for highband spectral envelope prediction, based upon codebook mapping with codebooks split by voicing. An objective comparison with several existing methods reveals that this new technique produces the smallest highband spectral distortion. Combined with a suitable highband excitation synthesis scheme, this envelope prediction scheme produces a significant quality improvement in speech that has been coded using narrowband standards.

电话语音的带宽通常限制在4千赫，导致“模糊”的质量。对带宽大于4khz的语音进行编码可以减少这种失真，但需要更高的比特率来避免其他类型的失真。编码更宽带宽语音的另一种方法是利用0-4 kHz和4-8 kHz语音带之间的相关性，从解码的窄带语音中重新合成宽带语音。本文提出了一种基于码本映射的高频段频谱包络预测新技术。与几种现有方法的客观比较表明，该方法产生的高频光谱失真最小。结合合适的高频段激励综合方案，该包络预测方案对使用窄带标准编码的语音产生了显著的质量改善。

引用次数: 82

An improved background noise coding mode for variable rate speech coders 一种改进的可变速率语音编码器的背景噪声编码模式

1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)

Pub Date : 1999-06-20 DOI: 10.1109/SCFT.1999.781509

K. El-Maleh, P. Kabal

In this paper, we present a novel background noise coding scheme for variable rate speech coders. Existing approaches to noise coding at very low bit rates (i.e. below 1 kbps) fail to faithfully reproduce background noise resulting in a degradation of the overall perceptual quality. In our approach, classification of the noise type is used to select the type of excitation to be used at the receiver. To illustrate the benefits of our scheme, we have modified the noise coding mode of the CDMA enhanced variable rate codec (EVRC) to include the proposed class-dependent noise excitation model. Evaluation tests have shown that we have improved the overall quality with the proposed noise coding scheme without an increase in bit rate.

本文提出了一种新的用于可变速率语音编码器的背景噪声编码方案。现有的非常低比特率(即低于1kbps)的噪声编码方法不能忠实地再现背景噪声，从而导致整体感知质量的下降。在我们的方法中，噪声类型的分类用于选择在接收器上使用的激励类型。为了说明该方案的优点，我们修改了CDMA增强可变速率编解码器(EVRC)的噪声编码模式，以包含所提出的类相关噪声激励模型。评估测试表明，我们在不增加比特率的情况下提高了噪声编码方案的整体质量。

引用次数: 2

SD optimization of spectral coders 频谱编码器的SD优化

1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)

Pub Date : 1999-06-20 DOI: 10.1109/SCFT.1999.781473

P. Hedelin, F. Nordén, J. Skoglund

In spectral coding of speech, several different criteria are in use for designing and evaluating quantizers. One measure, spectral distortion (SD), has become dominant for comparisons between coders. At run-time, a coder normally quantizes vectors according to other measures, e.g. line spectrum frequency (LSF) distance, in order to keep computational complexity down. In this study, we adopt the SD criterion both in coder design and for quantizer operation. The quantizer is optimized to give minimal average SD scores, This allows us to address the question, is average SD measure really a good criterion, matching subjective ratings. We perform a few objective and subjective tests based on SD optimized coding and some versions thereof. Our tests imply that minimizing average SD may not lead to the best subjective scoring.

在语音的频谱编码中，量化器的设计和评价采用了几种不同的标准。光谱失真(SD)这一指标已成为编码器之间比较的主要指标。在运行时，编码器通常根据其他度量(例如线谱频率(LSF)距离)对矢量进行量化，以降低计算复杂度。在本研究中，我们在编码器设计和量化器操作中都采用了SD准则。量化器被优化为给出最小的平均SD分数，这使我们能够解决这个问题，平均SD测量是否真的是一个很好的标准，匹配主观评分。我们基于SD优化编码和一些版本进行了一些客观和主观的测试。我们的测试表明，最小化平均SD可能不会导致最佳的主观得分。

引用次数: 16

A time warper for speech signals 语音信号的时间扭曲器

1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)

Pub Date : 1999-06-20 DOI: 10.1109/SCFT.1999.781514

R. Sluijter, A.J.E.M. Janssen

A parabolic time warper designed to enhance the stationarity of voiced speech segments, is presented. It is shown how, for a harmonic signal segment, the parabolic time warping function can remove the part of the frequency variation which progresses linearly with time, without changing the time duration of that segment. In the actual implementation of the time warping system, the linear part of the pitch frequency variation in a segment is removed on the basis of maximization of the pitch-related autocorrelation peak of the warped signal. As a by-product, the time warper yields a very reliable pitch estimation. An example on real speech is discussed.

提出了一种抛物线型时间失真器，用于提高浊音段的平稳性。对于谐波信号段，抛物线时间翘曲函数可以去除随时间线性发展的频率变化部分，而不改变该段的持续时间。在时间整波系统的实际实现中，在使被整波信号的音高相关自相关峰值最大化的基础上，去除一段音高频率变化的线性部分。作为副产品，时间扭曲产生了非常可靠的基音估计。最后讨论了一个真实语音的例子。

引用次数: 13

Closed-loop tracking of sinusoids for speech and audio coding 用于语音和音频编码的正弦波闭环跟踪

1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)

Pub Date : 1999-06-20 DOI: 10.1109/SCFT.1999.781464

R. Taori, R. Sluijter

A well recognised problem in low bit rate representation of audio and speech signals, based on the sinusoidal model, is that of tracking the sinusoidal components. Imperfections in the analysis process and the presence of components over a limited duration of time gives rise to ambiguities in the tracking process. As a solution to this problem, we propose a mechanism to achieve closed-loop tracking by means of using analysis-by-synthesis incorporating phase prediction. A simple implementation of such an algorithm is discussed by considering an overlap-add synthesizer. Finally, the results are presented using a voiced speech segment as an example.

在基于正弦模型的音频和语音信号的低比特率表示中，一个公认的问题是跟踪正弦分量。分析过程中的缺陷和在有限时间内存在的组件会导致跟踪过程中的模糊性。为了解决这一问题，我们提出了一种利用结合相位预测的合成分析来实现闭环跟踪的机制。通过考虑重叠加合成器，讨论了这种算法的一个简单实现。最后，以一个浊音段为例给出了结果。

引用次数: 2

Joint speech codec parameter and channel decoding of parameter individual block codes (PIBC) 参数分组码(PIBC)的联合语音编解码参数和信道解码

1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)

Pub Date : 1999-06-20 DOI: 10.1109/SCFT.1999.781489

T. Fingscheidt, S. Heinen, P. Vary

In digital mobile speech transmission usually the most important (class la) bits provided by the speech coding scheme are protected by a CRC for error detection. As a consequence all parameters spanned by the class la bits have to be marked at the receiver either as reliable or as unreliable. In contrast to this somewhat coarse approach we propose the usage of what we call parameter individual block codes (PIBC) for the most important codec parameters. This allows joint speech codec parameter and PIBC decoding taking advantage of the error concealing properties of soft-bit speech decoding.

在数字移动语音传输中，通常由语音编码方案提供的最重要的(la类)位由CRC保护以进行错误检测。因此，在接收端，所有由la类比特所跨越的参数都必须被标记为可靠或不可靠。与这种略显粗糙的方法相反，我们建议对最重要的编解码器参数使用我们称之为参数单个块码(PIBC)的方法。这允许联合语音编解码器参数和PIBC解码，利用软位语音解码的错误隐藏特性。

引用次数: 20

Coding distortion caused by a phase difference between the LP filter and its residual 由低电平滤波器与其残差之间的相位差引起的编码失真

1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)

Pub Date : 1999-06-20 DOI: 10.1109/SCFT.1999.781498

M. Tammi, V.T. Ruoppila, S. Kuusisto, J. Saarinen

Several speech coding algorithms modify the time scale of the residual signal to facilitate efficient coding of pitch information. Time scaling, however, results in a phase difference between the coded residual signal and the time-variant linear prediction (LP) filter used for synthesis in the decoder. In this paper, we examine the coding distortion induced by this phase difference. Moreover, we show that it may cause audible artifacts to the synthesized speech even if lossless coding of all parameters is employed. These artifacts occur particularly at onsets when the frequency response of successive LP filters changes rapidly. A waveform interpolation coder is used to illustrate the effects of the phase mismatch.

有几种语音编码算法通过修改残差信号的时间尺度来实现对音高信息的有效编码。然而，时间尺度导致编码残差信号与解码器中用于合成的时变线性预测(LP)滤波器之间存在相位差。本文研究了这种相位差引起的编码失真。此外，我们表明，即使采用所有参数的无损编码，也可能对合成语音造成可听伪影。当连续低电压滤波器的频率响应迅速变化时，这些伪影尤其会发生。波形插值编码器用来说明相位失配的影响。

引用次数: 8

How to deflate polynomials in LSP computation 如何消除LSP计算中的多项式

1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)

Pub Date : 1999-06-20 DOI: 10.1109/SCFT.1999.781481

B. Dumitrescu, I. Tabus

In this paper we propose a new deflation algorithm for line spectral pair (LSP) computation in speech coding. This algorithm is much more reliable than other methods based on deflation.

本文提出了一种新的用于语音编码中线谱对(LSP)计算的压缩算法。该算法比其他基于通货紧缩的方法可靠得多。

引用次数: 4

Robust voice activity detection for DTX operation of speech coders 语音编码器DTX操作的鲁棒语音活动检测

1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)

Pub Date : 1999-06-20 DOI: 10.1109/SCFT.1999.781483

F. Basbug, S. Nandkumar, K. Swaminathan

Robust detection of voice activity for short-term speech frames is essential for discontinuous transmission (DTX) mode of operation of vocoders such as IS-641. A reference VAD for the IS-641 coder has been chosen for such a purpose and is based on the GSM-EFR (enhance full rate) VAD. We show by developing a comprehensive evaluation procedure that the reference VAD is sensitive to speech level variations. For example, a significant increase is seen in frames falsely classified as active at speech levels of 10 dB above or below nominal level. We propose a solution based on automatic gain control to reduce level sensitivity. Objective performance measures confirm the robustness of our proposed VAD.

短期语音帧的语音活动鲁棒检测对于is -641等声码器的不连续传输(DTX)操作模式至关重要。为此，选择了is -641编码器的参考VAD，它基于GSM-EFR(增强全速率)VAD。我们通过开发一个综合评估程序表明，参考VAD对语音水平变化很敏感。例如，当语音水平高于或低于标称水平10 dB时，被错误地分类为活动的帧显著增加。我们提出了一种基于自动增益控制的方案来降低电平灵敏度。客观性能测量证实了我们提出的VAD的鲁棒性。

引用次数: 6

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀