{"title":"The Application of dither to the quantization of speech signals","authors":"N. Jayant, L. Rabiner","doi":"10.1002/J.1538-7305.1972.TB02653.X","DOIUrl":null,"url":null,"abstract":"By adding a pseudo-random “dither” noise to a signal X that is to be quantized, and by subtracting an identical noise sequence from the quantizer output, it is possible to break up undesirable signal-dependent patterns in the quantization error sequence, without increasing the variance of the error E. The idea has been widely discussed in the context of picture coding, and it is the purpose of this paper to demonstrate application of the technique to the quantization of speech signals. Computer simulations have shown how the use of dither whitens the quantization error sequence in PCM encoding, and renders it more acceptable than signal-correlated errors of equal variance. We demonstrate, for conditions of dither and no dither, typical speech recordings, illustrative error waveforms, and data on signal-to-error correlation C, and indicate how the advantage of dithering increases monotonically with crudeness of signal quantization and becomes significant when the number of bits per sample is less than about six. While the parameter C is a simple criterion for demonstrating the effect of dither, it must be emphasized that the truly relevant criterion is the statistical independence of E and X, and not merely the decorrelation of these functions. Thus, for example, we show that for the case of a reciprocal PDF (probability density function) for X, a zero value of C can be achieved without dither. For purposes of implementation, it is desirable to employ dither noise values characterized by a discrete PDF, with a support that is equal to an integral multiple of the step-size Δ x in the quantizer. We show that for effective dithering, the step-size Δ N in the noise PDF need be no smaller, typically, than Δ x /4. Finally, we indicate an application of dither to the quantization of speech signals by delta modulation.","PeriodicalId":55391,"journal":{"name":"Bell System Technical Journal","volume":"12 1","pages":"1293-1304"},"PeriodicalIF":0.0000,"publicationDate":"1972-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"50","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bell System Technical Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/J.1538-7305.1972.TB02653.X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 50
Abstract
By adding a pseudo-random “dither” noise to a signal X that is to be quantized, and by subtracting an identical noise sequence from the quantizer output, it is possible to break up undesirable signal-dependent patterns in the quantization error sequence, without increasing the variance of the error E. The idea has been widely discussed in the context of picture coding, and it is the purpose of this paper to demonstrate application of the technique to the quantization of speech signals. Computer simulations have shown how the use of dither whitens the quantization error sequence in PCM encoding, and renders it more acceptable than signal-correlated errors of equal variance. We demonstrate, for conditions of dither and no dither, typical speech recordings, illustrative error waveforms, and data on signal-to-error correlation C, and indicate how the advantage of dithering increases monotonically with crudeness of signal quantization and becomes significant when the number of bits per sample is less than about six. While the parameter C is a simple criterion for demonstrating the effect of dither, it must be emphasized that the truly relevant criterion is the statistical independence of E and X, and not merely the decorrelation of these functions. Thus, for example, we show that for the case of a reciprocal PDF (probability density function) for X, a zero value of C can be achieved without dither. For purposes of implementation, it is desirable to employ dither noise values characterized by a discrete PDF, with a support that is equal to an integral multiple of the step-size Δ x in the quantizer. We show that for effective dithering, the step-size Δ N in the noise PDF need be no smaller, typically, than Δ x /4. Finally, we indicate an application of dither to the quantization of speech signals by delta modulation.
通过添加一个伪随机“优柔寡断”噪声信号X是量子化的,减去一个相同的噪声序列从量化器输出,可以打破不良相互依赖模式的量化误差序列,不增加误差的方差大肠的想法已经被广泛讨论的上下文中图片编码,它的目的是本文演示的应用技术,语音信号的量化。计算机模拟显示了抖动如何使PCM编码中的量化误差序列白化,并使其比等方差的信号相关误差更容易接受。对于抖动和无抖动的条件,我们展示了典型的语音记录、说白了的误差波形和信号误差相关C的数据,并指出抖动的优势如何随着信号量化的粗糙程度单调增加,并在每个样本的比特数小于约6时变得显著。虽然参数C是证明抖动效应的一个简单准则,但必须强调的是,真正相关的准则是E和X的统计独立性,而不仅仅是这些函数的去相关。因此,例如,我们证明了对于X的倒数PDF(概率密度函数)的情况,C的零值可以在没有抖动的情况下实现。为了实现目的,希望采用离散PDF特征的抖动噪声值,其支持等于量化器中步长Δ x的整数倍。我们表明,为了有效抖动,噪声PDF中的步长Δ N通常不需要小于Δ x /4。最后,我们指出了抖动在增量调制语音信号量化中的应用。