抖动在语音信号量化中的应用

N. Jayant, L. Rabiner
{"title":"抖动在语音信号量化中的应用","authors":"N. Jayant, L. Rabiner","doi":"10.1002/J.1538-7305.1972.TB02653.X","DOIUrl":null,"url":null,"abstract":"By adding a pseudo-random “dither” noise to a signal X that is to be quantized, and by subtracting an identical noise sequence from the quantizer output, it is possible to break up undesirable signal-dependent patterns in the quantization error sequence, without increasing the variance of the error E. The idea has been widely discussed in the context of picture coding, and it is the purpose of this paper to demonstrate application of the technique to the quantization of speech signals. Computer simulations have shown how the use of dither whitens the quantization error sequence in PCM encoding, and renders it more acceptable than signal-correlated errors of equal variance. We demonstrate, for conditions of dither and no dither, typical speech recordings, illustrative error waveforms, and data on signal-to-error correlation C, and indicate how the advantage of dithering increases monotonically with crudeness of signal quantization and becomes significant when the number of bits per sample is less than about six. While the parameter C is a simple criterion for demonstrating the effect of dither, it must be emphasized that the truly relevant criterion is the statistical independence of E and X, and not merely the decorrelation of these functions. Thus, for example, we show that for the case of a reciprocal PDF (probability density function) for X, a zero value of C can be achieved without dither. For purposes of implementation, it is desirable to employ dither noise values characterized by a discrete PDF, with a support that is equal to an integral multiple of the step-size Δ x in the quantizer. We show that for effective dithering, the step-size Δ N in the noise PDF need be no smaller, typically, than Δ x /4. Finally, we indicate an application of dither to the quantization of speech signals by delta modulation.","PeriodicalId":55391,"journal":{"name":"Bell System Technical Journal","volume":"12 1","pages":"1293-1304"},"PeriodicalIF":0.0000,"publicationDate":"1972-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"50","resultStr":"{\"title\":\"The Application of dither to the quantization of speech signals\",\"authors\":\"N. Jayant, L. Rabiner\",\"doi\":\"10.1002/J.1538-7305.1972.TB02653.X\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"By adding a pseudo-random “dither” noise to a signal X that is to be quantized, and by subtracting an identical noise sequence from the quantizer output, it is possible to break up undesirable signal-dependent patterns in the quantization error sequence, without increasing the variance of the error E. The idea has been widely discussed in the context of picture coding, and it is the purpose of this paper to demonstrate application of the technique to the quantization of speech signals. Computer simulations have shown how the use of dither whitens the quantization error sequence in PCM encoding, and renders it more acceptable than signal-correlated errors of equal variance. We demonstrate, for conditions of dither and no dither, typical speech recordings, illustrative error waveforms, and data on signal-to-error correlation C, and indicate how the advantage of dithering increases monotonically with crudeness of signal quantization and becomes significant when the number of bits per sample is less than about six. While the parameter C is a simple criterion for demonstrating the effect of dither, it must be emphasized that the truly relevant criterion is the statistical independence of E and X, and not merely the decorrelation of these functions. Thus, for example, we show that for the case of a reciprocal PDF (probability density function) for X, a zero value of C can be achieved without dither. For purposes of implementation, it is desirable to employ dither noise values characterized by a discrete PDF, with a support that is equal to an integral multiple of the step-size Δ x in the quantizer. We show that for effective dithering, the step-size Δ N in the noise PDF need be no smaller, typically, than Δ x /4. Finally, we indicate an application of dither to the quantization of speech signals by delta modulation.\",\"PeriodicalId\":55391,\"journal\":{\"name\":\"Bell System Technical Journal\",\"volume\":\"12 1\",\"pages\":\"1293-1304\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1972-07-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"50\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bell System Technical Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/J.1538-7305.1972.TB02653.X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bell System Technical Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/J.1538-7305.1972.TB02653.X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 50

摘要

通过添加一个伪随机“优柔寡断”噪声信号X是量子化的,减去一个相同的噪声序列从量化器输出,可以打破不良相互依赖模式的量化误差序列,不增加误差的方差大肠的想法已经被广泛讨论的上下文中图片编码,它的目的是本文演示的应用技术,语音信号的量化。计算机模拟显示了抖动如何使PCM编码中的量化误差序列白化,并使其比等方差的信号相关误差更容易接受。对于抖动和无抖动的条件,我们展示了典型的语音记录、说白了的误差波形和信号误差相关C的数据,并指出抖动的优势如何随着信号量化的粗糙程度单调增加,并在每个样本的比特数小于约6时变得显著。虽然参数C是证明抖动效应的一个简单准则,但必须强调的是,真正相关的准则是E和X的统计独立性,而不仅仅是这些函数的去相关。因此,例如,我们证明了对于X的倒数PDF(概率密度函数)的情况,C的零值可以在没有抖动的情况下实现。为了实现目的,希望采用离散PDF特征的抖动噪声值,其支持等于量化器中步长Δ x的整数倍。我们表明,为了有效抖动,噪声PDF中的步长Δ N通常不需要小于Δ x /4。最后,我们指出了抖动在增量调制语音信号量化中的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
The Application of dither to the quantization of speech signals
By adding a pseudo-random “dither” noise to a signal X that is to be quantized, and by subtracting an identical noise sequence from the quantizer output, it is possible to break up undesirable signal-dependent patterns in the quantization error sequence, without increasing the variance of the error E. The idea has been widely discussed in the context of picture coding, and it is the purpose of this paper to demonstrate application of the technique to the quantization of speech signals. Computer simulations have shown how the use of dither whitens the quantization error sequence in PCM encoding, and renders it more acceptable than signal-correlated errors of equal variance. We demonstrate, for conditions of dither and no dither, typical speech recordings, illustrative error waveforms, and data on signal-to-error correlation C, and indicate how the advantage of dithering increases monotonically with crudeness of signal quantization and becomes significant when the number of bits per sample is less than about six. While the parameter C is a simple criterion for demonstrating the effect of dither, it must be emphasized that the truly relevant criterion is the statistical independence of E and X, and not merely the decorrelation of these functions. Thus, for example, we show that for the case of a reciprocal PDF (probability density function) for X, a zero value of C can be achieved without dither. For purposes of implementation, it is desirable to employ dither noise values characterized by a discrete PDF, with a support that is equal to an integral multiple of the step-size Δ x in the quantizer. We show that for effective dithering, the step-size Δ N in the noise PDF need be no smaller, typically, than Δ x /4. Finally, we indicate an application of dither to the quantization of speech signals by delta modulation.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Information Management System: The off-the-shelf system — a packaged information management system Stability of a general type of pulse-width-modulated feedback system Information management system: Interactive information management systems Error rates of digital signals in charge transfer devices Information Management System: The natural dialogue system
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1