Computationally efficient compression of audio signals by means of RIQ-DPCM

R. Maher
{"title":"Computationally efficient compression of audio signals by means of RIQ-DPCM","authors":"R. Maher","doi":"10.1109/ASPAA.1993.380002","DOIUrl":null,"url":null,"abstract":"The need to transmit large amounts of data over limited bandwidth channels has resulted in many methods for digital data compression. The common approach is to identify and remove redundancy from the input data stream using knowledge of the source characteristics. In the case of signals intended for human observers (speech, music, pictures, etc.) it is also useful to consider the strengths and weaknesses of the human sensory systems in order to achieve a greater degree of data compression. Unfortunately, achieving perceptually transparent compression requires considerable computational resources. For situations requiring extremely low computational complexity without strictly transparent coding, such as multimedia applications on personal computer platforms, a new adaptive differential pulse code modulation (DPCM) data compression scheme is proposed. Although standard DPCM structures are widely used in single-talker speech coding systems, the models and statistical assumptions well-known for speech signals are not applicable to arbitrary audio signals such as music. The new DPCM formulation presented includes a recursively indexed quantizer (RIQ) to eliminate the problem of overload distortion, a simple predictor structure to take advantage of the short-term correlation present in wideband audio signals, and an adaptation strategy to optimize the system to the local statistics of the input signal. Thus, the new RIQ-DPCM formulation is presented as a computationally efficient means of wideband audio compression.<<ETX>>","PeriodicalId":270576,"journal":{"name":"Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1993-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASPAA.1993.380002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

The need to transmit large amounts of data over limited bandwidth channels has resulted in many methods for digital data compression. The common approach is to identify and remove redundancy from the input data stream using knowledge of the source characteristics. In the case of signals intended for human observers (speech, music, pictures, etc.) it is also useful to consider the strengths and weaknesses of the human sensory systems in order to achieve a greater degree of data compression. Unfortunately, achieving perceptually transparent compression requires considerable computational resources. For situations requiring extremely low computational complexity without strictly transparent coding, such as multimedia applications on personal computer platforms, a new adaptive differential pulse code modulation (DPCM) data compression scheme is proposed. Although standard DPCM structures are widely used in single-talker speech coding systems, the models and statistical assumptions well-known for speech signals are not applicable to arbitrary audio signals such as music. The new DPCM formulation presented includes a recursively indexed quantizer (RIQ) to eliminate the problem of overload distortion, a simple predictor structure to take advantage of the short-term correlation present in wideband audio signals, and an adaptation strategy to optimize the system to the local statistics of the input signal. Thus, the new RIQ-DPCM formulation is presented as a computationally efficient means of wideband audio compression.<>
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用RIQ-DPCM对音频信号进行高效压缩
由于需要在有限的带宽信道上传输大量数据,因此产生了许多数字数据压缩方法。常用的方法是使用源特征的知识从输入数据流中识别和删除冗余。在为人类观察者准备的信号(语音、音乐、图片等)的情况下,为了实现更大程度的数据压缩,考虑人类感官系统的优缺点也是有用的。不幸的是,实现感知透明的压缩需要大量的计算资源。针对对计算复杂度要求极低、编码要求不严格透明的情况,如个人计算机平台上的多媒体应用,提出了一种新的自适应差分脉冲编码调制(DPCM)数据压缩方案。虽然标准的DPCM结构在单话音编码系统中得到了广泛的应用,但众所周知的语音信号模型和统计假设并不适用于音乐等任意音频信号。提出的新的DPCM公式包括一个递归索引量化器(RIQ)来消除过载失真问题,一个简单的预测器结构来利用宽带音频信号中存在的短期相关性,以及一个自适应策略来优化系统以适应输入信号的局部统计。因此,新的RIQ-DPCM公式是一种计算效率高的宽带音频压缩方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Multidimensional scaling analysis of head-related transfer functions Robust adaptive processing of microphone array data for hearing aids Local silencing of room acoustic noise using broadband active noise control Computationally efficient compression of audio signals by means of RIQ-DPCM A simplified source/filter model for percussive sounds
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1