{"title":"Scalable embedded zero tree wavelet packet audio coding","authors":"P. Chang, Jen-Hsin Lin","doi":"10.1109/SPAWC.2001.923932","DOIUrl":null,"url":null,"abstract":"Multimedia transmission over the Internet is getting popular and increasingly important. In particular, scalable coding is desirable for a heterogeneous network with varied bandwidths. We propose a scalable embedded zero tree wavelet packet (scalable EZWP) audio coding system that is a scalable audio compression system using wavelet packet decomposition and embedded zero-tree coding. We focus on multilayer low-bitrate coding which delivers high perceptual quality. In the base layer, the overlapped audio segment is first transformed by the wavelet packet. Then the local significant coefficients are extracted, quantized, and coded by variable length coding. In the enhancement layer and the full band layer, the residual signal that is the difference between the original and the output of the previous layer is coded via EZW with a psychoacoustic model and arithmetic coding. The target bit rates for three layers are 16, 32, and 64 Kbit/s, respectively. The performance of the proposed coding system is only slightly inferior to MPEG-1 layer 3 at 64 Kbit/s while it provides bitrate scalability that is suitable for multimedia distribution over the Internet with heterogeneous networks.","PeriodicalId":435867,"journal":{"name":"2001 IEEE Third Workshop on Signal Processing Advances in Wireless Communications (SPAWC'01). Workshop Proceedings (Cat. No.01EX471)","volume":"123 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2001 IEEE Third Workshop on Signal Processing Advances in Wireless Communications (SPAWC'01). Workshop Proceedings (Cat. No.01EX471)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPAWC.2001.923932","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Multimedia transmission over the Internet is getting popular and increasingly important. In particular, scalable coding is desirable for a heterogeneous network with varied bandwidths. We propose a scalable embedded zero tree wavelet packet (scalable EZWP) audio coding system that is a scalable audio compression system using wavelet packet decomposition and embedded zero-tree coding. We focus on multilayer low-bitrate coding which delivers high perceptual quality. In the base layer, the overlapped audio segment is first transformed by the wavelet packet. Then the local significant coefficients are extracted, quantized, and coded by variable length coding. In the enhancement layer and the full band layer, the residual signal that is the difference between the original and the output of the previous layer is coded via EZW with a psychoacoustic model and arithmetic coding. The target bit rates for three layers are 16, 32, and 64 Kbit/s, respectively. The performance of the proposed coding system is only slightly inferior to MPEG-1 layer 3 at 64 Kbit/s while it provides bitrate scalability that is suitable for multimedia distribution over the Internet with heterogeneous networks.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
可扩展的嵌入式零树小波包音频编码
互联网上的多媒体传输越来越流行,也越来越重要。特别是,可伸缩编码对于具有不同带宽的异构网络是理想的。我们提出了一种可扩展的嵌入式零树小波包音频编码系统(scalable EZWP),它是一种使用小波包分解和嵌入式零树编码的可扩展音频压缩系统。我们专注于多层低比特率编码,提供高感知质量。在基础层,首先对重叠的音频片段进行小波包变换。然后对局部有效系数进行提取、量化和变长编码。在增强层和全带层中,残差信号即原信号与前一层输出信号之差,通过EZW进行心理声学模型和算术编码。三层的目标比特率分别为16、32和64 Kbit/s。所提出的编码系统的性能仅略低于MPEG-1第3层的64 Kbit/s,而它提供了比特率的可扩展性,适合在具有异构网络的Internet上进行多媒体分发。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Realistic channel model considerations in UMTS downlink capacity with space-time block coding Signal detection and timing estimation via summation likelihood ratio test ESPAR antennas-based signal processing for DS-CDMA signal waveforms in ad hoc network systems New results on blind asynchronous CDMA receivers using code-constrained CMA A flexible RAKE receiver architecture for WCDMA mobile terminals
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1