What are good CGS/MGS configurations for H.264 quality scalable coding?

Shih-Hsuan Yang, Wei-Lune Tang
{"title":"What are good CGS/MGS configurations for H.264 quality scalable coding?","authors":"Shih-Hsuan Yang, Wei-Lune Tang","doi":"10.5220/0003608201040109","DOIUrl":null,"url":null,"abstract":"Scalable video coding (SVC) encodes image sequences into a single bit stream that can be adapted to various network and terminal capabilities. The H.264/AVC standard includes three kinds of video scalability, spatial scalability, temporal scalability, and quality scalability. Among them, quality scalability refers to image sequences of the same spatio-temporal resolution but with different fidelity levels. Two options of quality scalability are adopted in H.264/AVC, namely CGS (coarse-grain quality scalable coding) and MGS (medium-grain quality scalability), and they may be used in combinations. A refinement layer in CGS is obtained by re-quantizing the (residual) texture signal with a smaller quantization step size (QP). Using the CGS alone, however, may incur notable PSNR penalty and high encoding complexity if numerous rate points are required. MGS partitions the transform coefficients of a CGS layer into several MGS sub-layers and distributes them in different NAL units. The use of MGS may increase the adaptation flexibility, improve the coding efficiency, and reduce the coding complexity. In this paper, we investigate the CGS/MGS configurations that lead to good performance. From extensive experiments using the JSVM (Joint Scalable Video Model), however, we find that MGS should be carefully employed. Although MGS always reduces the encoding complexity as compared to using CGS alone, its rate-distortion is unstable. While MGS typically provides better or comparable rate-distortion performance for the cases with eight rate points or more, some configurations may cause an unexpected PSNR drop with an increased bit rate. This anomaly is currently under investigation.","PeriodicalId":103791,"journal":{"name":"Proceedings of the International Conference on Signal Processing and Multimedia Applications","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Conference on Signal Processing and Multimedia Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5220/0003608201040109","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Scalable video coding (SVC) encodes image sequences into a single bit stream that can be adapted to various network and terminal capabilities. The H.264/AVC standard includes three kinds of video scalability, spatial scalability, temporal scalability, and quality scalability. Among them, quality scalability refers to image sequences of the same spatio-temporal resolution but with different fidelity levels. Two options of quality scalability are adopted in H.264/AVC, namely CGS (coarse-grain quality scalable coding) and MGS (medium-grain quality scalability), and they may be used in combinations. A refinement layer in CGS is obtained by re-quantizing the (residual) texture signal with a smaller quantization step size (QP). Using the CGS alone, however, may incur notable PSNR penalty and high encoding complexity if numerous rate points are required. MGS partitions the transform coefficients of a CGS layer into several MGS sub-layers and distributes them in different NAL units. The use of MGS may increase the adaptation flexibility, improve the coding efficiency, and reduce the coding complexity. In this paper, we investigate the CGS/MGS configurations that lead to good performance. From extensive experiments using the JSVM (Joint Scalable Video Model), however, we find that MGS should be carefully employed. Although MGS always reduces the encoding complexity as compared to using CGS alone, its rate-distortion is unstable. While MGS typically provides better or comparable rate-distortion performance for the cases with eight rate points or more, some configurations may cause an unexpected PSNR drop with an increased bit rate. This anomaly is currently under investigation.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
什么是好的CGS/MGS配置为H.264质量可扩展编码?
可扩展视频编码(SVC)将图像序列编码成可以适应各种网络和终端功能的单个比特流。H.264/AVC标准包括三种视频可扩展性:空间可扩展性、时间可扩展性和质量可扩展性。其中,质量可扩展性是指具有相同时空分辨率但保真度不同的图像序列。H.264/AVC采用了两种质量可扩展性选项,即CGS(粗粒度质量可扩展性编码)和MGS(中粒度质量可扩展性编码),它们可以组合使用。采用较小的量化步长(QP)对(残余)纹理信号进行再量化,得到CGS中的细化层。然而,如果需要大量的速率点,单独使用CGS可能会导致显著的PSNR损失和高编码复杂性。MGS将一个CGS层的变换系数划分为几个MGS子层,并将它们分布在不同的NAL单位中。使用MGS可以增加自适应灵活性,提高编码效率,降低编码复杂度。在本文中,我们研究了能够带来良好性能的CGS/MGS配置。然而,从使用JSVM(联合可扩展视频模型)的大量实验中,我们发现应该谨慎使用MGS。虽然与单独使用CGS相比,MGS总能降低编码复杂度,但它的率失真是不稳定的。虽然MGS通常在8个或更多速率点的情况下提供更好或相当的速率失真性能,但随着比特率的增加,某些配置可能会导致意想不到的PSNR下降。这一异常现象目前正在调查中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Latent topic visual language model for object categorization Optimal combination of low-level features for surveillance object retrieval Managing multiple media streams in HTML5: The IEEE 1599-2008 case study Automatic sound restoration system concepts and design Visual AER-based processing with convolutions for a parallel supercomputer
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1