Multi-stage Locally and Long-range Correlated Feature Fusion for Learned In-loop Filter in VVC

B. Kathariya, Zhu Li, Hongtao Wang, G. V. D. Auwera
{"title":"Multi-stage Locally and Long-range Correlated Feature Fusion for Learned In-loop Filter in VVC","authors":"B. Kathariya, Zhu Li, Hongtao Wang, G. V. D. Auwera","doi":"10.1109/VCIP56404.2022.10008834","DOIUrl":null,"url":null,"abstract":"Versatile Video Coding (VVC)/H.266 is currently the state-of-the-art video coding standard with significant improvement in coding efficiency over its predecessor High Efficiency Video Coding (HEVC)/H.26S. Nonetheless, VVC is also block-based video coding technology where decoded pictures contain compression artifacts. In VVC, in-loop filters serve to suppress these compression artifacts. In this paper, convolution neural network (CNN) is utilized to better facilitate the suppression of compression artifacts over VVC. Nonetheless, our approach has uniqueness in obtaining better features by exploiting locally correlated spatial features in the pixel domain as well as long-range correlated spectral features in the discrete cosine transform (DCT) domain. In particular, we utilized CNN-features from DCT transformed input to extract high-frequency components and induce long-range correlation into the spatial CNN-features by employing multi-stage feature fusion. Our experimental result shows that the proposed approach achieves significant coding improvements up to 9.70% on average Bjantegaard Delta (BD)-Bitrate savings under AI configurations for luma (Y) components.","PeriodicalId":269379,"journal":{"name":"2022 IEEE International Conference on Visual Communications and Image Processing (VCIP)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Visual Communications and Image Processing (VCIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VCIP56404.2022.10008834","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Versatile Video Coding (VVC)/H.266 is currently the state-of-the-art video coding standard with significant improvement in coding efficiency over its predecessor High Efficiency Video Coding (HEVC)/H.26S. Nonetheless, VVC is also block-based video coding technology where decoded pictures contain compression artifacts. In VVC, in-loop filters serve to suppress these compression artifacts. In this paper, convolution neural network (CNN) is utilized to better facilitate the suppression of compression artifacts over VVC. Nonetheless, our approach has uniqueness in obtaining better features by exploiting locally correlated spatial features in the pixel domain as well as long-range correlated spectral features in the discrete cosine transform (DCT) domain. In particular, we utilized CNN-features from DCT transformed input to extract high-frequency components and induce long-range correlation into the spatial CNN-features by employing multi-stage feature fusion. Our experimental result shows that the proposed approach achieves significant coding improvements up to 9.70% on average Bjantegaard Delta (BD)-Bitrate savings under AI configurations for luma (Y) components.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
VVC学习环内滤波器的多阶段局部和远程相关特征融合
通用视频编码(VVC)/H266是目前最先进的视频编码标准,与之前的高效视频编码(HEVC)/H.26S相比,其编码效率有了显著提高。尽管如此,VVC也是基于块的视频编码技术,其中解码的图像包含压缩伪影。在VVC中,循环内滤波器用于抑制这些压缩伪影。本文利用卷积神经网络(CNN)来更好地抑制VVC上的压缩伪影。尽管如此,我们的方法在通过利用像素域的局部相关空间特征和离散余弦变换(DCT)域的远程相关光谱特征来获得更好的特征方面具有独特性。特别地,我们利用DCT变换后输入的cnn特征提取高频分量,并通过多阶段特征融合将远程相关性引入到空间cnn特征中。我们的实验结果表明,该方法在AI配置下对luma (Y)组件的平均Bjantegaard Delta (BD)比特率节省高达9.70%的显著编码改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
CdCLR: Clip-Driven Contrastive Learning for Skeleton-Based Action Recognition Spectral Analysis of Aerial Light Field for Optimization Sampling and Rendering of Unmanned Aerial Vehicle Near-lossless Point Cloud Geometry Compression Based on Adaptive Residual Compensation Efficient Interpolation Filters for Chroma Motion Compensation in Video Coding Rate Controllable Learned Image Compression Based on RFL Model
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1