Multi-stage Locally and Long-range Correlated Feature Fusion for Learned In-loop Filter in VVC

2022 IEEE International Conference on Visual Communications and Image Processing (VCIP) Pub Date : 2022-12-13 DOI:10.1109/VCIP56404.2022.10008834

B. Kathariya, Zhu Li, Hongtao Wang, G. V. D. Auwera

{"title":"Multi-stage Locally and Long-range Correlated Feature Fusion for Learned In-loop Filter in VVC","authors":"B. Kathariya, Zhu Li, Hongtao Wang, G. V. D. Auwera","doi":"10.1109/VCIP56404.2022.10008834","DOIUrl":null,"url":null,"abstract":"Versatile Video Coding (VVC)/H.266 is currently the state-of-the-art video coding standard with significant improvement in coding efficiency over its predecessor High Efficiency Video Coding (HEVC)/H.26S. Nonetheless, VVC is also block-based video coding technology where decoded pictures contain compression artifacts. In VVC, in-loop filters serve to suppress these compression artifacts. In this paper, convolution neural network (CNN) is utilized to better facilitate the suppression of compression artifacts over VVC. Nonetheless, our approach has uniqueness in obtaining better features by exploiting locally correlated spatial features in the pixel domain as well as long-range correlated spectral features in the discrete cosine transform (DCT) domain. In particular, we utilized CNN-features from DCT transformed input to extract high-frequency components and induce long-range correlation into the spatial CNN-features by employing multi-stage feature fusion. Our experimental result shows that the proposed approach achieves significant coding improvements up to 9.70% on average Bjantegaard Delta (BD)-Bitrate savings under AI configurations for luma (Y) components.","PeriodicalId":269379,"journal":{"name":"2022 IEEE International Conference on Visual Communications and Image Processing (VCIP)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Visual Communications and Image Processing (VCIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VCIP56404.2022.10008834","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Versatile Video Coding (VVC)/H.266 is currently the state-of-the-art video coding standard with significant improvement in coding efficiency over its predecessor High Efficiency Video Coding (HEVC)/H.26S. Nonetheless, VVC is also block-based video coding technology where decoded pictures contain compression artifacts. In VVC, in-loop filters serve to suppress these compression artifacts. In this paper, convolution neural network (CNN) is utilized to better facilitate the suppression of compression artifacts over VVC. Nonetheless, our approach has uniqueness in obtaining better features by exploiting locally correlated spatial features in the pixel domain as well as long-range correlated spectral features in the discrete cosine transform (DCT) domain. In particular, we utilized CNN-features from DCT transformed input to extract high-frequency components and induce long-range correlation into the spatial CNN-features by employing multi-stage feature fusion. Our experimental result shows that the proposed approach achieves significant coding improvements up to 9.70% on average Bjantegaard Delta (BD)-Bitrate savings under AI configurations for luma (Y) components.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

VVC学习环内滤波器的多阶段局部和远程相关特征融合

通用视频编码(VVC)/H266是目前最先进的视频编码标准，与之前的高效视频编码(HEVC)/H.26S相比，其编码效率有了显著提高。尽管如此，VVC也是基于块的视频编码技术，其中解码的图像包含压缩伪影。在VVC中，循环内滤波器用于抑制这些压缩伪影。本文利用卷积神经网络(CNN)来更好地抑制VVC上的压缩伪影。尽管如此，我们的方法在通过利用像素域的局部相关空间特征和离散余弦变换(DCT)域的远程相关光谱特征来获得更好的特征方面具有独特性。特别地，我们利用DCT变换后输入的cnn特征提取高频分量，并通过多阶段特征融合将远程相关性引入到空间cnn特征中。我们的实验结果表明，该方法在AI配置下对luma (Y)组件的平均Bjantegaard Delta (BD)比特率节省高达9.70%的显著编码改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2022 IEEE International Conference on Visual Communications and Image Processing (VCIP)

自引率

0.00%

发文量