An Efficient HEVC Intra Frame Coding Based on Deep Convolutional Neural Network

2021 IEEE/ACIS 22nd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD) Pub Date : 2021-11-24 DOI:10.1109/SNPD51163.2021.9704928

Tien-Yang Hsu, Yu Lu, Tung-Hung Hsieh, Chou-Chen Wang

{"title":"An Efficient HEVC Intra Frame Coding Based on Deep Convolutional Neural Network","authors":"Tien-Yang Hsu, Yu Lu, Tung-Hung Hsieh, Chou-Chen Wang","doi":"10.1109/SNPD51163.2021.9704928","DOIUrl":null,"url":null,"abstract":"High efficiency video coding (HEVC) is a very popular video coding standard. The HEVC can achieve high coding efficiency with a lower bitrate for intra frame coding. However, it still needs many bits to finish best rate-distortion (R-D) curve. Since there are only 35 directions prediction modes provided in intra prediction module (IPM), HEVC occurs a large distortion when the image contents are out of these prediction directions. In order to obtain a better R-D curve, Zhang et al. [3] recently proposed a simple convolutional neural network (S-CNN) to improve the encoding performance of HEVC. However, S-CNN has to consume more time to encode intra frame coding since it needs to perform more CNN enhancement mode. In order to further speed up S-CNN based intra frame coding, we propose an early termination algorithm to skip CNN. Because the natural images are generally homogenous, we find the mean square errors (MSE) of reconstructed CTU exist high spatial correlation at HEVC encoder. Therefore, a dynamic threshold of MSE is set according to three neighboring encoded CTU blocks to evaluate whether the current reconstructed CTU is useful for the CNN enhancement mode. Simulation results show that the proposed method can achieve faster HEVC encoding process than S-CNN by reducing time increase ratio (TIR) about 12% on an average.","PeriodicalId":235370,"journal":{"name":"2021 IEEE/ACIS 22nd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/ACIS 22nd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SNPD51163.2021.9704928","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

High efficiency video coding (HEVC) is a very popular video coding standard. The HEVC can achieve high coding efficiency with a lower bitrate for intra frame coding. However, it still needs many bits to finish best rate-distortion (R-D) curve. Since there are only 35 directions prediction modes provided in intra prediction module (IPM), HEVC occurs a large distortion when the image contents are out of these prediction directions. In order to obtain a better R-D curve, Zhang et al. [3] recently proposed a simple convolutional neural network (S-CNN) to improve the encoding performance of HEVC. However, S-CNN has to consume more time to encode intra frame coding since it needs to perform more CNN enhancement mode. In order to further speed up S-CNN based intra frame coding, we propose an early termination algorithm to skip CNN. Because the natural images are generally homogenous, we find the mean square errors (MSE) of reconstructed CTU exist high spatial correlation at HEVC encoder. Therefore, a dynamic threshold of MSE is set according to three neighboring encoded CTU blocks to evaluate whether the current reconstructed CTU is useful for the CNN enhancement mode. Simulation results show that the proposed method can achieve faster HEVC encoding process than S-CNN by reducing time increase ratio (TIR) about 12% on an average.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于深度卷积神经网络的高效HEVC帧内编码

高效视频编码(HEVC)是一种非常流行的视频编码标准。HEVC可以以较低的码率实现帧内编码的高效率。然而，它仍然需要很多比特来完成最佳率失真(R-D)曲线。由于图像内预测模块(IPM)只提供了35种方向预测模式，当图像内容不在这些预测方向时，HEVC会产生较大的失真。为了获得更好的R-D曲线，Zhang等[3]最近提出了一种简单卷积神经网络(S-CNN)来提高HEVC的编码性能。但是，S-CNN需要执行更多的CNN增强模式，因此需要花费更多的时间来编码帧内编码。为了进一步加快基于S-CNN的帧内编码速度，我们提出了一种跳过CNN的早期终止算法。由于自然图像一般都是同质的，我们发现重建CTU的均方误差(MSE)在HEVC编码器上具有很高的空间相关性。因此，根据三个相邻的编码CTU块设置MSE的动态阈值，以评估当前重构的CTU是否对CNN增强模式有用。仿真结果表明，该方法比S-CNN平均降低了12%左右的时间增长比，实现了更快的HEVC编码过程。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2021 IEEE/ACIS 22nd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)

自引率

0.00%

发文量