An Efficient HEVC Intra Frame Coding Based on Deep Convolutional Neural Network

Tien-Yang Hsu, Yu Lu, Tung-Hung Hsieh, Chou-Chen Wang
{"title":"An Efficient HEVC Intra Frame Coding Based on Deep Convolutional Neural Network","authors":"Tien-Yang Hsu, Yu Lu, Tung-Hung Hsieh, Chou-Chen Wang","doi":"10.1109/SNPD51163.2021.9704928","DOIUrl":null,"url":null,"abstract":"High efficiency video coding (HEVC) is a very popular video coding standard. The HEVC can achieve high coding efficiency with a lower bitrate for intra frame coding. However, it still needs many bits to finish best rate-distortion (R-D) curve. Since there are only 35 directions prediction modes provided in intra prediction module (IPM), HEVC occurs a large distortion when the image contents are out of these prediction directions. In order to obtain a better R-D curve, Zhang et al. [3] recently proposed a simple convolutional neural network (S-CNN) to improve the encoding performance of HEVC. However, S-CNN has to consume more time to encode intra frame coding since it needs to perform more CNN enhancement mode. In order to further speed up S-CNN based intra frame coding, we propose an early termination algorithm to skip CNN. Because the natural images are generally homogenous, we find the mean square errors (MSE) of reconstructed CTU exist high spatial correlation at HEVC encoder. Therefore, a dynamic threshold of MSE is set according to three neighboring encoded CTU blocks to evaluate whether the current reconstructed CTU is useful for the CNN enhancement mode. Simulation results show that the proposed method can achieve faster HEVC encoding process than S-CNN by reducing time increase ratio (TIR) about 12% on an average.","PeriodicalId":235370,"journal":{"name":"2021 IEEE/ACIS 22nd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/ACIS 22nd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SNPD51163.2021.9704928","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

High efficiency video coding (HEVC) is a very popular video coding standard. The HEVC can achieve high coding efficiency with a lower bitrate for intra frame coding. However, it still needs many bits to finish best rate-distortion (R-D) curve. Since there are only 35 directions prediction modes provided in intra prediction module (IPM), HEVC occurs a large distortion when the image contents are out of these prediction directions. In order to obtain a better R-D curve, Zhang et al. [3] recently proposed a simple convolutional neural network (S-CNN) to improve the encoding performance of HEVC. However, S-CNN has to consume more time to encode intra frame coding since it needs to perform more CNN enhancement mode. In order to further speed up S-CNN based intra frame coding, we propose an early termination algorithm to skip CNN. Because the natural images are generally homogenous, we find the mean square errors (MSE) of reconstructed CTU exist high spatial correlation at HEVC encoder. Therefore, a dynamic threshold of MSE is set according to three neighboring encoded CTU blocks to evaluate whether the current reconstructed CTU is useful for the CNN enhancement mode. Simulation results show that the proposed method can achieve faster HEVC encoding process than S-CNN by reducing time increase ratio (TIR) about 12% on an average.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于深度卷积神经网络的高效HEVC帧内编码
高效视频编码(HEVC)是一种非常流行的视频编码标准。HEVC可以以较低的码率实现帧内编码的高效率。然而,它仍然需要很多比特来完成最佳率失真(R-D)曲线。由于图像内预测模块(IPM)只提供了35种方向预测模式,当图像内容不在这些预测方向时,HEVC会产生较大的失真。为了获得更好的R-D曲线,Zhang等[3]最近提出了一种简单卷积神经网络(S-CNN)来提高HEVC的编码性能。但是,S-CNN需要执行更多的CNN增强模式,因此需要花费更多的时间来编码帧内编码。为了进一步加快基于S-CNN的帧内编码速度,我们提出了一种跳过CNN的早期终止算法。由于自然图像一般都是同质的,我们发现重建CTU的均方误差(MSE)在HEVC编码器上具有很高的空间相关性。因此,根据三个相邻的编码CTU块设置MSE的动态阈值,以评估当前重构的CTU是否对CNN增强模式有用。仿真结果表明,该方法比S-CNN平均降低了12%左右的时间增长比,实现了更快的HEVC编码过程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Quantum Annealing Approach for the Optimal Real-time Traffic Control using QUBO How to Enlighten Novice Users on Behavior of Machine Learning Models? Keynote Address: Deep Learning Networks for Medical Image Analysis: Its Past, Future, and Issues Web-based systems for inventory control in organizations: A Systematic Review Geometrical Schemes as Probabilistic and Entropic Tools to Estimate Duration and Peaks of Pandemic Waves
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1