{"title":"提高H.266/VVC编码效率的深度学习技术","authors":"J. Fang, Chen Ou, Ting-Chen Yeh, Yu-Yang Wang","doi":"10.1109/IS3C57901.2023.00059","DOIUrl":null,"url":null,"abstract":"H.266/VVC modifies the quadtree structure of HEVC and adopts the Quadtree with nested multi-type tree (QT-MTT) encoding structure to search for the best encoding unit. Although the QT-MTT encoding structure has better encoding efficiency, it also increases the computational complexity and encoding time. This paper mainly focuses on the QT-MTT structure of H.266/VVC intra-frame coding and proposes the use of convolutional neural networks (CNNs) based on deep learning to prematurely terminate the decision of the horizontal binary tree, horizontal ternary tree, vertical binary tree, or vertical ternary tree of $32\\times 32$ coding units, and skip the rate distortion optimization (RDO) step to save encoding time of H.266/VVC. Experiments show that this paper only approximately increases BDBR by 0.45 dB, but can reduce% of encoding time.","PeriodicalId":142483,"journal":{"name":"2023 Sixth International Symposium on Computer, Consumer and Control (IS3C)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Learning Technology to Improve the Coding Efficiency of H.266/VVC\",\"authors\":\"J. Fang, Chen Ou, Ting-Chen Yeh, Yu-Yang Wang\",\"doi\":\"10.1109/IS3C57901.2023.00059\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"H.266/VVC modifies the quadtree structure of HEVC and adopts the Quadtree with nested multi-type tree (QT-MTT) encoding structure to search for the best encoding unit. Although the QT-MTT encoding structure has better encoding efficiency, it also increases the computational complexity and encoding time. This paper mainly focuses on the QT-MTT structure of H.266/VVC intra-frame coding and proposes the use of convolutional neural networks (CNNs) based on deep learning to prematurely terminate the decision of the horizontal binary tree, horizontal ternary tree, vertical binary tree, or vertical ternary tree of $32\\\\times 32$ coding units, and skip the rate distortion optimization (RDO) step to save encoding time of H.266/VVC. Experiments show that this paper only approximately increases BDBR by 0.45 dB, but can reduce% of encoding time.\",\"PeriodicalId\":142483,\"journal\":{\"name\":\"2023 Sixth International Symposium on Computer, Consumer and Control (IS3C)\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 Sixth International Symposium on Computer, Consumer and Control (IS3C)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IS3C57901.2023.00059\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Sixth International Symposium on Computer, Consumer and Control (IS3C)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IS3C57901.2023.00059","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep Learning Technology to Improve the Coding Efficiency of H.266/VVC
H.266/VVC modifies the quadtree structure of HEVC and adopts the Quadtree with nested multi-type tree (QT-MTT) encoding structure to search for the best encoding unit. Although the QT-MTT encoding structure has better encoding efficiency, it also increases the computational complexity and encoding time. This paper mainly focuses on the QT-MTT structure of H.266/VVC intra-frame coding and proposes the use of convolutional neural networks (CNNs) based on deep learning to prematurely terminate the decision of the horizontal binary tree, horizontal ternary tree, vertical binary tree, or vertical ternary tree of $32\times 32$ coding units, and skip the rate distortion optimization (RDO) step to save encoding time of H.266/VVC. Experiments show that this paper only approximately increases BDBR by 0.45 dB, but can reduce% of encoding time.