Bouthaina Abdallah, Sonda Ben Jdidia, Fatma Belghith, Mohamed Ali Ben Ayed, N. Masmoudi
{"title":"A CNN-based QTMT partitioning decision for the VVC standard","authors":"Bouthaina Abdallah, Sonda Ben Jdidia, Fatma Belghith, Mohamed Ali Ben Ayed, N. Masmoudi","doi":"10.1109/DTS55284.2022.9809888","DOIUrl":null,"url":null,"abstract":"The Versatile Video Coding (VVC) standard enhances the coding performance in terms of bitrate and quality compared to its predecessor High Efficiency Video Coding (HEVC) at the expense of an additional coding complexity. This complexity is due to the new partitioning improvement named quadtree with nested multi-type tree (QTMT) which includes more partition shapes increasing the rate-distortion optimization (RDO) complexity. In this paper, a fast QTMT partition approach based on the Convolutional Neural Network (CNN) is proposed for the purpose of reducing the computational complexity. In fact, an intra QTMT decision tree method using a Convolutional Neural Network-binary tree horizontal (CNN-BTH) model is elaborated to determine the BTH decision depths at the 32×32 block size. The suggested fast partitioning algorithm achieves an important gain in the encoding time up to 64.26% compared to the original VVC software reference VTM-3.0. It allows a significant computational complexity reduction of 58.28% on average and an acceptable loss in the encoding performance.","PeriodicalId":290904,"journal":{"name":"2022 IEEE International Conference on Design & Test of Integrated Micro & Nano-Systems (DTS)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Design & Test of Integrated Micro & Nano-Systems (DTS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DTS55284.2022.9809888","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The Versatile Video Coding (VVC) standard enhances the coding performance in terms of bitrate and quality compared to its predecessor High Efficiency Video Coding (HEVC) at the expense of an additional coding complexity. This complexity is due to the new partitioning improvement named quadtree with nested multi-type tree (QTMT) which includes more partition shapes increasing the rate-distortion optimization (RDO) complexity. In this paper, a fast QTMT partition approach based on the Convolutional Neural Network (CNN) is proposed for the purpose of reducing the computational complexity. In fact, an intra QTMT decision tree method using a Convolutional Neural Network-binary tree horizontal (CNN-BTH) model is elaborated to determine the BTH decision depths at the 32×32 block size. The suggested fast partitioning algorithm achieves an important gain in the encoding time up to 64.26% compared to the original VVC software reference VTM-3.0. It allows a significant computational complexity reduction of 58.28% on average and an acceptable loss in the encoding performance.