{"title":"Fast Mode and CU Splitting Decision for Intra Prediction in VVC SCC","authors":"Dayong Wang;Junyi Yu;Xin Lu;Frederic Dufaux;Bo Hang;Hui Guo;Ce Zhu","doi":"10.1109/TBC.2024.3394288","DOIUrl":null,"url":null,"abstract":"Currently, screen content video applications are increasingly widespread in our daily lives. The latest Screen Content Coding (SCC) standard, known as Versatile Video Coding (VVC) SCC, employs a quad-tree plus multi-type tree (QTMT) coding structure for Coding Unit (CU) partitioning and screen content Coding Modes (CMs) selection. While VVC SCC achieves high coding efficiency, its coding complexity poses a significant obstacle to the further widespread adoption of screen content video. Hence, it is crucial to enhance the coding speed of VVC SCC. In this paper, we propose a fast mode and splitting decision for Intra prediction in VVC SCC. Specifically, we initially exploit deep learning techniques to predict content types for all CUs. Subsequently, we examine CM distributions of different content types to predict candidate CMs for CUs. We then introduce early skip and early terminate CM decisions for different content types of CUs to further eliminate unlikely CMs. Finally, we develop Block-based Differential Pulse-Code Modulation (BDPCM) early termination and CU splitting early termination to improve coding speed. Experimental results demonstrate that the proposed algorithm improves coding speed on average by 41.14%, with the BDBR increasing by 1.17%.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 3","pages":"872-883"},"PeriodicalIF":3.2000,"publicationDate":"2024-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Broadcasting","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10528669/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Currently, screen content video applications are increasingly widespread in our daily lives. The latest Screen Content Coding (SCC) standard, known as Versatile Video Coding (VVC) SCC, employs a quad-tree plus multi-type tree (QTMT) coding structure for Coding Unit (CU) partitioning and screen content Coding Modes (CMs) selection. While VVC SCC achieves high coding efficiency, its coding complexity poses a significant obstacle to the further widespread adoption of screen content video. Hence, it is crucial to enhance the coding speed of VVC SCC. In this paper, we propose a fast mode and splitting decision for Intra prediction in VVC SCC. Specifically, we initially exploit deep learning techniques to predict content types for all CUs. Subsequently, we examine CM distributions of different content types to predict candidate CMs for CUs. We then introduce early skip and early terminate CM decisions for different content types of CUs to further eliminate unlikely CMs. Finally, we develop Block-based Differential Pulse-Code Modulation (BDPCM) early termination and CU splitting early termination to improve coding speed. Experimental results demonstrate that the proposed algorithm improves coding speed on average by 41.14%, with the BDBR increasing by 1.17%.
目前,屏幕内容视频应用在我们的日常生活中越来越广泛。最新的屏幕内容编码(SCC)标准,即多功能视频编码(VVC)SCC,采用四叉树加多类型树(QTMT)编码结构进行编码单元(CU)划分和屏幕内容编码模式(CM)选择。虽然 VVC SCC 实现了较高的编码效率,但其编码复杂性对屏幕内容视频的进一步广泛应用构成了重大障碍。因此,提高 VVC SCC 的编码速度至关重要。本文提出了 VVC SCC 中内预测的快速模式和分割决策。具体来说,我们首先利用深度学习技术来预测所有 CU 的内容类型。随后,我们检查不同内容类型的 CM 分布,预测 CU 的候选 CM。然后,我们针对不同内容类型的 CU 引入早期跳过和早期终止 CM 的决策,以进一步消除不可能的 CM。最后,我们开发了基于块的差分脉冲编码调制(BDPCM)早期终止和 CU 分割早期终止,以提高编码速度。实验结果表明,所提算法的编码速度平均提高了 41.14%,BDBR 提高了 1.17%。
期刊介绍:
The Society’s Field of Interest is “Devices, equipment, techniques and systems related to broadcast technology, including the production, distribution, transmission, and propagation aspects.” In addition to this formal FOI statement, which is used to provide guidance to the Publications Committee in the selection of content, the AdCom has further resolved that “broadcast systems includes all aspects of transmission, propagation, and reception.”