{"title":"Investigations of Machine Learning Algorithms for High Efficiency Video Coding (HEVC)","authors":"N. Usha Bhanu, C. Saravanakumar","doi":"10.1109/IConSCEPT57958.2023.10170546","DOIUrl":null,"url":null,"abstract":"The growing demand of high-resolution video on portable devices, the applications require higher coding efficiency, high throughput and low power for handling heterogenous types of video signals. This paper presents a survey on possibility of applying Machine Learning (ML) models in H.265/ HEVC video encoder unit. Higher computational complexity with respect to motion estimation, coding, and parallel processing architectures are required for HEVC. The existing HEVC algorithms are based on spatial temporal relationship which requires dynamic video sequences handling for fast changes in scenes. This paper focuses on the possible realization of machine learning algorithms for Rate Control (RC) in video sequences, Coding Unit (CU) depth decision, Neural network-based Motion Estimation and Compensation, adaptive de-blocking filter for reducing blocking artifacts and task driven semantic coding for real time video applications. The algorithms are surveyed with respect to the learning process used in various units of HEVC encoders and summarized in terms of parameters achieved and datasets used in the existing literature.","PeriodicalId":240167,"journal":{"name":"2023 International Conference on Signal Processing, Computation, Electronics, Power and Telecommunication (IConSCEPT)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Signal Processing, Computation, Electronics, Power and Telecommunication (IConSCEPT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IConSCEPT57958.2023.10170546","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The growing demand of high-resolution video on portable devices, the applications require higher coding efficiency, high throughput and low power for handling heterogenous types of video signals. This paper presents a survey on possibility of applying Machine Learning (ML) models in H.265/ HEVC video encoder unit. Higher computational complexity with respect to motion estimation, coding, and parallel processing architectures are required for HEVC. The existing HEVC algorithms are based on spatial temporal relationship which requires dynamic video sequences handling for fast changes in scenes. This paper focuses on the possible realization of machine learning algorithms for Rate Control (RC) in video sequences, Coding Unit (CU) depth decision, Neural network-based Motion Estimation and Compensation, adaptive de-blocking filter for reducing blocking artifacts and task driven semantic coding for real time video applications. The algorithms are surveyed with respect to the learning process used in various units of HEVC encoders and summarized in terms of parameters achieved and datasets used in the existing literature.