{"title":"基于机器学习的视频压缩熵建模","authors":"M. A. Yakubenko, M. Gashnikov","doi":"10.1109/ITNT57377.2023.10139143","DOIUrl":null,"url":null,"abstract":"The paper investigates an entropy model of spatio-temporal characteristics in video compression using machine learning algorithms. A significant drawback of the vast majority of existing works on neural video codecs is the emphasis on the process of optimizing the creation of a latent representation through the study of various network structures, using well-studied models for image compression as an entropy model The model under study makes it possible to effectively evaluate both the spatial and temporal characteristics of compressed video data, which makes it possible to achieve a greater reduction in video redundancy. In addition, the universality of the entropy model also allows you to set the quantization step over the spatial channel. This content-adapted quantization mechanism, similar to rate control in standard codecs, not only helps achieve smooth compression ratio adjustment, but also improves final performance by dynamically distributing quantization intervals. The results of computational experiments on real video sequences confirm the efficiency of the studied video compression method.","PeriodicalId":296438,"journal":{"name":"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Entropy Modeling in Video Compression Based on Machine Learning\",\"authors\":\"M. A. Yakubenko, M. Gashnikov\",\"doi\":\"10.1109/ITNT57377.2023.10139143\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper investigates an entropy model of spatio-temporal characteristics in video compression using machine learning algorithms. A significant drawback of the vast majority of existing works on neural video codecs is the emphasis on the process of optimizing the creation of a latent representation through the study of various network structures, using well-studied models for image compression as an entropy model The model under study makes it possible to effectively evaluate both the spatial and temporal characteristics of compressed video data, which makes it possible to achieve a greater reduction in video redundancy. In addition, the universality of the entropy model also allows you to set the quantization step over the spatial channel. This content-adapted quantization mechanism, similar to rate control in standard codecs, not only helps achieve smooth compression ratio adjustment, but also improves final performance by dynamically distributing quantization intervals. The results of computational experiments on real video sequences confirm the efficiency of the studied video compression method.\",\"PeriodicalId\":296438,\"journal\":{\"name\":\"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITNT57377.2023.10139143\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITNT57377.2023.10139143","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Entropy Modeling in Video Compression Based on Machine Learning
The paper investigates an entropy model of spatio-temporal characteristics in video compression using machine learning algorithms. A significant drawback of the vast majority of existing works on neural video codecs is the emphasis on the process of optimizing the creation of a latent representation through the study of various network structures, using well-studied models for image compression as an entropy model The model under study makes it possible to effectively evaluate both the spatial and temporal characteristics of compressed video data, which makes it possible to achieve a greater reduction in video redundancy. In addition, the universality of the entropy model also allows you to set the quantization step over the spatial channel. This content-adapted quantization mechanism, similar to rate control in standard codecs, not only helps achieve smooth compression ratio adjustment, but also improves final performance by dynamically distributing quantization intervals. The results of computational experiments on real video sequences confirm the efficiency of the studied video compression method.