{"title":"基于机器学习的图像压缩方法在视频压缩中的推广","authors":"A. Maksimov, M. Gashnikov","doi":"10.1109/ITNT57377.2023.10139119","DOIUrl":null,"url":null,"abstract":"The article explores the adaptation of digital image compression methods based on machine learning to the case of video data compression. The generalized image compression method applies digital image generation and segmentation, pyramid-based digital image coding, and interpolation on hierarchically organized arrays of pixels based on machine learning. Image compression uses artificial convolutional neural networks and generative adversarial neural networks, super-resolution artificial neural network algorithms and autoencoders to implement the basic steps. The proposed generalization approach uses interframe dependencies to reduce information redundancy through a video frame approximator based on machine learning. Approximation can significantly reduce the entropy and variance of the encoded data, which results in a reduction in the size of data. The results of computational experiments on real video sequences prove the high efficiency of the approach proposed in this paper to generalize digital image coding methods based on machine learning for the case of video compression.","PeriodicalId":296438,"journal":{"name":"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Generalization of Machine Learning-Based Image Compression Methods for Video Compression\",\"authors\":\"A. Maksimov, M. Gashnikov\",\"doi\":\"10.1109/ITNT57377.2023.10139119\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The article explores the adaptation of digital image compression methods based on machine learning to the case of video data compression. The generalized image compression method applies digital image generation and segmentation, pyramid-based digital image coding, and interpolation on hierarchically organized arrays of pixels based on machine learning. Image compression uses artificial convolutional neural networks and generative adversarial neural networks, super-resolution artificial neural network algorithms and autoencoders to implement the basic steps. The proposed generalization approach uses interframe dependencies to reduce information redundancy through a video frame approximator based on machine learning. Approximation can significantly reduce the entropy and variance of the encoded data, which results in a reduction in the size of data. The results of computational experiments on real video sequences prove the high efficiency of the approach proposed in this paper to generalize digital image coding methods based on machine learning for the case of video compression.\",\"PeriodicalId\":296438,\"journal\":{\"name\":\"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITNT57377.2023.10139119\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITNT57377.2023.10139119","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Generalization of Machine Learning-Based Image Compression Methods for Video Compression
The article explores the adaptation of digital image compression methods based on machine learning to the case of video data compression. The generalized image compression method applies digital image generation and segmentation, pyramid-based digital image coding, and interpolation on hierarchically organized arrays of pixels based on machine learning. Image compression uses artificial convolutional neural networks and generative adversarial neural networks, super-resolution artificial neural network algorithms and autoencoders to implement the basic steps. The proposed generalization approach uses interframe dependencies to reduce information redundancy through a video frame approximator based on machine learning. Approximation can significantly reduce the entropy and variance of the encoded data, which results in a reduction in the size of data. The results of computational experiments on real video sequences prove the high efficiency of the approach proposed in this paper to generalize digital image coding methods based on machine learning for the case of video compression.