{"title":"卷积核在图像和视频压缩机器学习中的正交化和参数化","authors":"R. Yuzkiv, M. Gashnikov","doi":"10.1109/ITNT57377.2023.10139032","DOIUrl":null,"url":null,"abstract":"We study orthogonalization and parametrization of convolutional filters within the framework of the image and video compression method based on machine learning. We use the convolutional filters to interpolate less sparse video frame meshes based on sparser video frame meshes. We consider superresolution neural networks and decision trees as machine learning algorithms at the interpolation stage. Decision trees adaptively select an interpolating function from a predefined set of convolutional filters with parameterized orthogonal weights. The use of adaptive functions can significantly improve the accuracy of interpolation. Optimization of machine learning algorithms makes it possible to use the adaptability of interpolators in the most efficient way. We use orthogonalization and parametrization of convolution filter weights to increase the efficiency of the machine learning interpolation algorithm, which in turn leads to an increase in the efficiency of the image and video compression method in general. Computational experiments demonstrate the advantage of the proposed algorithm in real videos.","PeriodicalId":296438,"journal":{"name":"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Orthogonalization and Parameterization of Convolutional Kernels in Machine Learning for Image and Video Compression\",\"authors\":\"R. Yuzkiv, M. Gashnikov\",\"doi\":\"10.1109/ITNT57377.2023.10139032\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We study orthogonalization and parametrization of convolutional filters within the framework of the image and video compression method based on machine learning. We use the convolutional filters to interpolate less sparse video frame meshes based on sparser video frame meshes. We consider superresolution neural networks and decision trees as machine learning algorithms at the interpolation stage. Decision trees adaptively select an interpolating function from a predefined set of convolutional filters with parameterized orthogonal weights. The use of adaptive functions can significantly improve the accuracy of interpolation. Optimization of machine learning algorithms makes it possible to use the adaptability of interpolators in the most efficient way. We use orthogonalization and parametrization of convolution filter weights to increase the efficiency of the machine learning interpolation algorithm, which in turn leads to an increase in the efficiency of the image and video compression method in general. Computational experiments demonstrate the advantage of the proposed algorithm in real videos.\",\"PeriodicalId\":296438,\"journal\":{\"name\":\"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITNT57377.2023.10139032\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITNT57377.2023.10139032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Orthogonalization and Parameterization of Convolutional Kernels in Machine Learning for Image and Video Compression
We study orthogonalization and parametrization of convolutional filters within the framework of the image and video compression method based on machine learning. We use the convolutional filters to interpolate less sparse video frame meshes based on sparser video frame meshes. We consider superresolution neural networks and decision trees as machine learning algorithms at the interpolation stage. Decision trees adaptively select an interpolating function from a predefined set of convolutional filters with parameterized orthogonal weights. The use of adaptive functions can significantly improve the accuracy of interpolation. Optimization of machine learning algorithms makes it possible to use the adaptability of interpolators in the most efficient way. We use orthogonalization and parametrization of convolution filter weights to increase the efficiency of the machine learning interpolation algorithm, which in turn leads to an increase in the efficiency of the image and video compression method in general. Computational experiments demonstrate the advantage of the proposed algorithm in real videos.