基于离散高斯混合似然和注意模的学习图像压缩

G. Ranganathan, Bindhu
{"title":"基于离散高斯混合似然和注意模的学习图像压缩","authors":"G. Ranganathan, Bindhu","doi":"10.36548/JEEA.2020.4.004","DOIUrl":null,"url":null,"abstract":"There have been many compression standards developed during the past few decades and technological advances has resulted in introducing many methodologies with promising results. As far as PSNR metric is concerned, there is a performance gap between reigning compression standards and learned compression algorithms. Based on research, we experimented using an accurate entropy model on the learned compression algorithms to determine the rate-distortion performance. In this paper, discretized Gaussian Mixture likelihood is proposed to determine the latent code parameters in order to attain a more flexible and accurate model of entropy. Moreover, we have also enhanced the performance of the work by introducing recent attention modules in the network architecture. Simulation results indicate that when compared with the previously existing techniques using high-resolution and Kodak datasets, the proposed work achieves a higher rate of performance. When MS-SSIM is used for optimization, our work generates a more visually pleasant image.","PeriodicalId":20643,"journal":{"name":"Proposed for presentation at the 2020 Virtual MRS Fall Meeting & Exhibit held November 27 - December 4, 2020.","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Learned Image Compression with Discretized Gaussian Mixture Likelihoods and Attention Modules\",\"authors\":\"G. Ranganathan, Bindhu\",\"doi\":\"10.36548/JEEA.2020.4.004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There have been many compression standards developed during the past few decades and technological advances has resulted in introducing many methodologies with promising results. As far as PSNR metric is concerned, there is a performance gap between reigning compression standards and learned compression algorithms. Based on research, we experimented using an accurate entropy model on the learned compression algorithms to determine the rate-distortion performance. In this paper, discretized Gaussian Mixture likelihood is proposed to determine the latent code parameters in order to attain a more flexible and accurate model of entropy. Moreover, we have also enhanced the performance of the work by introducing recent attention modules in the network architecture. Simulation results indicate that when compared with the previously existing techniques using high-resolution and Kodak datasets, the proposed work achieves a higher rate of performance. When MS-SSIM is used for optimization, our work generates a more visually pleasant image.\",\"PeriodicalId\":20643,\"journal\":{\"name\":\"Proposed for presentation at the 2020 Virtual MRS Fall Meeting & Exhibit held November 27 - December 4, 2020.\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-02-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proposed for presentation at the 2020 Virtual MRS Fall Meeting & Exhibit held November 27 - December 4, 2020.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.36548/JEEA.2020.4.004\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proposed for presentation at the 2020 Virtual MRS Fall Meeting & Exhibit held November 27 - December 4, 2020.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36548/JEEA.2020.4.004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19

摘要

在过去的几十年里,已经开发了许多压缩标准,技术进步导致引入了许多有希望的结果的方法。就PSNR度量而言,主流压缩标准与学习压缩算法之间存在性能差距。在研究的基础上,我们对学习的压缩算法进行了精确熵模型的实验,以确定率失真的性能。为了得到更灵活、准确的熵模型,本文提出了离散高斯混合似然来确定隐码参数。此外,我们还通过在网络架构中引入最新的注意力模块来提高工作的性能。仿真结果表明,与先前使用高分辨率和柯达数据集的现有技术相比,所提出的工作实现了更高的性能。当使用MS-SSIM进行优化时,我们的工作生成了一个视觉上更令人愉快的图像。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Learned Image Compression with Discretized Gaussian Mixture Likelihoods and Attention Modules
There have been many compression standards developed during the past few decades and technological advances has resulted in introducing many methodologies with promising results. As far as PSNR metric is concerned, there is a performance gap between reigning compression standards and learned compression algorithms. Based on research, we experimented using an accurate entropy model on the learned compression algorithms to determine the rate-distortion performance. In this paper, discretized Gaussian Mixture likelihood is proposed to determine the latent code parameters in order to attain a more flexible and accurate model of entropy. Moreover, we have also enhanced the performance of the work by introducing recent attention modules in the network architecture. Simulation results indicate that when compared with the previously existing techniques using high-resolution and Kodak datasets, the proposed work achieves a higher rate of performance. When MS-SSIM is used for optimization, our work generates a more visually pleasant image.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Learned Image Compression with Discretized Gaussian Mixture Likelihoods and Attention Modules Analysis of Complex Non-Linear Environment Exploration in Speech Recognition by Hybrid Learning Technique Machine Learning Approach to Predictive Maintenance in Manufacturing Industry - A Comparative Study Data Elimination on Repetition using a Blockchain based Cyber Threat Intelligence Optimization of Citizen Broadband Radio Service Frequency Allocation for Dynamic Spectrum Access System
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1