基于VGG16的内容感知图像压缩分析

Alen Selimović, Blaž Meden, P. Peer, A. Hladnik
{"title":"基于VGG16的内容感知图像压缩分析","authors":"Alen Selimović, Blaž Meden, P. Peer, A. Hladnik","doi":"10.1109/IWOBI.2018.8464188","DOIUrl":null,"url":null,"abstract":"Content-aware compression based on the use of saliency maps aims to improve the interpretability of an image by encoding the more relevant image regions with a higher quality than the rest of the image. This paper revisits two convolutional neural network (CNN) models based on VGG16, multi-structure region of interest (MS-ROI) and class activation map (CAM), which enable the localization of salient image regions. While the MS-ROI model allows for the localization of multiple salient image regions, the CAM model, on the other hand, tends to localize only the most relevant class. We use the contextual information provided by the obtained saliency maps to guide the compression. By encoding more important image regions at a higher bitrate and less important ones at a lower bitrate, different qualities of compression for the regions of interest and the background are obtained, while also achieving smooth transitions from salient to non-salient regions. The performance of both models is evaluated on images from the MIT Saliency Benchmark dataset and the General-100 dataset, and the results of the compression are compared to the standard JPEG compression at different quality factors. Experimental results show that for the files of approximately same size, the compression methods based on the two CNN models outperform the standard JPEG compression. When comparing the compression based on the MS-ROI model to the compression based on the CAM model, the former is characterized by a higher PSNR and a better visual quality of the obtained images.","PeriodicalId":127078,"journal":{"name":"2018 IEEE International Work Conference on Bioinspired Intelligence (IWOBI)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Analysis of Content-Aware Image Compression with VGG16\",\"authors\":\"Alen Selimović, Blaž Meden, P. Peer, A. Hladnik\",\"doi\":\"10.1109/IWOBI.2018.8464188\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Content-aware compression based on the use of saliency maps aims to improve the interpretability of an image by encoding the more relevant image regions with a higher quality than the rest of the image. This paper revisits two convolutional neural network (CNN) models based on VGG16, multi-structure region of interest (MS-ROI) and class activation map (CAM), which enable the localization of salient image regions. While the MS-ROI model allows for the localization of multiple salient image regions, the CAM model, on the other hand, tends to localize only the most relevant class. We use the contextual information provided by the obtained saliency maps to guide the compression. By encoding more important image regions at a higher bitrate and less important ones at a lower bitrate, different qualities of compression for the regions of interest and the background are obtained, while also achieving smooth transitions from salient to non-salient regions. The performance of both models is evaluated on images from the MIT Saliency Benchmark dataset and the General-100 dataset, and the results of the compression are compared to the standard JPEG compression at different quality factors. Experimental results show that for the files of approximately same size, the compression methods based on the two CNN models outperform the standard JPEG compression. When comparing the compression based on the MS-ROI model to the compression based on the CAM model, the former is characterized by a higher PSNR and a better visual quality of the obtained images.\",\"PeriodicalId\":127078,\"journal\":{\"name\":\"2018 IEEE International Work Conference on Bioinspired Intelligence (IWOBI)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE International Work Conference on Bioinspired Intelligence (IWOBI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IWOBI.2018.8464188\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Work Conference on Bioinspired Intelligence (IWOBI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IWOBI.2018.8464188","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

摘要

基于显著性映射的内容感知压缩旨在通过编码比图像其他部分质量更高的相关图像区域来提高图像的可解释性。本文重新研究了基于VGG16的两种卷积神经网络(CNN)模型,即多结构感兴趣区域(MS-ROI)和类激活图(CAM),这两种模型实现了显著图像区域的定位。MS-ROI模型允许对多个显著图像区域进行定位,而CAM模型则倾向于只定位最相关的类别。我们使用所获得的显著性映射提供的上下文信息来指导压缩。通过以较高的比特率编码较重要的图像区域,以较低的比特率编码较不重要的图像区域,对感兴趣的区域和背景进行不同质量的压缩,同时也实现了从显著区域到非显著区域的平滑过渡。在来自MIT Saliency Benchmark数据集和General-100数据集的图像上评估了这两种模型的性能,并将压缩结果与不同质量因素下的标准JPEG压缩结果进行了比较。实验结果表明,对于大小大致相同的文件,基于两种CNN模型的压缩方法优于标准JPEG压缩方法。将基于MS-ROI模型的压缩与基于CAM模型的压缩进行比较,前者具有更高的PSNR和更好的图像视觉质量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Analysis of Content-Aware Image Compression with VGG16
Content-aware compression based on the use of saliency maps aims to improve the interpretability of an image by encoding the more relevant image regions with a higher quality than the rest of the image. This paper revisits two convolutional neural network (CNN) models based on VGG16, multi-structure region of interest (MS-ROI) and class activation map (CAM), which enable the localization of salient image regions. While the MS-ROI model allows for the localization of multiple salient image regions, the CAM model, on the other hand, tends to localize only the most relevant class. We use the contextual information provided by the obtained saliency maps to guide the compression. By encoding more important image regions at a higher bitrate and less important ones at a lower bitrate, different qualities of compression for the regions of interest and the background are obtained, while also achieving smooth transitions from salient to non-salient regions. The performance of both models is evaluated on images from the MIT Saliency Benchmark dataset and the General-100 dataset, and the results of the compression are compared to the standard JPEG compression at different quality factors. Experimental results show that for the files of approximately same size, the compression methods based on the two CNN models outperform the standard JPEG compression. When comparing the compression based on the MS-ROI model to the compression based on the CAM model, the former is characterized by a higher PSNR and a better visual quality of the obtained images.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Smart Placement of a Two-Arm Assembly for An Everyday Object Manipulation Humanoid Robot Based on Capability Maps Modules of Correlated Genes in a Gene Expression Regulatory Network of CDDP-Resistant Cancer Cells 2018 IEEE International Work Conference on Bioinspired Intelligence Parallelization of a Denoising Algorithm for Tonal Bioacoustic Signals Using OpenACC Directives Genome Copy Number Feature Selection Based on Chromosomal Regions Alterations and Chemosensitivity Subtypes
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1