Improving resolution of images using Generative Adversarial Networks

Sumit Dhawan, Shailender Kumar
{"title":"Improving resolution of images using Generative Adversarial Networks","authors":"Sumit Dhawan, Shailender Kumar","doi":"10.1109/ICECA49313.2020.9297414","DOIUrl":null,"url":null,"abstract":"Even with all the achievements in precision and speed of various image super-resolution models, such as better and more accurate Convolutional Neural Networks (CNN), the results have not been satisfactory. The high-resolution images produced are generally missing the finer and frequent texture details. The majority of the models in this area focus on such objective functions which minimize the Mean Square Error (MSE). Although, this produces images with better Peak Signal to Noise Ratio (PSNR) such images are perceptually unsatisfying and lack the fidelity and high-frequency details when seen at a high-resolution. Generative Adversarial Networks (GAN), a deep learningmodel, can be usedfor such problems. In this article, the working of the GAN is shown and described about the production satisfying images with decent PSNR score as well as good Perceptual Index (P1) when compared to other models. In contrast to the existing Super Resolution GAN model, various modifications have been introduced to improve the quality of images, like replacing batch normalization layer with weight normalization layer, modified the dense residual block, taking features for comparison before they are fed in activation layer, using the concept of a relativistic discriminator instead of a normal discriminator that is used in vanilla GAN and finally, using Mean Absolute Error in the model.","PeriodicalId":297285,"journal":{"name":"2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECA49313.2020.9297414","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Even with all the achievements in precision and speed of various image super-resolution models, such as better and more accurate Convolutional Neural Networks (CNN), the results have not been satisfactory. The high-resolution images produced are generally missing the finer and frequent texture details. The majority of the models in this area focus on such objective functions which minimize the Mean Square Error (MSE). Although, this produces images with better Peak Signal to Noise Ratio (PSNR) such images are perceptually unsatisfying and lack the fidelity and high-frequency details when seen at a high-resolution. Generative Adversarial Networks (GAN), a deep learningmodel, can be usedfor such problems. In this article, the working of the GAN is shown and described about the production satisfying images with decent PSNR score as well as good Perceptual Index (P1) when compared to other models. In contrast to the existing Super Resolution GAN model, various modifications have been introduced to improve the quality of images, like replacing batch normalization layer with weight normalization layer, modified the dense residual block, taking features for comparison before they are fed in activation layer, using the concept of a relativistic discriminator instead of a normal discriminator that is used in vanilla GAN and finally, using Mean Absolute Error in the model.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用生成对抗网络提高图像分辨率
尽管各种图像超分辨率模型在精度和速度上都取得了成就,比如更好、更精确的卷积神经网络(CNN),但结果并不令人满意。生成的高分辨率图像通常缺少更精细和频繁的纹理细节。该领域的大多数模型都集中在使均方误差(MSE)最小化的目标函数上。虽然,这产生了更好的峰值信噪比(PSNR)的图像,但在高分辨率下看到的图像在感知上是不令人满意的,缺乏保真度和高频细节。生成对抗网络(GAN)是一种深度学习模型,可用于解决此类问题。在本文中,与其他模型相比,GAN的工作显示并描述了具有体面的PSNR分数和良好的感知指数(P1)的令人满意的图像。与现有的超分辨率GAN模型相比,本文引入了各种改进来提高图像质量,如用权值归一化层代替批处理归一化层,修改稠密残差块,在激活层中输入特征之前进行比较,使用相对论鉴别器的概念代替普通GAN中使用的正态鉴别器,最后在模型中使用Mean Absolute Error。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Analysis of Prosodic features for the degree of emotions of an Assamese Emotional Speech MCU system based on IEC61508 for Autonomous Functional safety platform Comparative analysis of facial recognition models using video for real time attendance monitoring system Analysis of using IoT Sensors in Healthcare units Supported by Cloud Computing Human Friendly Smart Trolley with Automatic Billing System
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1