Image Shooting Parameter-Guided Cascade Image Retouching Network: Think Like an Artist

IF 9.7 1区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Transactions on Multimedia Pub Date : 2024-12-23 DOI:10.1109/TMM.2024.3521779

Hailong Ma;Sibo Feng;Xi Xiao;Chenyu Dong;Xingyue Cheng

{"title":"Image Shooting Parameter-Guided Cascade Image Retouching Network: Think Like an Artist","authors":"Hailong Ma;Sibo Feng;Xi Xiao;Chenyu Dong;Xingyue Cheng","doi":"10.1109/TMM.2024.3521779","DOIUrl":null,"url":null,"abstract":"Photo retouching aims to adjust the hue, luminance, contrast, and saturation of the image to make it more human and aesthetically desirable. Based on researches on image imaging process and artists' retouching processes, we propose three improvements to existing automatic retouching methods. Firstly, in the past retouching methods, all the imaging conditions in EXIF were ignored. According to this, we design a simple module to introduce these imaging conditions into a network called ECM (EXIF Condition Module). This module can improve the performance of several existing auto-retouching methods with only a small parameter cost. Additionally, artists' operations also were ignored. By investigating artists' operations in retouching, we propose a two-stage network that brightens images first and then enriches them in the chrominance plane to mimic artists. Finally, we find that there is a color imbalance in the existing retouching dataset, thus, hue palette loss is designed to resolve the imbalance and make the image more vibrant. Experimental results show that our method is effective on the benchmark MIT-Adobe FiveK dataset and PPR10 K dataset, and achieves SOTA performance in both quantitative and qualitative evaluation.","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"27 ","pages":"1566-1573"},"PeriodicalIF":9.7000,"publicationDate":"2024-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Multimedia","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10812845/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Photo retouching aims to adjust the hue, luminance, contrast, and saturation of the image to make it more human and aesthetically desirable. Based on researches on image imaging process and artists' retouching processes, we propose three improvements to existing automatic retouching methods. Firstly, in the past retouching methods, all the imaging conditions in EXIF were ignored. According to this, we design a simple module to introduce these imaging conditions into a network called ECM (EXIF Condition Module). This module can improve the performance of several existing auto-retouching methods with only a small parameter cost. Additionally, artists' operations also were ignored. By investigating artists' operations in retouching, we propose a two-stage network that brightens images first and then enriches them in the chrominance plane to mimic artists. Finally, we find that there is a color imbalance in the existing retouching dataset, thus, hue palette loss is designed to resolve the imbalance and make the image more vibrant. Experimental results show that our method is effective on the benchmark MIT-Adobe FiveK dataset and PPR10 K dataset, and achieves SOTA performance in both quantitative and qualitative evaluation.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

图像拍摄参数引导级联图像修饰网络：像艺术家一样思考

照片修饰的目的是调整图像的色调，亮度，对比度和饱和度，使其更人性化和美观。基于对图像成像过程和艺术家修图过程的研究，我们对现有的自动修图方法提出了三种改进。首先，在过去的修图方法中，忽略了EXIF中的所有成像条件。据此，我们设计了一个简单的模块，将这些成像条件引入到网络中，称为ECM （EXIF条件模块）。该模块以很小的参数成本提高了现有几种自动修图方法的性能。此外，艺术家的操作也被忽视了。通过研究艺术家在修图中的操作，我们提出了一个两阶段的网络，先亮化图像，然后在色度平面上丰富图像，以模仿艺术家。最后，我们发现在现有的修图数据集中存在色彩不平衡，因此，色调调色板损失被设计来解决不平衡，使图像更有活力。实验结果表明，我们的方法在MIT-Adobe FiveK和PPR10 K的基准数据集上是有效的，在定量和定性评价上都达到了SOTA的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Multimedia 工程技术-电信学

CiteScore

11.70

自引率

11.00%

发文量

576

审稿时长

5.5 months

期刊介绍： The IEEE Transactions on Multimedia delves into diverse aspects of multimedia technology and applications, covering circuits, networking, signal processing, systems, software, and systems integration. The scope aligns with the Fields of Interest of the sponsors, ensuring a comprehensive exploration of research in multimedia.