IAFormer: A Transformer Network for Image Aesthetic Evaluation and Cropping

Lei Wang, Yue Jin
{"title":"IAFormer: A Transformer Network for Image Aesthetic Evaluation and Cropping","authors":"Lei Wang, Yue Jin","doi":"10.1109/ACAIT56212.2022.10137804","DOIUrl":null,"url":null,"abstract":"Aesthetic quality evaluation of images has an important role in the field of visual analysis, and the widespread use of high-quality image editing has gradually increased the importance of image aesthetic evaluation in automatic image processing tasks. Previous researchers have mostly explored the mapping relationship between images and labeled scores using convolutional neural networks, but the aesthetic features of different regions on images have not been explored sufficiently, when an image is rich in background information and it is necessary to correlate the aesthetic features of different regions to evaluate the image, convolutional neural networks often cannot extract the aesthetic features of the image adequately due to the lack of the advantage of global feature modeling. We introduce a novel Transformer architecture for image aesthetic quality assessment(IAFormer), IAFormer can model the global aesthetic features of an image, and it is a framework that unifies the aesthetic quality assessment of images and the aesthetic cropping of images, while the aesthetic quality of the image is evaluated, the aesthetic weights on different patches within the image can be calculated to give valid reference information for the aesthetic cropping task.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACAIT56212.2022.10137804","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Aesthetic quality evaluation of images has an important role in the field of visual analysis, and the widespread use of high-quality image editing has gradually increased the importance of image aesthetic evaluation in automatic image processing tasks. Previous researchers have mostly explored the mapping relationship between images and labeled scores using convolutional neural networks, but the aesthetic features of different regions on images have not been explored sufficiently, when an image is rich in background information and it is necessary to correlate the aesthetic features of different regions to evaluate the image, convolutional neural networks often cannot extract the aesthetic features of the image adequately due to the lack of the advantage of global feature modeling. We introduce a novel Transformer architecture for image aesthetic quality assessment(IAFormer), IAFormer can model the global aesthetic features of an image, and it is a framework that unifies the aesthetic quality assessment of images and the aesthetic cropping of images, while the aesthetic quality of the image is evaluated, the aesthetic weights on different patches within the image can be calculated to give valid reference information for the aesthetic cropping task.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
IAFormer:一个用于图像审美评价和裁剪的变压器网络
图像的审美质量评价在视觉分析领域有着重要的作用,而高质量图像编辑的广泛使用也逐渐增加了图像审美评价在自动图像处理任务中的重要性。以往的研究大多是利用卷积神经网络来探索图像与标记分数之间的映射关系,但当图像背景信息丰富,需要将不同区域的审美特征联系起来评价图像时,对图像上不同区域的审美特征的探索还不够充分。由于缺乏全局特征建模的优势,卷积神经网络往往不能充分提取图像的美学特征。本文介绍了一种新的用于图像审美质量评价的Transformer架构(IAFormer), IAFormer可以对图像的全局审美特征进行建模,是一个将图像的审美质量评价与图像的审美裁剪统一起来的框架,在对图像的审美质量进行评价的同时,可以计算出图像内不同斑块的审美权重,为图像的审美裁剪任务提供有效的参考信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Transformer with Global and Local Interaction for Pedestrian Trajectory Prediction The Use of Explainable Artificial Intelligence in Music—Take Professor Nick Bryan-Kinns’ “XAI+Music” Research as a Perspective Playing Fight the Landlord with Tree Search and Hidden Information Evaluation Evaluation Method of Innovative Economic Benefits of Enterprise Human Capital Based on Deep Learning An Attribute Contribution-Based K-Nearest Neighbor Classifier
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1