L-CoIns: Language-based Colorization With Instance Awareness

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Pub Date : 2023-06-01 DOI:10.1109/CVPR52729.2023.01842

Zheng Chang, Shuchen Weng, Pei Zhang, Yu Li, Si Li, Boxin Shi

引用次数: 3

Abstract

Language-based colorization produces plausible colors consistent with the language description provided by the user. Recent studies introduce additional annotation to prevent color-object coupling and mismatch issues, but they still have difficulty in distinguishing instances corresponding to the same object words. In this paper, we propose a transformer-based framework to automatically aggregate similar image patches and achieve instance awareness without any additional knowledge. By applying our presented luminance augmentation and counter-color loss to break down the statistical correlation between luminance and color words, our model is driven to synthesize colors with better descriptive consistency. We further collect a dataset to provide distinctive visual characteristics and detailed language descriptions for multiple instances in the same image. Extensive experiments demonstrate our advantages of synthesizing visually pleasing and description-consistent results of instance-aware colorization.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

L-CoIns:基于实例感知的语言着色

基于语言的着色产生与用户提供的语言描述一致的可信颜色。最近的研究引入了额外的注释来防止颜色-对象耦合和不匹配问题，但它们仍然难以区分对应于相同对象词的实例。在本文中，我们提出了一个基于变换的框架来自动聚合相似的图像补丁并实现实例感知，而无需任何额外的知识。通过亮度增强和反颜色损失来打破亮度和颜色词之间的统计相关性，我们的模型被驱动来合成具有更好描述一致性的颜色。我们进一步收集数据集，为同一图像中的多个实例提供独特的视觉特征和详细的语言描述。大量的实验证明了我们在合成视觉愉悦和描述一致的实例感知着色结果方面的优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

自引率

0.00%

发文量

期刊最新文献

L-CoIns: Language-based Colorization With Instance Awareness Neural Texture Synthesis with Guided Correspondence LOGO: A Long-Form Video Dataset for Group Action Quality Assessment ERM-KTP: Knowledge-Level Machine Unlearning via Knowledge Transfer Target-referenced Reactive Grasping for Dynamic Objects