Method for automatic cartoon colorization

Vitaly Konovalov
{"title":"Method for automatic cartoon colorization","authors":"Vitaly Konovalov","doi":"10.1109/ITNT57377.2023.10139184","DOIUrl":null,"url":null,"abstract":"Colorization task consists of acquiring a full-color RGB image from grayscale image or a sketch. Article is concerned with the task of colorizing grayscale cartoon images and image sequences using neural networks. Efficiency of an existing prototype algorithm is reviewed with different modifications, as well as different combinations of loss functions. A new neural network loss function is proposed. It is based on a hypothesis that specifics of cartoons, such as clear object boundaries and color consistency within those boundaries can be used to improve colorization quality. Proposed loss function uses segmentation of cartoon images in the bilateral space, and minimizes difference between closest found segments and inside each segment, thus bringing closer predicted colors within the segment and between neighboring segments. Quantitative and qualitative experiments are conducted on efficiency as well as generalization ability of modified prototype algorithm with proposed loss function. Quantitative experiments consisted of measuring PSNR, LPIPS, MSE in Lab color space and CC, while qualitative focused on comparing temporal consistency, quality of colorization and quality of generalization.","PeriodicalId":296438,"journal":{"name":"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITNT57377.2023.10139184","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Colorization task consists of acquiring a full-color RGB image from grayscale image or a sketch. Article is concerned with the task of colorizing grayscale cartoon images and image sequences using neural networks. Efficiency of an existing prototype algorithm is reviewed with different modifications, as well as different combinations of loss functions. A new neural network loss function is proposed. It is based on a hypothesis that specifics of cartoons, such as clear object boundaries and color consistency within those boundaries can be used to improve colorization quality. Proposed loss function uses segmentation of cartoon images in the bilateral space, and minimizes difference between closest found segments and inside each segment, thus bringing closer predicted colors within the segment and between neighboring segments. Quantitative and qualitative experiments are conducted on efficiency as well as generalization ability of modified prototype algorithm with proposed loss function. Quantitative experiments consisted of measuring PSNR, LPIPS, MSE in Lab color space and CC, while qualitative focused on comparing temporal consistency, quality of colorization and quality of generalization.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
自动卡通上色方法
着色任务包括从灰度图像或草图中获取全彩色RGB图像。本文研究了利用神经网络对灰度卡通图像和图像序列进行着色的问题。通过不同的修改以及不同的损失函数组合,对现有原型算法的效率进行了评价。提出了一种新的神经网络损失函数。它基于一个假设,即卡通的特定特征,如清晰的对象边界和这些边界内的颜色一致性,可以用来提高着色质量。所提出的损失函数在双边空间中对卡通图像进行分割,并将最接近的发现段之间和每个段内的差异最小化,从而使段内和相邻段之间的预测颜色更接近。采用所提出的损失函数对改进的原型算法进行了效率和泛化能力的定量和定性实验。定量实验包括测量Lab色彩空间的PSNR、LPIPS、MSE和CC,定性实验主要比较时间一致性、着色质量和泛化质量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Cooperative Application of Vehicular Traffic Rerouting Method and Adaptive Traffic Signal Control Method Analysis of the Influence of Space Weather Factors on the Telemetry Parameters of Small Spacecraft in Low Earth Orbit Correlations and Statistical Memory Effects as Markers of Age-related Changes in Complex Systems of Living Nature Visualization of feature spaces based on spectral and texture characteristics Electrically controlled optical spectral filters for WDM communication networks based on multilayer inhomogeneous holographic diffraction structures
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1