Method for automatic cartoon colorization

2023 IX International Conference on Information Technology and Nanotechnology (ITNT) Pub Date : 2023-04-17 DOI:10.1109/ITNT57377.2023.10139184

Vitaly Konovalov

{"title":"Method for automatic cartoon colorization","authors":"Vitaly Konovalov","doi":"10.1109/ITNT57377.2023.10139184","DOIUrl":null,"url":null,"abstract":"Colorization task consists of acquiring a full-color RGB image from grayscale image or a sketch. Article is concerned with the task of colorizing grayscale cartoon images and image sequences using neural networks. Efficiency of an existing prototype algorithm is reviewed with different modifications, as well as different combinations of loss functions. A new neural network loss function is proposed. It is based on a hypothesis that specifics of cartoons, such as clear object boundaries and color consistency within those boundaries can be used to improve colorization quality. Proposed loss function uses segmentation of cartoon images in the bilateral space, and minimizes difference between closest found segments and inside each segment, thus bringing closer predicted colors within the segment and between neighboring segments. Quantitative and qualitative experiments are conducted on efficiency as well as generalization ability of modified prototype algorithm with proposed loss function. Quantitative experiments consisted of measuring PSNR, LPIPS, MSE in Lab color space and CC, while qualitative focused on comparing temporal consistency, quality of colorization and quality of generalization.","PeriodicalId":296438,"journal":{"name":"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITNT57377.2023.10139184","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Colorization task consists of acquiring a full-color RGB image from grayscale image or a sketch. Article is concerned with the task of colorizing grayscale cartoon images and image sequences using neural networks. Efficiency of an existing prototype algorithm is reviewed with different modifications, as well as different combinations of loss functions. A new neural network loss function is proposed. It is based on a hypothesis that specifics of cartoons, such as clear object boundaries and color consistency within those boundaries can be used to improve colorization quality. Proposed loss function uses segmentation of cartoon images in the bilateral space, and minimizes difference between closest found segments and inside each segment, thus bringing closer predicted colors within the segment and between neighboring segments. Quantitative and qualitative experiments are conducted on efficiency as well as generalization ability of modified prototype algorithm with proposed loss function. Quantitative experiments consisted of measuring PSNR, LPIPS, MSE in Lab color space and CC, while qualitative focused on comparing temporal consistency, quality of colorization and quality of generalization.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

自动卡通上色方法

着色任务包括从灰度图像或草图中获取全彩色RGB图像。本文研究了利用神经网络对灰度卡通图像和图像序列进行着色的问题。通过不同的修改以及不同的损失函数组合，对现有原型算法的效率进行了评价。提出了一种新的神经网络损失函数。它基于一个假设，即卡通的特定特征，如清晰的对象边界和这些边界内的颜色一致性，可以用来提高着色质量。所提出的损失函数在双边空间中对卡通图像进行分割，并将最接近的发现段之间和每个段内的差异最小化，从而使段内和相邻段之间的预测颜色更接近。采用所提出的损失函数对改进的原型算法进行了效率和泛化能力的定量和定性实验。定量实验包括测量Lab色彩空间的PSNR、LPIPS、MSE和CC，定性实验主要比较时间一致性、着色质量和泛化质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2023 IX International Conference on Information Technology and Nanotechnology (ITNT)

自引率

0.00%

发文量