使用生成对抗网络和迁移学习的图像着色

Leila Kiani, Masoudnia Saeed, H. Nezamabadi-pour
{"title":"使用生成对抗网络和迁移学习的图像着色","authors":"Leila Kiani, Masoudnia Saeed, H. Nezamabadi-pour","doi":"10.1109/MVIP49855.2020.9116882","DOIUrl":null,"url":null,"abstract":"Automatic colorizing is one of the most interesting problems in computer graphics. During the colorization process, the gray one-dimensional images are converted to three-dimensional images with colored components. As a typical technique, Convolutional neural networks (CNNs) have been well studied and used for automatic coloring. In these networks, the information that is generalized over in the top layers is available in intermediate layers. Although the output of the last layer of CNNs is usually used in many applications, in this paper, we use a concept called \"Hypercolumn\" derived from neuroscience to exploit information at all levels to develop a fully automated image colorization system. There are not always millions of data available in the real world to train complex deep learning models. Therefore, the VGG19 model trained with the big data set of ImageNet is used as a pre-trained model in the generator network and the hypercolumn idea is implemented in it with DIV2K datasets. We train our model to predict each pixel’s color texture. The results obtained indicate that the proposed method is superior to competing models.","PeriodicalId":255375,"journal":{"name":"2020 International Conference on Machine Vision and Image Processing (MVIP)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Image Colorization Using Generative Adversarial Networks and Transfer Learning\",\"authors\":\"Leila Kiani, Masoudnia Saeed, H. Nezamabadi-pour\",\"doi\":\"10.1109/MVIP49855.2020.9116882\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic colorizing is one of the most interesting problems in computer graphics. During the colorization process, the gray one-dimensional images are converted to three-dimensional images with colored components. As a typical technique, Convolutional neural networks (CNNs) have been well studied and used for automatic coloring. In these networks, the information that is generalized over in the top layers is available in intermediate layers. Although the output of the last layer of CNNs is usually used in many applications, in this paper, we use a concept called \\\"Hypercolumn\\\" derived from neuroscience to exploit information at all levels to develop a fully automated image colorization system. There are not always millions of data available in the real world to train complex deep learning models. Therefore, the VGG19 model trained with the big data set of ImageNet is used as a pre-trained model in the generator network and the hypercolumn idea is implemented in it with DIV2K datasets. We train our model to predict each pixel’s color texture. The results obtained indicate that the proposed method is superior to competing models.\",\"PeriodicalId\":255375,\"journal\":{\"name\":\"2020 International Conference on Machine Vision and Image Processing (MVIP)\",\"volume\":\"65 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Machine Vision and Image Processing (MVIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MVIP49855.2020.9116882\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Machine Vision and Image Processing (MVIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MVIP49855.2020.9116882","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

摘要

自动着色是计算机图形学中最有趣的问题之一。在着色过程中,将灰度一维图像转换为具有彩色分量的三维图像。卷积神经网络(Convolutional neural networks, cnn)作为一种典型的自动上色技术已经得到了广泛的研究和应用。在这些网络中,在顶层泛化的信息在中间层中可用。虽然cnn最后一层的输出通常用于许多应用中,但在本文中,我们使用源自神经科学的“Hypercolumn”概念来利用所有级别的信息来开发全自动图像着色系统。在现实世界中,并不总是有数百万的数据可以用来训练复杂的深度学习模型。因此,使用ImageNet大数据集训练的VGG19模型作为生成器网络中的预训练模型,并使用DIV2K数据集在其中实现超列思想。我们训练我们的模型来预测每个像素的颜色纹理。实验结果表明,该方法优于竞争模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Image Colorization Using Generative Adversarial Networks and Transfer Learning
Automatic colorizing is one of the most interesting problems in computer graphics. During the colorization process, the gray one-dimensional images are converted to three-dimensional images with colored components. As a typical technique, Convolutional neural networks (CNNs) have been well studied and used for automatic coloring. In these networks, the information that is generalized over in the top layers is available in intermediate layers. Although the output of the last layer of CNNs is usually used in many applications, in this paper, we use a concept called "Hypercolumn" derived from neuroscience to exploit information at all levels to develop a fully automated image colorization system. There are not always millions of data available in the real world to train complex deep learning models. Therefore, the VGG19 model trained with the big data set of ImageNet is used as a pre-trained model in the generator network and the hypercolumn idea is implemented in it with DIV2K datasets. We train our model to predict each pixel’s color texture. The results obtained indicate that the proposed method is superior to competing models.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Offline Handwritten Signature Verification and Recognition Based on Deep Transfer Learning A High-Accuracy, Cost-Effective People Counting Solution Based on Visual Depth Data Source Camera Identification Using WLBP Descriptor Convolutional Neural Network for Building Extraction from High-Resolution Remote Sensing Images PCB Defect Detection Using Denoising Convolutional Autoencoders
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1