DeepCap: A Deep Learning Model to Caption Black and White Images

2020 10th International Conference on Cloud Computing, Data Science & Engineering (Confluence) Pub Date : 2020-01-01 DOI:10.1109/Confluence47617.2020.9058164

Vaibhav Pandit, Rishabh Gulati, Chaitanya Singla, Sandeep Kr. Singh

引用次数: 2

Abstract

Captioning of colored images has been around for quite some time now, it uses object detection and the spatial relation between the objects to generate captions. There have been numerous approaches to caption colorized images in the past, but there have been a very few. In this paper we present an approach to caption Black and white images without any attempt of colorization. We have used transfer learning to implement Inception V3, a CNN model developed by Google and a runner up in the ImageNet image classification challenge, to generate captions from Black and white images achieving an accuracy of 45.77% on the validation set.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

DeepCap:一种用于描述黑白图像的深度学习模型

彩色图像的字幕已经存在很长一段时间了，它使用对象检测和对象之间的空间关系来生成字幕。在过去，有许多方法可以为彩色图像添加标题，但很少。在本文中，我们提出了一种方法来说明黑白图像没有任何尝试着色。我们使用迁移学习来实现Inception V3，这是一个由Google开发的CNN模型，也是ImageNet图像分类挑战的亚军，它从黑白图像中生成字幕，在验证集上实现了45.77%的准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2020 10th International Conference on Cloud Computing, Data Science & Engineering (Confluence)

自引率

0.00%

发文量