Scalable Logo Detection and Recognition with Minimal Labeling

D. M. Montserrat, Qian Lin, J. Allebach, E. Delp
{"title":"Scalable Logo Detection and Recognition with Minimal Labeling","authors":"D. M. Montserrat, Qian Lin, J. Allebach, E. Delp","doi":"10.1109/MIPR.2018.00034","DOIUrl":null,"url":null,"abstract":"In this paper we describe a new approach to detecting and locating brand logos in an image using machine learning methods and synthetic training data. Deep learning methods, particularly the use of Convolutional Neural Networks (CNN), have been very popular for extracting visual information, such as image shapes and objects, from images. A CNN has parameters and configuration information that are learned from training images. To obtain good accuracy usually a large amount of labeled (groundtruthed) images are required for training. Collecting the training images and labeling them can be expensive and time consuming. Methods that include data augmentation, image synthesis, and bootstrapping techniques provide useful alternatives to creating training images. In this paper, we present a logo detection method that requires minimum labeled images. First, we use synthetic images to train a CNN to detect logos. Then, this CNN is used to automatically detect and localize logos from images extracted from the web. Finally, these images are used to train a logo classifier. The combination of the logo detector and the classifier allows us to locate and classify multiple logos in a scene. While existing methods rely on manually labeled images, our method is fully trained with images obtained in an automated manner with minimal human supervision.","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MIPR.2018.00034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

In this paper we describe a new approach to detecting and locating brand logos in an image using machine learning methods and synthetic training data. Deep learning methods, particularly the use of Convolutional Neural Networks (CNN), have been very popular for extracting visual information, such as image shapes and objects, from images. A CNN has parameters and configuration information that are learned from training images. To obtain good accuracy usually a large amount of labeled (groundtruthed) images are required for training. Collecting the training images and labeling them can be expensive and time consuming. Methods that include data augmentation, image synthesis, and bootstrapping techniques provide useful alternatives to creating training images. In this paper, we present a logo detection method that requires minimum labeled images. First, we use synthetic images to train a CNN to detect logos. Then, this CNN is used to automatically detect and localize logos from images extracted from the web. Finally, these images are used to train a logo classifier. The combination of the logo detector and the classifier allows us to locate and classify multiple logos in a scene. While existing methods rely on manually labeled images, our method is fully trained with images obtained in an automated manner with minimal human supervision.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
可扩展的标识检测和识别与最小的标签
在本文中,我们描述了一种使用机器学习方法和综合训练数据来检测和定位图像中的品牌标识的新方法。深度学习方法,特别是卷积神经网络(CNN)的使用,在从图像中提取视觉信息(如图像形状和物体)方面非常流行。CNN具有从训练图像中学习到的参数和配置信息。为了获得良好的准确性,通常需要大量的标记(ground - truth)图像进行训练。收集训练图像并对其进行标记既昂贵又耗时。包括数据增强、图像合成和引导技术在内的方法为创建训练图像提供了有用的替代方法。在本文中,我们提出了一种需要最小标记图像的标识检测方法。首先,我们使用合成图像来训练CNN来检测徽标。然后,该CNN用于从网络中提取的图像中自动检测和定位徽标。最后,这些图像被用来训练一个标识分类器。logo检测器和分类器的结合使我们能够对一个场景中的多个logo进行定位和分类。虽然现有的方法依赖于手动标记的图像,但我们的方法是用最少人工监督的自动化方式获得的图像进行全面训练的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Joint Estimation of Age and Gender from Unconstrained Face Images Using Lightweight Multi-Task CNN for Mobile Applications A Multimodal Approach to Predict Social Media Popularity Ownership Identification and Signaling of Multimedia Content Components Deep Learning of Path-Based Tree Classifiers for Large-Scale Plant Species Identification Understanding User Profiles on Social Media for Fake News Detection
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1