Tree-based Shape Descriptor for scalable logo detection

2013 Visual Communications and Image Processing (VCIP) Pub Date : 2013-11-01 DOI:10.1109/VCIP.2013.6706326

Chengde Wan, Zhicheng Zhao, Xin Guo, A. Cai

{"title":"Tree-based Shape Descriptor for scalable logo detection","authors":"Chengde Wan, Zhicheng Zhao, Xin Guo, A. Cai","doi":"10.1109/VCIP.2013.6706326","DOIUrl":null,"url":null,"abstract":"Detecting logos in real-world images is a great challenging task due to a variety of viewpoint or light condition changes and real-time requirements in practice. Conventional object detection methods, e.g., part-based model, may suffer from expensively computational cost if it was directly applied to this task. A promising alternative, triangle structural descriptor associated with matching strategy, offers an efficient way of recognizing logos. However, the descriptor fails to the rotation of logo images that often occurs when viewpoint changes. To overcome this shortcoming, we propose a new Tree-based Shape Descriptor (TSD) in this paper, which is strictly invariant to affine transformation in real-world images. The core of proposed descriptor is to encode the shape of logos by depicting both appearance and spatial information of four local key-points. In the training stage, an efficient algorithm is introduced to mine a discriminate subset of four tuples from all possible key-point combinations. Moreover, a root indexing scheme is designed to enable to detect multiple logos simultaneously. Extensive experiments on three benchmarks demonstrate the superiority of proposed approach over state-of-the-art methods.","PeriodicalId":407080,"journal":{"name":"2013 Visual Communications and Image Processing (VCIP)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 Visual Communications and Image Processing (VCIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VCIP.2013.6706326","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

Detecting logos in real-world images is a great challenging task due to a variety of viewpoint or light condition changes and real-time requirements in practice. Conventional object detection methods, e.g., part-based model, may suffer from expensively computational cost if it was directly applied to this task. A promising alternative, triangle structural descriptor associated with matching strategy, offers an efficient way of recognizing logos. However, the descriptor fails to the rotation of logo images that often occurs when viewpoint changes. To overcome this shortcoming, we propose a new Tree-based Shape Descriptor (TSD) in this paper, which is strictly invariant to affine transformation in real-world images. The core of proposed descriptor is to encode the shape of logos by depicting both appearance and spatial information of four local key-points. In the training stage, an efficient algorithm is introduced to mine a discriminate subset of four tuples from all possible key-point combinations. Moreover, a root indexing scheme is designed to enable to detect multiple logos simultaneously. Extensive experiments on three benchmarks demonstrate the superiority of proposed approach over state-of-the-art methods.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于可扩展徽标检测的基于树的形状描述符

在现实世界中，由于各种视点或光线条件的变化以及实践中的实时性要求，检测徽标是一项极具挑战性的任务。传统的目标检测方法，如基于零件的模型，如果直接应用于该任务，可能会带来昂贵的计算成本。一种很有前途的替代方法是与匹配策略相关联的三角形结构描述符，它提供了一种有效的标识识别方法。但是，描述符无法在视点更改时经常发生的徽标图像旋转。为了克服这一缺点，本文提出了一种新的基于树的形状描述子(TSD)，该描述子对真实图像的仿射变换严格不变性。该描述符的核心是通过描述四个局部关键点的外观和空间信息来编码标识的形状。在训练阶段，引入了一种有效的算法，从所有可能的键点组合中挖掘出四个元组的区别子集。此外，还设计了一个根索引方案，可以同时检测多个徽标。在三个基准上进行的广泛实验表明，所提出的方法优于最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2013 Visual Communications and Image Processing (VCIP)

自引率

0.00%

发文量

期刊最新文献

New motherwavelet for pattern detection in IR image Improved disparity vector derivation in 3D-HEVC Learning non-negative locality-constrained Linear Coding for human action recognition Wavelet based smoke detection method with RGB Contrast-image and shape constrain Joint image denoising using self-similarity based low-rank approximations