Combining words and object-based visual features in image retrieval

12th International Conference on Image Analysis and Processing, 2003.Proceedings. Pub Date : 2003-09-17 DOI:10.1109/ICIAP.2003.1234075

Akihiko Nakagawa, Andrea Kutics, Kiyotaka Tanaka, Masaomi Nakajima

引用次数: 8

Abstract

The paper presents a novel approach for image retrieval by combining textual and object-based visual features in order to reduce the inconsistency between the subjective user's similarity interpretation and the retrieval results produced by objective similarity models. A novel multi-scale segmentation framework is proposed to detect prominent image objects. These objects are clustered according to their visual features and mapped to related words determined by psychophysical studies. Furthermore, a hierarchy of words expressing higher-level meaning is determined on the basis of natural language processing and user evaluation. Experiments conducted on a large set of natural images showed that higher retrieval precision in terms of estimating user retrieval semantics could be achieved via this two-layer word association and also by supporting various query specifications and options.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

结合词和基于对象的视觉特征在图像检索中的应用

本文提出了一种基于文本和基于对象的视觉特征相结合的图像检索方法，以减少主观用户的相似度解释与客观相似度模型产生的检索结果之间的不一致。提出了一种新的多尺度分割框架来检测图像中的突出目标。这些对象根据其视觉特征聚类，并映射到心理物理学研究确定的相关单词。此外，在自然语言处理和用户评价的基础上，确定了表达更高层次意义的词的层次。在大量自然图像上进行的实验表明，通过这种两层词关联和支持各种查询规范和选项，可以在估计用户检索语义方面获得更高的检索精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

12th International Conference on Image Analysis and Processing, 2003.Proceedings.

自引率

0.00%

发文量

期刊最新文献

Classification method for colored natural textures using Gabor filtering Perceptive visual texture classification and retrieval Deferring range/domain comparisons in fractal image compression Modeling the world: the virtualization pipeline A graphics hardware implementation of the generalized Hough transform for fast object recognition, scale, and 3D pose detection