基于多页差分哈希和二叉树搜索多对象桶的容差图像检索技术

2016 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA) Pub Date : 2016-06-27 DOI:10.1109/CIVEMSA.2016.7524250

P. Mack, D. Megherbi

{"title":"基于多页差分哈希和二叉树搜索多对象桶的容差图像检索技术","authors":"P. Mack, D. Megherbi","doi":"10.1109/CIVEMSA.2016.7524250","DOIUrl":null,"url":null,"abstract":"Involve manually tagging/annotating each image, and using traditional discrete data sorting techniques, such as hashing, to search images using the tags/annotations/groups of annotations. As of 2009, Flickr had 3.4 Billion Images, PhotoBucket had 7.2 Billion, Facebook had 15 Billion, and ImageShack had 20 Billion. However, none of these sites allow searching by image content and use other technologies, such as textual tags or the like. Image automatic annotation is still in its infancy. Google does allow searching in general by some kind of image-content description tagging, using some kind of limited-dictionary for image textual annotation. Although several high performance computing and data storage libraries exist, such as Hadoop and Spark, few are designed for fuzzy-with-some degree-of-similarity image non-textual content data-content retrieval. In this paper we use a multi-page hashing scheme to search images using the image itself to not only be efficient for identical images, but similar images to some degree of fuzziness and degree of similarity as well. The proposed technique uses Fourier descriptors as one representation of image objects as inputs to an evenly distributed and differentiable hashing scheme. One of the challenges in content-based retrieval schemes is the problem of overflow, usually expected in large databases. In the proposed method a Binary-Search-Tree (BST) scheme is used to decrease the search time within buckets and across pages when overflow occurs. Additionally, the proposed method allows for image retrieval based on either image object boundary contours (we call here Lambda search) or on object textures (we call here Lambda2 search) with identical or varying degrees of similarity. Benchmarking Results are presented to show the potential of the proposed method.","PeriodicalId":244122,"journal":{"name":"2016 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"A content-based image retrieval technique with tolerance via multi-page differentiate hashing and binary-tree searching multi-object buckets\",\"authors\":\"P. Mack, D. Megherbi\",\"doi\":\"10.1109/CIVEMSA.2016.7524250\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Involve manually tagging/annotating each image, and using traditional discrete data sorting techniques, such as hashing, to search images using the tags/annotations/groups of annotations. As of 2009, Flickr had 3.4 Billion Images, PhotoBucket had 7.2 Billion, Facebook had 15 Billion, and ImageShack had 20 Billion. However, none of these sites allow searching by image content and use other technologies, such as textual tags or the like. Image automatic annotation is still in its infancy. Google does allow searching in general by some kind of image-content description tagging, using some kind of limited-dictionary for image textual annotation. Although several high performance computing and data storage libraries exist, such as Hadoop and Spark, few are designed for fuzzy-with-some degree-of-similarity image non-textual content data-content retrieval. In this paper we use a multi-page hashing scheme to search images using the image itself to not only be efficient for identical images, but similar images to some degree of fuzziness and degree of similarity as well. The proposed technique uses Fourier descriptors as one representation of image objects as inputs to an evenly distributed and differentiable hashing scheme. One of the challenges in content-based retrieval schemes is the problem of overflow, usually expected in large databases. In the proposed method a Binary-Search-Tree (BST) scheme is used to decrease the search time within buckets and across pages when overflow occurs. Additionally, the proposed method allows for image retrieval based on either image object boundary contours (we call here Lambda search) or on object textures (we call here Lambda2 search) with identical or varying degrees of similarity. Benchmarking Results are presented to show the potential of the proposed method.\",\"PeriodicalId\":244122,\"journal\":{\"name\":\"2016 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIVEMSA.2016.7524250\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIVEMSA.2016.7524250","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

包括手动标记/注释每个图像，并使用传统的离散数据排序技术，如散列，使用标记/注释/注释组搜索图像。截至2009年，Flickr有34亿张图片，PhotoBucket有72亿张，Facebook有150亿张，ImageShack有200亿张。然而，这些网站都不允许通过图像内容进行搜索，而是使用其他技术，如文本标签或类似的技术。图像自动标注仍处于起步阶段。谷歌通常允许通过某种图像内容描述标记进行搜索，使用某种有限字典进行图像文本注释。虽然有一些高性能的计算和数据存储库，如Hadoop和Spark，但很少有设计用于具有一定相似度的模糊图像非文本内容数据-内容检索。在本文中，我们使用多页哈希方案来搜索图像本身，不仅对相同的图像有效，而且对一定程度的模糊和相似程度的相似图像也有效。提出的技术使用傅里叶描述子作为图像对象的一种表示，作为均匀分布和可微哈希方案的输入。基于内容的检索方案面临的挑战之一是溢出问题，这通常发生在大型数据库中。该方法采用了二叉搜索树(BST)方案，在发生溢出时减少了桶内和页间的搜索时间。此外，所提出的方法允许基于图像对象边界轮廓(我们在这里称为Lambda搜索)或基于具有相同或不同相似性程度的对象纹理(我们在这里称为Lambda2搜索)进行图像检索。基准测试结果显示了所提出方法的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A content-based image retrieval technique with tolerance via multi-page differentiate hashing and binary-tree searching multi-object buckets

Involve manually tagging/annotating each image, and using traditional discrete data sorting techniques, such as hashing, to search images using the tags/annotations/groups of annotations. As of 2009, Flickr had 3.4 Billion Images, PhotoBucket had 7.2 Billion, Facebook had 15 Billion, and ImageShack had 20 Billion. However, none of these sites allow searching by image content and use other technologies, such as textual tags or the like. Image automatic annotation is still in its infancy. Google does allow searching in general by some kind of image-content description tagging, using some kind of limited-dictionary for image textual annotation. Although several high performance computing and data storage libraries exist, such as Hadoop and Spark, few are designed for fuzzy-with-some degree-of-similarity image non-textual content data-content retrieval. In this paper we use a multi-page hashing scheme to search images using the image itself to not only be efficient for identical images, but similar images to some degree of fuzziness and degree of similarity as well. The proposed technique uses Fourier descriptors as one representation of image objects as inputs to an evenly distributed and differentiable hashing scheme. One of the challenges in content-based retrieval schemes is the problem of overflow, usually expected in large databases. In the proposed method a Binary-Search-Tree (BST) scheme is used to decrease the search time within buckets and across pages when overflow occurs. Additionally, the proposed method allows for image retrieval based on either image object boundary contours (we call here Lambda search) or on object textures (we call here Lambda2 search) with identical or varying degrees of similarity. Benchmarking Results are presented to show the potential of the proposed method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)

自引率

0.00%

发文量