{"title":"基于多页差分哈希和二叉树搜索多对象桶的容差图像检索技术","authors":"P. Mack, D. Megherbi","doi":"10.1109/CIVEMSA.2016.7524250","DOIUrl":null,"url":null,"abstract":"Involve manually tagging/annotating each image, and using traditional discrete data sorting techniques, such as hashing, to search images using the tags/annotations/groups of annotations. As of 2009, Flickr had 3.4 Billion Images, PhotoBucket had 7.2 Billion, Facebook had 15 Billion, and ImageShack had 20 Billion. However, none of these sites allow searching by image content and use other technologies, such as textual tags or the like. Image automatic annotation is still in its infancy. Google does allow searching in general by some kind of image-content description tagging, using some kind of limited-dictionary for image textual annotation. Although several high performance computing and data storage libraries exist, such as Hadoop and Spark, few are designed for fuzzy-with-some degree-of-similarity image non-textual content data-content retrieval. In this paper we use a multi-page hashing scheme to search images using the image itself to not only be efficient for identical images, but similar images to some degree of fuzziness and degree of similarity as well. The proposed technique uses Fourier descriptors as one representation of image objects as inputs to an evenly distributed and differentiable hashing scheme. One of the challenges in content-based retrieval schemes is the problem of overflow, usually expected in large databases. In the proposed method a Binary-Search-Tree (BST) scheme is used to decrease the search time within buckets and across pages when overflow occurs. Additionally, the proposed method allows for image retrieval based on either image object boundary contours (we call here Lambda search) or on object textures (we call here Lambda2 search) with identical or varying degrees of similarity. Benchmarking Results are presented to show the potential of the proposed method.","PeriodicalId":244122,"journal":{"name":"2016 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"A content-based image retrieval technique with tolerance via multi-page differentiate hashing and binary-tree searching multi-object buckets\",\"authors\":\"P. Mack, D. Megherbi\",\"doi\":\"10.1109/CIVEMSA.2016.7524250\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Involve manually tagging/annotating each image, and using traditional discrete data sorting techniques, such as hashing, to search images using the tags/annotations/groups of annotations. As of 2009, Flickr had 3.4 Billion Images, PhotoBucket had 7.2 Billion, Facebook had 15 Billion, and ImageShack had 20 Billion. However, none of these sites allow searching by image content and use other technologies, such as textual tags or the like. Image automatic annotation is still in its infancy. Google does allow searching in general by some kind of image-content description tagging, using some kind of limited-dictionary for image textual annotation. Although several high performance computing and data storage libraries exist, such as Hadoop and Spark, few are designed for fuzzy-with-some degree-of-similarity image non-textual content data-content retrieval. In this paper we use a multi-page hashing scheme to search images using the image itself to not only be efficient for identical images, but similar images to some degree of fuzziness and degree of similarity as well. The proposed technique uses Fourier descriptors as one representation of image objects as inputs to an evenly distributed and differentiable hashing scheme. One of the challenges in content-based retrieval schemes is the problem of overflow, usually expected in large databases. In the proposed method a Binary-Search-Tree (BST) scheme is used to decrease the search time within buckets and across pages when overflow occurs. Additionally, the proposed method allows for image retrieval based on either image object boundary contours (we call here Lambda search) or on object textures (we call here Lambda2 search) with identical or varying degrees of similarity. Benchmarking Results are presented to show the potential of the proposed method.\",\"PeriodicalId\":244122,\"journal\":{\"name\":\"2016 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIVEMSA.2016.7524250\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIVEMSA.2016.7524250","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A content-based image retrieval technique with tolerance via multi-page differentiate hashing and binary-tree searching multi-object buckets
Involve manually tagging/annotating each image, and using traditional discrete data sorting techniques, such as hashing, to search images using the tags/annotations/groups of annotations. As of 2009, Flickr had 3.4 Billion Images, PhotoBucket had 7.2 Billion, Facebook had 15 Billion, and ImageShack had 20 Billion. However, none of these sites allow searching by image content and use other technologies, such as textual tags or the like. Image automatic annotation is still in its infancy. Google does allow searching in general by some kind of image-content description tagging, using some kind of limited-dictionary for image textual annotation. Although several high performance computing and data storage libraries exist, such as Hadoop and Spark, few are designed for fuzzy-with-some degree-of-similarity image non-textual content data-content retrieval. In this paper we use a multi-page hashing scheme to search images using the image itself to not only be efficient for identical images, but similar images to some degree of fuzziness and degree of similarity as well. The proposed technique uses Fourier descriptors as one representation of image objects as inputs to an evenly distributed and differentiable hashing scheme. One of the challenges in content-based retrieval schemes is the problem of overflow, usually expected in large databases. In the proposed method a Binary-Search-Tree (BST) scheme is used to decrease the search time within buckets and across pages when overflow occurs. Additionally, the proposed method allows for image retrieval based on either image object boundary contours (we call here Lambda search) or on object textures (we call here Lambda2 search) with identical or varying degrees of similarity. Benchmarking Results are presented to show the potential of the proposed method.