{"title":"Content-based image retrieval using gradient projections","authors":"J. Rose, M. Shah","doi":"10.1109/SECON.1998.673306","DOIUrl":null,"url":null,"abstract":"Content-based image retrieval (CBIR) enables a user to extract an image, based on a query, from a database containing a vast amount of pictures. This concept may be applied to many fields of interest including forensic science and image archiving. Current CBIR systems, however, are inaccurate. The purpose of this research project was to improve the accuracy of CBIR. The image's structural properties were examined to distinguish one image from another. By examining the specific gray level of an image, a gradient can be computed at each pixel. Pixels with a magnitude larger than the thresholds are assigned a value of 1. These binary digits are added across the horizontal, vertical, and diagonal directions to compute three projections. These vectors are then compared with the vectors of the image to be matched using the Euclidean distance formula. These numbers are then stored in a bookmark so that the image needs only be examined once. A program has been developed for Matlab on a Sun Sparc Computer with Unix Open Windows that performs this method of projecting gradients. Three databases were amassed for the testing of the proposed system's accuracy: 82 digital camera pictures, 1000 photographic images, and a set of object orientated photos. The program was tested with 100% accuracy with all submitted images to the database, and was able to distinguish between pictures that fooled previous CBIR engines. More importantly, though, was the program's ability to find certain similar scenarios in the database. This CBIR approach has significantly increased the accuracy in obtaining results for image retrieval.","PeriodicalId":281991,"journal":{"name":"Proceedings IEEE Southeastcon '98 'Engineering for a New Era'","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings IEEE Southeastcon '98 'Engineering for a New Era'","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SECON.1998.673306","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
Content-based image retrieval (CBIR) enables a user to extract an image, based on a query, from a database containing a vast amount of pictures. This concept may be applied to many fields of interest including forensic science and image archiving. Current CBIR systems, however, are inaccurate. The purpose of this research project was to improve the accuracy of CBIR. The image's structural properties were examined to distinguish one image from another. By examining the specific gray level of an image, a gradient can be computed at each pixel. Pixels with a magnitude larger than the thresholds are assigned a value of 1. These binary digits are added across the horizontal, vertical, and diagonal directions to compute three projections. These vectors are then compared with the vectors of the image to be matched using the Euclidean distance formula. These numbers are then stored in a bookmark so that the image needs only be examined once. A program has been developed for Matlab on a Sun Sparc Computer with Unix Open Windows that performs this method of projecting gradients. Three databases were amassed for the testing of the proposed system's accuracy: 82 digital camera pictures, 1000 photographic images, and a set of object orientated photos. The program was tested with 100% accuracy with all submitted images to the database, and was able to distinguish between pictures that fooled previous CBIR engines. More importantly, though, was the program's ability to find certain similar scenarios in the database. This CBIR approach has significantly increased the accuracy in obtaining results for image retrieval.
基于内容的图像检索(CBIR)使用户能够根据查询从包含大量图片的数据库中提取图像。这个概念可以应用于许多感兴趣的领域,包括法医学和图像存档。然而,目前的CBIR系统并不准确。本研究项目的目的是提高CBIR的准确性。检查图像的结构属性以区分不同的图像。通过检查图像的特定灰度级,可以在每个像素处计算梯度。大小大于阈值的像素被赋值为1。这些二进制数字在水平、垂直和对角线方向上相加,以计算三个投影。然后将这些矢量与使用欧几里得距离公式进行匹配的图像的矢量进行比较。然后将这些数字存储在书签中,这样图像只需要检查一次。在Sun Sparc计算机和Unix Open Windows上用Matlab编写了一个程序,实现了这种投影梯度的方法。为了测试系统的准确性,收集了三个数据库:82张数码相机照片、1000张摄影图像和一组面向对象的照片。该程序对所有提交到数据库的图像进行了100%的准确率测试,并且能够区分那些欺骗了以前的CBIR引擎的图片。不过,更重要的是,该程序能够在数据库中找到某些类似的场景。该方法显著提高了图像检索结果的准确性。