二维部分可见的对象识别使用高效的多维范围查询

Proceedings. 1987 IEEE International Conference on Robotics and Automation Pub Date : 1987-03-01 DOI:10.1109/ROBOT.1987.1087796

P. Gottschalk, J. L. Turney, T. Mudge

{"title":"二维部分可见的对象识别使用高效的多维范围查询","authors":"P. Gottschalk, J. L. Turney, T. Mudge","doi":"10.1109/ROBOT.1987.1087796","DOIUrl":null,"url":null,"abstract":"An important task in computer vision is the recognition of partially visible two-dimensional objects in a gray scale image. Recent works addressing this problem have attempted to match spatially local features from the image to features generated by models of the objects. However, many algorithms are less efficient than is possible. This is due primarily to insufficient attention being paid to the issues of reducing the data in features and feature matching. In this paper we discuss an algorithm that addresses both of these problems. Our algorithm uses the local shape of contour segments near critical points, represented in slope angle-arclength space (θ-s space), as the fundamental feature vectors. These fundamental feature vectors are further processed by projecting them onto a subspace of θ-s space that is obtained by applying the Karhunen-Loève expansion to all critical points in the model set to obtain the final feature vectors. This allows the data needed to store the features to be reduced, while retaining nearly all their recognitive information. The resultant set of feature vectors from the image are matched to the model set using multidimensional range queries to a database of model feature vectors. The database is implemented using an efficient data-structure called a k-d tree. The entire recognition procedure for one image has complexity O(IlogI + IlogN), where I is the number of features in the image, and N is the number of model features. Experimental results showing our algorithm's performance on a number of test images are presented.","PeriodicalId":438447,"journal":{"name":"Proceedings. 1987 IEEE International Conference on Robotics and Automation","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1987-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Two-dimensional partially visible object recognition using efficient multidimensional range queries\",\"authors\":\"P. Gottschalk, J. L. Turney, T. Mudge\",\"doi\":\"10.1109/ROBOT.1987.1087796\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An important task in computer vision is the recognition of partially visible two-dimensional objects in a gray scale image. Recent works addressing this problem have attempted to match spatially local features from the image to features generated by models of the objects. However, many algorithms are less efficient than is possible. This is due primarily to insufficient attention being paid to the issues of reducing the data in features and feature matching. In this paper we discuss an algorithm that addresses both of these problems. Our algorithm uses the local shape of contour segments near critical points, represented in slope angle-arclength space (θ-s space), as the fundamental feature vectors. These fundamental feature vectors are further processed by projecting them onto a subspace of θ-s space that is obtained by applying the Karhunen-Loève expansion to all critical points in the model set to obtain the final feature vectors. This allows the data needed to store the features to be reduced, while retaining nearly all their recognitive information. The resultant set of feature vectors from the image are matched to the model set using multidimensional range queries to a database of model feature vectors. The database is implemented using an efficient data-structure called a k-d tree. The entire recognition procedure for one image has complexity O(IlogI + IlogN), where I is the number of features in the image, and N is the number of model features. Experimental results showing our algorithm's performance on a number of test images are presented.\",\"PeriodicalId\":438447,\"journal\":{\"name\":\"Proceedings. 1987 IEEE International Conference on Robotics and Automation\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1987-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings. 1987 IEEE International Conference on Robotics and Automation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ROBOT.1987.1087796\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. 1987 IEEE International Conference on Robotics and Automation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROBOT.1987.1087796","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

摘要

计算机视觉的一个重要任务是在灰度图像中识别部分可见的二维物体。最近解决这个问题的工作试图将图像的空间局部特征与物体模型生成的特征相匹配。然而，许多算法的效率比可能的要低。这主要是由于对特征中数据的减少和特征匹配问题重视不够。在本文中，我们讨论了一种解决这两个问题的算法。我们的算法使用斜率角-弧长空间(θ-s空间)表示的临界点附近轮廓段的局部形状作为基本特征向量。对这些基本特征向量进行进一步处理，将其投影到θ-s空间的子空间上，该子空间通过对模型集中的所有临界点应用karhunen - lo展开得到，从而得到最终的特征向量。这样可以减少存储特征所需的数据，同时保留几乎所有的识别信息。通过对模型特征向量数据库的多维范围查询，将图像的结果特征向量集与模型集进行匹配。该数据库使用一种称为k-d树的高效数据结构实现。对一张图像的整个识别过程复杂度为O(IlogI + IlogN)，其中I为图像中特征的个数，N为模型特征的个数。实验结果显示了该算法在大量测试图像上的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Two-dimensional partially visible object recognition using efficient multidimensional range queries

An important task in computer vision is the recognition of partially visible two-dimensional objects in a gray scale image. Recent works addressing this problem have attempted to match spatially local features from the image to features generated by models of the objects. However, many algorithms are less efficient than is possible. This is due primarily to insufficient attention being paid to the issues of reducing the data in features and feature matching. In this paper we discuss an algorithm that addresses both of these problems. Our algorithm uses the local shape of contour segments near critical points, represented in slope angle-arclength space (θ-s space), as the fundamental feature vectors. These fundamental feature vectors are further processed by projecting them onto a subspace of θ-s space that is obtained by applying the Karhunen-Loève expansion to all critical points in the model set to obtain the final feature vectors. This allows the data needed to store the features to be reduced, while retaining nearly all their recognitive information. The resultant set of feature vectors from the image are matched to the model set using multidimensional range queries to a database of model feature vectors. The database is implemented using an efficient data-structure called a k-d tree. The entire recognition procedure for one image has complexity O(IlogI + IlogN), where I is the number of features in the image, and N is the number of model features. Experimental results showing our algorithm's performance on a number of test images are presented.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助