首页 > 最新文献

CVGIP: Image Understanding最新文献

英文 中文
A zero-crossing-based optimal three-dimensional edge detector 基于零交叉的最优三维边缘检测器
Pub Date : 1994-03-01 DOI: 10.1006/CIUN.1994.1016
Shiming Zhan, R. Mehrotra
Abstract Three-dimensional (3D) image processing and interpretation is very important in many medical and industrial applications. Detection of 3D boundaries is an essential step in most of the 3D image analysis tasks. In this paper a new computational approach to 3D edge detection is proposed. Optimality criteria such as signal-to-noise ratio, localization, and spurious response for zero-crossing-based, rotationally invariant 3D step edge detectors are derived. An optimal 3D step edge detector is obtained by optimizing a penalty function which combines all the three criteria. The closed form solution to the optimization problem yields the optimal detector. The detector is the Laplacian of a rotationally invariant function, which has a finite spatial support. The behavior of the proposed detector is theoretically analyzed and compared to that of the 3D Laplacian of Gaussian detector. Experimental results with some synthetic and real images are presented.
三维(3D)图像处理和解释在许多医疗和工业应用中非常重要。在大多数三维图像分析任务中,三维边界检测是必不可少的一步。本文提出了一种新的三维边缘检测计算方法。优化标准,如信噪比,定位和杂散响应的零交叉为基础,旋转不变的三维阶跃边缘检测器导出。通过对三个准则结合的罚函数进行优化,得到了最优的三维阶跃边缘检测器。优化问题的封闭解产生最优检测器。检测器是具有有限空间支持的旋转不变函数的拉普拉斯算子。从理论上分析了该探测器的性能,并与三维拉普拉斯高斯探测器的性能进行了比较。给出了合成图像和真实图像的实验结果。
{"title":"A zero-crossing-based optimal three-dimensional edge detector","authors":"Shiming Zhan, R. Mehrotra","doi":"10.1006/CIUN.1994.1016","DOIUrl":"https://doi.org/10.1006/CIUN.1994.1016","url":null,"abstract":"Abstract Three-dimensional (3D) image processing and interpretation is very important in many medical and industrial applications. Detection of 3D boundaries is an essential step in most of the 3D image analysis tasks. In this paper a new computational approach to 3D edge detection is proposed. Optimality criteria such as signal-to-noise ratio, localization, and spurious response for zero-crossing-based, rotationally invariant 3D step edge detectors are derived. An optimal 3D step edge detector is obtained by optimizing a penalty function which combines all the three criteria. The closed form solution to the optimization problem yields the optimal detector. The detector is the Laplacian of a rotationally invariant function, which has a finite spatial support. The behavior of the proposed detector is theoretically analyzed and compared to that of the 3D Laplacian of Gaussian detector. Experimental results with some synthetic and real images are presented.","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"15 1","pages":"242-253"},"PeriodicalIF":0.0,"publicationDate":"1994-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78867888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
A Zero-Crossing-Based Optimal Three-Dimensional Edge Detector 基于零交叉的最优三维边缘检测器
Pub Date : 1994-03-01 DOI: 10.1006/ciun.1994.1016
Zhan S.M., Mehrotra R.

Three-dimensional (3D) image processing and interpretation is very important in many medical and industrial applications. Detection of 3D boundaries is an essential step in most of the 3D image analysis tasks. In this paper a new computational approach to 3D edge detection is proposed. Optimality criteria such as signal-to-noise ratio, localization, and spurious response for zero-crossing-based, rotationally invariant 3D step edge detectors are derived. An optimal 3D step edge detector is obtained by optimizing a penalty function which combines all the three criteria. The closed form solution to the optimization problem yields the optimal detector. The detector is the Laplacian of a rotationally invariant function, which has a finite spatial support. The behavior of the proposed detector is theoretically analyzed and compared to that of the 3D Laplacian of Gaussian detector. Experimental results with some synthetic and real images are presented.

三维(3D)图像处理和解释在许多医疗和工业应用中非常重要。在大多数三维图像分析任务中,三维边界检测是必不可少的一步。本文提出了一种新的三维边缘检测计算方法。优化标准,如信噪比,定位和杂散响应的零交叉为基础,旋转不变的三维阶跃边缘检测器导出。通过对三个准则结合的罚函数进行优化,得到了最优的三维阶跃边缘检测器。优化问题的封闭解产生最优检测器。检测器是具有有限空间支持的旋转不变函数的拉普拉斯算子。从理论上分析了该探测器的性能,并与三维拉普拉斯高斯探测器的性能进行了比较。给出了合成图像和真实图像的实验结果。
{"title":"A Zero-Crossing-Based Optimal Three-Dimensional Edge Detector","authors":"Zhan S.M.,&nbsp;Mehrotra R.","doi":"10.1006/ciun.1994.1016","DOIUrl":"https://doi.org/10.1006/ciun.1994.1016","url":null,"abstract":"<div><p>Three-dimensional (3D) image processing and interpretation is very important in many medical and industrial applications. Detection of 3D boundaries is an essential step in most of the 3D image analysis tasks. In this paper a new computational approach to 3D edge detection is proposed. Optimality criteria such as signal-to-noise ratio, localization, and spurious response for zero-crossing-based, rotationally invariant 3D step edge detectors are derived. An optimal 3D step edge detector is obtained by optimizing a penalty function which combines all the three criteria. The closed form solution to the optimization problem yields the optimal detector. The detector is the Laplacian of a rotationally invariant function, which has a finite spatial support. The behavior of the proposed detector is theoretically analyzed and compared to that of the 3D Laplacian of Gaussian detector. Experimental results with some synthetic and real images are presented.</p></div>","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"59 2","pages":"Pages 242-253"},"PeriodicalIF":0.0,"publicationDate":"1994-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1016","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91687637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Recursive-Batch Estimation of Motion and Structure from Monocular Image Sequences 单眼图像序列中运动和结构的递归批估计
Pub Date : 1994-03-01 DOI: 10.1006/ciun.1994.1010
Cui N., Weng J.J., Cohen P.

This paper addresses the issue of optimal motion and structure estimation from monocular image sequences of a rigid scene. The new method has the following characteristics: (1) the dimension of the search space in the nonlinear optimization is drastically reduced by exploiting the relationship between structure and motion parameters; (2) the degree of reliability of the observations and estimates is effectively taken into account; (3) the proposed formulation allows arbitrary interframe motion; (4) the information about the structure of the scene, acquired from previous images, is systematically integrated into the new estimations; (5) the integration of multiple views using this method gives a large 2.5D visual map, much larger than that covered by any single view. It is shown also that the scale factor associated with any two consecutive images in a monocular sequence is determined by the scale factor of the first two images. Our simulation results and experiments with long image sequences of real world scenes indicate that the optimization method developed in this paper not only greatly reduces the computational complexity but also substantially improves the motion and structure estimates over those produced by the linear algorithms.

本文研究了刚性场景单目图像序列的最优运动和结构估计问题。该方法具有以下特点:(1)利用结构与运动参数之间的关系,大大降低了非线性优化中搜索空间的维数;(2)观测值和估计值的可靠程度得到有效考虑;(3)所提公式允许任意帧间运动;(4)将从先前图像中获取的场景结构信息系统地整合到新的估计中;(5)采用该方法将多个视图整合在一起,可以得到比任何单一视图所覆盖的大得多的大的2.5D视觉地图。还表明,与单眼序列中任意两幅连续图像相关的比例因子由前两幅图像的比例因子决定。我们的仿真结果和对真实世界场景的长图像序列的实验表明,本文提出的优化方法不仅大大降低了计算复杂度,而且大大提高了线性算法产生的运动和结构估计。
{"title":"Recursive-Batch Estimation of Motion and Structure from Monocular Image Sequences","authors":"Cui N.,&nbsp;Weng J.J.,&nbsp;Cohen P.","doi":"10.1006/ciun.1994.1010","DOIUrl":"10.1006/ciun.1994.1010","url":null,"abstract":"<div><p>This paper addresses the issue of optimal motion and structure estimation from monocular image sequences of a rigid scene. The new method has the following characteristics: (1) the dimension of the search space in the nonlinear optimization is drastically reduced by exploiting the relationship between structure and motion parameters; (2) the degree of reliability of the observations and estimates is effectively taken into account; (3) the proposed formulation allows arbitrary interframe motion; (4) the information about the structure of the scene, acquired from previous images, is systematically integrated into the new estimations; (5) the integration of multiple views using this method gives a large 2.5D visual map, much larger than that covered by any single view. It is shown also that the scale factor associated with any two consecutive images in a monocular sequence is determined by the scale factor of the first two images. Our simulation results and experiments with long image sequences of real world scenes indicate that the optimization method developed in this paper not only greatly reduces the computational complexity but also substantially improves the motion and structure estimates over those produced by the linear algorithms.</p></div>","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"59 2","pages":"Pages 154-170"},"PeriodicalIF":0.0,"publicationDate":"1994-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1010","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90809189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 35
Calibration of a computer controlled robotic vision sensor with a zoom lens 用变焦镜头标定计算机控制的机器人视觉传感器
Pub Date : 1994-03-01 DOI: 10.1006/CIUN.1994.1015
K. Tarabanis, R. Tsai, D. Goodman
Abstract Active vision sensors are increasingly being employed in vision systems for their greater flexibility. For example, vision sensors in hand-eye configurations with computer controllable lenses (e.g., zoom lenses) can be set to values which satisfy the sensing situation at hand. For such applications, it is essential to determine the mapping between the parameters that can actually be controlled in a reconfigurable vision system (e.g., the robot arm pose, the zoom setting of the lens) and the higher-level viewpoint parameters that must be set to desired values (e.g., the viewpoint location, focal length). In this paper we present calibration techniques to determine this mapping. In addition, we discuss how to use these relationships in order to achieve the desired values of the viewpoint parameters by setting the controllable parameters to the appropriate values. The sensor setup that is considered consists of a camera in a hand-eye arrangement equipped with a lens that has zoom, focus, and aperture control. The calibration techniques are applied to the H6 × 12.5R Fujinon zoom lens and the experimental results are shown.
主动视觉传感器因其更大的灵活性在视觉系统中得到越来越多的应用。例如,具有计算机控制镜头(例如变焦镜头)的手眼配置的视觉传感器可以设置为满足手头感知情况的值。对于此类应用,确定可重构视觉系统中实际可控制的参数(例如,机械臂姿势,镜头的变焦设置)与必须设置为所需值的高级视点参数(例如,视点位置,焦距)之间的映射是至关重要的。在本文中,我们提出了校准技术来确定这种映射。此外,我们还讨论了如何利用这些关系,通过将可控参数设置为适当的值来实现视点参数的期望值。所考虑的传感器设置包括一个手眼布局的相机,配备有变焦、对焦和光圈控制的镜头。将标定技术应用于H6 × 12.5R富士能变焦镜头,并给出了实验结果。
{"title":"Calibration of a computer controlled robotic vision sensor with a zoom lens","authors":"K. Tarabanis, R. Tsai, D. Goodman","doi":"10.1006/CIUN.1994.1015","DOIUrl":"https://doi.org/10.1006/CIUN.1994.1015","url":null,"abstract":"Abstract Active vision sensors are increasingly being employed in vision systems for their greater flexibility. For example, vision sensors in hand-eye configurations with computer controllable lenses (e.g., zoom lenses) can be set to values which satisfy the sensing situation at hand. For such applications, it is essential to determine the mapping between the parameters that can actually be controlled in a reconfigurable vision system (e.g., the robot arm pose, the zoom setting of the lens) and the higher-level viewpoint parameters that must be set to desired values (e.g., the viewpoint location, focal length). In this paper we present calibration techniques to determine this mapping. In addition, we discuss how to use these relationships in order to achieve the desired values of the viewpoint parameters by setting the controllable parameters to the appropriate values. The sensor setup that is considered consists of a camera in a hand-eye arrangement equipped with a lens that has zoom, focus, and aperture control. The calibration techniques are applied to the H6 × 12.5R Fujinon zoom lens and the experimental results are shown.","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"23 1","pages":"226-241"},"PeriodicalIF":0.0,"publicationDate":"1994-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84340386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
Robust Consensus Based Edge Detection 基于鲁棒一致性的边缘检测
Pub Date : 1994-03-01 DOI: 10.1006/ciun.1994.1009
Mintz D.

We present a new robust algorithm for edge detection. The algorithm detects both roof and step type edges. A pixel is declared as an edge pixel if there is a consensus between different processes that try to determine if the pixel lies on a discontinuity. We use robust estimation methods to estimate local fits to windows in the pixel′s neighborhood and accumulate votes from each fit. The use of robust estimators enables us to transform any window possibly containing a discontinuity to a binary window containing a step edge in the location of the discontinuity. We then employ conventional methods to detect this step edge. We show experimental results on simulated edges and synthetic images with varying Gaussian and random noise levels and analyze the probability of detection. The algorithm is also applied to several real intensity and range images and it is shown to perform well. A comparison with the Canny edge detector is given when applicable.

提出了一种新的鲁棒边缘检测算法。该算法同时检测顶型和阶跃型边缘。如果在试图确定像素是否位于不连续上的不同进程之间存在共识,则将像素声明为边缘像素。我们使用鲁棒估计方法来估计像素邻域窗口的局部拟合,并从每个拟合中累积投票。鲁棒估计的使用使我们能够将任何可能包含不连续点的窗口转换为在不连续点位置包含阶跃边缘的二值窗口。然后,我们采用常规方法来检测该阶跃边缘。我们展示了不同高斯和随机噪声水平的模拟边缘和合成图像的实验结果,并分析了检测的概率。将该算法应用于多幅真实强度和距离图像,取得了良好的效果。在适用的情况下,与Canny边缘检测器进行了比较。
{"title":"Robust Consensus Based Edge Detection","authors":"Mintz D.","doi":"10.1006/ciun.1994.1009","DOIUrl":"https://doi.org/10.1006/ciun.1994.1009","url":null,"abstract":"<div><p>We present a new robust algorithm for edge detection. The algorithm detects both roof and step type edges. A pixel is declared as an edge pixel if there is a consensus between different processes that try to determine if the pixel lies on a discontinuity. We use robust estimation methods to estimate local fits to windows in the pixel′s neighborhood and accumulate votes from each fit. The use of robust estimators enables us to transform any window possibly containing a discontinuity to a binary window containing a step edge in the location of the discontinuity. We then employ conventional methods to detect this step edge. We show experimental results on simulated edges and synthetic images with varying Gaussian and random noise levels and analyze the probability of detection. The algorithm is also applied to several real intensity and range images and it is shown to perform well. A comparison with the Canny edge detector is given when applicable.</p></div>","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"59 2","pages":"Pages 137-153"},"PeriodicalIF":0.0,"publicationDate":"1994-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1009","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"137087699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Volumetric Shapes of Solids of Revolution from a Single-View Range Image 单视距图像中旋转固体的体积形状
Pub Date : 1994-01-01 DOI: 10.1006/ciun.1994.1003
Yokoya N., Levine M.D.

Many man-made objects such as industrial parts are partially constructed of surfaces of revolution, as well as planar surfaces. We have studied the problem of finding and recovering solids of revolution in range data which are potentially useful for modeling and recognizing 3D objects. We propose an approach to the problem which is based on the fact that at least one of two focal surfaces for a surface of revolution degenerates into the axis of rotation. First, by computing the surface normal and principal curvatures, the centers of principal curvature which construct the focal surfaces are obtained for each point in the range image. Then, using the Hough transform, the axes of rotation are detected by finding the centers of principal curvature which lie on straight lines in space. Finally, the solid of revolution is completely determined by estimating the radius function of cross-section along each rotational axis. The proposed method can be used even in situations where occlusion or truncation is a problem because it does not require the visibility of entire surfaces. Experiments have been successfully carried out with real range data obtained from laser rangefinders.

许多人造物体,如工业零件,部分由旋转表面和平面构成。我们研究了在距离数据中寻找和恢复旋转固体的问题,这对三维物体的建模和识别有潜在的帮助。我们提出了一种解决这一问题的方法,该方法是基于一个公转曲面的两个焦点面中至少有一个退化为旋转轴的事实。首先,通过计算曲面法曲率和主曲率,得到构成焦面的主曲率中心;然后,利用霍夫变换,通过寻找位于空间直线上的主曲率中心来检测旋转轴。最后,通过估算沿各旋转轴的截面半径函数,完全确定了旋转体。所提出的方法甚至可以在遮挡或截断是一个问题的情况下使用,因为它不需要整个表面的可见性。用激光测距仪获得的实际测距数据成功地进行了实验。
{"title":"Volumetric Shapes of Solids of Revolution from a Single-View Range Image","authors":"Yokoya N.,&nbsp;Levine M.D.","doi":"10.1006/ciun.1994.1003","DOIUrl":"10.1006/ciun.1994.1003","url":null,"abstract":"<div><p>Many man-made objects such as industrial parts are partially constructed of <em>surfaces of revolution</em>, as well as planar surfaces. We have studied the problem of finding and recovering <em>solids of revolution</em> in range data which are potentially useful for modeling and recognizing 3D objects. We propose an approach to the problem which is based on the fact that at least one of two <em>focal surfaces</em> for a surface of revolution degenerates into the axis of rotation. First, by computing the surface normal and principal curvatures, the centers of principal curvature which construct the focal surfaces are obtained for each point in the range image. Then, using the Hough transform, the axes of rotation are detected by finding the centers of principal curvature which lie on straight lines in space. Finally, the solid of revolution is completely determined by estimating the radius function of cross-section along each rotational axis. The proposed method can be used even in situations where occlusion or truncation is a problem because it does not require the visibility of entire surfaces. Experiments have been successfully carried out with real range data obtained from laser rangefinders.</p></div>","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"59 1","pages":"Pages 43-52"},"PeriodicalIF":0.0,"publicationDate":"1994-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78354860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Function-Based Generic Recognition for Multiple Object Categories 基于函数的多目标分类通用识别
Pub Date : 1994-01-01 DOI: 10.1006/ciun.1994.1001
Stark L., Bowyer K.
Abstract In function-based object recognition an object category is represented by knowledge about object function. Function-based approaches are important because they provide a principled means of constructing generic recognition systems. Our work concentrates specifically on the relation between shape and function of rigid 3D objects. Recognition of an observed object shape is performed by reasoning about the function that the shape might serve. Recent research has demonstrated the feasibility of function-based shape recognition. However, previous efforts have dealt with only a single basic level object category. A number of important issues arise in extending a function-based approach to handle multiple basic level categories. One issue is whether the knowledge about function can be organized into general primitive chunks that are reusable across different categories. Another issue is how to efficiently index the knowledge base so as to avoid exhaustive testing of an object shape against each known category. In order to explore these issues, we have implemented a second-generation function-based recognition system that handles a collection of basic level object categories within the superordinate category furniture. The recognition capabilities and indexing performance of this system have been evaluated on a database of over 250 shapes. We also show recognition results from some 3D shape descriptions acquired from laser range finder data. To our knowledge, this is the first (only) work in function-based recognition to address the recognition of multiple object categories.
在基于函数的对象识别中,对象类别由对象函数的知识表示。基于函数的方法很重要,因为它们提供了构建通用识别系统的原则方法。我们的工作主要集中在刚性三维物体的形状和功能之间的关系。对观察到的物体形状的识别是通过对该形状可能服务的功能进行推理来完成的。最近的研究已经证明了基于功能的形状识别的可行性。然而,以前的工作只处理了一个基本级别的对象类别。在扩展基于函数的方法来处理多个基本级别类别时,会出现许多重要问题。一个问题是,关于函数的知识是否可以组织成通用的基元块,这些基元块可以跨不同的类别重用。另一个问题是如何有效地索引知识库,以避免针对每个已知类别对对象形状进行详尽的测试。为了探索这些问题,我们实现了第二代基于函数的识别系统,该系统处理上级类别家具中的基本级别对象类别集合。该系统的识别能力和索引性能在250多个形状的数据库上进行了评估。我们还展示了从激光测距仪数据中获得的一些三维形状描述的识别结果。据我们所知,这是第一个(唯一的)基于函数的识别工作,以解决多个对象类别的识别问题。
{"title":"Function-Based Generic Recognition for Multiple Object Categories","authors":"Stark L.,&nbsp;Bowyer K.","doi":"10.1006/ciun.1994.1001","DOIUrl":"10.1006/ciun.1994.1001","url":null,"abstract":"Abstract In function-based object recognition an object category is represented by knowledge about object function. Function-based approaches are important because they provide a principled means of constructing generic recognition systems. Our work concentrates specifically on the relation between shape and function of rigid 3D objects. Recognition of an observed object shape is performed by reasoning about the function that the shape might serve. Recent research has demonstrated the feasibility of function-based shape recognition. However, previous efforts have dealt with only a single basic level object category. A number of important issues arise in extending a function-based approach to handle multiple basic level categories. One issue is whether the knowledge about function can be organized into general primitive chunks that are reusable across different categories. Another issue is how to efficiently index the knowledge base so as to avoid exhaustive testing of an object shape against each known category. In order to explore these issues, we have implemented a second-generation function-based recognition system that handles a collection of basic level object categories within the superordinate category furniture. The recognition capabilities and indexing performance of this system have been evaluated on a database of over 250 shapes. We also show recognition results from some 3D shape descriptions acquired from laser range finder data. To our knowledge, this is the first (only) work in function-based recognition to address the recognition of multiple object categories.","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"59 1","pages":"Pages 1-21"},"PeriodicalIF":0.0,"publicationDate":"1994-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81886198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 84
Multiscale Minimization of Global Energy Functions in Some Visual Recovery Problems 若干视觉恢复问题中全局能量函数的多尺度最小化
Pub Date : 1994-01-01 DOI: 10.1006/ciun.1994.1008
Heitz F., Perez P., Bouthemy P.

Many image analysis and computer vision problems have been expressed as the minimization of global energy functions describing the interactions between the observed data and the image representations to be extracted in a given task. In this note, we investigate a new comprehensive approach to minimize global energy functions using a multiscale relaxation algorithm. The energy function is minimized over nested subspaces of the original space of possible solutions. These subspaces consist of solutions which are constrained at different scales. The constrained relaxation is implemented via a coarse-to-fine multiresolution algorithm that yields fast convergence towards high quality estimates when compared to standard monoresolution or multigrid relaxation schemes. It also appears to be far less sensitive to local minima than standard relaxation algorithms. The efficiency of the approach is demonstrated on a highly nonlinear combinatorial problem which consists of estimating long-range motion in an image sequence on a discrete label space. The method is compared to standard relaxation algorithms on real world and synthetic image sequences.

许多图像分析和计算机视觉问题已经被表示为描述观测数据和在给定任务中提取的图像表示之间相互作用的全局能量函数的最小化。在本文中,我们研究了一种利用多尺度松弛算法最小化全局能量函数的新的综合方法。能量函数在可能解的原始空间的嵌套子空间上被最小化。这些子空间由不同尺度约束下的解组成。约束松弛是通过一个从粗到精的多分辨率算法实现的,与标准的单分辨率或多网格松弛方案相比,该算法可以快速收敛到高质量的估计。与标准松弛算法相比,它对局部最小值的敏感性也要低得多。在一个高度非线性的组合问题上证明了该方法的有效性,该问题包括在离散标记空间上估计图像序列的远程运动。将该方法与标准松弛算法在真实世界和合成图像序列上进行了比较。
{"title":"Multiscale Minimization of Global Energy Functions in Some Visual Recovery Problems","authors":"Heitz F.,&nbsp;Perez P.,&nbsp;Bouthemy P.","doi":"10.1006/ciun.1994.1008","DOIUrl":"10.1006/ciun.1994.1008","url":null,"abstract":"<div><p>Many image analysis and computer vision problems have been expressed as the minimization of global energy functions describing the interactions between the observed data and the image representations to be extracted in a given task. In this note, we investigate a new comprehensive approach to minimize global energy functions using a multiscale relaxation algorithm. The energy function is minimized over nested subspaces of the original space of possible solutions. These subspaces consist of solutions which are constrained at different scales. The constrained relaxation is implemented via a coarse-to-fine multiresolution algorithm that yields fast convergence towards high quality estimates when compared to standard monoresolution or multigrid relaxation schemes. It also appears to be far less sensitive to local minima than standard relaxation algorithms. The efficiency of the approach is demonstrated on a highly nonlinear combinatorial problem which consists of estimating long-range motion in an image sequence on a discrete label space. The method is compared to standard relaxation algorithms on real world and synthetic image sequences.</p></div>","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"59 1","pages":"Pages 125-134"},"PeriodicalIF":0.0,"publicationDate":"1994-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1008","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76853775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 209
Towards Model-Based Recognition of Human Movements in Image Sequences 基于模型的图像序列人体运动识别研究
Pub Date : 1994-01-01 DOI: 10.1006/ciun.1994.1006
Rohr K.

The interpretation of the movements of articulated bodies in image sequences is one of the most challenging problems in computer vision. In this contribution, we introduce a model-based approach for the recognition of pedestrians. We represent the human body by a 3D-model consisting of cylinders, whereas for modelling the movement of walking we use data from medical motion studies. The estimation of model parameters in consecutive images is done by applying a Kalman filter. Experimental results are shown for synthetic as well as for real image data.

图像序列中关节体运动的解释是计算机视觉中最具挑战性的问题之一。在这篇文章中,我们介绍了一种基于模型的行人识别方法。我们通过一个由圆柱体组成的3d模型来表示人体,而对于步行运动的建模,我们使用来自医学运动研究的数据。利用卡尔曼滤波对连续图像进行模型参数估计。实验结果显示了合成和真实图像数据。
{"title":"Towards Model-Based Recognition of Human Movements in Image Sequences","authors":"Rohr K.","doi":"10.1006/ciun.1994.1006","DOIUrl":"10.1006/ciun.1994.1006","url":null,"abstract":"<div><p>The interpretation of the movements of articulated bodies in image sequences is one of the most challenging problems in computer vision. In this contribution, we introduce a model-based approach for the recognition of pedestrians. We represent the human body by a 3D-model consisting of cylinders, whereas for modelling the movement of walking we use data from medical motion studies. The estimation of model parameters in consecutive images is done by applying a Kalman filter. Experimental results are shown for synthetic as well as for real image data.</p></div>","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"59 1","pages":"Pages 94-115"},"PeriodicalIF":0.0,"publicationDate":"1994-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1006","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80892138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 487
Convergence and Continuity Criteria for Discrete Approximations of the Continuous Planar Skeleton 连续平面骨架离散逼近的收敛性和连续性准则
Pub Date : 1994-01-01 DOI: 10.1006/ciun.1994.1007
Brandt J.W.

It has been proposed recently that the skeleton of a shape can be computed using the Voronoi diagram of a discrete sample set of the shape boundary. This method avoids many of the complications encountered when computing the skeleton directly from an image because it is based on a continuous-domain model for shapes. In order to make better use of this new approach, it is necessary to establish a bridge between the continuous domain skeleton and its approximation obtained from the discrete boundary sample set. In this paper, the skeleton and Voronoi diagram formulations are briefly reviewed and elaborated upon to establish criteria for the functions to be continuous. Then the new continuity results are related to the discrete sample set model in order to establish conditions under which the skeleton approximation converges to the exact continuous skeleton.

最近有人提出,可以使用形状边界的离散样本集的Voronoi图来计算形状的骨架。这种方法避免了直接从图像计算骨架时遇到的许多复杂性,因为它是基于形状的连续域模型。为了更好地利用这种新方法,有必要在连续的域骨架和从离散边界样本集得到的近似之间建立一座桥梁。在本文中,简要回顾和阐述了骨架和Voronoi图的公式,以建立函数连续的准则。然后将新的连续性结果与离散样本集模型联系起来,以建立骨架近似收敛于精确连续骨架的条件。
{"title":"Convergence and Continuity Criteria for Discrete Approximations of the Continuous Planar Skeleton","authors":"Brandt J.W.","doi":"10.1006/ciun.1994.1007","DOIUrl":"10.1006/ciun.1994.1007","url":null,"abstract":"<div><p>It has been proposed recently that the skeleton of a shape can be computed using the Voronoi diagram of a discrete sample set of the shape boundary. This method avoids many of the complications encountered when computing the skeleton directly from an image because it is based on a continuous-domain model for shapes. In order to make better use of this new approach, it is necessary to establish a bridge between the continuous domain skeleton and its approximation obtained from the discrete boundary sample set. In this paper, the skeleton and Voronoi diagram formulations are briefly reviewed and elaborated upon to establish criteria for the functions to be continuous. Then the new continuity results are related to the discrete sample set model in order to establish conditions under which the skeleton approximation converges to the exact continuous skeleton.</p></div>","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"59 1","pages":"Pages 116-124"},"PeriodicalIF":0.0,"publicationDate":"1994-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1007","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79242663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 80
期刊
CVGIP: Image Understanding
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1