首页 > 最新文献

CVGIP: Image Understanding最新文献

英文 中文
Phase-Based Binocular Vergence Control and Depth Reconstruction Using Active Vision 基于相位的双目收敛控制与主动视觉深度重建
Pub Date : 1994-11-01 DOI: 10.1006/ciun.1994.1061
Theimer W.M., Mallot H.A.

We present a technique for guiding vergence movements for an active stereo camera system and for calculating dense disparity maps. Both processes are described in the same theoretical framework based on phase differences in complex Gabor filter responses, modeling receptive field properties in the visual cortex. While the camera movements are computed with input images of coarse spatial resolution, the disparity map calculation uses a finer resolution in the scale space. The correspondence problem is solved implicitly by restricting the disparity range around zero disparity (Panum′s area in the human visual system). The vergence process is interpreted as a mechanism to minimize global disparity, thereby setting a 3D region of interest for subsequent disparity detection. The disparity map represents smaller local disparities as an important cue for depth perception. Experimental data for the integrated performance of vergence in natural scenes followed by disparity map calculations are presented.

我们提出了一种用于引导主动立体相机系统的收敛运动和计算密集视差图的技术。基于复杂Gabor过滤反应的相位差,这两个过程在相同的理论框架中被描述,模拟了视觉皮层的接受野特性。相机运动是用粗空间分辨率的输入图像来计算的,而视差图的计算则是在比例空间中使用更精细的分辨率。通过将视差范围限制在零视差附近(人类视觉系统中的Panum区域),隐式地解决了对应问题。收敛过程被解释为一种最小化全局视差的机制,从而为后续视差检测设置感兴趣的3D区域。视差图表示较小的局部差异,作为深度感知的重要线索。给出了基于视差图计算的自然场景收敛综合性能实验数据。
{"title":"Phase-Based Binocular Vergence Control and Depth Reconstruction Using Active Vision","authors":"Theimer W.M.,&nbsp;Mallot H.A.","doi":"10.1006/ciun.1994.1061","DOIUrl":"10.1006/ciun.1994.1061","url":null,"abstract":"<div><p>We present a technique for guiding vergence movements for an active stereo camera system and for calculating dense disparity maps. Both processes are described in the same theoretical framework based on phase differences in complex Gabor filter responses, modeling receptive field properties in the visual cortex. While the camera movements are computed with input images of coarse spatial resolution, the disparity map calculation uses a finer resolution in the scale space. The correspondence problem is solved implicitly by restricting the disparity range around zero disparity (Panum′s area in the human visual system). The vergence process is interpreted as a mechanism to minimize <em>global disparity</em>, thereby setting a 3D region of interest for subsequent disparity detection. The disparity map represents smaller <em>local disparities</em> as an important cue for depth perception. Experimental data for the integrated performance of vergence in natural scenes followed by disparity map calculations are presented.</p></div>","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"60 3","pages":"Pages 343-358"},"PeriodicalIF":0.0,"publicationDate":"1994-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1061","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"51091674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 104
Default Shape Theory: With Application to the Computation of the Direction of the Light Source 默认形状理论:在光源方向计算中的应用
Pub Date : 1994-11-01 DOI: 10.1006/ciun.1994.1058
Vega O.E., Yang Y.H.

Humans appear to have the ability to infer the three-dimensional shape of an object from its outline or occluding contour. In computer vision, an occluding contour is an important clue in the shape recovery process. So far, only a small family of shapes, in particular, shapes generated by generalized cylinders, and shapes of polyhedrons, has been investigated in computer vision. However, there are many instances where these types of surfaces are not applicable, e.g., a solid with holes or a solid with no planar surfaces. This paper proposes a new concept called the default shape theory which includes the solid of revolution as a special case. A default shape is a function whose domain is a closed contour and whose range is a vector representation of a three-dimensional solid. The application of this new concept to recover the direction of the light source is demonstrated in this paper. Experimental results show that the default-shape-based method has comparable performance as the Pentland light source determination algorithm. This paper also proves the relationship between one particular type of default shape with the solid of revolution. Other possible definitions of default shape are also discussed.

人类似乎有能力从物体的轮廓或遮挡轮廓推断出物体的三维形状。在计算机视觉中,遮挡轮廓是形状恢复过程中的重要线索。到目前为止,在计算机视觉中只研究了一小部分形状,特别是由广义圆柱体生成的形状和多面体的形状。然而,在许多情况下,这些类型的表面不适用,例如,有孔的实体或没有平面的实体。本文提出了一个新的概念,称为默认形状理论,其中包含了作为特例的旋转固体。默认形状是一个函数,其域是封闭轮廓,其范围是三维实体的矢量表示。本文演示了这一新概念在恢复光源方向中的应用。实验结果表明,基于默认形状的方法与Pentland光源确定算法具有相当的性能。本文还证明了一种特殊类型的默认形状与旋转立体之间的关系。还讨论了默认形状的其他可能定义。
{"title":"Default Shape Theory: With Application to the Computation of the Direction of the Light Source","authors":"Vega O.E.,&nbsp;Yang Y.H.","doi":"10.1006/ciun.1994.1058","DOIUrl":"10.1006/ciun.1994.1058","url":null,"abstract":"<div><p>Humans appear to have the ability to infer the three-dimensional shape of an object from its outline or occluding contour. In computer vision, an occluding contour is an important clue in the shape recovery process. So far, only a small family of shapes, in particular, shapes generated by generalized cylinders, and shapes of polyhedrons, has been investigated in computer vision. However, there are many instances where these types of surfaces are not applicable, e.g., a solid with holes or a solid with no planar surfaces. This paper proposes a new concept called the <em>default shape theory</em> which includes the solid of revolution as a special case. A default shape is a function whose domain is a closed contour and whose range is a vector representation of a three-dimensional solid. The application of this new concept to recover the direction of the light source is demonstrated in this paper. Experimental results show that the default-shape-based method has comparable performance as the Pentland light source determination algorithm. This paper also proves the relationship between one particular type of default shape with the solid of revolution. Other possible definitions of default shape are also discussed.</p></div>","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"60 3","pages":"Pages 285-299"},"PeriodicalIF":0.0,"publicationDate":"1994-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1058","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79298436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Finding Planes and Clusters of Objects from 3D Line Segments with Application to 3D Motion Determination 从三维线段中寻找平面和物体簇及其在三维运动确定中的应用
Pub Date : 1994-11-01 DOI: 10.1006/ciun.1994.1057
Zhang Z.Y., Faugeras O.D.

We address in this paper how to find clusters based on proximity and planar facets based on coplanarity from 3D line segments obtained from stereo. The proposed methods are efficient and have been tested with many real stereo data. These procedures are indispensable in many applications including scene interpretation, object modeling, and object recognition. We show their application to 3D motion determination. We have developed an algorithm based on the hypothesize-and-verify paradigm to register two consecutive 3D frames obtained from stereo and estimate their transformation/motion. By grouping 3D line segments in each frame into clusters and planes, we can reduce effectively the complexity of the hypothesis generation phase.

本文讨论了如何从立体图像中获得的三维线段中基于接近度的聚类和基于共平面的平面切面。所提出的方法是有效的,并已在大量的真实立体数据中得到验证。这些程序在许多应用程序中是必不可少的,包括场景解释,对象建模和对象识别。我们展示了它们在三维运动确定中的应用。我们开发了一种基于假设-验证范式的算法,用于注册从立体图像中获得的两个连续3D帧并估计它们的变换/运动。通过将每帧中的三维线段分组为聚类和平面,可以有效地降低假设生成阶段的复杂度。
{"title":"Finding Planes and Clusters of Objects from 3D Line Segments with Application to 3D Motion Determination","authors":"Zhang Z.Y.,&nbsp;Faugeras O.D.","doi":"10.1006/ciun.1994.1057","DOIUrl":"10.1006/ciun.1994.1057","url":null,"abstract":"<div><p>We address in this paper how to find clusters based on proximity and planar facets based on coplanarity from 3D line segments obtained from stereo. The proposed methods are efficient and have been tested with many real stereo data. These procedures are indispensable in many applications including scene interpretation, object modeling, and object recognition. We show their application to 3D motion determination. We have developed an algorithm based on the hypothesize-and-verify paradigm to register two consecutive 3D frames obtained from stereo and estimate their transformation/motion. By grouping 3D line segments in each frame into clusters and planes, we can reduce effectively the complexity of the hypothesis generation phase.</p></div>","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"60 3","pages":"Pages 267-284"},"PeriodicalIF":0.0,"publicationDate":"1994-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1057","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90417940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Robust Methods for Estimating Pose and a Sensitivity Analysis 姿态估计的鲁棒方法及灵敏度分析
Pub Date : 1994-11-01 DOI: 10.1006/ciun.1994.1060
Kumar R., Hanson A.R.

This paper mathematically analyzes and proposes new solutions for the problem of estimating the camera 3D location and orientation (pose determination) from a matched set of 3D model and 2D image landmark features. Least-squares techniques for line tokens, which minimize both rotation and translation simultaneously, are developed and shown to be far superior to the earlier techniques which solved for rotation first and then translation. However, least-squares techniques fail catastrophically when outliers (or gross errors) are present in the match data. Outliers arise frequently due to incorrect correspondences or gross errors in the 3D model. Robust techniques for pose determination are developed to handle data contaminated by fewer than 50.0% outliers. Finally, the sensitivity of pose determination to incorrect estimates of camera parameters is analyzed. It is shown that for small field of view systems, offsets in the image center do not significantly affect the location of the camera in a world coordinate system. Errors in the focal length significantly affect only the component of translation along the optical axis in the pose computation.

本文从数学上分析并提出了从一组匹配的三维模型和二维图像地标特征估计相机三维位置和方向(姿态确定)问题的新解决方案。线标记的最小二乘技术,同时最小化旋转和平移,被开发出来,并被证明远远优于早期的技术,先解决旋转,再解决平移。然而,当匹配数据中存在异常值(或严重错误)时,最小二乘技术就会失败。在三维模型中,由于不正确的对应或严重的错误,经常出现异常值。开发了健壮的姿态确定技术,以处理被少于50.0%的异常值污染的数据。最后,分析了姿态确定对相机参数不正确估计的敏感性。结果表明,对于小视场系统,图像中心的偏移量不会显著影响相机在世界坐标系中的位置。在位姿计算中,焦距误差只显著影响沿光轴方向的平移分量。
{"title":"Robust Methods for Estimating Pose and a Sensitivity Analysis","authors":"Kumar R.,&nbsp;Hanson A.R.","doi":"10.1006/ciun.1994.1060","DOIUrl":"10.1006/ciun.1994.1060","url":null,"abstract":"<div><p>This paper mathematically analyzes and proposes new solutions for the problem of estimating the camera 3D location and orientation (<em>pose determination</em>) from a matched set of 3D model and 2D image landmark features. Least-squares techniques for line tokens, which minimize both rotation and translation simultaneously, are developed and shown to be far superior to the earlier techniques which solved for rotation first and then translation. However, least-squares techniques fail catastrophically when outliers (or gross errors) are present in the match data. Outliers arise frequently due to incorrect correspondences or gross errors in the 3D model. Robust techniques for pose determination are developed to handle data contaminated by fewer than 50.0% outliers. Finally, the sensitivity of pose determination to incorrect estimates of camera parameters is analyzed. It is shown that for small field of view systems, offsets in the image center do not significantly affect the location of the camera in a world coordinate system. Errors in the focal length significantly affect only the component of translation along the optical axis in the pose computation.</p></div>","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"60 3","pages":"Pages 313-342"},"PeriodicalIF":0.0,"publicationDate":"1994-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1060","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87995181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 243
Refining 3D Reconstructions: A Theoretical and Experimental Study of the Effect of Cross-Correlations 细化三维重建:相互关联效应的理论和实验研究
Pub Date : 1994-11-01 DOI: 10.1006/ciun.1994.1062
Thomas J.I., Hanson A., Oliensis J.

In robot navigation a model of the environment needs to be reconstructed for various applications, including path planning, obstacle avoidance, and determining where the robot is located. Traditionally, the model was acquired using two images (two-frame structure from motion) but the acquired models were unreliable and inaccurate. Recently, research has shifted to using several frames (multiframe structure from motion) instead of just two frames. However, almost none of the reported multiframe algorithms have produced accurate and stable reconstructions for general robot motion. The main reason seems to be that the primary source of error in the reconstruction-the error in the underlying motion-has been mostly ignored. Intuitively, if a reconstruction of the scene is made up of points, this motion error affects each reconstructed point in a systematic way. For example, if the translation of the robot is erroneous in a certain direction, all the reconstructed points would be shifted along the same direction. The contributions of this paper include mathematically isolating the effect of the motion error (as correlations in the structure error) and showing theoretically that these correlations can improve existing multiframe structure from motion techniques. Finally it is shown that new experimental results and previously reported work confirm the theoretical predictions.

在机器人导航中,需要重建环境模型,用于各种应用,包括路径规划、避障和确定机器人的位置。传统的模型获取方法是采用两幅运动图像(两帧结构),但获取的模型不可靠且不准确。最近,研究已经转向使用几个帧(多帧结构从运动),而不是仅仅两个帧。然而,几乎没有报道的多帧算法产生准确和稳定的一般机器人运动重建。主要原因似乎是重建中的主要误差来源——潜在运动中的误差——大多被忽略了。直观地说,如果场景的重建是由点组成的,这种运动误差会系统地影响每个重建点。例如,如果机器人在某个方向上的平移错误,则所有重构点将沿同一方向移动。本文的贡献包括在数学上隔离运动误差的影响(作为结构误差中的相关性),并从理论上表明这些相关性可以从运动技术中改进现有的多帧结构。最后表明,新的实验结果和先前报道的工作证实了理论预测。
{"title":"Refining 3D Reconstructions: A Theoretical and Experimental Study of the Effect of Cross-Correlations","authors":"Thomas J.I.,&nbsp;Hanson A.,&nbsp;Oliensis J.","doi":"10.1006/ciun.1994.1062","DOIUrl":"https://doi.org/10.1006/ciun.1994.1062","url":null,"abstract":"<div><p>In robot navigation a model of the environment needs to be reconstructed for various applications, including path planning, obstacle avoidance, and determining where the robot is located. Traditionally, the model was acquired using two images (two-frame structure from motion) but the acquired models were unreliable and inaccurate. Recently, research has shifted to using several frames (multiframe structure from motion) instead of just two frames. However, almost none of the reported multiframe algorithms have produced accurate and stable reconstructions for general robot motion. The main reason seems to be that the primary source of error in the reconstruction-the error in the underlying motion-has been mostly ignored. Intuitively, if a reconstruction of the scene is made up of points, this motion error affects each reconstructed point in a systematic way. For example, if the translation of the robot is erroneous in a certain direction, all the reconstructed points would be shifted along the same direction. The contributions of this paper include mathematically isolating the effect of the motion error (as correlations in the structure error) and showing theoretically that these correlations can improve existing multiframe structure from motion techniques. Finally it is shown that new experimental results and previously reported work confirm the theoretical predictions.</p></div>","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"60 3","pages":"Pages 359-370"},"PeriodicalIF":0.0,"publicationDate":"1994-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1062","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91983810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Author Index for Volume 60 第60卷作者索引
Pub Date : 1994-11-01 DOI: 10.1006/ciun.1994.1066
{"title":"Author Index for Volume 60","authors":"","doi":"10.1006/ciun.1994.1066","DOIUrl":"https://doi.org/10.1006/ciun.1994.1066","url":null,"abstract":"","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"60 3","pages":"Page 399"},"PeriodicalIF":0.0,"publicationDate":"1994-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1066","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92027842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Refining 3D reconstruction: a theoretical and experimental study of the effect of cross-correlations 细化三维重建:相互关系影响的理论和实验研究
Pub Date : 1994-11-01 DOI: 10.1006/CIUN.1994.1062
J. I. Thomas, A. Hanson, J. Oliensis
Abstract In robot navigation a model of the environment needs to be reconstructed for various applications, including path planning, obstacle avoidance, and determining where the robot is located. Traditionally, the model was acquired using two images (two-frame structure from motion) but the acquired models were unreliable and inaccurate. Recently, research has shifted to using several frames (multiframe structure from motion) instead of just two frames. However, almost none of the reported multiframe algorithms have produced accurate and stable reconstructions for general robot motion. The main reason seems to be that the primary source of error in the reconstruction-the error in the underlying motion-has been mostly ignored. Intuitively, if a reconstruction of the scene is made up of points, this motion error affects each reconstructed point in a systematic way. For example, if the translation of the robot is erroneous in a certain direction, all the reconstructed points would be shifted along the same direction. The contributions of this paper include mathematically isolating the effect of the motion error (as correlations in the structure error) and showing theoretically that these correlations can improve existing multiframe structure from motion techniques. Finally it is shown that new experimental results and previously reported work confirm the theoretical predictions.
在机器人导航中,需要重建环境模型,用于各种应用,包括路径规划、避障和确定机器人的位置。传统的模型获取方法是采用两幅运动图像(两帧结构),但获取的模型不可靠且不准确。最近,研究已经转向使用几个帧(多帧结构从运动),而不是仅仅两个帧。然而,几乎没有报道的多帧算法产生准确和稳定的一般机器人运动重建。主要原因似乎是重建中的主要误差来源——潜在运动中的误差——大多被忽略了。直观地说,如果场景的重建是由点组成的,这种运动误差会系统地影响每个重建点。例如,如果机器人在某个方向上的平移错误,则所有重构点将沿同一方向移动。本文的贡献包括在数学上隔离运动误差的影响(作为结构误差中的相关性),并从理论上表明这些相关性可以从运动技术中改进现有的多帧结构。最后表明,新的实验结果和先前报道的工作证实了理论预测。
{"title":"Refining 3D reconstruction: a theoretical and experimental study of the effect of cross-correlations","authors":"J. I. Thomas, A. Hanson, J. Oliensis","doi":"10.1006/CIUN.1994.1062","DOIUrl":"https://doi.org/10.1006/CIUN.1994.1062","url":null,"abstract":"Abstract In robot navigation a model of the environment needs to be reconstructed for various applications, including path planning, obstacle avoidance, and determining where the robot is located. Traditionally, the model was acquired using two images (two-frame structure from motion) but the acquired models were unreliable and inaccurate. Recently, research has shifted to using several frames (multiframe structure from motion) instead of just two frames. However, almost none of the reported multiframe algorithms have produced accurate and stable reconstructions for general robot motion. The main reason seems to be that the primary source of error in the reconstruction-the error in the underlying motion-has been mostly ignored. Intuitively, if a reconstruction of the scene is made up of points, this motion error affects each reconstructed point in a systematic way. For example, if the translation of the robot is erroneous in a certain direction, all the reconstructed points would be shifted along the same direction. The contributions of this paper include mathematically isolating the effect of the motion error (as correlations in the structure error) and showing theoretically that these correlations can improve existing multiframe structure from motion techniques. Finally it is shown that new experimental results and previously reported work confirm the theoretical predictions.","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"1 1","pages":"359-370"},"PeriodicalIF":0.0,"publicationDate":"1994-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82205917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
3D Structure Reconstruction from Point Correspondences between two Perspective Projections 从两个透视投影之间的点对应中重建三维结构
Pub Date : 1994-11-01 DOI: 10.1006/ciun.1994.1065
Kara A., Wilkes D.M., Kawamura K.

An iterative algorithm for 3D structure reconstruction from two perspective projections is proposed. The basis of the method is the eight-point algorithm (Longuet-Higgins, Nature 293(10), 1981, 133-135; Tsai and Huang, IEEE Trans. PAMI 6, 1984, 13-27). A drawback of the eight-point algorithm is that it requires at least eight point correspondences. Further, there are certain point configurations for which the algorithm fails. For example, the eight corners of a cube on a quadratic surface passing through the focal points of the cameras form such a degenerate configuration. By combining the eight-point algorithm with an SVD (singular value decomposition) characterization of the so-called E-matrix (Faugeras and Maybank, Internat. J. Comput. Vision 4, 1990, 225-246; Huang and Faugeras, IEEE Trans. PAMI 11, 1989, 1310-1312), the proposed iterative algorithm solves the 3D reconstruction problem even from less than eight points. The algorithm is also free from the artificial degeneracy problem inherent to the eight-point algorithm. The iteration in the algorithm takes place only if the configuration is degenerate or violates the SVD characterization due to measurement error. Otherwise the computation is O(N) as in the eight-point algorithm.

提出了一种基于两视角投影的三维结构重建迭代算法。该方法的基础是8点算法(Longuet-Higgins, Nature 293(10), 1981,133 -135;蔡和黄,IEEE译。Pami 6, 1984, 13-27)。8点算法的一个缺点是它需要至少8个点对应。此外,对于某些点的配置,该算法是失败的。例如,一个二次曲面上的立方体的八个角通过相机的焦点形成了这样一个简并构型。通过将8点算法与所谓的e矩阵的SVD(奇异值分解)表征相结合(Faugeras和Maybank, Internat)。j .第一版。愿景4,1990,225-246;Huang和Faugeras, IEEE译。PAMI 11, 1989, 1310-1312),所提出的迭代算法即使在小于8个点的情况下也能解决三维重建问题。该算法还避免了八点算法固有的人工退化问题。只有当构型退化或由于测量误差而违反SVD特征时,算法才会进行迭代。否则计算是O(N),在8点算法。
{"title":"3D Structure Reconstruction from Point Correspondences between two Perspective Projections","authors":"Kara A.,&nbsp;Wilkes D.M.,&nbsp;Kawamura K.","doi":"10.1006/ciun.1994.1065","DOIUrl":"10.1006/ciun.1994.1065","url":null,"abstract":"<div><p>An iterative algorithm for 3D structure reconstruction from two perspective projections is proposed. The basis of the method is the <em>eight-point algorithm</em> (Longuet-Higgins, <em>Nature</em> 293(10), 1981, 133-135; Tsai and Huang, <em>IEEE Trans. PAMI</em> 6, 1984, 13-27). A drawback of the eight-point algorithm is that it requires at least eight point correspondences. Further, there are certain point configurations for which the algorithm fails. For example, the eight corners of a cube on a quadratic surface passing through the focal points of the cameras form such a degenerate configuration. By combining the eight-point algorithm with an SVD (singular value decomposition) characterization of the so-called <em>E</em>-matrix (Faugeras and Maybank, <em>Internat. J. Comput. Vision</em> 4, 1990, 225-246; Huang and Faugeras, <em>IEEE Trans. PAMI</em> 11, 1989, 1310-1312), the proposed iterative algorithm solves the 3D reconstruction problem even from less than eight points. The algorithm is also free from the artificial degeneracy problem inherent to the eight-point algorithm. The iteration in the algorithm takes place only if the configuration is degenerate or violates the SVD characterization due to measurement error. Otherwise the computation is <em>O</em>(<em>N</em>) as in the eight-point algorithm.</p></div>","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"60 3","pages":"Pages 392-397"},"PeriodicalIF":0.0,"publicationDate":"1994-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1065","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72611953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Computational Cross Ratio for Computer Vision 计算机视觉的计算交叉比
Pub Date : 1994-11-01 DOI: 10.1006/ciun.1994.1063
Kanatani K.

A new "computational" formulation of cross ratio is presented with a view to applications to computer vision problems by extending the framework of "computational projective geometry" of Kanatani (Image Understand. 54, 1991, 333-348). As typical examples, we construct procedures for computing the 3-D orientation of a planar shape from its 2-D projection image and the focus of expansion from an image trajectory of a single point by taking advantage of the perspective invariance of cross ratio and "projective coordinates," and the resulting 3-D interpretation of "harmonic range."

通过扩展Kanatani的“计算射影几何”框架,提出了一种新的“计算”交叉比公式,以期将其应用于计算机视觉问题(Image Understand. 54,1991,333 -348)。作为典型的例子,我们构建了从平面形状的二维投影图像计算三维方向的程序,并利用交叉比和“射影坐标”的透视不变性,从单点的图像轨迹计算扩展焦点,以及由此产生的“谐波范围”的三维解释。
{"title":"Computational Cross Ratio for Computer Vision","authors":"Kanatani K.","doi":"10.1006/ciun.1994.1063","DOIUrl":"10.1006/ciun.1994.1063","url":null,"abstract":"<div><p>A new \"computational\" formulation of cross ratio is presented with a view to applications to computer vision problems by extending the framework of \"computational projective geometry\" of Kanatani (<em>Image Understand.</em> 54, 1991, 333-348). As typical examples, we construct procedures for computing the 3-D orientation of a planar shape from its 2-D projection image and the focus of expansion from an image trajectory of a single point by taking advantage of the perspective invariance of cross ratio and \"projective coordinates,\" and the resulting 3-D interpretation of \"harmonic range.\"</p></div>","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"60 3","pages":"Pages 371-381"},"PeriodicalIF":0.0,"publicationDate":"1994-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1063","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82060116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Statistical Foundation for Hypothesis Testing of Image Data 图像数据假设检验的统计基础
Pub Date : 1994-11-01 DOI: 10.1006/ciun.1994.1064
Kanatani K.

A statistical foundation is given to the problem of hypothesizing and testing geometric properties of image data heuristically derived by Kanatani (CVGIP: Image Understanding54 (1991), 333-348). Points and lines in the image are represented by "N-vectors" and their reliability is evaluated by their "covariance matrices". Under a Gaussian approximation of the distribution, the test takes the form of a χ2 test. Test criteria are explicitly stated for model matching and testing edge groupings, vanishing points, focuses of expansion, and vanishing lines.

统计基础给出了假设和测试图像数据的几何特性的问题,这是由Kanatani (CVGIP: image Understanding54(1991), 333-348)启发式推导出来的。图像中的点和线用“n向量”表示,它们的可靠性由它们的“协方差矩阵”来评估。在分布的高斯近似下,检验采用χ2检验的形式。明确地说明了模型匹配和测试边缘分组、消失点、扩展焦点和消失线的测试标准。
{"title":"Statistical Foundation for Hypothesis Testing of Image Data","authors":"Kanatani K.","doi":"10.1006/ciun.1994.1064","DOIUrl":"10.1006/ciun.1994.1064","url":null,"abstract":"<div><p>A statistical foundation is given to the problem of hypothesizing and testing geometric properties of image data heuristically derived by Kanatani (<em>CVGIP: Image Understanding</em><em>54</em> (1991), 333-348). Points and lines in the image are represented by \"N-vectors\" and their reliability is evaluated by their \"covariance matrices\". Under a Gaussian approximation of the distribution, the test takes the form of a χ<sup>2</sup> test. Test criteria are explicitly stated for model matching and testing edge groupings, vanishing points, focuses of expansion, and vanishing lines.</p></div>","PeriodicalId":100350,"journal":{"name":"CVGIP: Image Understanding","volume":"60 3","pages":"Pages 382-391"},"PeriodicalIF":0.0,"publicationDate":"1994-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/ciun.1994.1064","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88197717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
CVGIP: Image Understanding
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1