首页 > 最新文献

Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)最新文献

英文 中文
Constrained self-calibration 受约束的自校准
J. Mendelsohn, Kostas Daniilidis
This paper focuses on the estimation of the intrinsic camera parameters and the trajectory of the camera from an image sequence. Intrinsic camera calibration and pose estimation are the prerequisites for many applications involving navigation tasks, scene reconstruction, and merging of virtual and real environments. Proposed and evaluated is a technical solution to decrease the sensitivity of self-calibration by placing easily identifiable targets of known shape in the environment. The relative position of the targets need not be known a priori. Assuming an appropriate ratio of size to distance these targets resolve known ambiguities. Constraints on the target placement and the cameras' motions are explored. The algorithm is extensively tested in a variety of real-world scenarios.
本文主要研究了从图像序列中估计相机的固有参数和相机的运动轨迹。内置相机校准和姿态估计是涉及导航任务,场景重建以及虚拟和真实环境合并的许多应用程序的先决条件。提出并评估了一种通过在环境中放置易于识别的已知形状的目标来降低自校准灵敏度的技术解决方案。不需要先验地知道目标的相对位置。假设大小与距离的适当比例,这些目标解决了已知的模糊性。探讨了目标位置和摄像机运动的约束条件。该算法在各种现实世界的场景中进行了广泛的测试。
{"title":"Constrained self-calibration","authors":"J. Mendelsohn, Kostas Daniilidis","doi":"10.1109/CVPR.1999.784974","DOIUrl":"https://doi.org/10.1109/CVPR.1999.784974","url":null,"abstract":"This paper focuses on the estimation of the intrinsic camera parameters and the trajectory of the camera from an image sequence. Intrinsic camera calibration and pose estimation are the prerequisites for many applications involving navigation tasks, scene reconstruction, and merging of virtual and real environments. Proposed and evaluated is a technical solution to decrease the sensitivity of self-calibration by placing easily identifiable targets of known shape in the environment. The relative position of the targets need not be known a priori. Assuming an appropriate ratio of size to distance these targets resolve known ambiguities. Constraints on the target placement and the cameras' motions are explored. The algorithm is extensively tested in a variety of real-world scenarios.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"10 1","pages":"581-587 Vol. 2"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73990453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Indexing using a spectral encoding of topological structure 索引使用频谱编码的拓扑结构
A. Shokoufandeh, Sven J. Dickinson, Kaleem Siddiqi, S. Zucker
In an object recognition system, if the extracted image features are multilevel or multiscale, the indexing structure may take the form of a tree. Such structures are not only common in computer vision, but also appear in linguistics, graphics, computational biology, and a wide range of other domains. In this paper, we develop an indexing mechanism that maps the topological structure of a tree into a low-dimensional vector space. Based on a novel eigenvalue characterization of a tree, this topological signature allows us to efficiently retrieve a small set of candidates from a database of models. To accommodate occlusion and local deformation, local evidence is accumulated in each of the tree's topological subspaces. We demonstrate the approach with a series of indexing experiments in the domain of 2-D object recognition.
在目标识别系统中,如果提取的图像特征是多层次或多尺度的,则索引结构可以采用树的形式。这种结构不仅在计算机视觉中很常见,而且出现在语言学、图形学、计算生物学和其他广泛的领域。在本文中,我们开发了一种索引机制,将树的拓扑结构映射到低维向量空间。基于一种新的树的特征值特征,这种拓扑特征允许我们从模型数据库中有效地检索一小组候选模型。为了适应遮挡和局部变形,在树的每个拓扑子空间中积累局部证据。我们通过一系列二维目标识别领域的索引实验来证明该方法。
{"title":"Indexing using a spectral encoding of topological structure","authors":"A. Shokoufandeh, Sven J. Dickinson, Kaleem Siddiqi, S. Zucker","doi":"10.1109/CVPR.1999.784726","DOIUrl":"https://doi.org/10.1109/CVPR.1999.784726","url":null,"abstract":"In an object recognition system, if the extracted image features are multilevel or multiscale, the indexing structure may take the form of a tree. Such structures are not only common in computer vision, but also appear in linguistics, graphics, computational biology, and a wide range of other domains. In this paper, we develop an indexing mechanism that maps the topological structure of a tree into a low-dimensional vector space. Based on a novel eigenvalue characterization of a tree, this topological signature allows us to efficiently retrieve a small set of candidates from a database of models. To accommodate occlusion and local deformation, local evidence is accumulated in each of the tree's topological subspaces. We demonstrate the approach with a series of indexing experiments in the domain of 2-D object recognition.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"73 1","pages":"491-497 Vol. 2"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74645857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 177
Interpolating view and scene motion by dynamic view morphing 通过动态视图变形插值视图和场景运动
R. A. Manning, C. Dyer
We introduce the problem of view interpolation for dynamic scenes. Our solution to this problem extends the concept of view morphing and retains the practical advantages of that method. We are specifically concerned with interpolating between two reference views captured at different times, so that there is a missing interval of time between when the views were taken. The synthetic interpolations produced by our algorithm portray one possible physically-valid version of what transpired in the scene during the missing time. It is assumed that each object in the original scene underwent a series of rigid translations. Dynamic view morphing can work with widely-spaced reference views, sparse point correspondences, and uncalibrated cameras. When the camera-to-camera transformation can be determined, the synthetic interpolation will portray scene objects moving along straight-line, constant-velocity trajectories in world space.
介绍了动态场景的视图插值问题。我们对这个问题的解决方案扩展了视图变形的概念,并保留了该方法的实际优点。我们特别关注在不同时间捕获的两个参考视图之间的插值,以便在视图被捕获之间存在缺失的时间间隔。由我们的算法产生的合成插值描绘了一个可能的物理有效版本,在消失的时间里,场景中发生了什么。假设原始场景中的每个物体都经历了一系列的刚性平移。动态视图变形可以与宽间隔的参考视图、稀疏点对应和未校准的相机一起工作。当摄像机到摄像机的变换确定后,合成插值将在世界空间中描绘沿直线匀速轨迹运动的场景物体。
{"title":"Interpolating view and scene motion by dynamic view morphing","authors":"R. A. Manning, C. Dyer","doi":"10.1109/CVPR.1999.786968","DOIUrl":"https://doi.org/10.1109/CVPR.1999.786968","url":null,"abstract":"We introduce the problem of view interpolation for dynamic scenes. Our solution to this problem extends the concept of view morphing and retains the practical advantages of that method. We are specifically concerned with interpolating between two reference views captured at different times, so that there is a missing interval of time between when the views were taken. The synthetic interpolations produced by our algorithm portray one possible physically-valid version of what transpired in the scene during the missing time. It is assumed that each object in the original scene underwent a series of rigid translations. Dynamic view morphing can work with widely-spaced reference views, sparse point correspondences, and uncalibrated cameras. When the camera-to-camera transformation can be determined, the synthetic interpolation will portray scene objects moving along straight-line, constant-velocity trajectories in world space.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"7 1","pages":"388-394 Vol. 1"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74284480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 83
Efficient bundle adjustment with virtual key frames: a hierarchical approach to multi-frame structure from motion 基于虚拟关键帧的有效束调整:一种从运动到多帧结构的分层方法
H. Shum, Zhengyou Zhang, Qifa Ke
In this paper we present an efficient hierarchical approach to structure from motion for long image sequences. There are two key elements to our approach: accurate 3D reconstruction for each segment and efficient bundle adjustment for the whole sequence. The image sequence is first divided into a number of segments so that feature points can be reliably tracked across each segment. Each segment has a long baseline to ensure accurate 3D reconstruction. To efficiently bundle adjust 3D structures from ail segments, we reduce the number of frames in each segment by introducing "virtual keyframes". The virtual frames encode the 3D structure of each segment along with its uncertainty but they form a small subset of the original frames. Our method achieves significant speedup over conventional bundle adjustment methods.
在本文中,我们提出了一种有效的从运动中构造长图像序列的分层方法。我们的方法有两个关键要素:每个片段的精确3D重建和整个序列的有效束调整。首先将图像序列划分为多个片段,以便在每个片段上可靠地跟踪特征点。每个部分都有很长的基线,以确保准确的3D重建。为了有效地从所有片段中捆绑调整3D结构,我们通过引入“虚拟关键帧”来减少每个片段中的帧数。虚拟帧编码了每个片段的三维结构及其不确定性,但它们只是原始帧的一小部分。与传统的束调整方法相比,我们的方法实现了显著的加速。
{"title":"Efficient bundle adjustment with virtual key frames: a hierarchical approach to multi-frame structure from motion","authors":"H. Shum, Zhengyou Zhang, Qifa Ke","doi":"10.1109/CVPR.1999.784733","DOIUrl":"https://doi.org/10.1109/CVPR.1999.784733","url":null,"abstract":"In this paper we present an efficient hierarchical approach to structure from motion for long image sequences. There are two key elements to our approach: accurate 3D reconstruction for each segment and efficient bundle adjustment for the whole sequence. The image sequence is first divided into a number of segments so that feature points can be reliably tracked across each segment. Each segment has a long baseline to ensure accurate 3D reconstruction. To efficiently bundle adjust 3D structures from ail segments, we reduce the number of frames in each segment by introducing \"virtual keyframes\". The virtual frames encode the 3D structure of each segment along with its uncertainty but they form a small subset of the original frames. Our method achieves significant speedup over conventional bundle adjustment methods.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"316 1","pages":"538-543 Vol. 2"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77623309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 81
Robot localization using uncalibrated camera invariants 利用未标定的相机不变量进行机器人定位
M. Werman, M. Qiu, Subhashis Banerjee, Sumantra Dutta Roy
We describe a set of image measurements which are invariant to the camera internals but are location variant. We show that using these measurements it is possible to calculate the self-localization of a robot using known landmarks and uncalibrated cameras. We also show that it is possible to compute, using uncalibrated cameras, the Euclidean structure of 3-D world points using multiple views from known positions. We are free to alter the internal parameters of the camera during these operations. Our initial experiments demonstrate the applicability of the method.
我们描述了一组图像测量值,这些测量值与相机内部不变,但与位置不同。我们表明,使用这些测量值,可以使用已知地标和未校准的相机计算机器人的自定位。我们还表明,使用未校准的摄像机,可以使用来自已知位置的多个视图计算三维世界点的欧几里得结构。在这些操作过程中,我们可以自由地改变相机的内部参数。我们的初步实验证明了该方法的适用性。
{"title":"Robot localization using uncalibrated camera invariants","authors":"M. Werman, M. Qiu, Subhashis Banerjee, Sumantra Dutta Roy","doi":"10.1109/CVPR.1999.784658","DOIUrl":"https://doi.org/10.1109/CVPR.1999.784658","url":null,"abstract":"We describe a set of image measurements which are invariant to the camera internals but are location variant. We show that using these measurements it is possible to calculate the self-localization of a robot using known landmarks and uncalibrated cameras. We also show that it is possible to compute, using uncalibrated cameras, the Euclidean structure of 3-D world points using multiple views from known positions. We are free to alter the internal parameters of the camera during these operations. Our initial experiments demonstrate the applicability of the method.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"69 1","pages":"353-359 Vol. 2"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79833492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
Robust and efficient image alignment with spatially varying illumination models 鲁棒和有效的图像对齐与空间变化的照明模型
S. Lai, M. Fang
Image alignment is one of the most important task in computer vision. In this paper, we explicitly model spatial illumination variations by low-order polynomial functions in an energy minimization framework. Data constraints for the alignment and illumination parameters are derived from the first-order Taylor approximation of a generalized brightness assumption. We formulate the parameter estimation problem in a weighted least-square framework by using the influence function from robust estimation to derive an iterative re-weighted least-square algorithm. A dynamic weighting scheme, which combines the factors from influence function, consistency of image gradients and nonlinear image intensity sensing, is used to improve the robustness of the image matching. In addition, a constraint sampling scheme and an estimation-warping alternative strategy are used in the proposed algorithm to improve its efficiency and accuracy. Experimental results are shown to demonstrate the robustness, efficiency and accuracy of the algorithm.
图像对齐是计算机视觉中的重要任务之一。本文利用能量最小化框架下的低阶多项式函数对空间光照变化进行了显式建模。从广义亮度假设的一阶泰勒近似推导出对准和照明参数的数据约束。在加权最小二乘框架下,利用鲁棒估计的影响函数,推导出一种迭代的重加权最小二乘算法。结合影响函数、图像梯度一致性和非线性图像强度感知等因素,采用动态加权方案提高图像匹配的鲁棒性。此外,为了提高算法的效率和精度,该算法还采用了约束采样方案和估计-翘曲替代策略。实验结果证明了该算法的鲁棒性、有效性和准确性。
{"title":"Robust and efficient image alignment with spatially varying illumination models","authors":"S. Lai, M. Fang","doi":"10.1109/CVPR.1999.784625","DOIUrl":"https://doi.org/10.1109/CVPR.1999.784625","url":null,"abstract":"Image alignment is one of the most important task in computer vision. In this paper, we explicitly model spatial illumination variations by low-order polynomial functions in an energy minimization framework. Data constraints for the alignment and illumination parameters are derived from the first-order Taylor approximation of a generalized brightness assumption. We formulate the parameter estimation problem in a weighted least-square framework by using the influence function from robust estimation to derive an iterative re-weighted least-square algorithm. A dynamic weighting scheme, which combines the factors from influence function, consistency of image gradients and nonlinear image intensity sensing, is used to improve the robustness of the image matching. In addition, a constraint sampling scheme and an estimation-warping alternative strategy are used in the proposed algorithm to improve its efficiency and accuracy. Experimental results are shown to demonstrate the robustness, efficiency and accuracy of the algorithm.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"14 1","pages":"167-172 Vol. 2"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81262952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 45
Sensor planning for a trinocular active vision system 三眼主动视觉系统的传感器规划
Peter Lehel, E. Hemayed, A. Farag
We present an algorithm to solve the sensor planning problem for a trinocular, active vision system. This algorithm uses an iterative optimization method to first solve for the translation between the three cameras and then uses this result to solve for parameters such as pan, tilt angles of the cameras and zoom setting.
提出了一种求解三眼主动视觉系统传感器规划问题的算法。该算法采用迭代优化的方法,首先求解三台相机之间的平移,然后利用该结果求解相机的平移、倾斜角度和变焦设置等参数。
{"title":"Sensor planning for a trinocular active vision system","authors":"Peter Lehel, E. Hemayed, A. Farag","doi":"10.1109/CVPR.1999.784649","DOIUrl":"https://doi.org/10.1109/CVPR.1999.784649","url":null,"abstract":"We present an algorithm to solve the sensor planning problem for a trinocular, active vision system. This algorithm uses an iterative optimization method to first solve for the translation between the three cameras and then uses this result to solve for parameters such as pan, tilt angles of the cameras and zoom setting.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"3 1 1","pages":"306-312 Vol. 2"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85114174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Extending superquadrics with exponent functions: modeling and reconstruction 用指数函数扩展超二次曲面:建模与重构
Lin Zhou, C. Kambhamettu
Superquadrics are a family of parametric shapes which can model a diverse set of objects. They have received significant attention because of their compact representation and robust methods for recovery of 3D models. However, their assumption of intrinsical symmetry fails in modeling numerous real-world examples such as human, body, animals, and other naturally occurring objects. In this paper, we present a novel approach, which is called extended superquadric to extend superquadric's representation power with exponent functions. An extended superquadric model can be deformed in any direction because it extends the exponents of superquadrics from constants to functions of the latitude and longitude angles in the spherical coordinate system. Thus extended superquadrics can model more complex shapes than superquadrics. It also maintains many desired properties of superquadrics such as compactness controllability, and intuitive meaning, which are all advantageous for shape modeling, recognition, and reconstruction. In this paper, besides the use of extended superquadrics for modeling, we also discuss our research into the recovery of extended superquadrics from 3D information (reconstruction). Experimental results of fitting extended superquadrics to 3D real data are presented. Our results are very encouraging and indicate that the use of extended superquadric is a promising paradigm for shape representation and recovery in computers vision and has potential benefits for the generation of synthetic images for computer graphics.
超二次曲面是一组参数形状,可以模拟各种各样的物体。由于其紧凑的表示和强大的3D模型恢复方法,它们受到了极大的关注。然而,他们对内在对称性的假设在模拟人类、身体、动物和其他自然发生的物体等许多现实世界的例子时失败了。本文提出了一种新的方法——扩展超二次曲线,用指数函数来扩展超二次曲线的表示能力。扩展的超二次曲面模型可以在任何方向上变形,因为它将超二次曲面的指数从常数扩展到球坐标系中经纬度角的函数。因此,扩展超二次曲面可以模拟比超二次曲面更复杂的形状。它还保持了超二次曲面的紧凑性、可控性和直观意义等特性,有利于形状建模、识别和重构。在本文中,除了使用扩展超二次曲面进行建模之外,我们还讨论了从三维信息中恢复扩展超二次曲面(重建)的研究。给出了扩展超二次曲面拟合三维实际数据的实验结果。我们的结果非常令人鼓舞,并表明扩展超二次曲面的使用是计算机视觉中形状表示和恢复的一个有前途的范例,并且对计算机图形学合成图像的生成具有潜在的好处。
{"title":"Extending superquadrics with exponent functions: modeling and reconstruction","authors":"Lin Zhou, C. Kambhamettu","doi":"10.1109/CVPR.1999.784611","DOIUrl":"https://doi.org/10.1109/CVPR.1999.784611","url":null,"abstract":"Superquadrics are a family of parametric shapes which can model a diverse set of objects. They have received significant attention because of their compact representation and robust methods for recovery of 3D models. However, their assumption of intrinsical symmetry fails in modeling numerous real-world examples such as human, body, animals, and other naturally occurring objects. In this paper, we present a novel approach, which is called extended superquadric to extend superquadric's representation power with exponent functions. An extended superquadric model can be deformed in any direction because it extends the exponents of superquadrics from constants to functions of the latitude and longitude angles in the spherical coordinate system. Thus extended superquadrics can model more complex shapes than superquadrics. It also maintains many desired properties of superquadrics such as compactness controllability, and intuitive meaning, which are all advantageous for shape modeling, recognition, and reconstruction. In this paper, besides the use of extended superquadrics for modeling, we also discuss our research into the recovery of extended superquadrics from 3D information (reconstruction). Experimental results of fitting extended superquadrics to 3D real data are presented. Our results are very encouraging and indicate that the use of extended superquadric is a promising paradigm for shape representation and recovery in computers vision and has potential benefits for the generation of synthetic images for computer graphics.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"12 1","pages":"73-78 Vol. 2"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81213883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 52
A dependency-based framework of combining multiple experts for the recognition of unconstrained handwritten numerals 结合多个专家的基于依赖的框架,用于无约束手写数字的识别
H. Kang, Seong-Whan Lee
Although Behavior-Knowledge Space (BKS) method does not need any assumptions in combining multiple experts, it should build theoretically exponential storage spaces for storing and managing jointly observed K decisions from K experts. That is, combining K experts needs a (K+1)st-order probability distribution. However, it is well known that the distribution becomes unmanageable in storing and estimating, even for a small K. In order to overcome such weakness, it would be attractive to decompose the distribution into a number of component distributions and to approximate the distribution with a product of the component distributions. One of such previous works is to apply a conditional independence assumption to the distribution. Another work is to approximate the distribution with a product of only first-order tree dependencies or second-order distributions. In this paper, a dependency-based framework is proposed to optimality approximate a probability distribution with a product set of dth-order dependencies where 1
虽然行为知识空间(BKS)方法在组合多个专家时不需要任何假设,但它应该建立理论上的指数存储空间来存储和管理K个专家共同观察到的K个决策。也就是说,组合K个专家需要一个(K+1)个一阶概率分布。然而,众所周知,即使对于一个很小的k,分布在存储和估计方面也变得难以管理。为了克服这种弱点,将分布分解为许多分量分布,并用分量分布的乘积来近似分布是很有吸引力的。以前的工作之一是对分布应用条件独立假设。另一项工作是用一阶树依赖关系或二阶分布的乘积来近似分布。本文提出了一种基于依赖关系的框架,用于最优逼近具有1
{"title":"A dependency-based framework of combining multiple experts for the recognition of unconstrained handwritten numerals","authors":"H. Kang, Seong-Whan Lee","doi":"10.1109/CVPR.1999.784619","DOIUrl":"https://doi.org/10.1109/CVPR.1999.784619","url":null,"abstract":"Although Behavior-Knowledge Space (BKS) method does not need any assumptions in combining multiple experts, it should build theoretically exponential storage spaces for storing and managing jointly observed K decisions from K experts. That is, combining K experts needs a (K+1)st-order probability distribution. However, it is well known that the distribution becomes unmanageable in storing and estimating, even for a small K. In order to overcome such weakness, it would be attractive to decompose the distribution into a number of component distributions and to approximate the distribution with a product of the component distributions. One of such previous works is to apply a conditional independence assumption to the distribution. Another work is to approximate the distribution with a product of only first-order tree dependencies or second-order distributions. In this paper, a dependency-based framework is proposed to optimality approximate a probability distribution with a product set of dth-order dependencies where 1<d<K, and to combine multiple experts based on the product set using the Bayesian formalism. This framework was experimented and evaluated with a standardized CEN-PARIMl data base.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"2 1","pages":"124-129 Vol. 2"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82250588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Torque-based recursive filtering approach to the recovery of 3D articulated motion from image sequences 基于转矩的递归滤波方法从图像序列中恢复三维关节运动
Hiroyuki Segawa, T. Totsuka
In this paper we introduce a recursive filtering method to recover the 3D articulated motion from image sequences. In recursive filtering frameworks, the quality of the results heavily depends on the choice of state variables and the determination of the process model; which models a real object whose motion is to be estimated. Our approach employs robotics dynamics into the recursive filtering framework. And the key strategy is to incorporate joint torques into the model state variables. In addition, we assumed the variations of the joint torques are Gaussian noises. We describe how to integrate dynamics equations into Kalman filters, and with the experimental results our method is shown to be effective.
本文介绍了一种从图像序列中恢复三维关节运动的递归滤波方法。在递归滤波框架中,结果的质量很大程度上取决于状态变量的选择和过程模型的确定;它模拟一个要估计其运动的真实物体。我们的方法将机器人动力学应用到递归滤波框架中。关键策略是将关节力矩纳入模型状态变量。此外,我们假设关节力矩的变化是高斯噪声。本文描述了如何将动力学方程集成到卡尔曼滤波器中,实验结果表明该方法是有效的。
{"title":"Torque-based recursive filtering approach to the recovery of 3D articulated motion from image sequences","authors":"Hiroyuki Segawa, T. Totsuka","doi":"10.1109/CVPR.1999.784656","DOIUrl":"https://doi.org/10.1109/CVPR.1999.784656","url":null,"abstract":"In this paper we introduce a recursive filtering method to recover the 3D articulated motion from image sequences. In recursive filtering frameworks, the quality of the results heavily depends on the choice of state variables and the determination of the process model; which models a real object whose motion is to be estimated. Our approach employs robotics dynamics into the recursive filtering framework. And the key strategy is to incorporate joint torques into the model state variables. In addition, we assumed the variations of the joint torques are Gaussian noises. We describe how to integrate dynamics equations into Kalman filters, and with the experimental results our method is shown to be effective.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"331 1","pages":"340-345 Vol. 2"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76564085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1