首页 > 最新文献

Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)最新文献

英文 中文
Fast, robust, and consistent camera motion estimation 快速,稳健,一致的相机运动估计
Tong Zhang, Carlo Tomasi
Previous algorithms that recover camera motion from image velocities suffer from both bias and excessive variance in the results. We propose a robust estimator of camera motion that is statistically consistent when image noise is isotropic. Consistency means that the estimated motion converges in probability, to the true value as the number of image points increases. An algorithm based on reweighted Gauss-Newton iterations handles 100 velocity measurements in about 50 milliseconds on a workstation.
以前从图像速度中恢复相机运动的算法在结果中存在偏差和过大的方差。我们提出了一种鲁棒的相机运动估计,当图像噪声是各向同性时,该估计在统计上是一致的。一致性是指随着图像点数量的增加,估计的运动在概率上收敛到真实值。一种基于重加权高斯-牛顿迭代的算法在工作站上处理100次速度测量大约50毫秒。
{"title":"Fast, robust, and consistent camera motion estimation","authors":"Tong Zhang, Carlo Tomasi","doi":"10.1109/CVPR.1999.786934","DOIUrl":"https://doi.org/10.1109/CVPR.1999.786934","url":null,"abstract":"Previous algorithms that recover camera motion from image velocities suffer from both bias and excessive variance in the results. We propose a robust estimator of camera motion that is statistically consistent when image noise is isotropic. Consistency means that the estimated motion converges in probability, to the true value as the number of image points increases. An algorithm based on reweighted Gauss-Newton iterations handles 100 velocity measurements in about 50 milliseconds on a workstation.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"16 1","pages":"164-170 Vol. 1"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87388283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 60
Algebraic curves that work better 代数曲线效果更好
T. Tasdizen, Jean-Philippe Tarel, D. Cooper
An algebraic curve is defined as the zero set of a polynomial in two variables. Algebraic curves are practical for modeling shapes much more complicated than conics or superquadrics. The main drawback in representing shapes by algebraic curves has been the lack of repeatability in fitting algebraic curves to data. A regularized fast linear fitting method based on ridge regression and restricting the representation to well behaved subsets of polynomials is proposed, and its properties are investigated. The fitting algorithm is of sufficient stability for very fast position-invariant shape recognition, position estimation, and shape tracking, based on new invariants and representations, and is appropriate to open as well as closed curves of unorganized data. Among appropriate applications are shape-based indexing into image databases.
代数曲线被定义为两个变量的多项式的零集。代数曲线对于比二次曲线或超二次曲线更复杂的形状建模是实用的。用代数曲线表示形状的主要缺点是在拟合代数曲线到数据时缺乏可重复性。提出了一种基于岭回归的正则化快速线性拟合方法,并对其性质进行了研究。该拟合算法基于新的不变量和表示,对快速的位置不变形状识别、位置估计和形状跟踪具有足够的稳定性,适用于无组织数据的开放曲线和封闭曲线。合适的应用程序包括基于形状的图像数据库索引。
{"title":"Algebraic curves that work better","authors":"T. Tasdizen, Jean-Philippe Tarel, D. Cooper","doi":"10.1109/CVPR.1999.784605","DOIUrl":"https://doi.org/10.1109/CVPR.1999.784605","url":null,"abstract":"An algebraic curve is defined as the zero set of a polynomial in two variables. Algebraic curves are practical for modeling shapes much more complicated than conics or superquadrics. The main drawback in representing shapes by algebraic curves has been the lack of repeatability in fitting algebraic curves to data. A regularized fast linear fitting method based on ridge regression and restricting the representation to well behaved subsets of polynomials is proposed, and its properties are investigated. The fitting algorithm is of sufficient stability for very fast position-invariant shape recognition, position estimation, and shape tracking, based on new invariants and representations, and is appropriate to open as well as closed curves of unorganized data. Among appropriate applications are shape-based indexing into image databases.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"1 1","pages":"35-41 Vol. 2"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75012848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Using a linear subspace approach for invariant subpixel material identification in airborne hyperspectral imagery 基于线性子空间方法的航空高光谱图像不变亚像素材料识别
Bea Thai, G. Healey
We present an algorithm for subpixel material identification that is invariant to the illumination and atmospheric conditions. The target material spectral reflectance is the only prior information required by the algorithm. A target material subspace model is constructed from the reflectance using a physical model and a background subspace model is estimated directly from the image. These two subspace models are used to compute maximum likelihood estimates for the target material component and the background component at each image pixel. These estimates form the basis of a generalized likelihood ratio test for subpixel material identification. We present experimental results using HYDICE imagery that demonstrate the utility of the algorithm for subpixel material identification under varying illumination and atmospheric conditions.
我们提出了一种不受光照和大气条件影响的亚像素材料识别算法。该算法只需要目标材料的光谱反射率作为先验信息。利用物理模型从反射率构造目标材料子空间模型,直接从图像估计背景子空间模型。这两个子空间模型用于计算每个图像像素上目标材料成分和背景成分的最大似然估计。这些估计构成了亚像素材料识别的广义似然比检验的基础。我们展示了使用HYDICE图像的实验结果,证明了该算法在不同照明和大气条件下用于亚像素材料识别的实用性。
{"title":"Using a linear subspace approach for invariant subpixel material identification in airborne hyperspectral imagery","authors":"Bea Thai, G. Healey","doi":"10.1109/CVPR.1999.786995","DOIUrl":"https://doi.org/10.1109/CVPR.1999.786995","url":null,"abstract":"We present an algorithm for subpixel material identification that is invariant to the illumination and atmospheric conditions. The target material spectral reflectance is the only prior information required by the algorithm. A target material subspace model is constructed from the reflectance using a physical model and a background subspace model is estimated directly from the image. These two subspace models are used to compute maximum likelihood estimates for the target material component and the background component at each image pixel. These estimates form the basis of a generalized likelihood ratio test for subpixel material identification. We present experimental results using HYDICE imagery that demonstrate the utility of the algorithm for subpixel material identification under varying illumination and atmospheric conditions.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"96 1","pages":"567-572 Vol. 1"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75146055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Integrating shape from shading and range data using neural networks 利用神经网络从阴影和距离数据中整合形状
M. Mostafa, S. Yamany, A. Farag
This paper presents a framework for integrating multiple sensory data, sparse range data and dense depth maps from shape from shading in order to improve the 3D reconstruction of visible surfaces of 3D objects. The integration process is based on propagating the error difference between the two data sets by fitting a surface to that difference and using it to correct the visible surface obtained from shape from shading. A feedforward neural network is used to fit a surface to the sparse data. We also study the use of the extended Kalman filter for supervised learning and compare it with the backpropagation algorithm. A performance analysis is done to obtain the best neural network architecture and learning algorithm. It is found that the integration of sparse depth measurements has greatly enhanced the 3D visible surface obtained from shape from shading in terms of metric measurements.
为了提高三维物体可见表面的三维重建,本文提出了一种从形状到阴影的多感官数据、稀疏距离数据和密集深度图的集成框架。积分过程是基于传播两个数据集之间的误差差,通过拟合一个表面到该差异,并用它来校正从阴影中获得的形状可见表面。采用前馈神经网络对稀疏数据拟合曲面。我们还研究了扩展卡尔曼滤波在监督学习中的应用,并将其与反向传播算法进行了比较。为了获得最佳的神经网络结构和学习算法,进行了性能分析。研究发现,稀疏深度测量值的整合,极大地增强了从阴影形状获得的三维可见表面的度量值。
{"title":"Integrating shape from shading and range data using neural networks","authors":"M. Mostafa, S. Yamany, A. Farag","doi":"10.1109/CVPR.1999.784602","DOIUrl":"https://doi.org/10.1109/CVPR.1999.784602","url":null,"abstract":"This paper presents a framework for integrating multiple sensory data, sparse range data and dense depth maps from shape from shading in order to improve the 3D reconstruction of visible surfaces of 3D objects. The integration process is based on propagating the error difference between the two data sets by fitting a surface to that difference and using it to correct the visible surface obtained from shape from shading. A feedforward neural network is used to fit a surface to the sparse data. We also study the use of the extended Kalman filter for supervised learning and compare it with the backpropagation algorithm. A performance analysis is done to obtain the best neural network architecture and learning algorithm. It is found that the integration of sparse depth measurements has greatly enhanced the 3D visible surface obtained from shape from shading in terms of metric measurements.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"24 1","pages":"15-20 Vol. 2"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74266117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
Automatic reconstruction of piecewise planar models from multiple views 从多个视图中自动重建分段平面模型
C. Baillard, Andrew Zisserman
A new method is described for automatically reconstructing 3D planar faces from multiple images of a scene. The novelty of the approach lies in the use of inter-image homographies to validate and best estimate the plane, and in the minimal initialization requirements-only a single 3D line with a textured neighbourhood is required to generate a plane hypothesis. The planar facets enable line grouping and also the construction of parts of the wireframe which were missed due to the inevitable shortcomings of feature detection and matching. The method allows a piecewise planar model of a scene to be built completely automatically, with no user intervention at any stage, given only the images and camera projection matrices as input. The robustness and reliability of the method are illustrated on several examples, from both aerial and interior views.
提出了一种从多幅场景图像中自动重建三维平面人脸的新方法。该方法的新颖之处在于使用图像间的同形异构词来验证和最佳估计平面,并且在最小的初始化要求中-仅需要一条具有纹理邻居的3D线来生成平面假设。平面面可以实现线分组,也可以构建由于特征检测和匹配不可避免的缺陷而错过的部分线框。该方法允许完全自动地构建场景的分段平面模型,在任何阶段都不需要用户干预,只给出图像和相机投影矩阵作为输入。实例分析表明,该方法具有较好的鲁棒性和可靠性。
{"title":"Automatic reconstruction of piecewise planar models from multiple views","authors":"C. Baillard, Andrew Zisserman","doi":"10.1109/CVPR.1999.784966","DOIUrl":"https://doi.org/10.1109/CVPR.1999.784966","url":null,"abstract":"A new method is described for automatically reconstructing 3D planar faces from multiple images of a scene. The novelty of the approach lies in the use of inter-image homographies to validate and best estimate the plane, and in the minimal initialization requirements-only a single 3D line with a textured neighbourhood is required to generate a plane hypothesis. The planar facets enable line grouping and also the construction of parts of the wireframe which were missed due to the inevitable shortcomings of feature detection and matching. The method allows a piecewise planar model of a scene to be built completely automatically, with no user intervention at any stage, given only the images and camera projection matrices as input. The robustness and reliability of the method are illustrated on several examples, from both aerial and interior views.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"17 1","pages":"559-565 Vol. 2"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74556896","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 226
A new visualization paradigm for multispectral imagery and data fusion 一种新的多光谱图像和数据融合可视化范例
Diego A. Socolinsky, L. B. Wolff
We present a new formalism for the treatment and understanding of multispectral images and multisensor imagery based on first order contrast information. Although little attention has been paid to the utility of multispectral contrast, we develop a theory for multispectral contrast that enables us to produce an optimal grayscale visualization of the first order contrast of an image with an arbitrary number of bands. We demonstrate how our technique can reveal significantly more interpretive information to an image analyst, who can use it in a number of image understanding algorithms. Existing grayscale visualization strategies are reviewed and a discussion is given as to why our algorithm is optimal and outperforms them. A variety of experimental results are presented.
我们提出了一种新的基于一阶对比度信息的多光谱图像和多传感器图像处理和理解的形式。虽然很少有人注意到多光谱对比度的效用,但我们开发了一种多光谱对比度理论,使我们能够产生具有任意数量波段的图像的一阶对比度的最佳灰度可视化。我们展示了我们的技术如何能够向图像分析师揭示更多的解释性信息,他们可以在许多图像理解算法中使用它。回顾了现有的灰度可视化策略,并讨论了为什么我们的算法是最优的,并且优于它们。给出了各种实验结果。
{"title":"A new visualization paradigm for multispectral imagery and data fusion","authors":"Diego A. Socolinsky, L. B. Wolff","doi":"10.1109/CVPR.1999.786958","DOIUrl":"https://doi.org/10.1109/CVPR.1999.786958","url":null,"abstract":"We present a new formalism for the treatment and understanding of multispectral images and multisensor imagery based on first order contrast information. Although little attention has been paid to the utility of multispectral contrast, we develop a theory for multispectral contrast that enables us to produce an optimal grayscale visualization of the first order contrast of an image with an arbitrary number of bands. We demonstrate how our technique can reveal significantly more interpretive information to an image analyst, who can use it in a number of image understanding algorithms. Existing grayscale visualization strategies are reviewed and a discussion is given as to why our algorithm is optimal and outperforms them. A variety of experimental results are presented.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"43 1","pages":"319-324 Vol. 1"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74881637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
Histogram clustering for unsupervised image segmentation 无监督图像分割的直方图聚类
J. Puzicha, J. Buhmann, Thomas Hofmann
This paper introduces a novel statistical mixture model for probabilistic grouping of distributional (histogram) data. Adopting the Bayesian framework, we propose to perform annealed maximum a posteriori estimation to compute optimal clustering solutions. In order to accelerate the optimization process, an efficient multiscale formulation is developed. We present a prototypical application of this method for the unsupervised segmentation of textured images based on local distributions of Gabor coefficients. Benchmark results indicate superior performance compared to K-means clustering and proximity-based algorithms.
本文介绍了一种新的分布(直方图)数据概率分组统计混合模型。采用贝叶斯框架,我们提出了退火最大后验估计来计算最优聚类解。为了加快优化过程,开发了一种高效的多尺度配方。我们提出了一个基于Gabor系数局部分布的纹理图像无监督分割的原型应用。基准测试结果表明,与K-means聚类和基于接近度的算法相比,该算法具有更好的性能。
{"title":"Histogram clustering for unsupervised image segmentation","authors":"J. Puzicha, J. Buhmann, Thomas Hofmann","doi":"10.1109/CVPR.1999.784981","DOIUrl":"https://doi.org/10.1109/CVPR.1999.784981","url":null,"abstract":"This paper introduces a novel statistical mixture model for probabilistic grouping of distributional (histogram) data. Adopting the Bayesian framework, we propose to perform annealed maximum a posteriori estimation to compute optimal clustering solutions. In order to accelerate the optimization process, an efficient multiscale formulation is developed. We present a prototypical application of this method for the unsupervised segmentation of textured images based on local distributions of Gabor coefficients. Benchmark results indicate superior performance compared to K-means clustering and proximity-based algorithms.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"22 1","pages":"602-608 Vol. 2"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85048144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 94
Vision-based speaker detection using Bayesian networks 基于视觉的贝叶斯网络说话人检测
James M. Rehg, Kevin P. Murphy, P. Fieguth
The development of user interfaces based on vision and speech requires the solution of a challenging statistical inference problem: The intentions and actions of multiple individuals must be inferred from noisy and ambiguous data. We argue that Bayesian network models are an attractive statistical framework for cue fusion in these applications. Bayes nets combine a natural mechanism for expressing contextual information with efficient algorithms for learning and inference. We illustrate these points through the development of a Bayes net model for detecting when a user is speaking. The model combines four simple vision sensors: face detection, skin color, skin texture, and mouth motion. We present some promising experimental results.
基于视觉和语音的用户界面的开发需要解决一个具有挑战性的统计推断问题:必须从嘈杂和模糊的数据中推断出多个个体的意图和行为。我们认为贝叶斯网络模型是这些应用中线索融合的一个有吸引力的统计框架。贝叶斯网络结合了表达上下文信息的自然机制和高效的学习和推理算法。我们通过开发用于检测用户何时说话的贝叶斯网络模型来说明这些要点。该模型结合了四种简单的视觉传感器:面部检测、肤色、皮肤纹理和口腔运动。我们提出了一些有希望的实验结果。
{"title":"Vision-based speaker detection using Bayesian networks","authors":"James M. Rehg, Kevin P. Murphy, P. Fieguth","doi":"10.1109/CVPR.1999.784617","DOIUrl":"https://doi.org/10.1109/CVPR.1999.784617","url":null,"abstract":"The development of user interfaces based on vision and speech requires the solution of a challenging statistical inference problem: The intentions and actions of multiple individuals must be inferred from noisy and ambiguous data. We argue that Bayesian network models are an attractive statistical framework for cue fusion in these applications. Bayes nets combine a natural mechanism for expressing contextual information with efficient algorithms for learning and inference. We illustrate these points through the development of a Bayes net model for detecting when a user is speaking. The model combines four simple vision sensors: face detection, skin color, skin texture, and mouth motion. We present some promising experimental results.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"54 1","pages":"110-116 Vol. 2"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79449288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 79
Toward a scale-space aspect graph: solids of revolution 朝向一个尺度空间方面图:旋转立体
S. Pae, J. Ponce
This paper addresses the problem of constructing the scale-space aspect graph of a solid of revolution whose surface is the zero set of a polynomial volumetric density undergoing a Gaussian diffusion process. Equations for the associated visual event surfaces are derived, and polynomial curve tracing techniques are used to delineate these surfaces. An implementation and examples are presented, and limitations as well as extensions of the proposed approach are discussed.
本文研究了一个旋转固体的尺度空间面向图的构造问题,该旋转固体的表面是经过高斯扩散过程的多项式体积密度的零集。推导了相关视觉事件曲面的方程,并使用多项式曲线跟踪技术来描绘这些曲面。给出了一个实现和示例,并讨论了该方法的局限性和扩展。
{"title":"Toward a scale-space aspect graph: solids of revolution","authors":"S. Pae, J. Ponce","doi":"10.1109/CVPR.1999.784629","DOIUrl":"https://doi.org/10.1109/CVPR.1999.784629","url":null,"abstract":"This paper addresses the problem of constructing the scale-space aspect graph of a solid of revolution whose surface is the zero set of a polynomial volumetric density undergoing a Gaussian diffusion process. Equations for the associated visual event surfaces are derived, and polynomial curve tracing techniques are used to delineate these surfaces. An implementation and examples are presented, and limitations as well as extensions of the proposed approach are discussed.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"19 2 1","pages":"196-201 Vol. 2"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82564166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Independent motion: the importance of history 独立运动:历史的重要性
Robert Pless, T. Brodský, Y. Aloimonos
We consider a problem central in aerial visual surveillance applications-detection and tracking of small, independently moving objects in long and noisy video sequences. We directly use spatiotemporal image intensity gradient measurements to compute an exact model of background motion. This allows the creation of accurate mosaics over many frames and the definition of a constraint violation function which acts as an indication of independent motion. A novel temporal integration method maintains confidence measures over long subsequences without computing the optic flow, requiring object models, or using a Kalman filler. The mosaic acts as a stable feature frame, allowing precise localization of the independently moving objects. We present a statistical analysis of the effects of image noise on the constraint violation measure and find a good match between the predicted probability distribution function and the measured sample frequencies in a test sequence.
我们考虑了航空视觉监视应用中的一个核心问题——在长而有噪声的视频序列中检测和跟踪小的、独立运动的物体。我们直接使用时空图像强度梯度测量来计算背景运动的精确模型。这允许在许多帧上创建精确的马赛克,并定义约束违反函数,作为独立运动的指示。一种新的时间积分方法在不计算光流,不需要对象模型或使用卡尔曼填充的情况下保持长子序列的置信度。马赛克作为一个稳定的特征框架,允许精确定位独立移动的物体。我们对图像噪声对约束违逆测度的影响进行了统计分析,发现在一个测试序列中,预测的概率分布函数与实测的样本频率之间有很好的匹配。
{"title":"Independent motion: the importance of history","authors":"Robert Pless, T. Brodský, Y. Aloimonos","doi":"10.1109/CVPR.1999.784614","DOIUrl":"https://doi.org/10.1109/CVPR.1999.784614","url":null,"abstract":"We consider a problem central in aerial visual surveillance applications-detection and tracking of small, independently moving objects in long and noisy video sequences. We directly use spatiotemporal image intensity gradient measurements to compute an exact model of background motion. This allows the creation of accurate mosaics over many frames and the definition of a constraint violation function which acts as an indication of independent motion. A novel temporal integration method maintains confidence measures over long subsequences without computing the optic flow, requiring object models, or using a Kalman filler. The mosaic acts as a stable feature frame, allowing precise localization of the independently moving objects. We present a statistical analysis of the effects of image noise on the constraint violation measure and find a good match between the predicted probability distribution function and the measured sample frequencies in a test sequence.","PeriodicalId":20644,"journal":{"name":"Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)","volume":"07 1","pages":"92-97 Vol. 2"},"PeriodicalIF":0.0,"publicationDate":"1999-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85854501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
期刊
Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1