首页 > 最新文献

Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)最新文献

英文 中文
Maintaining multiple motion model hypotheses over many views to recover matching and structure 在多个视图上保持多个运动模型假设,以恢复匹配和结构
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710762
P. Torr, A. Fitzgibbon, Andrew Zisserman
In order to recover structure from images it is desirable to use many views to obtain the best possible estimates. However, whilst recovering projective structure and motion from such extended sequences problems arise that are not apparent from a general view-point/structure approach. Foremost amongst these are (a) maintaining image correspondences consistently through many images, and (b) identifying images, within the sequence, for which structure cannot be reliably recovery. Within this paper the use of multiple motion model hypotheses is explored as an aid to solve both of these problems.
为了从图像中恢复结构,需要使用多个视图来获得最佳估计。然而,当从这种扩展序列中恢复投影结构和运动时,从一般的观点/结构方法中出现的问题并不明显。其中最重要的是(a)通过许多图像一致地保持图像对应,以及(b)识别序列中结构无法可靠恢复的图像。本文探讨了利用多运动模型假设来帮助解决这两个问题。
{"title":"Maintaining multiple motion model hypotheses over many views to recover matching and structure","authors":"P. Torr, A. Fitzgibbon, Andrew Zisserman","doi":"10.1109/ICCV.1998.710762","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710762","url":null,"abstract":"In order to recover structure from images it is desirable to use many views to obtain the best possible estimates. However, whilst recovering projective structure and motion from such extended sequences problems arise that are not apparent from a general view-point/structure approach. Foremost amongst these are (a) maintaining image correspondences consistently through many images, and (b) identifying images, within the sequence, for which structure cannot be reliably recovery. Within this paper the use of multiple motion model hypotheses is explored as an aid to solve both of these problems.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"186 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134330211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 125
Retrieving images by appearance 根据外观检索图像
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710780
S. Ravela, R. Manmatha
A system to retrieve images using a description of the image intensity surface is presented. Gaussian derivative filters at several scales are applied to the image and low order 2D differential invariants are computed. The resulting multi-scale representation is indexed for rapid retrieval. Queries are designed by the users from an example image by selecting appropriate regions. The invariant vectors corresponding to these regions are matched with the database counterparts both in feature and coordinate space. This yields a match score per image. Images are sorted by the match score and displayed. Experiments conducted with over 1500 images of objects embedded in arbitrary backgrounds are described. It is observed that images similar in appearance and whose viewpoint is within small view variations of the query can be retrieved with an average precision of 50%.
提出了一种基于图像强度表面描述的图像检索系统。将不同尺度的高斯导数滤波器应用于图像,计算了低阶二维微分不变量。生成的多尺度表示被索引以便快速检索。查询由用户从示例图像中选择适当的区域来设计。这些区域对应的不变向量在特征空间和坐标空间上与数据库对应的不变向量进行匹配。这将生成每个图像的匹配分数。图像按比赛分数排序并显示。描述了在任意背景中嵌入1500多幅物体图像的实验。观察到,外观相似且视点在查询的小视图变化范围内的图像可以以平均50%的精度检索。
{"title":"Retrieving images by appearance","authors":"S. Ravela, R. Manmatha","doi":"10.1109/ICCV.1998.710780","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710780","url":null,"abstract":"A system to retrieve images using a description of the image intensity surface is presented. Gaussian derivative filters at several scales are applied to the image and low order 2D differential invariants are computed. The resulting multi-scale representation is indexed for rapid retrieval. Queries are designed by the users from an example image by selecting appropriate regions. The invariant vectors corresponding to these regions are matched with the database counterparts both in feature and coordinate space. This yields a match score per image. Images are sorted by the match score and displayed. Experiments conducted with over 1500 images of objects embedded in arbitrary backgrounds are described. It is observed that images similar in appearance and whose viewpoint is within small view variations of the query can be retrieved with an average precision of 50%.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132589516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 44
Recognition and interpretation of parametric gesture 参数手势的识别和解释
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710739
Andrew D. Wilson, A. Bobick
A new method for the representation, recognition, and interpretation of parameterized gesture is presented. By parameterized gesture. We mean gestures that exhibit a meaningful variation; one example is a point gesture where the important parameter is the 2-dimensional direction. Our approach is to extend the standard hidden Markov model method of gesture recognition by including a global parametric variation in the output probabilities of the states of the HMM. Using a linear model to derive the theory, we formulated an expectation-maximization (EM) method for training the parametric HMM. During testing, the parametric HMM simultaneously recognizes the gesture and estimates the quantifying parameters. Using visually derived and directly measured 3-dimensional hand position measurements as input, we present results on two. Different movements-a size gesture and a point gesture-and show robustness with respect to noise in the input features.
提出了一种参数化手势的表示、识别和解释新方法。通过参数化手势。我们指的是表现出有意义变化的手势;一个例子是点手势,其中重要的参数是二维方向。我们的方法是通过在HMM状态的输出概率中包含一个全局参数变化来扩展手势识别的标准隐马尔可夫模型方法。利用线性模型推导理论,提出了一种期望最大化(EM)方法来训练参数HMM。在测试过程中,参数HMM同时识别手势并估计量化参数。使用视觉推导和直接测量的三维手部位置测量作为输入,我们给出了两个结果。不同的动作——大小手势和点手势——对输入特征中的噪声表现出鲁棒性。
{"title":"Recognition and interpretation of parametric gesture","authors":"Andrew D. Wilson, A. Bobick","doi":"10.1109/ICCV.1998.710739","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710739","url":null,"abstract":"A new method for the representation, recognition, and interpretation of parameterized gesture is presented. By parameterized gesture. We mean gestures that exhibit a meaningful variation; one example is a point gesture where the important parameter is the 2-dimensional direction. Our approach is to extend the standard hidden Markov model method of gesture recognition by including a global parametric variation in the output probabilities of the states of the HMM. Using a linear model to derive the theory, we formulated an expectation-maximization (EM) method for training the parametric HMM. During testing, the parametric HMM simultaneously recognizes the gesture and estimates the quantifying parameters. Using visually derived and directly measured 3-dimensional hand position measurements as input, we present results on two. Different movements-a size gesture and a point gesture-and show robustness with respect to noise in the input features.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116659289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 124
Signfinder: using color to detect, localize and identify informational signs Signfinder:用颜色来检测、定位和识别信息标志
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710783
A. Yuille, Daniel Snow, Mark Nitzberg
We describe an approach to detecting, locating and normalizing road signs. The approach will apply provided: (i) the signs have stereotypical boundary shapes (i.e. rectangular, or hexagonal-of course, we allow for these shapes to be distorted by projection to unknown viewpoint), (ii) the writing on the sign has one uniform color and the rest of the sign has a second uniform color (we allow for the color of the illuminant to be unknown). We show that the approach works even under significant illuminant color changes, viewpoint direction, shadowing, and occlusion. This work is part of a project intended to help people who are blind, or whose sight is impaired.
我们描述了一种检测、定位和规范化道路标志的方法。该方法适用于以下情况:(i)标志具有典型的边界形状(即矩形或六边形-当然,我们允许这些形状因投影到未知视点而扭曲),(ii)标志上的文字具有一种统一的颜色,而标志的其余部分具有第二种统一的颜色(我们允许光源的颜色未知)。我们证明,即使在显著的光源颜色变化、视点方向、阴影和遮挡下,该方法也有效。这项工作是一个旨在帮助盲人或视力受损人士的项目的一部分。
{"title":"Signfinder: using color to detect, localize and identify informational signs","authors":"A. Yuille, Daniel Snow, Mark Nitzberg","doi":"10.1109/ICCV.1998.710783","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710783","url":null,"abstract":"We describe an approach to detecting, locating and normalizing road signs. The approach will apply provided: (i) the signs have stereotypical boundary shapes (i.e. rectangular, or hexagonal-of course, we allow for these shapes to be distorted by projection to unknown viewpoint), (ii) the writing on the sign has one uniform color and the rest of the sign has a second uniform color (we allow for the color of the illuminant to be unknown). We show that the approach works even under significant illuminant color changes, viewpoint direction, shadowing, and occlusion. This work is part of a project intended to help people who are blind, or whose sight is impaired.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122233547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 35
Color- and texture-based image segmentation using EM and its application to content-based image retrieval 基于颜色和纹理的EM图像分割及其在基于内容的图像检索中的应用
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710790
Serge J. Belongie, C. Carson, H. Greenspan, Jitendra Malik
Retrieving images from large and varied collections using image content as a key is a challenging and important problem. In this paper we present a new image representation which provides a transformation from the raw pixel data to a small set of image regions which are coherent in color and texture space. This so-called "blobworld" representation is based on segmentation using the expectation-maximization algorithm on combined color and texture features. The texture features we use for the segmentation arise from a new approach to texture description and scale selection. We describe a system that uses the blobworld representation to retrieve images. An important and unique aspect of the system is that, in the context of similarity-based querying, the user is allowed to view the internal representation of the submitted image and the query results. Similar systems do not offer the user this view into the workings of the system; consequently, the outcome of many queries on these systems can be quite inexplicable, despite the availability of knobs for adjusting the similarity metric.
使用图像内容作为键从大型和不同的集合中检索图像是一个具有挑战性和重要的问题。在本文中,我们提出了一种新的图像表示,它提供了从原始像素数据到在颜色和纹理空间上一致的一小组图像区域的转换。这种所谓的“blobworld”表示是基于对组合颜色和纹理特征使用期望最大化算法的分割。我们用于分割的纹理特征来源于一种新的纹理描述和尺度选择方法。我们描述了一个使用blobworld表示来检索图像的系统。该系统的一个重要且独特的方面是,在基于相似性的查询上下文中,允许用户查看提交的图像的内部表示和查询结果。类似的系统不会向用户提供这种进入系统工作的视图;因此,尽管有调节相似度度量的旋钮可用,但在这些系统上的许多查询的结果可能相当难以解释。
{"title":"Color- and texture-based image segmentation using EM and its application to content-based image retrieval","authors":"Serge J. Belongie, C. Carson, H. Greenspan, Jitendra Malik","doi":"10.1109/ICCV.1998.710790","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710790","url":null,"abstract":"Retrieving images from large and varied collections using image content as a key is a challenging and important problem. In this paper we present a new image representation which provides a transformation from the raw pixel data to a small set of image regions which are coherent in color and texture space. This so-called \"blobworld\" representation is based on segmentation using the expectation-maximization algorithm on combined color and texture features. The texture features we use for the segmentation arise from a new approach to texture description and scale selection. We describe a system that uses the blobworld representation to retrieve images. An important and unique aspect of the system is that, in the context of similarity-based querying, the user is allowed to view the internal representation of the submitted image and the query results. Similar systems do not offer the user this view into the workings of the system; consequently, the outcome of many queries on these systems can be quite inexplicable, despite the availability of knobs for adjusting the similarity metric.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124653029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 564
Multidimensional morphable models 多维变形模型
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710791
Michael J. Jones, T. Poggio
We describe a flexible model for representing images of objects of a certain class, known a priori, such as faces, and introduce a new algorithm for matching it to a novel image and thereby performing image analysis. We call this model a multidimensional morphable model or just a, morphable model. The morphable model is learned from example images (called prototypes) of objects of a class. In this paper we introduce an effective stochastic gradient descent algorithm that automaticaIly matches a model to a novel image by finding the parameters that minimize the error between the image generated by the model and the novel image. Two examples demonstrate the robustness and the broad range of applicability of the matching algorithm and the underlying morphable model. Our approach can provide novel solutions to several vision tasks, including the computation of image correspondence, object verification, image synthesis and image compression.
我们描述了一个灵活的模型来表示特定类别的物体的图像,已知的先验,如人脸,并引入了一个新的算法来匹配它到一个新的图像,从而执行图像分析。我们称这个模型为多维可变形模型或者只是一个可变形模型。变形模型是从类对象的示例图像(称为原型)中学习的。本文介绍了一种有效的随机梯度下降算法,该算法通过寻找使模型生成的图像与新图像之间的误差最小的参数,自动将模型与新图像进行匹配。两个实例证明了该匹配算法和底层可变形模型的鲁棒性和广泛的适用性。我们的方法可以为图像对应计算、目标验证、图像合成和图像压缩等视觉任务提供新颖的解决方案。
{"title":"Multidimensional morphable models","authors":"Michael J. Jones, T. Poggio","doi":"10.1109/ICCV.1998.710791","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710791","url":null,"abstract":"We describe a flexible model for representing images of objects of a certain class, known a priori, such as faces, and introduce a new algorithm for matching it to a novel image and thereby performing image analysis. We call this model a multidimensional morphable model or just a, morphable model. The morphable model is learned from example images (called prototypes) of objects of a class. In this paper we introduce an effective stochastic gradient descent algorithm that automaticaIly matches a model to a novel image by finding the parameters that minimize the error between the image generated by the model and the novel image. Two examples demonstrate the robustness and the broad range of applicability of the matching algorithm and the underlying morphable model. Our approach can provide novel solutions to several vision tasks, including the computation of image correspondence, object verification, image synthesis and image compression.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125147543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 145
Bilinear voting 双线性投票
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710716
G. Sapiro
A geometric-vision approach to solve bilinear problems in general, and the color constancy and illuminant estimation problem in particular, is presented in this paper. We show a general framework, based on ideas from the generalized (probabilistic) Hough transform, to estimate the unknown variables in the bilinear form. In the case of illuminant and reflectance estimation in natural images, each image pixel "votes" for possible illuminants (or reflectance), and the estimation is based on cumulative votes. In the general case, the voting is for the parameters of the bilinear model. The framework is natural for the introduction of physical constraints. For the case of illuminant estimation, we briefly show the relation of this work with previous algorithms for color constancy, and present examples.
本文提出了一种几何视觉方法来解决一般的双线性问题,特别是颜色常数和光源估计问题。我们展示了一个基于广义(概率)霍夫变换思想的一般框架,以双线性形式估计未知变量。在自然图像中的光源和反射率估计的情况下,每个图像像素“投票”可能的光源(或反射率),估计是基于累积投票。在一般情况下,投票是针对双线性模型的参数。该框架对于引入物理约束是很自然的。对于光源估计的情况,我们简要地展示了这项工作与以前的颜色常数算法的关系,并给出了例子。
{"title":"Bilinear voting","authors":"G. Sapiro","doi":"10.1109/ICCV.1998.710716","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710716","url":null,"abstract":"A geometric-vision approach to solve bilinear problems in general, and the color constancy and illuminant estimation problem in particular, is presented in this paper. We show a general framework, based on ideas from the generalized (probabilistic) Hough transform, to estimate the unknown variables in the bilinear form. In the case of illuminant and reflectance estimation in natural images, each image pixel \"votes\" for possible illuminants (or reflectance), and the estimation is based on cumulative votes. In the general case, the voting is for the parameters of the bilinear model. The framework is natural for the introduction of physical constraints. For the case of illuminant estimation, we briefly show the relation of this work with previous algorithms for color constancy, and present examples.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128822729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
A unified factorization algorithm for points, line segments and planes with uncertainty models 具有不确定模型的点、线段和平面的统一分解算法
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710793
Daniel Morris, T. Kanade
In this paper we present a unified factorization algorithm for recovering structure and motion from image sequences by using point features, line segments and planes. This new formulation is based on directional uncertainty model for features. Points and line segments are both described by the same probabilistic models and so can be recovered in the same way. Prior information on the coplanarity of features is shown to fit naturally into the new factorization formulation and provides additional constraints for the shape recovery. This formulation leads to a weighted least squares motion and shape recovery problem which is solved by an efficient quasi-linear algorithm. The statistical uncertainty model also enables us to recover uncertainty estimates for the reconstructed three dimensional feature locations.
本文提出了一种利用点特征、线段和平面从图像序列中恢复结构和运动的统一分解算法。该公式基于特征的方向性不确定性模型。点和线段都由相同的概率模型描述,因此可以用相同的方法恢复。关于特征共平面性的先验信息被证明可以自然地拟合到新的分解公式中,并为形状恢复提供了额外的约束。该公式导致加权最小二乘运动和形状恢复问题,该问题由有效的准线性算法解决。统计不确定性模型还使我们能够恢复重建的三维特征位置的不确定性估计。
{"title":"A unified factorization algorithm for points, line segments and planes with uncertainty models","authors":"Daniel Morris, T. Kanade","doi":"10.1109/ICCV.1998.710793","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710793","url":null,"abstract":"In this paper we present a unified factorization algorithm for recovering structure and motion from image sequences by using point features, line segments and planes. This new formulation is based on directional uncertainty model for features. Points and line segments are both described by the same probabilistic models and so can be recovered in the same way. Prior information on the coplanarity of features is shown to fit naturally into the new factorization formulation and provides additional constraints for the shape recovery. This formulation leads to a weighted least squares motion and shape recovery problem which is solved by an efficient quasi-linear algorithm. The statistical uncertainty model also enables us to recover uncertainty estimates for the reconstructed three dimensional feature locations.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122567841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 138
Motion segmentation and tracking using normalized cuts 运动分割和跟踪使用归一化切割
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710861
Jianbo Shi, Jitendra Malik
We propose a motion segmentation algorithm that aims to break a scene into its most prominent moving groups. A weighted graph is constructed on the image sequence by connecting pixels that are in the spatiotemporal neighborhood of each other. At each pixel, we define motion profile vectors which capture the probability distribution of the image velocity. The distance between motion profiles is used to assign a weight on the graph edges. Using normalised cuts we find the most salient partitions of the spatiotemporal graph formed by the image sequence. For segmenting long image sequences, we have developed a recursive update procedure that incorporates knowledge of segmentation in previous frames for efficiently finding the group correspondence in the new frame.
我们提出了一种运动分割算法,旨在将场景分解为最突出的运动组。通过连接彼此处于时空邻域的像素,在图像序列上构建加权图。在每个像素处,我们定义了运动轮廓向量,它捕获了图像速度的概率分布。运动轮廓之间的距离用于在图边上分配权重。使用归一化切割,我们找到由图像序列形成的时空图中最显著的分区。对于分割长图像序列,我们开发了一种递归更新程序,该程序结合了前帧的分割知识,以便有效地找到新帧中的组对应关系。
{"title":"Motion segmentation and tracking using normalized cuts","authors":"Jianbo Shi, Jitendra Malik","doi":"10.1109/ICCV.1998.710861","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710861","url":null,"abstract":"We propose a motion segmentation algorithm that aims to break a scene into its most prominent moving groups. A weighted graph is constructed on the image sequence by connecting pixels that are in the spatiotemporal neighborhood of each other. At each pixel, we define motion profile vectors which capture the probability distribution of the image velocity. The distance between motion profiles is used to assign a weight on the graph edges. Using normalised cuts we find the most salient partitions of the spatiotemporal graph formed by the image sequence. For segmenting long image sequences, we have developed a recursive update procedure that incorporates knowledge of segmentation in previous frames for efficiently finding the group correspondence in the new frame.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"15 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120906188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 436
Detecting changes in aerial views of man-made structures 探测人造建筑鸟瞰图的变化
Pub Date : 1998-01-04 DOI: 10.1109/ICCV.1998.710703
A. Huertas, R. Nevatia
Many applications require detecting structural changes in a scene over a period of time. Comparing intensity values of successive images is not effective as such changes don't necessarily reflect actual changes at a site but might be caused by changes in the view point, illumination and seasons. We take the approach of comparing a 3-D model of the site, prepared from previous images, with new images to infer significant changes. This task is difficult as the images and the models have very different levels of abstract representations. Our approach consists of several steps: registering a site model to a new image, model validation to confirm the presence of model objects in the image; structural change detection seeks to resolve matching problems and indicate possibly changed structures; and finally updating models to reflect the changes. Our system is able to detect missing (or mis-modeled) buildings, changes in model dimensions, and new buildings under some conditions.
许多应用程序需要在一段时间内检测场景中的结构变化。比较连续图像的强度值是无效的,因为这种变化不一定反映一个地点的实际变化,而可能是由视点、光照和季节的变化引起的。我们采用的方法是将该地点的三维模型与新图像进行比较,以推断出重大变化。这个任务很困难,因为图像和模型具有非常不同的抽象表示级别。我们的方法包括几个步骤:将站点模型注册到新图像,模型验证以确认图像中模型对象的存在;结构变化检测旨在解决匹配问题并指出可能发生变化的结构;最后更新模型以反映变化。我们的系统能够检测缺失(或错误建模)的建筑物,模型尺寸的变化,以及在某些条件下的新建筑物。
{"title":"Detecting changes in aerial views of man-made structures","authors":"A. Huertas, R. Nevatia","doi":"10.1109/ICCV.1998.710703","DOIUrl":"https://doi.org/10.1109/ICCV.1998.710703","url":null,"abstract":"Many applications require detecting structural changes in a scene over a period of time. Comparing intensity values of successive images is not effective as such changes don't necessarily reflect actual changes at a site but might be caused by changes in the view point, illumination and seasons. We take the approach of comparing a 3-D model of the site, prepared from previous images, with new images to infer significant changes. This task is difficult as the images and the models have very different levels of abstract representations. Our approach consists of several steps: registering a site model to a new image, model validation to confirm the presence of model objects in the image; structural change detection seeks to resolve matching problems and indicate possibly changed structures; and finally updating models to reflect the changes. Our system is able to detect missing (or mis-modeled) buildings, changes in model dimensions, and new buildings under some conditions.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"25 1-2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120930000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 116
期刊
Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1