Using temporal coherence to build models of animals

Proceedings Ninth IEEE International Conference on Computer Vision Pub Date : 2003-10-13 DOI:10.1109/ICCV.2003.1238364

Deva Ramanan, D. Forsyth

{"title":"Using temporal coherence to build models of animals","authors":"Deva Ramanan, D. Forsyth","doi":"10.1109/ICCV.2003.1238364","DOIUrl":null,"url":null,"abstract":"We describe a system that can build appearance models of animals automatically from a video sequence of the relevant animal with no explicit supervisory information. The video sequence need not have any form of special background. Animals are modeled as a 2D kinematic chain of rectangular segments, where the number of segments and the topology of the chain are unknown. The system detects possible segments, clusters segments whose appearance is coherent over time, and then builds a spatial model of such segment clusters. The resulting representation of the spatial configuration of the animal in each frame can be seen either as a track - in which case the system described should be viewed as a generalized tracker, that is capable of modeling objects while tracking them - or as the source of an appearance model which can be used to build detectors for the particular animal. This is because knowing a video sequence is temporally coherent - i.e. that a particular animal is present through the sequence - is a strong supervisory signal. The method is shown to be successful as a tracker on video sequences of real scenes showing three different animals. For the same reason it is successful as a tracker, the method results in detectors that can be used to find each animal fairly reliably within the Corel collection of images.","PeriodicalId":131580,"journal":{"name":"Proceedings Ninth IEEE International Conference on Computer Vision","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"73","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Ninth IEEE International Conference on Computer Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCV.2003.1238364","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 73

Abstract

We describe a system that can build appearance models of animals automatically from a video sequence of the relevant animal with no explicit supervisory information. The video sequence need not have any form of special background. Animals are modeled as a 2D kinematic chain of rectangular segments, where the number of segments and the topology of the chain are unknown. The system detects possible segments, clusters segments whose appearance is coherent over time, and then builds a spatial model of such segment clusters. The resulting representation of the spatial configuration of the animal in each frame can be seen either as a track - in which case the system described should be viewed as a generalized tracker, that is capable of modeling objects while tracking them - or as the source of an appearance model which can be used to build detectors for the particular animal. This is because knowing a video sequence is temporally coherent - i.e. that a particular animal is present through the sequence - is a strong supervisory signal. The method is shown to be successful as a tracker on video sequences of real scenes showing three different animals. For the same reason it is successful as a tracker, the method results in detectors that can be used to find each animal fairly reliably within the Corel collection of images.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用时间一致性建立动物模型

我们描述了一个系统，该系统可以在没有明确监管信息的情况下，从相关动物的视频序列自动构建动物的外观模型。视频序列不需要有任何形式的特殊背景。动物被建模为矩形段的二维运动链，其中段的数量和链的拓扑结构是未知的。该系统检测可能的片段，集群片段，其外观随着时间的推移是一致的，然后建立这样的片段集群的空间模型。在每一帧中，动物的空间结构的最终表示可以被看作是一个轨迹——在这种情况下，所描述的系统应该被看作是一个通用的跟踪器，它能够在跟踪它们的同时建模对象——或者是一个外观模型的来源，它可以用来为特定的动物建立检测器。这是因为知道视频序列在时间上是连贯的——也就是说，一个特定的动物在序列中出现——是一个很强的监督信号。该方法被证明是成功的跟踪视频序列显示三种不同的动物的真实场景。由于同样的原因，它是一个成功的跟踪器，该方法产生检测器，可用于在Corel图像集合中相当可靠地找到每个动物。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊