首页 > 最新文献

2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance最新文献

英文 中文
Validation of blind region learning and tracking 盲区学习与跟踪的验证
J. Black, Dimitrios Makris, T. Ellis
Multi view tracking systems enable an object's identity to be preserved as it moves through a wide area surveillance network of cameras. One limitation of these systems is an inability to track objects between blind regions, i.e. pans of the scene that are not observable by the network of cameras. Recent interest has been shown in blind region learning and tracking but not much work has been reported on the systematic performance evaluation of these algorithms. The main contribution of this paper is to define a set of novel techniques that can be employed to validate a camera topology model, and a blind region multi view tracking algorithm.
多视图跟踪系统使物体在通过广域摄像机监控网络时能够保持其身份。这些系统的一个限制是无法跟踪盲区之间的物体,即摄像机网络无法观察到的场景。近年来,人们对盲区学习和盲区跟踪产生了兴趣,但对这些算法的系统性能评估的研究还不多。本文的主要贡献是定义了一套新的技术,可用于验证摄像机拓扑模型和盲区多视图跟踪算法。
{"title":"Validation of blind region learning and tracking","authors":"J. Black, Dimitrios Makris, T. Ellis","doi":"10.1109/VSPETS.2005.1570892","DOIUrl":"https://doi.org/10.1109/VSPETS.2005.1570892","url":null,"abstract":"Multi view tracking systems enable an object's identity to be preserved as it moves through a wide area surveillance network of cameras. One limitation of these systems is an inability to track objects between blind regions, i.e. pans of the scene that are not observable by the network of cameras. Recent interest has been shown in blind region learning and tracking but not much work has been reported on the systematic performance evaluation of these algorithms. The main contribution of this paper is to define a set of novel techniques that can be employed to validate a camera topology model, and a blind region multi view tracking algorithm.","PeriodicalId":435841,"journal":{"name":"2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114824111","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
A Contour-Based Moving Object Detection and Tracking 基于轮廓的运动目标检测与跟踪
Masayuki Yokoyama, T. Poggio
We propose a fast and robust approach to the detection and tracking of moving objects. Our method is based on using lines computed by a gradient-based optical flow and an edge detector. While it is known among researchers that gradient-based optical flow and edges are well matched for accurate computation of velocity, not much attention is paid to creating systems for detecting and tracking objects using this feature. In our method, extracted edges by using optical flow and the edge detector are restored as lines, and background lines of the previous frame are subtracted. Contours of objects are obtained by using snakes to clustered lines. Detected objects are tracked, and each tracked object has a state for handling occlusion and interference. The experimental results on outdoor-scenes show fast and robust performance of our method. The computation time of our method is 0.089 s/frame on a 900 MHz processor.
我们提出了一种快速和鲁棒的方法来检测和跟踪运动物体。我们的方法是基于使用基于梯度的光流和边缘检测器计算的线。虽然研究人员都知道,基于梯度的光流和边缘可以很好地匹配精确的速度计算,但利用这一特征创建检测和跟踪物体的系统却没有得到太多的关注。该方法将光流和边缘检测器提取的边缘恢复为直线,并减去前一帧的背景线。物体的轮廓是用蛇形线聚类得到的。检测到的对象被跟踪,每个被跟踪的对象都有一个处理遮挡和干扰的状态。室外场景的实验结果表明,该方法具有快速、鲁棒性好。该方法在900mhz处理器上的计算时间为0.089 s/帧。
{"title":"A Contour-Based Moving Object Detection and Tracking","authors":"Masayuki Yokoyama, T. Poggio","doi":"10.1109/VSPETS.2005.1570925","DOIUrl":"https://doi.org/10.1109/VSPETS.2005.1570925","url":null,"abstract":"We propose a fast and robust approach to the detection and tracking of moving objects. Our method is based on using lines computed by a gradient-based optical flow and an edge detector. While it is known among researchers that gradient-based optical flow and edges are well matched for accurate computation of velocity, not much attention is paid to creating systems for detecting and tracking objects using this feature. In our method, extracted edges by using optical flow and the edge detector are restored as lines, and background lines of the previous frame are subtracted. Contours of objects are obtained by using snakes to clustered lines. Detected objects are tracked, and each tracked object has a state for handling occlusion and interference. The experimental results on outdoor-scenes show fast and robust performance of our method. The computation time of our method is 0.089 s/frame on a 900 MHz processor.","PeriodicalId":435841,"journal":{"name":"2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115837577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 183
Towards Interactive Generation of "Ground-truth" in Background Subtraction from Partially Labeled Examples 部分标记例子背景减法中“基础真理”的交互生成
E. Grossmann, A. Kale, C. Jaynes
Ground truth segmentation of foreground and background is important for performance evaluation of existing techniques and can guide principled development of video analysis algorithms. Unfortunately, generating ground truth data is a cumbersome and incurs a high cost in human labor. In this paper, we propose an interactive method to produce foreground/background segmentation of video sequences captured by a stationary camera, that requires comparatively little human labor, while still producing high quality results. Given a sequence, the user indicates, with a few clicks in a GUI, a few rectangular regions that contain only foreground or background pixels. Adaboost then builds a classifier that combines the output of a set of weak classifiers. The resulting classifier is run on the remainder of the sequence. Based on the results and the accuracy requirements, the user can then select more example regions for training. This cycle of hand-labeling, training and automatic classification steps leads to a high-quality segmentation with little effort. Our experiments show promising results, raise new issues and provide some insight on possible improvements.
前景和背景的真值分割对现有技术的性能评价具有重要意义,可以指导视频分析算法的原则发展。不幸的是,生成地面真实数据是一项繁琐的工作,并且需要耗费大量人力。在本文中,我们提出了一种交互式方法来生成由固定摄像机捕获的视频序列的前景/背景分割,该方法需要相对较少的人力,同时仍然产生高质量的结果。给定一个序列,用户只需在GUI中单击几下,就可以指示几个仅包含前景或背景像素的矩形区域。然后Adaboost构建一个分类器,将一组弱分类器的输出结合起来。生成的分类器在序列的剩余部分上运行。根据结果和精度要求,用户可以选择更多的示例区域进行训练。这种手工标记、训练和自动分类步骤的循环导致了高质量的分割。我们的实验显示了有希望的结果,提出了新的问题,并为可能的改进提供了一些见解。
{"title":"Towards Interactive Generation of \"Ground-truth\" in Background Subtraction from Partially Labeled Examples","authors":"E. Grossmann, A. Kale, C. Jaynes","doi":"10.1109/VSPETS.2005.1570932","DOIUrl":"https://doi.org/10.1109/VSPETS.2005.1570932","url":null,"abstract":"Ground truth segmentation of foreground and background is important for performance evaluation of existing techniques and can guide principled development of video analysis algorithms. Unfortunately, generating ground truth data is a cumbersome and incurs a high cost in human labor. In this paper, we propose an interactive method to produce foreground/background segmentation of video sequences captured by a stationary camera, that requires comparatively little human labor, while still producing high quality results. Given a sequence, the user indicates, with a few clicks in a GUI, a few rectangular regions that contain only foreground or background pixels. Adaboost then builds a classifier that combines the output of a set of weak classifiers. The resulting classifier is run on the remainder of the sequence. Based on the results and the accuracy requirements, the user can then select more example regions for training. This cycle of hand-labeling, training and automatic classification steps leads to a high-quality segmentation with little effort. Our experiments show promising results, raise new issues and provide some insight on possible improvements.","PeriodicalId":435841,"journal":{"name":"2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115407434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Robust object matching for persistent tracking with heterogeneous features 基于异构特征的持久跟踪鲁棒对象匹配
Yanlin Guol, H. Sawhney, Rakesh Kumar, Steve Hsu
Tracking objects over a long period of time in realistic environments remains a challenging problem for ground and aerial video surveillance. Matching objects and verifying their identities across multiple spatial and temporal gaps proves to be an effective way to extend tracking range. When an object track is lost due to occlusion or other reasons, we need to learn the object signature and use it to confirm the object's identity against a set of active objects when it appears again. In order to deal with poor image quality and large variations in aerial video tracking, we present in this paper a unified framework that employs a heterogeneous collection of features such as lines, points and regions for robust vehicle matching under variations in illumination, aspect and camera poses. Our approach fully utilizes the characteristics of vehicular objects that consist of relatively large textureless areas delimited by line like features, and demonstrates the important usage of heterogeneous features for different stages of vehicle matching. Experiments demonstrate the enhancement in performance of vehicle identification across multiple sightings using the heterogeneous feature set.
在现实环境中长时间跟踪目标仍然是地面和空中视频监控的一个具有挑战性的问题。跨越多个时空间隙对目标进行匹配和身份验证是扩大目标跟踪范围的有效方法。当物体轨迹由于遮挡或其他原因丢失时,我们需要学习物体的签名,并在它再次出现时使用它来确认一组活动物体的身份。为了解决航拍视频跟踪中图像质量差和变化大的问题,本文提出了一个统一的框架,该框架采用了线、点和区域等异构特征集合,在光照、角度和相机姿态变化下进行鲁棒的车辆匹配。我们的方法充分利用了由线状特征划分的相对较大的无纹理区域组成的车辆对象的特征,并证明了异构特征在车辆匹配的不同阶段的重要用途。实验表明,使用异构特征集可以提高车辆识别的性能。
{"title":"Robust object matching for persistent tracking with heterogeneous features","authors":"Yanlin Guol, H. Sawhney, Rakesh Kumar, Steve Hsu","doi":"10.1109/VSPETS.2005.1570901","DOIUrl":"https://doi.org/10.1109/VSPETS.2005.1570901","url":null,"abstract":"Tracking objects over a long period of time in realistic environments remains a challenging problem for ground and aerial video surveillance. Matching objects and verifying their identities across multiple spatial and temporal gaps proves to be an effective way to extend tracking range. When an object track is lost due to occlusion or other reasons, we need to learn the object signature and use it to confirm the object's identity against a set of active objects when it appears again. In order to deal with poor image quality and large variations in aerial video tracking, we present in this paper a unified framework that employs a heterogeneous collection of features such as lines, points and regions for robust vehicle matching under variations in illumination, aspect and camera poses. Our approach fully utilizes the characteristics of vehicular objects that consist of relatively large textureless areas delimited by line like features, and demonstrates the important usage of heterogeneous features for different stages of vehicle matching. Experiments demonstrate the enhancement in performance of vehicle identification across multiple sightings using the heterogeneous feature set.","PeriodicalId":435841,"journal":{"name":"2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance","volume":"2010 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128212280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Ontology-based hierarchical conceptual model for semantic representation of events in dynamic scenes 基于本体的动态场景事件语义表示层次概念模型
Lun Xin, T. Tan
There is an increasing interest in semantic analysis of events in dynamic scenes in recent years, and many different methods have been reported for this challenging problem. A new approach towards event modeling and analysis with semantic representations is proposed in this paper. Our method is inspired by the entity-relation model in software engineering. It integrates all related information into a hierarchical conceptual model by the name of ontology, and defines events as significant changes and mappings of conceptual units in the mode. All concepts are represented by three basic components, an entity, a word, and a set of attributes. The lower level of our framework achieves the task of feature extraction, and in the upper level, semantically meaningful representations of events are received by using these words. So our framework is data-driven and provides semantic outputs. Semantic similarity measurement of concepts is another important problem. In this paper we propose a method that uses conceptual status vector (CSV) and weighted semantic distance (WSD) to deal with it. Experimental results are presented which demonstrate the effectiveness of our approach on real-world videos captured from different scenes.
近年来,人们对动态场景中事件的语义分析越来越感兴趣,并报道了许多不同的方法来解决这一具有挑战性的问题。本文提出了一种基于语义表示的事件建模与分析新方法。该方法的灵感来源于软件工程中的实体-关系模型。它将所有相关信息以本体的名义集成到一个层次概念模型中,并将事件定义为模式中概念单元的重大变化和映射。所有概念都由三个基本组件表示:一个实体、一个词和一组属性。我们的框架在底层完成特征提取的任务,在上层通过使用这些词接收事件的语义上有意义的表示。因此,我们的框架是数据驱动的,并提供语义输出。概念的语义相似度度量是另一个重要问题。本文提出了一种使用概念状态向量(CSV)和加权语义距离(WSD)来处理它的方法。实验结果证明了我们的方法对从不同场景捕获的真实视频的有效性。
{"title":"Ontology-based hierarchical conceptual model for semantic representation of events in dynamic scenes","authors":"Lun Xin, T. Tan","doi":"10.1109/VSPETS.2005.1570898","DOIUrl":"https://doi.org/10.1109/VSPETS.2005.1570898","url":null,"abstract":"There is an increasing interest in semantic analysis of events in dynamic scenes in recent years, and many different methods have been reported for this challenging problem. A new approach towards event modeling and analysis with semantic representations is proposed in this paper. Our method is inspired by the entity-relation model in software engineering. It integrates all related information into a hierarchical conceptual model by the name of ontology, and defines events as significant changes and mappings of conceptual units in the mode. All concepts are represented by three basic components, an entity, a word, and a set of attributes. The lower level of our framework achieves the task of feature extraction, and in the upper level, semantically meaningful representations of events are received by using these words. So our framework is data-driven and provides semantic outputs. Semantic similarity measurement of concepts is another important problem. In this paper we propose a method that uses conceptual status vector (CSV) and weighted semantic distance (WSD) to deal with it. Experimental results are presented which demonstrate the effectiveness of our approach on real-world videos captured from different scenes.","PeriodicalId":435841,"journal":{"name":"2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129166432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Integrating component cues for human pose tracking 为人体姿态跟踪集成组件线索
M. Lee, R. Nevatia
Tracking human body pose in monocular video in the presence of image noise, imperfect foreground extraction and partial occlusion of the human body is important for many video analysis applications. Human pose tracking can be made more robust by integrating the detection of components such as face and limbs. We proposed an approach based on data-driven Markov chain Monte Carlo (DD-MCMC) where component detection results are used to generate state proposals for pose estimation and initialization. Experimental results on a realistic indoor video sequence show that the method is able to track a person during turning and sitting movements.
在存在图像噪声、前景提取不完美和人体部分遮挡的情况下,对单眼视频中的人体姿态进行跟踪是许多视频分析应用的重要内容。人体姿态跟踪可以通过整合面部和肢体等成分的检测来增强鲁棒性。我们提出了一种基于数据驱动的马尔可夫链蒙特卡罗(DD-MCMC)方法,利用分量检测结果生成姿态估计和初始化的状态建议。在一个真实的室内视频序列上的实验结果表明,该方法能够跟踪人的转身和坐姿运动。
{"title":"Integrating component cues for human pose tracking","authors":"M. Lee, R. Nevatia","doi":"10.1109/VSPETS.2005.1570896","DOIUrl":"https://doi.org/10.1109/VSPETS.2005.1570896","url":null,"abstract":"Tracking human body pose in monocular video in the presence of image noise, imperfect foreground extraction and partial occlusion of the human body is important for many video analysis applications. Human pose tracking can be made more robust by integrating the detection of components such as face and limbs. We proposed an approach based on data-driven Markov chain Monte Carlo (DD-MCMC) where component detection results are used to generate state proposals for pose estimation and initialization. Experimental results on a realistic indoor video sequence show that the method is able to track a person during turning and sitting movements.","PeriodicalId":435841,"journal":{"name":"2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115495584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
On calibrating a camera network using parabolic trajectories of a bouncing ball 用弹跳球的抛物线轨迹标定摄像机网络
Kuan-Wen Chen, Y. Hung, Yong-Sheng Chen
Camera networks are often used in visual surveillance systems for wide-range monitoring. In this paper, we present a novel method for calibrating a camera network, which uses the trajectory of a bouncing ball as the calibration data. An important feature of our method is the use of the parabolic property of a ball's bouncing trajectory. This parabolic trajectory lies on a plane, called the parabolic trajectory plane (PT-plane), so that the relationship between the trajectory's points and their corresponding image points is a homography. Combining the vertical velocity determined by the earth's gravity and the horizontal velocity calculated from the homography, we can compute the 2D coordinates of the trajectory points on the PT-plane. By throwing the ball multiple times, we obtain calibration points on multiple planes for calibrating both intrinsic and extrinsic parameters of the networked cameras. Experimental results have demonstrated the feasibility and accuracy of the proposed method.
摄像机网络常用于大范围监控的视觉监控系统中。本文提出了一种利用弹跳球轨迹作为标定数据的摄像机网络标定方法。我们方法的一个重要特点是利用了球弹跳轨迹的抛物线特性。这条抛物线轨迹位于一个平面上,称为抛物线轨迹平面(pt平面),因此轨迹点与其对应的图像点之间是一种单应性关系。结合由地球重力确定的垂直速度和由单应性计算的水平速度,我们可以计算出pt平面上轨迹点的二维坐标。通过多次抛球,得到多个平面上的标定点,对网络摄像机的内外参数进行标定。实验结果证明了该方法的可行性和准确性。
{"title":"On calibrating a camera network using parabolic trajectories of a bouncing ball","authors":"Kuan-Wen Chen, Y. Hung, Yong-Sheng Chen","doi":"10.1109/VSPETS.2005.1570914","DOIUrl":"https://doi.org/10.1109/VSPETS.2005.1570914","url":null,"abstract":"Camera networks are often used in visual surveillance systems for wide-range monitoring. In this paper, we present a novel method for calibrating a camera network, which uses the trajectory of a bouncing ball as the calibration data. An important feature of our method is the use of the parabolic property of a ball's bouncing trajectory. This parabolic trajectory lies on a plane, called the parabolic trajectory plane (PT-plane), so that the relationship between the trajectory's points and their corresponding image points is a homography. Combining the vertical velocity determined by the earth's gravity and the horizontal velocity calculated from the homography, we can compute the 2D coordinates of the trajectory points on the PT-plane. By throwing the ball multiple times, we obtain calibration points on multiple planes for calibrating both intrinsic and extrinsic parameters of the networked cameras. Experimental results have demonstrated the feasibility and accuracy of the proposed method.","PeriodicalId":435841,"journal":{"name":"2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance","volume":"131 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114098686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
On-line Conservative Learning for Person Detection 基于在线保守学习的人检测
P. Roth, H. Grabner, D. Skočaj, Horst Bischof, A. Leonardis
We present a novel on-line conservative learning framework for an object detection system. All algorithms operate in an on-line mode, in particular we also present a novel on-line AdaBoost method. The basic idea is to start with a very simple object detection system and to exploit a huge amount of unlabeled video data by being very conservative in selecting training examples. The key idea is to use reconstructive and discriminative classifiers in an iterative co-training fashion to arrive at increasingly better object detectors. We demonstrate the framework on a surveillance task where we learn person detectors that are tested on two surveillance video sequences. We start with a simple moving object classifier and proceed with incremental PCA (on shape and appearance) as a reconstructive classifier, which in turn generates a training set for a discriminative on-line AdaBoost classifier
提出了一种用于目标检测系统的在线保守学习框架。所有算法都在在线模式下运行,特别是我们还提出了一种新的在线AdaBoost方法。基本思路是从一个非常简单的目标检测系统开始,通过非常保守地选择训练样本来利用大量未标记的视频数据。关键思想是在迭代的共同训练方式中使用重构和判别分类器来达到越来越好的目标检测器。我们在一个监控任务中演示了该框架,其中我们学习了在两个监控视频序列上进行测试的人员检测器。我们从一个简单的移动对象分类器开始,并继续使用增量PCA(在形状和外观上)作为重建分类器,这反过来又为判别在线AdaBoost分类器生成训练集
{"title":"On-line Conservative Learning for Person Detection","authors":"P. Roth, H. Grabner, D. Skočaj, Horst Bischof, A. Leonardis","doi":"10.1109/VSPETS.2005.1570919","DOIUrl":"https://doi.org/10.1109/VSPETS.2005.1570919","url":null,"abstract":"We present a novel on-line conservative learning framework for an object detection system. All algorithms operate in an on-line mode, in particular we also present a novel on-line AdaBoost method. The basic idea is to start with a very simple object detection system and to exploit a huge amount of unlabeled video data by being very conservative in selecting training examples. The key idea is to use reconstructive and discriminative classifiers in an iterative co-training fashion to arrive at increasingly better object detectors. We demonstrate the framework on a surveillance task where we learn person detectors that are tested on two surveillance video sequences. We start with a simple moving object classifier and proceed with incremental PCA (on shape and appearance) as a reconstructive classifier, which in turn generates a training set for a discriminative on-line AdaBoost classifier","PeriodicalId":435841,"journal":{"name":"2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124968006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 83
Online learning of region confidences for object tracking 用于目标跟踪的区域置信度在线学习
Datong Chen, Jie Yang
This paper presents an online learning method for object tracking. Motivated by the attention shifting among local regions of a human vision system during tracking, we propose to allow different regions of an object to have different confidences. The confidence of each region is learned online to reflect the discriminative power of the region in feature space and the probability of occlusion. The distribution of region confidences is employed to guide a tracking algorithm to find correspondences in adjacent frames of video images. Only high confidence regions are tracked instead of the entire object. We demonstrate feasibility of the proposed method in video surveillance applications. The method can be combined with many other existing tracking systems to enhance robustness of these systems.
提出了一种用于目标跟踪的在线学习方法。基于人类视觉系统在跟踪过程中注意力在局部区域之间的转移,我们提出允许物体的不同区域具有不同的置信度。在线学习每个区域的置信度,以反映该区域在特征空间中的判别能力和遮挡概率。利用区域置信度的分布来指导跟踪算法在视频图像的相邻帧中寻找对应关系。只跟踪高置信度区域,而不是整个对象。我们证明了该方法在视频监控应用中的可行性。该方法可以与许多其他现有跟踪系统相结合,以增强这些系统的鲁棒性。
{"title":"Online learning of region confidences for object tracking","authors":"Datong Chen, Jie Yang","doi":"10.1109/VSPETS.2005.1570891","DOIUrl":"https://doi.org/10.1109/VSPETS.2005.1570891","url":null,"abstract":"This paper presents an online learning method for object tracking. Motivated by the attention shifting among local regions of a human vision system during tracking, we propose to allow different regions of an object to have different confidences. The confidence of each region is learned online to reflect the discriminative power of the region in feature space and the probability of occlusion. The distribution of region confidences is employed to guide a tracking algorithm to find correspondences in adjacent frames of video images. Only high confidence regions are tracked instead of the entire object. We demonstrate feasibility of the proposed method in video surveillance applications. The method can be combined with many other existing tracking systems to enhance robustness of these systems.","PeriodicalId":435841,"journal":{"name":"2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance","volume":"2017 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123256096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Vehicle Categorization: Parts for Speed and Accuracy 车辆分类:速度和准确性零件
Eric Nowak, F. Jurie
In this paper we propose a framework for categorization of different types of vehicles. The difficulty comes from the high inter-class similarity and the high intra-class variability. We address this problem using a part-based recognition system. We particularly focus on the trade-off between the number of parts included in the vehicle models and the recognition rate, i.e the trade-off between fast computation and high accuracy. We propose a high-level data transformation algorithm and a feature selection scheme adapted to hierarchical SVM classifiers to improve the performance of part-based vehicle models. We have tested the proposed framework on real data acquired by infrared surveillance cameras, and on visible images too. On the infrared dataset, with the same speedup factor of 100, our accuracy is 12% better than the standard one-versus-one SVM.
本文提出了一种对不同类型车辆进行分类的框架。难度在于类间相似性高,类内变异性大。我们使用基于零件的识别系统来解决这个问题。我们特别关注车辆模型中包含的零件数量与识别率之间的权衡,即快速计算与高精度之间的权衡。为了提高基于零件的汽车模型的性能,我们提出了一种高级数据转换算法和一种适应分层支持向量机分类器的特征选择方案。我们已经在红外监控摄像机获取的真实数据和可见图像上测试了所提出的框架。在红外数据集上,同样的加速系数为100,我们的准确率比标准的1对1 SVM提高了12%。
{"title":"Vehicle Categorization: Parts for Speed and Accuracy","authors":"Eric Nowak, F. Jurie","doi":"10.1109/VSPETS.2005.1570926","DOIUrl":"https://doi.org/10.1109/VSPETS.2005.1570926","url":null,"abstract":"In this paper we propose a framework for categorization of different types of vehicles. The difficulty comes from the high inter-class similarity and the high intra-class variability. We address this problem using a part-based recognition system. We particularly focus on the trade-off between the number of parts included in the vehicle models and the recognition rate, i.e the trade-off between fast computation and high accuracy. We propose a high-level data transformation algorithm and a feature selection scheme adapted to hierarchical SVM classifiers to improve the performance of part-based vehicle models. We have tested the proposed framework on real data acquired by infrared surveillance cameras, and on visible images too. On the infrared dataset, with the same speedup factor of 100, our accuracy is 12% better than the standard one-versus-one SVM.","PeriodicalId":435841,"journal":{"name":"2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128361602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
期刊
2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1