首页 > 最新文献

2003 Conference on Computer Vision and Pattern Recognition Workshop最新文献

英文 中文
Automated Posture Analysis for Detecting Learner's Interest Level 用于检测学习者兴趣水平的自动姿势分析
Pub Date : 2003-06-16 DOI: 10.1109/CVPRW.2003.10047
Selene Mota, Rosalind W. Picard
This paper presents a system for recognizing naturally occurring postures and associated affective states related to a child's interest level while performing a learning task on a computer. Postures are gathered using two matrices of pressure sensors mounted on the seat and back of a chair. Subsequently, posture features are extracted using a mixture of four gaussians, and input to a 3-layer feed-forward neural network. The neural network classifies nine postures in real time and achieves an overall accuracy of 87.6% when tested with postures coming from new subjects. A set of independent Hidden Markov Models (HMMs) is used to analyze temporal patterns among these posture sequences in order to determine three categories related to a child's level of interest, as rated by human observers. The system reaches an overall performance of 82.3% with posture sequences coming from known subjects and 76.5% with unknown subjects.
本文提出了一种识别自然发生的姿势和与儿童在计算机上执行学习任务时的兴趣水平相关的情感状态的系统。姿势是通过安装在座椅和椅背上的两个压力传感器矩阵来收集的。随后,使用四高斯混合提取姿态特征,并将其输入到三层前馈神经网络中。该神经网络实时对9种姿势进行分类,在对来自新受试者的姿势进行测试时,总体准确率达到87.6%。一组独立的隐马尔可夫模型(hmm)用于分析这些姿势序列之间的时间模式,以确定与儿童兴趣水平相关的三种类别,并由人类观察者评分。对于来自已知对象的姿态序列,系统的总体性能为82.3%,对于未知对象的姿态序列,系统的总体性能为76.5%。
{"title":"Automated Posture Analysis for Detecting Learner's Interest Level","authors":"Selene Mota, Rosalind W. Picard","doi":"10.1109/CVPRW.2003.10047","DOIUrl":"https://doi.org/10.1109/CVPRW.2003.10047","url":null,"abstract":"This paper presents a system for recognizing naturally occurring postures and associated affective states related to a child's interest level while performing a learning task on a computer. Postures are gathered using two matrices of pressure sensors mounted on the seat and back of a chair. Subsequently, posture features are extracted using a mixture of four gaussians, and input to a 3-layer feed-forward neural network. The neural network classifies nine postures in real time and achieves an overall accuracy of 87.6% when tested with postures coming from new subjects. A set of independent Hidden Markov Models (HMMs) is used to analyze temporal patterns among these posture sequences in order to determine three categories related to a child's level of interest, as rated by human observers. The system reaches an overall performance of 82.3% with posture sequences coming from known subjects and 76.5% with unknown subjects.","PeriodicalId":121249,"journal":{"name":"2003 Conference on Computer Vision and Pattern Recognition Workshop","volume":"172 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116698491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 351
Robust LDA Classification by Subsampling 基于子抽样的鲁棒LDA分类
Pub Date : 2003-06-16 DOI: 10.1109/CVPRW.2003.10089
S. Fidler, A. Leonardis
In this paper we present a new method which enables a robust calculation of the LDA classification rule, thus making the recognition of objects under non-ideal conditions possible, i.e., in situations when objects are occluded or they appear on a varying background, or when their images are corrupted by outliers. The main idea behind the method is to translate the task of calculating the LDA classification rule into the problem of determining the coefficients of an augmented generative model (PCA). Specifically, we construct an augmented PCA basis which, on the one hand, contains information necessary for the classification (in the LDA sense), and, on the other hand, enables us to calculate the necessary coefficients by means of a subsampling approach resulting in a high breakdown point classification. The theoretical results are evaluated on the ORL face database showing that the proposed method significantly outperforms the standard LDA.
在本文中,我们提出了一种新的方法,可以鲁棒地计算LDA分类规则,从而使非理想条件下的目标识别成为可能,即当目标被遮挡或它们出现在不同的背景中,或者当它们的图像被异常值损坏时。该方法的主要思想是将计算LDA分类规则的任务转化为确定增广生成模型(PCA)系数的问题。具体来说,我们构建了一个增强型PCA基础,一方面包含分类所需的信息(在LDA意义上),另一方面,使我们能够通过子抽样方法计算必要的系数,从而获得高分解点分类。在ORL人脸数据库上对理论结果进行了评估,结果表明该方法明显优于标准LDA方法。
{"title":"Robust LDA Classification by Subsampling","authors":"S. Fidler, A. Leonardis","doi":"10.1109/CVPRW.2003.10089","DOIUrl":"https://doi.org/10.1109/CVPRW.2003.10089","url":null,"abstract":"In this paper we present a new method which enables a robust calculation of the LDA classification rule, thus making the recognition of objects under non-ideal conditions possible, i.e., in situations when objects are occluded or they appear on a varying background, or when their images are corrupted by outliers. The main idea behind the method is to translate the task of calculating the LDA classification rule into the problem of determining the coefficients of an augmented generative model (PCA). Specifically, we construct an augmented PCA basis which, on the one hand, contains information necessary for the classification (in the LDA sense), and, on the other hand, enables us to calculate the necessary coefficients by means of a subsampling approach resulting in a high breakdown point classification. The theoretical results are evaluated on the ORL face database showing that the proposed method significantly outperforms the standard LDA.","PeriodicalId":121249,"journal":{"name":"2003 Conference on Computer Vision and Pattern Recognition Workshop","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124941330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
Enforcing Constraints for Human Body Tracking 强制约束人体跟踪
Pub Date : 2003-06-16 DOI: 10.1109/CVPRW.2003.10101
D. Demirdjian
A novel approach for tracking 3D articulated human bodies in stereo images is presented. We present a projection-based method for enforcing articulated constraints. We define the articulated motion space as the space in which the motions of the limbs of a body belong. We show that around the origin, the articulated motion space can be approximated by a linear space estimated directly from the previous body pose. Articulated constraints are enforced by projecting unconstrained motions onto the linearized articulated motion space in an optimal way. Our paper also addresses the problem of accounting for other constraints on body pose and dynamics (e.g. joint angle bounds, maximum speed). We present here an approach to guarantee these constraints while tracking people.
提出了一种在立体图像中跟踪三维关节人体的新方法。我们提出了一种基于投影的方法来执行铰接约束。我们将关节运动空间定义为物体四肢运动所属的空间。我们证明了在原点周围,关节运动空间可以通过直接从前一个身体姿势估计的线性空间来近似。铰接约束是通过将无约束运动以最优方式投射到线性化的铰接运动空间来实现的。我们的论文还解决了考虑身体姿势和动力学的其他约束的问题(例如关节角度界限,最大速度)。我们在这里提出了一种在跟踪人员时保证这些约束的方法。
{"title":"Enforcing Constraints for Human Body Tracking","authors":"D. Demirdjian","doi":"10.1109/CVPRW.2003.10101","DOIUrl":"https://doi.org/10.1109/CVPRW.2003.10101","url":null,"abstract":"A novel approach for tracking 3D articulated human bodies in stereo images is presented. We present a projection-based method for enforcing articulated constraints. We define the articulated motion space as the space in which the motions of the limbs of a body belong. We show that around the origin, the articulated motion space can be approximated by a linear space estimated directly from the previous body pose. Articulated constraints are enforced by projecting unconstrained motions onto the linearized articulated motion space in an optimal way. Our paper also addresses the problem of accounting for other constraints on body pose and dynamics (e.g. joint angle bounds, maximum speed). We present here an approach to guarantee these constraints while tracking people.","PeriodicalId":121249,"journal":{"name":"2003 Conference on Computer Vision and Pattern Recognition Workshop","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125360759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
A Statistical Assessment of Subject Factors in the PCA Recognition of Human Faces 人脸主成分分析中主体因素的统计分析
Pub Date : 2003-06-16 DOI: 10.1109/CVPRW.2003.10088
G. Givens, Ross Beveridge, B. Draper, D. Bolme
Some people's faces are easier to recognize than others, but it is not obvious what subject-specific factors make individual faces easy or difficult to recognize. This study considers 11 factors that might make recognition easy or difficult for 1,072 human subjects in the FERET dataset. The specific factors are: race (white, Asian, African-American, or other), gender, age (young or old), glasses (present or absent), facial hair (present or absent), bangs (present or absent), mouth (closed or other), eyes (open or other), complexion (clear or other), makeup (present or absent), and expression (neutral or other). An ANOVA is used to determine the relationship between these subject covariates and the distance between pairs of images of the same subject in a standard Eigenfaces subspace. Some results are not terribly surprising. For example, the distance between pairs of images of the same subject increases for people who change their appearance, e.g., open and close their eyes, open and close their mouth or change expression. Thus changing appearance makes recognition harder. Other findings are surprising. Distance between pairs of images for subjects decreases for people who consistently wear glasses, so wearing glasses makes subjects more recognizable. Pairwise distance also decreases for people who are either Asian or African-American rather than white. A possible shortcoming of our analysis is that minority classifications such as African-Americans and wearers-of-glasses are underrepresented in training. Followup experiments with balanced training addresses this concern and corroborates the original findings. Another possible shortcoming of this analysis is the novel use of pairwise distance between images of a single person as the predictor of recognition difficulty. A separate experiment confirms that larger distances between pairs of subject images implies a larger recognition rank for that same pair of images, thus confirming that the subject is harder to recognize.
有些人的脸比其他人的脸更容易识别,但不清楚是什么特定于受试者的因素使个人的脸容易或难以识别。这项研究考虑了11个因素,这些因素可能会使FERET数据集中的1072名人类受试者的识别变得容易或困难。具体因素有:种族(白人、亚洲人、非裔美国人或其他)、性别、年龄(年轻或年老)、眼镜(戴或不戴)、面部毛发(戴或不留)、刘海(留或不留)、嘴巴(闭或其他)、眼睛(睁开或其他)、肤色(清澈或其他)、妆容(戴或不戴)和表情(中性或其他)。方差分析用于确定这些主体协变量与标准特征面子空间中同一主体的图像对之间的距离之间的关系。有些结果并不特别令人惊讶。例如,对于改变外表的人来说,例如,睁眼和闭眼、张嘴和闭嘴或改变表情,同一对象的成对图像之间的距离会增加。因此,改变外表会增加识别难度。其他发现令人惊讶。对于经常戴眼镜的人来说,被试的图像之间的距离会缩短,所以戴眼镜会让被试更容易被识别。与白人相比,亚裔或非裔美国人的两两距离也会减少。我们分析的一个可能的缺点是,像非洲裔美国人和戴眼镜的人这样的少数族裔在培训中的代表性不足。平衡训练的后续实验解决了这一问题,并证实了最初的发现。这种分析的另一个可能的缺点是,它新颖地使用了单个人的图像之间的成对距离作为识别难度的预测因子。一项单独的实验证实,两组被试图像之间的距离越大,意味着同一对图像的识别等级越高,从而证实了被试更难被识别。
{"title":"A Statistical Assessment of Subject Factors in the PCA Recognition of Human Faces","authors":"G. Givens, Ross Beveridge, B. Draper, D. Bolme","doi":"10.1109/CVPRW.2003.10088","DOIUrl":"https://doi.org/10.1109/CVPRW.2003.10088","url":null,"abstract":"Some people's faces are easier to recognize than others, but it is not obvious what subject-specific factors make individual faces easy or difficult to recognize. This study considers 11 factors that might make recognition easy or difficult for 1,072 human subjects in the FERET dataset. The specific factors are: race (white, Asian, African-American, or other), gender, age (young or old), glasses (present or absent), facial hair (present or absent), bangs (present or absent), mouth (closed or other), eyes (open or other), complexion (clear or other), makeup (present or absent), and expression (neutral or other). An ANOVA is used to determine the relationship between these subject covariates and the distance between pairs of images of the same subject in a standard Eigenfaces subspace. Some results are not terribly surprising. For example, the distance between pairs of images of the same subject increases for people who change their appearance, e.g., open and close their eyes, open and close their mouth or change expression. Thus changing appearance makes recognition harder. Other findings are surprising. Distance between pairs of images for subjects decreases for people who consistently wear glasses, so wearing glasses makes subjects more recognizable. Pairwise distance also decreases for people who are either Asian or African-American rather than white. A possible shortcoming of our analysis is that minority classifications such as African-Americans and wearers-of-glasses are underrepresented in training. Followup experiments with balanced training addresses this concern and corroborates the original findings. Another possible shortcoming of this analysis is the novel use of pairwise distance between images of a single person as the predictor of recognition difficulty. A separate experiment confirms that larger distances between pairs of subject images implies a larger recognition rank for that same pair of images, thus confirming that the subject is harder to recognize.","PeriodicalId":121249,"journal":{"name":"2003 Conference on Computer Vision and Pattern Recognition Workshop","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123861541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 83
Drawing Accurate Ground Plans Using Optical Triangulation Data 使用光学三角测量数据绘制精确的地面平面图
Pub Date : 2003-06-16 DOI: 10.1109/CVPRW.2003.10009
Kevin Cain, Philippe Martinez
Here we consider optical triangulation scanning as a means of creating permanent architectural archives in the form of accurate ground plans and other orthographic views. We present plan drawings created with laser scan data and use these documents to make comparisons with existing documents. Finally, we present a new technique for decreasing the laser scanning field time required to create plans and other views.
在这里,我们考虑将光学三角测量扫描作为一种以精确的平面图和其他正射影视图的形式创建永久建筑档案的手段。我们展示了用激光扫描数据创建的平面图,并使用这些文件与现有文件进行比较。最后,我们提出了一种新的技术,以减少激光扫描所需的场时间,以创建平面图和其他视图。
{"title":"Drawing Accurate Ground Plans Using Optical Triangulation Data","authors":"Kevin Cain, Philippe Martinez","doi":"10.1109/CVPRW.2003.10009","DOIUrl":"https://doi.org/10.1109/CVPRW.2003.10009","url":null,"abstract":"Here we consider optical triangulation scanning as a means of creating permanent architectural archives in the form of accurate ground plans and other orthographic views. We present plan drawings created with laser scan data and use these documents to make comparisons with existing documents. Finally, we present a new technique for decreasing the laser scanning field time required to create plans and other views.","PeriodicalId":121249,"journal":{"name":"2003 Conference on Computer Vision and Pattern Recognition Workshop","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125452577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Multi Target Track Before Detect Application 一种多目标检测前跟踪的应用
Pub Date : 2003-06-16 DOI: 10.1109/CVPRW.2003.10100
Y. Boers, J. Driessen, F. Verschure, W. Heemels, A. Juloski
This paper deals with a radar track before detect application in a multi target setting. Track before detect is a method to track weak objects (targets) on the basis raw radar measurements, e.g. the reflected target power. In classical target tracking, the tracking process is performed on the basis of pre-processed measurements, that are constructed from the original measurement data every time step. In this way no integration over time takes place and information is lost. In this paper we will give a modelling setup and a particle filter based algorithm to deal with a multiple target track before detect situation. In simulations we show that, using this method, it is possible to track multiple, closely spaced, (weak) targets.
研究了一种雷达探测前航迹在多目标环境下的应用。先探测后跟踪是一种基于原始雷达测量值(如目标反射功率)对弱小物体(目标)进行跟踪的方法。在经典的目标跟踪中,跟踪过程是在每个时间步长由原始测量数据构建的预处理测量的基础上进行的。这样,随着时间的推移,集成就不会发生,信息也会丢失。在本文中,我们将给出一种建模方法和一种基于粒子滤波的算法来处理检测前的多目标轨迹。在模拟中,我们表明,使用这种方法,可以跟踪多个,紧密间隔,(弱)目标。
{"title":"A Multi Target Track Before Detect Application","authors":"Y. Boers, J. Driessen, F. Verschure, W. Heemels, A. Juloski","doi":"10.1109/CVPRW.2003.10100","DOIUrl":"https://doi.org/10.1109/CVPRW.2003.10100","url":null,"abstract":"This paper deals with a radar track before detect application in a multi target setting. Track before detect is a method to track weak objects (targets) on the basis raw radar measurements, e.g. the reflected target power. In classical target tracking, the tracking process is performed on the basis of pre-processed measurements, that are constructed from the original measurement data every time step. In this way no integration over time takes place and information is lost. In this paper we will give a modelling setup and a particle filter based algorithm to deal with a multiple target track before detect situation. In simulations we show that, using this method, it is possible to track multiple, closely spaced, (weak) targets.","PeriodicalId":121249,"journal":{"name":"2003 Conference on Computer Vision and Pattern Recognition Workshop","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125482269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Automatic 3D modeling of archaeological objects 考古对象的自动3D建模
Pub Date : 2003-06-16 DOI: 10.1109/CVPRW.2003.10006
M. Andreetto, N. Brusco, G. Cortelazzo
A wide-spread use of 3D models in archeology application requires low cost equipment and technically simple modeling procedures. In this context methods for automatic 3D modeling based on fully automatic techniques for 3D views registration will play a central role. This paper proposes a very robust procedure which does not require special equipment or skill in order to make 3D models. The results of this paper, originally conceived to address the costs issues of heritage's modeling, can be profitably exploited also in other modeling applications.
三维模型在考古应用中的广泛应用需要低成本的设备和技术上简单的建模程序。在这种情况下,基于全自动三维视图注册技术的自动三维建模方法将发挥核心作用。本文提出了一种非常稳健的程序,它不需要特殊的设备或技能来制作3D模型。本文的结果,最初是为了解决遗产建模的成本问题,也可以在其他建模应用中得到有益的利用。
{"title":"Automatic 3D modeling of archaeological objects","authors":"M. Andreetto, N. Brusco, G. Cortelazzo","doi":"10.1109/CVPRW.2003.10006","DOIUrl":"https://doi.org/10.1109/CVPRW.2003.10006","url":null,"abstract":"A wide-spread use of 3D models in archeology application requires low cost equipment and technically simple modeling procedures. In this context methods for automatic 3D modeling based on fully automatic techniques for 3D views registration will play a central role. This paper proposes a very robust procedure which does not require special equipment or skill in order to make 3D models. The results of this paper, originally conceived to address the costs issues of heritage's modeling, can be profitably exploited also in other modeling applications.","PeriodicalId":121249,"journal":{"name":"2003 Conference on Computer Vision and Pattern Recognition Workshop","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125756779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
A Factorization Approach for Activity Recognition 一种活动识别的因子分解方法
Pub Date : 2003-06-16 DOI: 10.1109/CVPRW.2003.10040
A. Roy-Chowdhury, R. Chellappa
Understanding activities arising out of the interactions of a configuration of moving objects is an important problem in video understanding, with applications in surveillance and monitoring. A special situation is when the objects are small enough to be represented as points on a 2D plane. In this paper, we introduce a novel method of representing the activity by the deformations of the point configuration in a properly defined shape space. Instead of inferring about the activity directly from the motion tracks of the individual points, we propose to model an activity by the polygonal shape formed by joining the locations of these point masses at any time t, and its deformation as the activity unfolds. Given the locations of the 2D points over a sequence of frames in the video, the factorization theorem for matrices is used to obtain a set of basis shapes for each activity. An unknown activity can now be recognized by projecting onto these basis shapes. Also, once a specific activity is recognized, the deviations from it can be modeled by the deformations from the basis shape. This is used to identify an abnormal activity. We demonstrate the applicability of our algorithm using real-life video sequences in an airport surveillance environment. We are able to identify the major activities that take place in that setting and detect abnormal ones.
在视频理解中,理解由一组移动物体的相互作用引起的活动是一个重要问题,在监视和监控中有应用。一种特殊情况是,当对象足够小,可以在二维平面上表示为点。在本文中,我们引入了一种用适当定义的形状空间中点的形态变形来表示活动的新方法。我们不是直接从单个点的运动轨迹推断活动,而是建议通过在任何时间t连接这些点质量的位置形成的多边形形状来模拟活动,以及活动展开时的变形。给定视频中一系列帧上2D点的位置,使用矩阵的分解定理来获得每个活动的一组基形状。一个未知的活动现在可以通过投射到这些基本形状来识别。此外,一旦识别出特定的活动,就可以通过基本形状的变形来模拟与之相关的偏差。这用于识别异常活动。我们用机场监控环境中的真实视频序列证明了算法的适用性。我们能够识别出在这种环境下发生的主要活动,并检测出异常活动。
{"title":"A Factorization Approach for Activity Recognition","authors":"A. Roy-Chowdhury, R. Chellappa","doi":"10.1109/CVPRW.2003.10040","DOIUrl":"https://doi.org/10.1109/CVPRW.2003.10040","url":null,"abstract":"Understanding activities arising out of the interactions of a configuration of moving objects is an important problem in video understanding, with applications in surveillance and monitoring. A special situation is when the objects are small enough to be represented as points on a 2D plane. In this paper, we introduce a novel method of representing the activity by the deformations of the point configuration in a properly defined shape space. Instead of inferring about the activity directly from the motion tracks of the individual points, we propose to model an activity by the polygonal shape formed by joining the locations of these point masses at any time t, and its deformation as the activity unfolds. Given the locations of the 2D points over a sequence of frames in the video, the factorization theorem for matrices is used to obtain a set of basis shapes for each activity. An unknown activity can now be recognized by projecting onto these basis shapes. Also, once a specific activity is recognized, the deviations from it can be modeled by the deformations from the basis shape. This is used to identify an abnormal activity. We demonstrate the applicability of our algorithm using real-life video sequences in an airport surveillance environment. We are able to identify the major activities that take place in that setting and detect abnormal ones.","PeriodicalId":121249,"journal":{"name":"2003 Conference on Computer Vision and Pattern Recognition Workshop","volume":"24 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113938395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
Comparative Studies of Line-based Panoramic Camera Calibration 基于线的全景相机标定的比较研究
Pub Date : 2003-06-16 DOI: 10.1109/CVPRW.2003.10086
F. Huang, Shou-Kang Wei, R. Klette
The calibration of a line-based panoramic camera can be split into two independent subtasks: first calibrate the effective focal length and the principal row, and second, calibrate the off-axis distance and the principal angle. The paper provides solutions for three different methods, and compares these methods based on experiments using a superhigh resolution line-based panoramic camera. It turns out that the second subtask is solved best if a straight-segment based approach is used, compared to point-based or correspondence-based calibration methods, all already known for traditional (planar) pinhole cameras, but not yet previously discussed for panoramic cameras.
基于线的全景相机的标定可分为两个独立的子任务:一是标定有效焦距和主行,二是标定离轴距离和主角度。本文给出了三种不同方法的解决方案,并通过超高分辨率线全景相机的实验对三种方法进行了比较。事实证明,与基于点或基于对应的校准方法相比,如果使用基于直线段的方法,则第二个子任务的解决效果最好,这些方法都已用于传统(平面)针孔相机,但尚未讨论全景相机。
{"title":"Comparative Studies of Line-based Panoramic Camera Calibration","authors":"F. Huang, Shou-Kang Wei, R. Klette","doi":"10.1109/CVPRW.2003.10086","DOIUrl":"https://doi.org/10.1109/CVPRW.2003.10086","url":null,"abstract":"The calibration of a line-based panoramic camera can be split into two independent subtasks: first calibrate the effective focal length and the principal row, and second, calibrate the off-axis distance and the principal angle. The paper provides solutions for three different methods, and compares these methods based on experiments using a superhigh resolution line-based panoramic camera. It turns out that the second subtask is solved best if a straight-segment based approach is used, compared to point-based or correspondence-based calibration methods, all already known for traditional (planar) pinhole cameras, but not yet previously discussed for panoramic cameras.","PeriodicalId":121249,"journal":{"name":"2003 Conference on Computer Vision and Pattern Recognition Workshop","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114645856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Image Based Hand Tracking via Interacting Multiple Model and Probabilistic Data Association (IMM-PDA) Algorithm 基于交互多模型和概率数据关联(IMM-PDA)算法的图像手部跟踪
Pub Date : 2003-06-16 DOI: 10.1109/CVPRW.2003.10050
Shunguang Wu, L. Hong, Francis K. H. Quek
Traditional image based hand tracking uses a single Kalman filter to estimate and predict the hand state (position, velocity, and acceleration). However, this approach may fail in the case of large maneuvers and cluttered measurements. In this paper we propose to use the interacting multiple model (IMM) filter to catch a maneuver and the probabilistic data association (PDA) method to process noisy measurements and false alarms. A theoretical framework of image based hand tracking by IMM-PDA is set up. Experiment results from several video segments show that IMM-PDA can successfully track hand motions in a natural conversational environment.
传统的基于图像的手部跟踪使用单个卡尔曼滤波器来估计和预测手部状态(位置、速度和加速度)。然而,这种方法在大型机动和杂乱测量的情况下可能会失败。本文提出用交互多模型(IMM)滤波捕捉机动,用概率数据关联(PDA)方法处理噪声测量和虚警。建立了基于图像的IMM-PDA手部跟踪的理论框架。来自多个视频片段的实验结果表明,IMM-PDA可以成功地跟踪自然会话环境中的手部动作。
{"title":"Image Based Hand Tracking via Interacting Multiple Model and Probabilistic Data Association (IMM-PDA) Algorithm","authors":"Shunguang Wu, L. Hong, Francis K. H. Quek","doi":"10.1109/CVPRW.2003.10050","DOIUrl":"https://doi.org/10.1109/CVPRW.2003.10050","url":null,"abstract":"Traditional image based hand tracking uses a single Kalman filter to estimate and predict the hand state (position, velocity, and acceleration). However, this approach may fail in the case of large maneuvers and cluttered measurements. In this paper we propose to use the interacting multiple model (IMM) filter to catch a maneuver and the probabilistic data association (PDA) method to process noisy measurements and false alarms. A theoretical framework of image based hand tracking by IMM-PDA is set up. Experiment results from several video segments show that IMM-PDA can successfully track hand motions in a natural conversational environment.","PeriodicalId":121249,"journal":{"name":"2003 Conference on Computer Vision and Pattern Recognition Workshop","volume":"257 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120868434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2003 Conference on Computer Vision and Pattern Recognition Workshop
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1