首页 > 最新文献

Object recognition supported by user interaction for service robots最新文献

英文 中文
Multiple complex object tracking using a combined technique 基于组合技术的多复杂目标跟踪
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048402
E. Polat, M. Yeasin, Rajeev Sharma
We present a multiple object tracking framework that employs two common methods for tracking and image matching, namely Multiple Hypothesis Tracking (MHT) and Hausdorff image matching. We use the MHT algorithm to track image edges simultaneously. This algorithm is capable of tracking multiple edges with limited occlusions and is suitable for resolving any data association uncertainty caused by background clutter and closely-spaced edges. We use the Hausdorff matching algorithm to organize individual edges into objects given their two-dimensional models. The combined technique provides a robust probabilistic tracking framework which is capable of tracking complex objects in cluttered background in video sequences.
我们提出了一个多目标跟踪框架,该框架采用了两种常见的跟踪和图像匹配方法,即多假设跟踪(MHT)和Hausdorff图像匹配。我们使用MHT算法同时跟踪图像边缘。该算法能够在有限遮挡的情况下跟踪多个边缘,适用于解决背景杂波和密集边缘引起的数据关联不确定性。我们使用Hausdorff匹配算法将单个边缘组织成给定二维模型的对象。该方法提供了一种鲁棒的概率跟踪框架,能够对视频序列中杂乱背景下的复杂目标进行跟踪。
{"title":"Multiple complex object tracking using a combined technique","authors":"E. Polat, M. Yeasin, Rajeev Sharma","doi":"10.1109/ICPR.2002.1048402","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048402","url":null,"abstract":"We present a multiple object tracking framework that employs two common methods for tracking and image matching, namely Multiple Hypothesis Tracking (MHT) and Hausdorff image matching. We use the MHT algorithm to track image edges simultaneously. This algorithm is capable of tracking multiple edges with limited occlusions and is suitable for resolving any data association uncertainty caused by background clutter and closely-spaced edges. We use the Hausdorff matching algorithm to organize individual edges into objects given their two-dimensional models. The combined technique provides a robust probabilistic tracking framework which is capable of tracking complex objects in cluttered background in video sequences.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"103 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128689290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Step acceleration based training algorithm for feedforward neural networks 基于阶跃加速的前馈神经网络训练算法
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048243
Yanlai Li, Kuanquan Wang, David Zhang
This paper presents a very fast step acceleration based training algorithm (SATA) for multilayer feedforward neural network training. The most outstanding virtue of this algorithm is that it does not need to calculate the gradient of the target function. In each iteration step, the computation only concentrates on the corresponding varied part. The proposed algorithm has attributes in simplicity, flexibility and feasibility, as well as high speed of convergence. Compared with the other methods, including the conventional backpropagation (BP), conjugate gradient, and weight extrapolation based BP, many simulations confirmed the superiority of this algorithm in terms of converging speed and computation time required.
提出了一种用于多层前馈神经网络训练的快速阶跃加速训练算法(SATA)。该算法最突出的优点是不需要计算目标函数的梯度。在每个迭代步骤中,计算只集中在相应的变化部分。该算法具有简单、灵活、可行、收敛速度快等特点。与传统的反向传播(BP)、共轭梯度和基于权外推的BP等方法相比,许多仿真验证了该算法在收敛速度和计算时间方面的优越性。
{"title":"Step acceleration based training algorithm for feedforward neural networks","authors":"Yanlai Li, Kuanquan Wang, David Zhang","doi":"10.1109/ICPR.2002.1048243","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048243","url":null,"abstract":"This paper presents a very fast step acceleration based training algorithm (SATA) for multilayer feedforward neural network training. The most outstanding virtue of this algorithm is that it does not need to calculate the gradient of the target function. In each iteration step, the computation only concentrates on the corresponding varied part. The proposed algorithm has attributes in simplicity, flexibility and feasibility, as well as high speed of convergence. Compared with the other methods, including the conventional backpropagation (BP), conjugate gradient, and weight extrapolation based BP, many simulations confirmed the superiority of this algorithm in terms of converging speed and computation time required.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127337852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Dependence characteristics of face recognition algorithms 人脸识别算法的依赖特性
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048230
A. Rukhin, P. Grother, P. Phillips, Stefan Leigh, A. Heckert, E. Newton
Nonparametric statistics for quantifying dependence between the output rankings of face recognition algorithms are described Analysis of the archived results of a large face recognition study shows that even the better algorithms exhibit significantly different behaviors. It is found that there is significant dependence in the rankings given by two algorithms to similar and dissimilar faces but that other samples are ranked independently. A class of functions known as copulas is used; it is shown that the correlations arise from a mixture of two copulas.
对一项大型人脸识别研究的存档结果的分析表明,即使是较好的算法也表现出显著不同的行为。研究发现,两种算法对相似和不相似的人脸给出的排名有显著的依赖性,而其他样本的排名是独立的。使用了一类称为copulas的函数;结果表明,这种相关性是由两种联结的混合产生的。
{"title":"Dependence characteristics of face recognition algorithms","authors":"A. Rukhin, P. Grother, P. Phillips, Stefan Leigh, A. Heckert, E. Newton","doi":"10.1109/ICPR.2002.1048230","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048230","url":null,"abstract":"Nonparametric statistics for quantifying dependence between the output rankings of face recognition algorithms are described Analysis of the archived results of a large face recognition study shows that even the better algorithms exhibit significantly different behaviors. It is found that there is significant dependence in the rankings given by two algorithms to similar and dissimilar faces but that other samples are ranked independently. A class of functions known as copulas is used; it is shown that the correlations arise from a mixture of two copulas.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"124 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116647218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Improved stereo image matching using mutual information and hierarchical prior probabilities 利用互信息和分层先验概率改进立体图像匹配
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048459
C. Fookes, Bennamoun, A. Lamanna
Mutual information (MI) has shown promise as an effective stereo matching measure for images affected by radiometric distortion. This is due to the robustness of MI against changes in illumination. However MI-based approaches are particularly prone to the generation of false matches due to the small statistical power of the matching windows. The paper proposes extensions to MI-based stereo matching in order to increase the robustness of the algorithm. Firstly, prior probabilities are incorporated into the MI measure in order to considerably increase the statistical power of the matching windows. These prior probabilities, which are calculated from the global joint histogram between the stereo pair, are tuned to a two level hierarchical approach. A 2D match surface, in which the match score is computed for every possible combination of template and matching window, is also utilised. This enforces left-right consistency and uniqueness constraints. These additions to MI-based stereo matching significantly enhance the algorithm's ability to detect correct matches while decreasing computation time and improving the accuracy.
互信息(MI)作为一种有效的立体匹配方法,对受辐射畸变影响的图像有很大的应用前景。这是由于MI对光照变化的鲁棒性。然而,由于匹配窗口的统计能力较小,基于mi的方法特别容易产生错误匹配。为了提高算法的鲁棒性,本文对基于mi的立体匹配进行了扩展。首先,将先验概率纳入到MI测度中,以显著提高匹配窗口的统计能力。这些先验概率是从立体对之间的全局联合直方图计算出来的,被调整为两级分层方法。还使用了一个二维匹配曲面,其中计算了模板和匹配窗口的每种可能组合的匹配分数。这强制了左右一致性和唯一性约束。这些添加到基于mi的立体匹配中,显著增强了算法检测正确匹配的能力,同时减少了计算时间,提高了精度。
{"title":"Improved stereo image matching using mutual information and hierarchical prior probabilities","authors":"C. Fookes, Bennamoun, A. Lamanna","doi":"10.1109/ICPR.2002.1048459","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048459","url":null,"abstract":"Mutual information (MI) has shown promise as an effective stereo matching measure for images affected by radiometric distortion. This is due to the robustness of MI against changes in illumination. However MI-based approaches are particularly prone to the generation of false matches due to the small statistical power of the matching windows. The paper proposes extensions to MI-based stereo matching in order to increase the robustness of the algorithm. Firstly, prior probabilities are incorporated into the MI measure in order to considerably increase the statistical power of the matching windows. These prior probabilities, which are calculated from the global joint histogram between the stereo pair, are tuned to a two level hierarchical approach. A 2D match surface, in which the match score is computed for every possible combination of template and matching window, is also utilised. This enforces left-right consistency and uniqueness constraints. These additions to MI-based stereo matching significantly enhance the algorithm's ability to detect correct matches while decreasing computation time and improving the accuracy.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"136 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131206586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Real-time MPEG2 video watermarking in the VLC domain 实时MPEG2视频水印在VLC域
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048363
Chun-Shien Lu, Jan-Ru Chen, H. M. Liao, Kuo-Chin Fan
This paper proposes a compressed domain video watermarking scheme for copyright protection. Our scheme is designed based on the concept of communications with side information. For making the real-time detection a reality, the watermark is directly embedded and detected in the VLC domain. The typical problems of video watermarking such as preservation of bit rate, video attacks, real-time detection is examined. The performance of the new watermarking scheme is examined by checking its robustness capability against attacks together with false positive analysis.
提出了一种用于版权保护的压缩域视频水印方案。我们的方案是基于侧信息通信的概念设计的。为了实现水印的实时检测,水印被直接嵌入到VLC域中进行检测。研究了视频水印的典型问题,如码率保持、视频攻击、实时检测等。通过对攻击的鲁棒性和误报分析来检验新水印方案的性能。
{"title":"Real-time MPEG2 video watermarking in the VLC domain","authors":"Chun-Shien Lu, Jan-Ru Chen, H. M. Liao, Kuo-Chin Fan","doi":"10.1109/ICPR.2002.1048363","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048363","url":null,"abstract":"This paper proposes a compressed domain video watermarking scheme for copyright protection. Our scheme is designed based on the concept of communications with side information. For making the real-time detection a reality, the watermark is directly embedded and detected in the VLC domain. The typical problems of video watermarking such as preservation of bit rate, video attacks, real-time detection is examined. The performance of the new watermarking scheme is examined by checking its robustness capability against attacks together with false positive analysis.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131867060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 38
Robust face analysis using convolutional neural networks 基于卷积神经网络的鲁棒人脸分析
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048231
B. Fasel
Automatic face analysis has to cope with pose and lighting variations. Especially pose variations are difficult to tackle and many face analysis methods require the use of sophisticated normalization procedures. We propose a data-driven face analysis approach that is not only capable of extracting features relevant to a given face analysis task but is also robust with regard to face location changes and scale variations. This is achieved by deploying convolutional neural networks, which are either trained for facial expression recognition or face identity recognition. Combining the outputs of these networks allows us to obtain a subject dependent or personalized recognition of facial expressions.
自动面部分析必须处理姿势和光照变化。特别是位姿变化很难处理,许多人脸分析方法需要使用复杂的归一化程序。我们提出了一种数据驱动的人脸分析方法,该方法不仅能够提取与给定人脸分析任务相关的特征,而且对于人脸位置变化和尺度变化也具有鲁棒性。这是通过部署卷积神经网络来实现的,卷积神经网络被训练用于面部表情识别或面部身份识别。结合这些网络的输出,我们可以获得一个对象依赖或个性化的面部表情识别。
{"title":"Robust face analysis using convolutional neural networks","authors":"B. Fasel","doi":"10.1109/ICPR.2002.1048231","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048231","url":null,"abstract":"Automatic face analysis has to cope with pose and lighting variations. Especially pose variations are difficult to tackle and many face analysis methods require the use of sophisticated normalization procedures. We propose a data-driven face analysis approach that is not only capable of extracting features relevant to a given face analysis task but is also robust with regard to face location changes and scale variations. This is achieved by deploying convolutional neural networks, which are either trained for facial expression recognition or face identity recognition. Combining the outputs of these networks allows us to obtain a subject dependent or personalized recognition of facial expressions.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"392 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133052623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 109
A fast leading eigenvector approximation for segmentation and grouping 一种用于分割和分组的快速领先特征向量逼近
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048383
A. Robles-Kelly, Sudeep Sarkar, E. Hancock
We present a fast non-iterative method for approximating the leading eigenvector so as to render graph-spectral based grouping algorithms more efficient. The approximation is based on a linear perturbation analysis and applies to matrices that are non-sparse, non-negative and symmetric. For an N/spl times/N matrix, the approximation can be implemented with complexity as low as O(4N/sup 2/). We provide a performance analysis and demonstrate the usefulness of our method on image segmentation problems.
我们提出了一种快速的非迭代方法来逼近前导特征向量,从而使基于图谱的分组算法更加高效。该近似基于线性摄动分析,适用于非稀疏、非负和对称的矩阵。对于N/spl次/N矩阵,该近似可以以低至0 (4N/sup 2/)的复杂度实现。我们提供了性能分析,并证明了我们的方法在图像分割问题上的实用性。
{"title":"A fast leading eigenvector approximation for segmentation and grouping","authors":"A. Robles-Kelly, Sudeep Sarkar, E. Hancock","doi":"10.1109/ICPR.2002.1048383","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048383","url":null,"abstract":"We present a fast non-iterative method for approximating the leading eigenvector so as to render graph-spectral based grouping algorithms more efficient. The approximation is based on a linear perturbation analysis and applies to matrices that are non-sparse, non-negative and symmetric. For an N/spl times/N matrix, the approximation can be implemented with complexity as low as O(4N/sup 2/). We provide a performance analysis and demonstrate the usefulness of our method on image segmentation problems.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131736428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Supervised segmentation of textures in backscatter images 后向散射图像纹理的监督分割
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048345
P. Paclík, R. Duin, Geert M. P. van Kempen, R. Kohlus
In this paper we present an application of statistical pattern recognition for segmentation of backscatter images (BSE) in product analysis of laundry detergents. Currently, application experts segment BSE images interactively which is both time consuming and expert dependent. We present a new, automatic procedure for supervised BSE segmentation which is trained using additional multi-spectral EDX images. Each time a new feature selection procedure is employed to find a convenient feature subset for a particular segmentation problem. The performance of the presented algorithm is evaluated using ground-truth segmentation results. It is compared with that of interactive segmentation performed by the analyst.
本文提出了一种基于统计模式识别的后向散射图像分割(BSE)在洗衣粉产品分析中的应用。目前,应用专家对疯牛病图像进行交互式分割既耗时又依赖于专家。我们提出了一种新的自动过程,用于监督BSE分割,该过程使用额外的多光谱EDX图像进行训练。每次都使用一个新的特征选择过程来为特定的分割问题找到一个方便的特征子集。利用真值分割结果对该算法的性能进行了评价。将其与分析人员进行的交互式分割进行了比较。
{"title":"Supervised segmentation of textures in backscatter images","authors":"P. Paclík, R. Duin, Geert M. P. van Kempen, R. Kohlus","doi":"10.1109/ICPR.2002.1048345","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048345","url":null,"abstract":"In this paper we present an application of statistical pattern recognition for segmentation of backscatter images (BSE) in product analysis of laundry detergents. Currently, application experts segment BSE images interactively which is both time consuming and expert dependent. We present a new, automatic procedure for supervised BSE segmentation which is trained using additional multi-spectral EDX images. Each time a new feature selection procedure is employed to find a convenient feature subset for a particular segmentation problem. The performance of the presented algorithm is evaluated using ground-truth segmentation results. It is compared with that of interactive segmentation performed by the analyst.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131936883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Using MPEG-7 descriptors in image retrieval with self-organizing maps MPEG-7描述符在自组织映射图像检索中的应用
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048485
M. Koskela, Jorma T. Laaksonen, E. Oja
The MPEG-7 standard is emerging as both a general framework for content description and a collection of specific, agreed-upon content descriptors. We have developed a neural, self-organizing technique for content-based image retrieval. In this paper we apply the visual content descriptors provided by MPEG-7 in our PicSOM system and compare our own image indexing technique with a reference method based on vector quantization. The results of our experiments show that the MPEG-7 descriptors can be used as such in the PicSOM system.
MPEG-7标准既是内容描述的通用框架,也是一组特定的、商定的内容描述符。我们开发了一种基于内容的图像检索的神经自组织技术。本文将MPEG-7提供的视觉内容描述符应用于我们的PicSOM系统,并将我们自己的图像索引技术与基于矢量量化的参考方法进行了比较。实验结果表明,MPEG-7描述符可以在PicSOM系统中使用。
{"title":"Using MPEG-7 descriptors in image retrieval with self-organizing maps","authors":"M. Koskela, Jorma T. Laaksonen, E. Oja","doi":"10.1109/ICPR.2002.1048485","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048485","url":null,"abstract":"The MPEG-7 standard is emerging as both a general framework for content description and a collection of specific, agreed-upon content descriptors. We have developed a neural, self-organizing technique for content-based image retrieval. In this paper we apply the visual content descriptors provided by MPEG-7 in our PicSOM system and compare our own image indexing technique with a reference method based on vector quantization. The results of our experiments show that the MPEG-7 descriptors can be used as such in the PicSOM system.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133966430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Motion based event recognition using HMM 基于HMM的运动事件识别
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048431
Gu Xu, Yu-Fei Ma, HongJiang Zhang, Shiqiang Yang
Motion is an important cue for video understanding and is widely used in many semantic video analyses. We present a new motion representation scheme in which motion in a video is represented by the responses of frames to a set of motion filters. Each of these filters is designed to be most responsive to a type of dominant motion. Then we employ hidden Markov models (HMMs) to characterize the motion patterns based on these features and thus classify basketball video into 16 events. The evaluation by human satisfaction rate to classification result is 75%, demonstrating effectiveness of the proposed approach to recognizing semantic events in video.
运动是视频理解的重要线索,被广泛应用于许多语义视频分析中。我们提出了一种新的运动表示方案,其中视频中的运动由帧对一组运动滤波器的响应来表示。每一个过滤器都被设计成对一种主导运动最敏感。然后利用隐马尔可夫模型(hmm)基于这些特征来描述运动模式,从而将篮球视频分为16个事件。人类对分类结果的满意率评价为75%,证明了该方法对视频中语义事件识别的有效性。
{"title":"Motion based event recognition using HMM","authors":"Gu Xu, Yu-Fei Ma, HongJiang Zhang, Shiqiang Yang","doi":"10.1109/ICPR.2002.1048431","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048431","url":null,"abstract":"Motion is an important cue for video understanding and is widely used in many semantic video analyses. We present a new motion representation scheme in which motion in a video is represented by the responses of frames to a set of motion filters. Each of these filters is designed to be most responsive to a type of dominant motion. Then we employ hidden Markov models (HMMs) to characterize the motion patterns based on these features and thus classify basketball video into 16 events. The evaluation by human satisfaction rate to classification result is 75%, demonstrating effectiveness of the proposed approach to recognizing semantic events in video.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121224503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 49
期刊
Object recognition supported by user interaction for service robots
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1