{"title":"学习非负位置约束线性编码用于人体动作识别","authors":"Yuanbo Chen, Xin Guo","doi":"10.1109/VCIP.2013.6706432","DOIUrl":null,"url":null,"abstract":"Description methods based on interest points and Bag-of-Words (BOW) model have gained remarkable success in human action recognition. Despite their popularity, the existing interest point detectors always come with high computational complexity and lose their power when camera is moving. Additionally, vector quantization procedure in BOW model ignores the relationship between bases and is always with large reconstruction errors. In this paper, a spatio-temporal interest point detector based on flow vorticity is used, which can not only suppress most effects of camera motion but also provide prominent interest points around key positions of the moving foreground. Besides, by combining non-negativity constraints of patterns and average pooling function, a Non-negative Locality-constrained Linear Coding (NLLC) model is introduced into action recognition to provide better features representation than the traditional BOW model. Experimental results on two widely used action datasets demonstrate the effectiveness of the proposed approach.","PeriodicalId":407080,"journal":{"name":"2013 Visual Communications and Image Processing (VCIP)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Learning non-negative locality-constrained Linear Coding for human action recognition\",\"authors\":\"Yuanbo Chen, Xin Guo\",\"doi\":\"10.1109/VCIP.2013.6706432\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Description methods based on interest points and Bag-of-Words (BOW) model have gained remarkable success in human action recognition. Despite their popularity, the existing interest point detectors always come with high computational complexity and lose their power when camera is moving. Additionally, vector quantization procedure in BOW model ignores the relationship between bases and is always with large reconstruction errors. In this paper, a spatio-temporal interest point detector based on flow vorticity is used, which can not only suppress most effects of camera motion but also provide prominent interest points around key positions of the moving foreground. Besides, by combining non-negativity constraints of patterns and average pooling function, a Non-negative Locality-constrained Linear Coding (NLLC) model is introduced into action recognition to provide better features representation than the traditional BOW model. Experimental results on two widely used action datasets demonstrate the effectiveness of the proposed approach.\",\"PeriodicalId\":407080,\"journal\":{\"name\":\"2013 Visual Communications and Image Processing (VCIP)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 Visual Communications and Image Processing (VCIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/VCIP.2013.6706432\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 Visual Communications and Image Processing (VCIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VCIP.2013.6706432","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
摘要
基于兴趣点和词袋模型的描述方法在人体动作识别中取得了显著的成功。现有的兴趣点检测器虽然很受欢迎,但计算复杂度高,且在摄像机移动时性能下降。此外,BOW模型的矢量量化过程忽略了基间的关系,重构误差较大。本文提出了一种基于流涡度的时空兴趣点检测器,该检测器不仅可以抑制摄像机运动的大部分影响,而且可以在运动前景的关键位置周围提供突出的兴趣点。结合模式的非负性约束和平均池化函数,将非负性位置约束线性编码(Non-negative Locality-constrained Linear Coding, NLLC)模型引入到动作识别中,以提供比传统BOW模型更好的特征表示。在两个广泛使用的动作数据集上的实验结果证明了该方法的有效性。
Learning non-negative locality-constrained Linear Coding for human action recognition
Description methods based on interest points and Bag-of-Words (BOW) model have gained remarkable success in human action recognition. Despite their popularity, the existing interest point detectors always come with high computational complexity and lose their power when camera is moving. Additionally, vector quantization procedure in BOW model ignores the relationship between bases and is always with large reconstruction errors. In this paper, a spatio-temporal interest point detector based on flow vorticity is used, which can not only suppress most effects of camera motion but also provide prominent interest points around key positions of the moving foreground. Besides, by combining non-negativity constraints of patterns and average pooling function, a Non-negative Locality-constrained Linear Coding (NLLC) model is introduced into action recognition to provide better features representation than the traditional BOW model. Experimental results on two widely used action datasets demonstrate the effectiveness of the proposed approach.