基于R2.5D-GRU网络的动作识别

2021 IEEE 4th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC) Pub Date : 2021-06-18 DOI:10.1109/IMCEC51613.2021.9482115

Xiaolin Ma, Yuying Xiao

{"title":"基于R2.5D-GRU网络的动作识别","authors":"Xiaolin Ma, Yuying Xiao","doi":"10.1109/IMCEC51613.2021.9482115","DOIUrl":null,"url":null,"abstract":"The field of body action recognition is a research hotspot in computer vision. Due to the complex calculation process of traditional recognition algorithms and the limitations of the data set to be processed, action recognition algorithms based on deep learning have gradually attracted attention. Various network frameworks have been proposed, which greatly improved the recognition Accuracy. In view of some problems in the action recognition algorithm of deep learning at this stage, this paper proposes a new R2.5D-GRU network. First, the 3D convolution is decomposed into a two-dimensional spatial convolution and a one-dimensional time convolution, and the low-level spatio-temporal features are extracted, and the high-level temporal features are extracted using GRU for temporal modeling. Experimental results show that the algorithm proposed in this paper performs better than some existing mainstream algorithms in the UCF101 data set.","PeriodicalId":240400,"journal":{"name":"2021 IEEE 4th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC)","volume":"10 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Action recognition based on R2.5D-GRU networks\",\"authors\":\"Xiaolin Ma, Yuying Xiao\",\"doi\":\"10.1109/IMCEC51613.2021.9482115\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The field of body action recognition is a research hotspot in computer vision. Due to the complex calculation process of traditional recognition algorithms and the limitations of the data set to be processed, action recognition algorithms based on deep learning have gradually attracted attention. Various network frameworks have been proposed, which greatly improved the recognition Accuracy. In view of some problems in the action recognition algorithm of deep learning at this stage, this paper proposes a new R2.5D-GRU network. First, the 3D convolution is decomposed into a two-dimensional spatial convolution and a one-dimensional time convolution, and the low-level spatio-temporal features are extracted, and the high-level temporal features are extracted using GRU for temporal modeling. Experimental results show that the algorithm proposed in this paper performs better than some existing mainstream algorithms in the UCF101 data set.\",\"PeriodicalId\":240400,\"journal\":{\"name\":\"2021 IEEE 4th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC)\",\"volume\":\"10 4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 4th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IMCEC51613.2021.9482115\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 4th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMCEC51613.2021.9482115","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

人体动作识别是计算机视觉领域的一个研究热点。由于传统识别算法计算过程复杂以及处理数据集的局限性，基于深度学习的动作识别算法逐渐受到关注。提出了多种网络框架，大大提高了识别精度。针对现阶段深度学习动作识别算法中存在的一些问题，本文提出了一种新的R2.5D-GRU网络。首先，将三维卷积分解为二维空间卷积和一维时间卷积，提取低层时空特征，并利用GRU提取高层时间特征进行时间建模;实验结果表明，本文提出的算法在UCF101数据集上的性能优于现有的主流算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Action recognition based on R2.5D-GRU networks

The field of body action recognition is a research hotspot in computer vision. Due to the complex calculation process of traditional recognition algorithms and the limitations of the data set to be processed, action recognition algorithms based on deep learning have gradually attracted attention. Various network frameworks have been proposed, which greatly improved the recognition Accuracy. In view of some problems in the action recognition algorithm of deep learning at this stage, this paper proposes a new R2.5D-GRU network. First, the 3D convolution is decomposed into a two-dimensional spatial convolution and a one-dimensional time convolution, and the low-level spatio-temporal features are extracted, and the high-level temporal features are extracted using GRU for temporal modeling. Experimental results show that the algorithm proposed in this paper performs better than some existing mainstream algorithms in the UCF101 data set.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE 4th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC)

自引率

0.00%

发文量