{"title":"基于R2.5D-GRU网络的动作识别","authors":"Xiaolin Ma, Yuying Xiao","doi":"10.1109/IMCEC51613.2021.9482115","DOIUrl":null,"url":null,"abstract":"The field of body action recognition is a research hotspot in computer vision. Due to the complex calculation process of traditional recognition algorithms and the limitations of the data set to be processed, action recognition algorithms based on deep learning have gradually attracted attention. Various network frameworks have been proposed, which greatly improved the recognition Accuracy. In view of some problems in the action recognition algorithm of deep learning at this stage, this paper proposes a new R2.5D-GRU network. First, the 3D convolution is decomposed into a two-dimensional spatial convolution and a one-dimensional time convolution, and the low-level spatio-temporal features are extracted, and the high-level temporal features are extracted using GRU for temporal modeling. Experimental results show that the algorithm proposed in this paper performs better than some existing mainstream algorithms in the UCF101 data set.","PeriodicalId":240400,"journal":{"name":"2021 IEEE 4th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC)","volume":"10 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Action recognition based on R2.5D-GRU networks\",\"authors\":\"Xiaolin Ma, Yuying Xiao\",\"doi\":\"10.1109/IMCEC51613.2021.9482115\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The field of body action recognition is a research hotspot in computer vision. Due to the complex calculation process of traditional recognition algorithms and the limitations of the data set to be processed, action recognition algorithms based on deep learning have gradually attracted attention. Various network frameworks have been proposed, which greatly improved the recognition Accuracy. In view of some problems in the action recognition algorithm of deep learning at this stage, this paper proposes a new R2.5D-GRU network. First, the 3D convolution is decomposed into a two-dimensional spatial convolution and a one-dimensional time convolution, and the low-level spatio-temporal features are extracted, and the high-level temporal features are extracted using GRU for temporal modeling. Experimental results show that the algorithm proposed in this paper performs better than some existing mainstream algorithms in the UCF101 data set.\",\"PeriodicalId\":240400,\"journal\":{\"name\":\"2021 IEEE 4th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC)\",\"volume\":\"10 4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 4th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IMCEC51613.2021.9482115\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 4th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMCEC51613.2021.9482115","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The field of body action recognition is a research hotspot in computer vision. Due to the complex calculation process of traditional recognition algorithms and the limitations of the data set to be processed, action recognition algorithms based on deep learning have gradually attracted attention. Various network frameworks have been proposed, which greatly improved the recognition Accuracy. In view of some problems in the action recognition algorithm of deep learning at this stage, this paper proposes a new R2.5D-GRU network. First, the 3D convolution is decomposed into a two-dimensional spatial convolution and a one-dimensional time convolution, and the low-level spatio-temporal features are extracted, and the high-level temporal features are extracted using GRU for temporal modeling. Experimental results show that the algorithm proposed in this paper performs better than some existing mainstream algorithms in the UCF101 data set.