{"title":"Person-Frame Dynamic Feature Graph Network for Group Activity Recognition","authors":"Dongli Wang, JiaLiu, Yan Zhou","doi":"10.1109/ACAIT56212.2022.10137953","DOIUrl":null,"url":null,"abstract":"Dynamic modeling of different dimensional features in video is the key element of group activity recognition. In the past years, a lot of work has been devoted to the modeling of character features, these methods have achieved good results, but most of them ignore that group activity is a continuous motion closely related to the scene, and underestimated the importance of the relationship between frames. This paper proposes a Person-Frame Dynamic Feature Graph Network to model group activity information from two levels: video frame level and individual level: Temporal Semantic sub-Graph (TSG) channel constructs temporal semantic relation subgraph for video frame features, and Person-level Dynamic Feature Map (PDFM) models personal dynamic characteristics. In addition, in order to alleviate the problem of slow training speed of group activity model, we use lightweight mobilenet-v2 as the backbone, and embed the Initial Feature Preprocessing Module (IFPM) in it to improve the training efficiency while maintaining the recognition accuracy. A lot of experiments have been done on this model with the most widely used dataset in the field of group activity recognition, and excellent results are obtained, which proves the effectiveness of the model.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACAIT56212.2022.10137953","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Dynamic modeling of different dimensional features in video is the key element of group activity recognition. In the past years, a lot of work has been devoted to the modeling of character features, these methods have achieved good results, but most of them ignore that group activity is a continuous motion closely related to the scene, and underestimated the importance of the relationship between frames. This paper proposes a Person-Frame Dynamic Feature Graph Network to model group activity information from two levels: video frame level and individual level: Temporal Semantic sub-Graph (TSG) channel constructs temporal semantic relation subgraph for video frame features, and Person-level Dynamic Feature Map (PDFM) models personal dynamic characteristics. In addition, in order to alleviate the problem of slow training speed of group activity model, we use lightweight mobilenet-v2 as the backbone, and embed the Initial Feature Preprocessing Module (IFPM) in it to improve the training efficiency while maintaining the recognition accuracy. A lot of experiments have been done on this model with the most widely used dataset in the field of group activity recognition, and excellent results are obtained, which proves the effectiveness of the model.