{"title":"Aggressive Action Recognition Using 3D CNN Architectures","authors":"A. Saveliev, M. Uzdiaev, D. Malov","doi":"10.1109/DeSE.2019.00165","DOIUrl":null,"url":null,"abstract":"This paper discusses an application of the transfer learning approach concerning human aggressive actions recognition task in video content. Comparative analysis of this approach was performed using various three-dimensional convolutional network architectures (3D CNN): Convolutional 3D Neural Network (C3D), Inception 3D Neural Network (I3D), Residual 3D Neural Network (R3D) based only on RGB frames processing. These 3D CNNs have trained on a composite aggressive action video dataset, that includes benchmark datasets for aggression recognition. The neural networks were evaluated in terms of accuracy, precision, recall, f1-score and loss function values metrics. The aggressive action recognition transfer learning approach using 3D CNNs showed impressive results on the considered metrics. Moreover, learning time in context of this approach was relatively short.","PeriodicalId":6632,"journal":{"name":"2019 12th International Conference on Developments in eSystems Engineering (DeSE)","volume":"1 1","pages":"890-895"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 12th International Conference on Developments in eSystems Engineering (DeSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DeSE.2019.00165","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
This paper discusses an application of the transfer learning approach concerning human aggressive actions recognition task in video content. Comparative analysis of this approach was performed using various three-dimensional convolutional network architectures (3D CNN): Convolutional 3D Neural Network (C3D), Inception 3D Neural Network (I3D), Residual 3D Neural Network (R3D) based only on RGB frames processing. These 3D CNNs have trained on a composite aggressive action video dataset, that includes benchmark datasets for aggression recognition. The neural networks were evaluated in terms of accuracy, precision, recall, f1-score and loss function values metrics. The aggressive action recognition transfer learning approach using 3D CNNs showed impressive results on the considered metrics. Moreover, learning time in context of this approach was relatively short.