{"title":"使用3D CNN架构的攻击性动作识别","authors":"A. Saveliev, M. Uzdiaev, D. Malov","doi":"10.1109/DeSE.2019.00165","DOIUrl":null,"url":null,"abstract":"This paper discusses an application of the transfer learning approach concerning human aggressive actions recognition task in video content. Comparative analysis of this approach was performed using various three-dimensional convolutional network architectures (3D CNN): Convolutional 3D Neural Network (C3D), Inception 3D Neural Network (I3D), Residual 3D Neural Network (R3D) based only on RGB frames processing. These 3D CNNs have trained on a composite aggressive action video dataset, that includes benchmark datasets for aggression recognition. The neural networks were evaluated in terms of accuracy, precision, recall, f1-score and loss function values metrics. The aggressive action recognition transfer learning approach using 3D CNNs showed impressive results on the considered metrics. Moreover, learning time in context of this approach was relatively short.","PeriodicalId":6632,"journal":{"name":"2019 12th International Conference on Developments in eSystems Engineering (DeSE)","volume":"1 1","pages":"890-895"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Aggressive Action Recognition Using 3D CNN Architectures\",\"authors\":\"A. Saveliev, M. Uzdiaev, D. Malov\",\"doi\":\"10.1109/DeSE.2019.00165\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper discusses an application of the transfer learning approach concerning human aggressive actions recognition task in video content. Comparative analysis of this approach was performed using various three-dimensional convolutional network architectures (3D CNN): Convolutional 3D Neural Network (C3D), Inception 3D Neural Network (I3D), Residual 3D Neural Network (R3D) based only on RGB frames processing. These 3D CNNs have trained on a composite aggressive action video dataset, that includes benchmark datasets for aggression recognition. The neural networks were evaluated in terms of accuracy, precision, recall, f1-score and loss function values metrics. The aggressive action recognition transfer learning approach using 3D CNNs showed impressive results on the considered metrics. Moreover, learning time in context of this approach was relatively short.\",\"PeriodicalId\":6632,\"journal\":{\"name\":\"2019 12th International Conference on Developments in eSystems Engineering (DeSE)\",\"volume\":\"1 1\",\"pages\":\"890-895\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 12th International Conference on Developments in eSystems Engineering (DeSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DeSE.2019.00165\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 12th International Conference on Developments in eSystems Engineering (DeSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DeSE.2019.00165","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Aggressive Action Recognition Using 3D CNN Architectures
This paper discusses an application of the transfer learning approach concerning human aggressive actions recognition task in video content. Comparative analysis of this approach was performed using various three-dimensional convolutional network architectures (3D CNN): Convolutional 3D Neural Network (C3D), Inception 3D Neural Network (I3D), Residual 3D Neural Network (R3D) based only on RGB frames processing. These 3D CNNs have trained on a composite aggressive action video dataset, that includes benchmark datasets for aggression recognition. The neural networks were evaluated in terms of accuracy, precision, recall, f1-score and loss function values metrics. The aggressive action recognition transfer learning approach using 3D CNNs showed impressive results on the considered metrics. Moreover, learning time in context of this approach was relatively short.