{"title":"Video Analysis For Human Action Recognition Using Deep Convolutional Neural Networks","authors":"Nehal N. Mostafa, M. F. Alrahmawy, O. Nomair","doi":"10.21608/mjcis.2018.311989","DOIUrl":null,"url":null,"abstract":"In the last few years, human action recognition potential applications have been studied in many fields such as robotics, human computer interaction, and video surveillance systems and it has been evaluated as an active research area. This paper presents a recognition system using deep learning to recognize and identify human actions from video input. The proposed system has been fine-tuned by partial training and dropout of the classification layer of Alexnet and replacing it by another one that use SVM. The performance of the network is boosted by using key frames that were extracted via applying Kalman filter during dataset augmentation. The proposed system resulted in oromising performance compared to the state of the art approaches. The classification accuracy reached 92.35%.","PeriodicalId":253950,"journal":{"name":"Mansoura Journal for Computer and Information Sciences","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mansoura Journal for Computer and Information Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21608/mjcis.2018.311989","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In the last few years, human action recognition potential applications have been studied in many fields such as robotics, human computer interaction, and video surveillance systems and it has been evaluated as an active research area. This paper presents a recognition system using deep learning to recognize and identify human actions from video input. The proposed system has been fine-tuned by partial training and dropout of the classification layer of Alexnet and replacing it by another one that use SVM. The performance of the network is boosted by using key frames that were extracted via applying Kalman filter during dataset augmentation. The proposed system resulted in oromising performance compared to the state of the art approaches. The classification accuracy reached 92.35%.