V. Hemamalini, D. Jayasutha, V. R. Vinothini, R. Manjula Devi, Arun Kumar, E. Anitha
{"title":"Innovative Video Classification Method Based on Deep Learning Approach","authors":"V. Hemamalini, D. Jayasutha, V. R. Vinothini, R. Manjula Devi, Arun Kumar, E. Anitha","doi":"10.2174/0118722121248139231023111754","DOIUrl":null,"url":null,"abstract":"Background: The method includes: receiving a set of video data and labeling it into categories, segmenting the received videos into N segments, randomly selecting M frames for each video segment in the training phase, concatenating the video images into multi-channel images, and rolling. Methods: This work was developed in the Python programming language using the Keras library with Tensorflow as the back-end. The objective is to develop a network that presents performance compatible with the state of the art in terms of classifying videos according to the actions taken. Results: Given the hardware limitations, there is considerable distance between the implementation possibilities in this work and what is known as the state-of-the-art. Conclusion: Throughout the work, some aspects in which this limitation influenced the development are presented, but it is shown that this realization is feasible and that obtaining expressive results is possible. 98.6% accuracy is obtained in the UCF101 data set, compared to the 98 percentage points of the best result ever reported, using, however, considerably fewer resources. In addition, the importance of transfer learning in achieving expressive results as well as the different performances of each architecture are reviewed. Thus, this work may open doors to carry patent- based outcomes.","PeriodicalId":40022,"journal":{"name":"Recent Patents on Engineering","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Recent Patents on Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/0118722121248139231023111754","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 0
Abstract
Background: The method includes: receiving a set of video data and labeling it into categories, segmenting the received videos into N segments, randomly selecting M frames for each video segment in the training phase, concatenating the video images into multi-channel images, and rolling. Methods: This work was developed in the Python programming language using the Keras library with Tensorflow as the back-end. The objective is to develop a network that presents performance compatible with the state of the art in terms of classifying videos according to the actions taken. Results: Given the hardware limitations, there is considerable distance between the implementation possibilities in this work and what is known as the state-of-the-art. Conclusion: Throughout the work, some aspects in which this limitation influenced the development are presented, but it is shown that this realization is feasible and that obtaining expressive results is possible. 98.6% accuracy is obtained in the UCF101 data set, compared to the 98 percentage points of the best result ever reported, using, however, considerably fewer resources. In addition, the importance of transfer learning in achieving expressive results as well as the different performances of each architecture are reviewed. Thus, this work may open doors to carry patent- based outcomes.
期刊介绍:
Recent Patents on Engineering publishes review articles by experts on recent patents in the major fields of engineering. A selection of important and recent patents on engineering is also included in the journal. The journal is essential reading for all researchers involved in engineering sciences.