{"title":"利用微调 CNN-LSTM 进行人类活动识别","authors":"Erdal Genc, M. E. Yıldırım, Y. B. Salman","doi":"10.2478/jee-2024-0002","DOIUrl":null,"url":null,"abstract":"\n Human activity recognition (HAR) by deep learning is a challenging and interesting topic. Although there are robust models, there is also a bunch of parameters and variables, which affect the performance such as the number of layers, pooling type. This study presents a new deep learning architecture that is obtained by fine-tuning of the conventional CNN-LSTM model, namely, CNN (+3)-LSTM. Three changes are made to the conventional model to increase the accuracy. Firstly, kernel size is set to 1×1 to extract more information. Secondly, three convolutional layers are added to the model. Lastly, average pooling is used instead of max-pooling. Performance analysis of the proposed model is conducted on the KTH dataset and implemented on Keras. In addition to the overall accuracy of the proposed model, the contribution of each change is observed individually. Results show that adding layers made the highest contribution followed by kernel size and pooling, respectively. The proposed model is compared with state-of-art and outperformed some of the recent studies with a 94.1% recognition rate.","PeriodicalId":508697,"journal":{"name":"Journal of Electrical Engineering","volume":"41 6","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Human activity recognition with fine-tuned CNN-LSTM\",\"authors\":\"Erdal Genc, M. E. Yıldırım, Y. B. Salman\",\"doi\":\"10.2478/jee-2024-0002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Human activity recognition (HAR) by deep learning is a challenging and interesting topic. Although there are robust models, there is also a bunch of parameters and variables, which affect the performance such as the number of layers, pooling type. This study presents a new deep learning architecture that is obtained by fine-tuning of the conventional CNN-LSTM model, namely, CNN (+3)-LSTM. Three changes are made to the conventional model to increase the accuracy. Firstly, kernel size is set to 1×1 to extract more information. Secondly, three convolutional layers are added to the model. Lastly, average pooling is used instead of max-pooling. Performance analysis of the proposed model is conducted on the KTH dataset and implemented on Keras. In addition to the overall accuracy of the proposed model, the contribution of each change is observed individually. Results show that adding layers made the highest contribution followed by kernel size and pooling, respectively. The proposed model is compared with state-of-art and outperformed some of the recent studies with a 94.1% recognition rate.\",\"PeriodicalId\":508697,\"journal\":{\"name\":\"Journal of Electrical Engineering\",\"volume\":\"41 6\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Electrical Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2478/jee-2024-0002\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Electrical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/jee-2024-0002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Human activity recognition with fine-tuned CNN-LSTM
Human activity recognition (HAR) by deep learning is a challenging and interesting topic. Although there are robust models, there is also a bunch of parameters and variables, which affect the performance such as the number of layers, pooling type. This study presents a new deep learning architecture that is obtained by fine-tuning of the conventional CNN-LSTM model, namely, CNN (+3)-LSTM. Three changes are made to the conventional model to increase the accuracy. Firstly, kernel size is set to 1×1 to extract more information. Secondly, three convolutional layers are added to the model. Lastly, average pooling is used instead of max-pooling. Performance analysis of the proposed model is conducted on the KTH dataset and implemented on Keras. In addition to the overall accuracy of the proposed model, the contribution of each change is observed individually. Results show that adding layers made the highest contribution followed by kernel size and pooling, respectively. The proposed model is compared with state-of-art and outperformed some of the recent studies with a 94.1% recognition rate.