利用微调 CNN-LSTM 进行人类活动识别

Journal of Electrical Engineering Pub Date : 2024-02-01 DOI:10.2478/jee-2024-0002

Erdal Genc, M. E. Yıldırım, Y. B. Salman

{"title":"利用微调 CNN-LSTM 进行人类活动识别","authors":"Erdal Genc, M. E. Yıldırım, Y. B. Salman","doi":"10.2478/jee-2024-0002","DOIUrl":null,"url":null,"abstract":"\n Human activity recognition (HAR) by deep learning is a challenging and interesting topic. Although there are robust models, there is also a bunch of parameters and variables, which affect the performance such as the number of layers, pooling type. This study presents a new deep learning architecture that is obtained by fine-tuning of the conventional CNN-LSTM model, namely, CNN (+3)-LSTM. Three changes are made to the conventional model to increase the accuracy. Firstly, kernel size is set to 1×1 to extract more information. Secondly, three convolutional layers are added to the model. Lastly, average pooling is used instead of max-pooling. Performance analysis of the proposed model is conducted on the KTH dataset and implemented on Keras. In addition to the overall accuracy of the proposed model, the contribution of each change is observed individually. Results show that adding layers made the highest contribution followed by kernel size and pooling, respectively. The proposed model is compared with state-of-art and outperformed some of the recent studies with a 94.1% recognition rate.","PeriodicalId":508697,"journal":{"name":"Journal of Electrical Engineering","volume":"41 6","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Human activity recognition with fine-tuned CNN-LSTM\",\"authors\":\"Erdal Genc, M. E. Yıldırım, Y. B. Salman\",\"doi\":\"10.2478/jee-2024-0002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Human activity recognition (HAR) by deep learning is a challenging and interesting topic. Although there are robust models, there is also a bunch of parameters and variables, which affect the performance such as the number of layers, pooling type. This study presents a new deep learning architecture that is obtained by fine-tuning of the conventional CNN-LSTM model, namely, CNN (+3)-LSTM. Three changes are made to the conventional model to increase the accuracy. Firstly, kernel size is set to 1×1 to extract more information. Secondly, three convolutional layers are added to the model. Lastly, average pooling is used instead of max-pooling. Performance analysis of the proposed model is conducted on the KTH dataset and implemented on Keras. In addition to the overall accuracy of the proposed model, the contribution of each change is observed individually. Results show that adding layers made the highest contribution followed by kernel size and pooling, respectively. The proposed model is compared with state-of-art and outperformed some of the recent studies with a 94.1% recognition rate.\",\"PeriodicalId\":508697,\"journal\":{\"name\":\"Journal of Electrical Engineering\",\"volume\":\"41 6\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Electrical Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2478/jee-2024-0002\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Electrical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/jee-2024-0002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

利用深度学习进行人类活动识别（HAR）是一个具有挑战性的有趣课题。虽然有稳健的模型，但也有一堆参数和变量会影响性能，如层数、池类型等。本研究提出了一种新的深度学习架构，它是通过微调传统的 CNN-LSTM 模型获得的，即 CNN (+3)-LSTM 模型。为提高准确性，对传统模型做了三处改动。首先，内核大小设为 1×1，以提取更多信息。其次，在模型中增加了三个卷积层。最后，使用平均池化代替最大池化。我们在 KTH 数据集上对所提模型进行了性能分析，并在 Keras 上实现了该模型。除了对所提模型的整体准确性进行分析外，还对每项变化的贡献进行了单独观察。结果显示，增加层数的贡献最大，其次分别是内核大小和池化。所提出的模型与最先进的模型进行了比较，其识别率达到 94.1%，优于最近的一些研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Human activity recognition with fine-tuned CNN-LSTM

Human activity recognition (HAR) by deep learning is a challenging and interesting topic. Although there are robust models, there is also a bunch of parameters and variables, which affect the performance such as the number of layers, pooling type. This study presents a new deep learning architecture that is obtained by fine-tuning of the conventional CNN-LSTM model, namely, CNN (+3)-LSTM. Three changes are made to the conventional model to increase the accuracy. Firstly, kernel size is set to 1×1 to extract more information. Secondly, three convolutional layers are added to the model. Lastly, average pooling is used instead of max-pooling. Performance analysis of the proposed model is conducted on the KTH dataset and implemented on Keras. In addition to the overall accuracy of the proposed model, the contribution of each change is observed individually. Results show that adding layers made the highest contribution followed by kernel size and pooling, respectively. The proposed model is compared with state-of-art and outperformed some of the recent studies with a 94.1% recognition rate.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Electrical Engineering

自引率

0.00%

发文量