Bin Jiang , Hao Wu , Qingling Xia , Gen Li , Hanguang Xiao , Yun Zhao
{"title":"NKDFF-CNN: A convolutional neural network with narrow kernel and dual-view feature fusion for multitype gesture recognition based on sEMG","authors":"Bin Jiang , Hao Wu , Qingling Xia , Gen Li , Hanguang Xiao , Yun Zhao","doi":"10.1016/j.dsp.2024.104772","DOIUrl":null,"url":null,"abstract":"<div><p>Deep learning algorithms have been widely applied to gesture recognition based on multi-channel surface electromyography (sEMG). However, the limitations in feature extraction capabilities of existing algorithms have restricted the performance of multitype gesture recognition. To address this challenge, we propose a novel sEMG-based gesture recognition algorithm, namely, Narrow Kernel and Dual-view Feature Fusion Convolutional Neural Network (NKDFF-CNN). Firstly, to overcome the issue of traditional square kernel convolution operation, which loses channel independence features, we employ the narrow kernel convolution in the model to learn time-related features in each independent channel of sEMG, resulting in obtaining representative correlation information between specific muscles and gestures. Then, the dual-view structure is used to capture both shallow and deep features, which are fused at the decision level. Thus, the multi-dimensional feature information is extracted. The NKDFF-CNN is further extended to ACC<img>NKDFF-CNN by introducing acceleration signals for multimodal feature integration. Experimental validation on the NinaPro DB2 dataset demonstrates the superior classification performance of NKDFF-CNN, achieving 88.03 % accuracy for 49 hand gestures, outperforming other state-of-the-art MSFF-net. In addition, the ACC<img>NKDFF-CNN model with multimodal feature information significantly improved the accuracy to 95.25 %. We also validated the proposed NKDFF-CNN on NinaPro DB3 with the disabled subjects and the NinaPro DB4 with healthy subjects. The results showcased that the NKDFF-CNN achieved advanced accuracies of 70.58 % and 85.91 % for the multitype hand gestures classification, respectively, showing the high generalization ability of the proposed model. As a consequence, the proposed NKDFF-CNN method achieved superior recognition performance in both accuracy and generality compared to other advanced models. Thus, it provides a reliable algorithm for research in fields such as rehabilitative medicine.</p></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"156 ","pages":"Article 104772"},"PeriodicalIF":2.9000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S105120042400397X","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Deep learning algorithms have been widely applied to gesture recognition based on multi-channel surface electromyography (sEMG). However, the limitations in feature extraction capabilities of existing algorithms have restricted the performance of multitype gesture recognition. To address this challenge, we propose a novel sEMG-based gesture recognition algorithm, namely, Narrow Kernel and Dual-view Feature Fusion Convolutional Neural Network (NKDFF-CNN). Firstly, to overcome the issue of traditional square kernel convolution operation, which loses channel independence features, we employ the narrow kernel convolution in the model to learn time-related features in each independent channel of sEMG, resulting in obtaining representative correlation information between specific muscles and gestures. Then, the dual-view structure is used to capture both shallow and deep features, which are fused at the decision level. Thus, the multi-dimensional feature information is extracted. The NKDFF-CNN is further extended to ACCNKDFF-CNN by introducing acceleration signals for multimodal feature integration. Experimental validation on the NinaPro DB2 dataset demonstrates the superior classification performance of NKDFF-CNN, achieving 88.03 % accuracy for 49 hand gestures, outperforming other state-of-the-art MSFF-net. In addition, the ACCNKDFF-CNN model with multimodal feature information significantly improved the accuracy to 95.25 %. We also validated the proposed NKDFF-CNN on NinaPro DB3 with the disabled subjects and the NinaPro DB4 with healthy subjects. The results showcased that the NKDFF-CNN achieved advanced accuracies of 70.58 % and 85.91 % for the multitype hand gestures classification, respectively, showing the high generalization ability of the proposed model. As a consequence, the proposed NKDFF-CNN method achieved superior recognition performance in both accuracy and generality compared to other advanced models. Thus, it provides a reliable algorithm for research in fields such as rehabilitative medicine.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,