Kun Chen , Wenhao Ruan , Quan Liu , Qingsong Ai , Li Ma
{"title":"A novel deep learning model combining 3DCNN-CapsNet and hierarchical attention mechanism for EEG emotion recognition","authors":"Kun Chen , Wenhao Ruan , Quan Liu , Qingsong Ai , Li Ma","doi":"10.1016/j.neunet.2025.107267","DOIUrl":null,"url":null,"abstract":"<div><div>Emotion recognition plays a key role in the field of human–computer interaction. Classifying and predicting human emotions using electroencephalogram (EEG) signals has consistently been a challenging research area. Recently, with the increasing application of deep learning methods such as convolutional neural network (CNN) and channel attention mechanism (CA). The recognition accuracy of emotion recognition methods has already reached an outstanding level. However, CNN and its derivatives have the defect that the sensory field of view is small and can only extract local features. The traditional channel attention mechanism only focuses on the correlation between different channels and assigns weights to each channel according to its contribution to the emotion recognition task, ignoring the fact that different EEG frequency bands in the same channel signal also have different contributions to the task. To address the above-mentioned problems , this paper propose HA-CapsNet, a novel end-to-end model combining 3DCNN-CapsNet with a Hierarchical Attention mechanism. This model captures both inter-channel correlations and the contribution of each frequency band. Additionally, the capsule network in 3DCNN-CapsNet extracts more spatial feature information compared to conventional CNNs. Our HA-CapsNet achieves recognition accuracies of 97.40%, 97.20%, and 97.60% on the DEAP dataset, and 95.80%, 96.10%, and 96.30% on the DREAMER dataset, outperforming state-of-the-art methods with the smallest variance. Furthermore, experiments removing channels from the DEAP and DREAMER datasets in ascending order of their hierarchical attention weights showed that even with fewer channels, the model maintained strong recognition performance. This demonstrates HA-CapsNet’s low dependence on large datasets and its suitability for lightweight EEG devices, promoting advancements in EEG device development.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"186 ","pages":"Article 107267"},"PeriodicalIF":6.0000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0893608025001467","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Emotion recognition plays a key role in the field of human–computer interaction. Classifying and predicting human emotions using electroencephalogram (EEG) signals has consistently been a challenging research area. Recently, with the increasing application of deep learning methods such as convolutional neural network (CNN) and channel attention mechanism (CA). The recognition accuracy of emotion recognition methods has already reached an outstanding level. However, CNN and its derivatives have the defect that the sensory field of view is small and can only extract local features. The traditional channel attention mechanism only focuses on the correlation between different channels and assigns weights to each channel according to its contribution to the emotion recognition task, ignoring the fact that different EEG frequency bands in the same channel signal also have different contributions to the task. To address the above-mentioned problems , this paper propose HA-CapsNet, a novel end-to-end model combining 3DCNN-CapsNet with a Hierarchical Attention mechanism. This model captures both inter-channel correlations and the contribution of each frequency band. Additionally, the capsule network in 3DCNN-CapsNet extracts more spatial feature information compared to conventional CNNs. Our HA-CapsNet achieves recognition accuracies of 97.40%, 97.20%, and 97.60% on the DEAP dataset, and 95.80%, 96.10%, and 96.30% on the DREAMER dataset, outperforming state-of-the-art methods with the smallest variance. Furthermore, experiments removing channels from the DEAP and DREAMER datasets in ascending order of their hierarchical attention weights showed that even with fewer channels, the model maintained strong recognition performance. This demonstrates HA-CapsNet’s low dependence on large datasets and its suitability for lightweight EEG devices, promoting advancements in EEG device development.
期刊介绍:
Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.