Mutian Liu , Banghua Yang , Lin Meng , Yonghuai Zhang , Shouwei Gao , Peng Zan , Xinxing Xia
{"title":"STA-Net: Spatial–temporal alignment network for hybrid EEG-fNIRS decoding","authors":"Mutian Liu , Banghua Yang , Lin Meng , Yonghuai Zhang , Shouwei Gao , Peng Zan , Xinxing Xia","doi":"10.1016/j.inffus.2025.103023","DOIUrl":null,"url":null,"abstract":"<div><div>Hybrid brain–computer interfaces (BCI) have garnered attention for the capacity to transcend the constraints of single-modality BCI. It is essential to develop innovative fusion methodologies to exploit the high temporal resolution of electroencephalography (EEG) and the high spatial resolution of functional near-infrared spectroscopy (fNIRS). We propose an end-to-end Spatial–Temporal Alignment Network (STA-Net) that achieves precise spatial and temporal alignment between EEG and fNIRS. STA-Net comprises two sub-layers: the fNIRS-guided Spatial Alignment (FGSA) layer and the EEG-guided Temporal Alignment (EGTA) layer. The FGSA layer calculates spatial attention maps from fNRIS to identify sensitive brain regions and spatially aligns EEG with fNIRS through the weighting of EEG channels. The EGTA layer generates temporal attention maps based on the cross-attention mechanism, thereby producing fNIRS signals that are temporally aligned with EEG. This resolves the issue of temporal mismatch caused by the inherent delay of fNIRS. Finally, spatio-temporally aligned EEG-fNIRS signals are fused to classify mental tasks: motor imagery (MI), mental arithmetic (MA), and word generation (WG). STA-Net achieves remarkable performance, with an average accuracy of 69.65% for MI, 85.14% for MA, and 79.03% for WG in subject-specific evaluations, which is superior to state-of-the-art single-modality and multi-modality algorithms. Moreover, STA-Net exhibits less performance degradation in the early stages of tasks compared with the benchmark methods. The spatial–temporal alignment between EEG and fNIRS enhances the performance of hybrid BCI and promotes the decoding of EEG-fNIRS. STA-Net has the potential to establish a new backbone for EEG-fNIRS BCI. The code is available at <span><span>https://github.com/MutianLiu-SHU/STA-Net</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"119 ","pages":"Article 103023"},"PeriodicalIF":15.5000,"publicationDate":"2025-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S156625352500096X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Hybrid brain–computer interfaces (BCI) have garnered attention for the capacity to transcend the constraints of single-modality BCI. It is essential to develop innovative fusion methodologies to exploit the high temporal resolution of electroencephalography (EEG) and the high spatial resolution of functional near-infrared spectroscopy (fNIRS). We propose an end-to-end Spatial–Temporal Alignment Network (STA-Net) that achieves precise spatial and temporal alignment between EEG and fNIRS. STA-Net comprises two sub-layers: the fNIRS-guided Spatial Alignment (FGSA) layer and the EEG-guided Temporal Alignment (EGTA) layer. The FGSA layer calculates spatial attention maps from fNRIS to identify sensitive brain regions and spatially aligns EEG with fNIRS through the weighting of EEG channels. The EGTA layer generates temporal attention maps based on the cross-attention mechanism, thereby producing fNIRS signals that are temporally aligned with EEG. This resolves the issue of temporal mismatch caused by the inherent delay of fNIRS. Finally, spatio-temporally aligned EEG-fNIRS signals are fused to classify mental tasks: motor imagery (MI), mental arithmetic (MA), and word generation (WG). STA-Net achieves remarkable performance, with an average accuracy of 69.65% for MI, 85.14% for MA, and 79.03% for WG in subject-specific evaluations, which is superior to state-of-the-art single-modality and multi-modality algorithms. Moreover, STA-Net exhibits less performance degradation in the early stages of tasks compared with the benchmark methods. The spatial–temporal alignment between EEG and fNIRS enhances the performance of hybrid BCI and promotes the decoding of EEG-fNIRS. STA-Net has the potential to establish a new backbone for EEG-fNIRS BCI. The code is available at https://github.com/MutianLiu-SHU/STA-Net.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.