Xin Wan, Yongxiong Wang, Zhe Wang, Yiheng Tang, Benke Liu
{"title":"基于多模态生理信号的情绪识别的联合低阶张量融合和跨模态关注。","authors":"Xin Wan, Yongxiong Wang, Zhe Wang, Yiheng Tang, Benke Liu","doi":"10.1088/1361-6579/ad5bbc","DOIUrl":null,"url":null,"abstract":"<p><p><i>Objective</i>. Physiological signals based emotion recognition is a prominent research domain in the field of human-computer interaction. Previous studies predominantly focused on unimodal data, giving limited attention to the interplay among multiple modalities. Within the scope of multimodal emotion recognition, integrating the information from diverse modalities and leveraging the complementary information are the two essential issues to obtain the robust representations.<i>Approach</i>. Thus, we propose a intermediate fusion strategy for combining low-rank tensor fusion with the cross-modal attention to enhance the fusion of electroencephalogram, electrooculogram, electromyography, and galvanic skin response. Firstly, handcrafted features from distinct modalities are individually fed to corresponding feature extractors to obtain latent features. Subsequently, low-rank tensor is fused to integrate the information by the modality interaction representation. Finally, a cross-modal attention module is employed to explore the potential relationships between the distinct latent features and modality interaction representation, and recalibrate the weights of different modalities. And the resultant representation is adopted for emotion recognition.<i>Main results</i>. Furthermore, to validate the effectiveness of the proposed method, we execute subject-independent experiments within the DEAP dataset. The proposed method has achieved the accuracies of 73.82% and 74.55% for valence and arousal classification.<i>Significance</i>. The results of extensive experiments verify the outstanding performance of the proposed method.</p>","PeriodicalId":20047,"journal":{"name":"Physiological measurement","volume":null,"pages":null},"PeriodicalIF":2.3000,"publicationDate":"2024-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Joint low-rank tensor fusion and cross-modal attention for multimodal physiological signals based emotion recognition.\",\"authors\":\"Xin Wan, Yongxiong Wang, Zhe Wang, Yiheng Tang, Benke Liu\",\"doi\":\"10.1088/1361-6579/ad5bbc\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p><i>Objective</i>. Physiological signals based emotion recognition is a prominent research domain in the field of human-computer interaction. Previous studies predominantly focused on unimodal data, giving limited attention to the interplay among multiple modalities. Within the scope of multimodal emotion recognition, integrating the information from diverse modalities and leveraging the complementary information are the two essential issues to obtain the robust representations.<i>Approach</i>. Thus, we propose a intermediate fusion strategy for combining low-rank tensor fusion with the cross-modal attention to enhance the fusion of electroencephalogram, electrooculogram, electromyography, and galvanic skin response. Firstly, handcrafted features from distinct modalities are individually fed to corresponding feature extractors to obtain latent features. Subsequently, low-rank tensor is fused to integrate the information by the modality interaction representation. Finally, a cross-modal attention module is employed to explore the potential relationships between the distinct latent features and modality interaction representation, and recalibrate the weights of different modalities. And the resultant representation is adopted for emotion recognition.<i>Main results</i>. Furthermore, to validate the effectiveness of the proposed method, we execute subject-independent experiments within the DEAP dataset. The proposed method has achieved the accuracies of 73.82% and 74.55% for valence and arousal classification.<i>Significance</i>. The results of extensive experiments verify the outstanding performance of the proposed method.</p>\",\"PeriodicalId\":20047,\"journal\":{\"name\":\"Physiological measurement\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2024-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Physiological measurement\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1088/1361-6579/ad5bbc\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physiological measurement","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1088/1361-6579/ad5bbc","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOPHYSICS","Score":null,"Total":0}
Joint low-rank tensor fusion and cross-modal attention for multimodal physiological signals based emotion recognition.
Objective. Physiological signals based emotion recognition is a prominent research domain in the field of human-computer interaction. Previous studies predominantly focused on unimodal data, giving limited attention to the interplay among multiple modalities. Within the scope of multimodal emotion recognition, integrating the information from diverse modalities and leveraging the complementary information are the two essential issues to obtain the robust representations.Approach. Thus, we propose a intermediate fusion strategy for combining low-rank tensor fusion with the cross-modal attention to enhance the fusion of electroencephalogram, electrooculogram, electromyography, and galvanic skin response. Firstly, handcrafted features from distinct modalities are individually fed to corresponding feature extractors to obtain latent features. Subsequently, low-rank tensor is fused to integrate the information by the modality interaction representation. Finally, a cross-modal attention module is employed to explore the potential relationships between the distinct latent features and modality interaction representation, and recalibrate the weights of different modalities. And the resultant representation is adopted for emotion recognition.Main results. Furthermore, to validate the effectiveness of the proposed method, we execute subject-independent experiments within the DEAP dataset. The proposed method has achieved the accuracies of 73.82% and 74.55% for valence and arousal classification.Significance. The results of extensive experiments verify the outstanding performance of the proposed method.
期刊介绍:
Physiological Measurement publishes papers about the quantitative assessment and visualization of physiological function in clinical research and practice, with an emphasis on the development of new methods of measurement and their validation.
Papers are published on topics including:
applied physiology in illness and health
electrical bioimpedance, optical and acoustic measurement techniques
advanced methods of time series and other data analysis
biomedical and clinical engineering
in-patient and ambulatory monitoring
point-of-care technologies
novel clinical measurements of cardiovascular, neurological, and musculoskeletal systems.
measurements in molecular, cellular and organ physiology and electrophysiology
physiological modeling and simulation
novel biomedical sensors, instruments, devices and systems
measurement standards and guidelines.