Jian Li;Yuliang Zhao;Yinghao Liu;Huawei Zhang;Peng Shan;Yuanyi Wu;Wanyue Wang;Yulin Wang
{"title":"Sparse Emotion Dictionary and CWT Spectrogram Fusion With Multi-Head Self-Attention for Depression Recognition in Parkinson's Disease Patients","authors":"Jian Li;Yuliang Zhao;Yinghao Liu;Huawei Zhang;Peng Shan;Yuanyi Wu;Wanyue Wang;Yulin Wang","doi":"10.1109/TAFFC.2024.3498009","DOIUrl":null,"url":null,"abstract":"Depression is prevalent in patients with Parkinson's disease (PD), due to the dramatic negative impact that behavioral disorders have on daily life. Regrettably, most researchers in the past ignored the study of depression in PD patients, especially when depressive symptoms and PD symptoms are coupled together, it is difficult for researchers to recognize depression from the macro physiological signs of PD patients. Researchers are increasingly turning their attention to the subtle phenomena of emotional expression in conversation, using the textual and spectral features extracted from the audio of interviews as the primary support for understanding emotional states. However, there is still a lack of effective technical means to fuse these two features to recognize depression in PD patients. In this study, we proposed an innovative image fusion approach, fusing a sparse emotion dictionary with textual features and a Continuous Wavelet Transform (CWT) spectrogram with spectral features for the precise recognition of depression in PD patients. The fusion process integrates low-dimensional emotion-related textual cues, contributing to a more comprehensive extraction of emotionally relevant information. Subsequently, we introduce a High and Low Frequency Feature Fusion Multi-headed Self-Attention (HL-MSA) mechanism within a high and low frequency feature fusion network to amalgamate information across different frequency features within the images. The results underscore the efficacy of this novel fusion approach in effectively extracting depressive features in PD patients, attaining advanced recognition performance. Notably, this endeavor represents a pioneering stride in seamlessly fusing a sparse emotion dictionary and CWT spectrogram, exemplifying a promising and effective initiative for recognizing depression in PD patients.","PeriodicalId":13131,"journal":{"name":"IEEE Transactions on Affective Computing","volume":"16 2","pages":"1159-1176"},"PeriodicalIF":9.8000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Affective Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10753072/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Depression is prevalent in patients with Parkinson's disease (PD), due to the dramatic negative impact that behavioral disorders have on daily life. Regrettably, most researchers in the past ignored the study of depression in PD patients, especially when depressive symptoms and PD symptoms are coupled together, it is difficult for researchers to recognize depression from the macro physiological signs of PD patients. Researchers are increasingly turning their attention to the subtle phenomena of emotional expression in conversation, using the textual and spectral features extracted from the audio of interviews as the primary support for understanding emotional states. However, there is still a lack of effective technical means to fuse these two features to recognize depression in PD patients. In this study, we proposed an innovative image fusion approach, fusing a sparse emotion dictionary with textual features and a Continuous Wavelet Transform (CWT) spectrogram with spectral features for the precise recognition of depression in PD patients. The fusion process integrates low-dimensional emotion-related textual cues, contributing to a more comprehensive extraction of emotionally relevant information. Subsequently, we introduce a High and Low Frequency Feature Fusion Multi-headed Self-Attention (HL-MSA) mechanism within a high and low frequency feature fusion network to amalgamate information across different frequency features within the images. The results underscore the efficacy of this novel fusion approach in effectively extracting depressive features in PD patients, attaining advanced recognition performance. Notably, this endeavor represents a pioneering stride in seamlessly fusing a sparse emotion dictionary and CWT spectrogram, exemplifying a promising and effective initiative for recognizing depression in PD patients.
期刊介绍:
The IEEE Transactions on Affective Computing is an international and interdisciplinary journal. Its primary goal is to share research findings on the development of systems capable of recognizing, interpreting, and simulating human emotions and related affective phenomena. The journal publishes original research on the underlying principles and theories that explain how and why affective factors shape human-technology interactions. It also focuses on how techniques for sensing and simulating affect can enhance our understanding of human emotions and processes. Additionally, the journal explores the design, implementation, and evaluation of systems that prioritize the consideration of affect in their usability. We also welcome surveys of existing work that provide new perspectives on the historical and future directions of this field.