Gokul S Krishnan;Sarala Padi;Craig S. Greenberg;Balaraman Ravindran;Dinesh Manocha;Ram D. Sriram
{"title":"LineConGraphs: Line Conversation Graphs for Effective Emotion Recognition Using Graph Neural Networks","authors":"Gokul S Krishnan;Sarala Padi;Craig S. Greenberg;Balaraman Ravindran;Dinesh Manocha;Ram D. Sriram","doi":"10.1109/TAFFC.2025.3537538","DOIUrl":null,"url":null,"abstract":"Emotion Recognition in Conversations (ERC) is an important aspect of affective computing with practical applications in healthcare, education, chatbots, and social media platforms. Previous approaches to <inline-formula><tex-math>$\\text {ERC}$</tex-math></inline-formula> analysis involved using graph neural network architectures to model both speaker and long-term contextual information. In this paper, we introduce new models for <inline-formula><tex-math>$\\text {ERC}$</tex-math></inline-formula> analysis: the <i>LineConGCN</i> and <i>LineConGAT</i> models, which are constructed using a graph construction strategy for conversations called line conversational graphs (<i>LineConGraphs</i>). <i>LineConGraph</i> is designed to capture short-term conversational context using one previous and future utterance, while also capturing long-term context using GCN or GAT layers without explicitly integrating into the graph construction strategy. We evaluate the performance of our proposed models on two benchmark datasets, <inline-formula><tex-math>$\\text {IEMOCAP}$</tex-math></inline-formula> and <inline-formula><tex-math>$\\text {MELD}$</tex-math></inline-formula>, and show that our <i>LineConGAT</i> model outperforms the state-of-the-art methods with an F1-score of <inline-formula><tex-math>$\\text {64.58}\\%$</tex-math></inline-formula> and <inline-formula><tex-math>$\\text {76.50}\\%$</tex-math></inline-formula>. Furthermore, we demonstrate that incorporating sentiment shift information into line conversation graphs further enhances <inline-formula><tex-math>$\\text {ERC}$</tex-math></inline-formula> performance in the case of <i>LineConGCN</i> models. We also evaluate the performance of our proposed model by embedding speaker information into <i>LineConGCN</i> and <i>LineConGAT</i> models and show that <i>LineConGAT</i> and <i>LineConGAT</i> with speaker embeddings performed equally for ERC analysis.","PeriodicalId":13131,"journal":{"name":"IEEE Transactions on Affective Computing","volume":"16 3","pages":"1747-1759"},"PeriodicalIF":9.8000,"publicationDate":"2025-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Affective Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10858741/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Emotion Recognition in Conversations (ERC) is an important aspect of affective computing with practical applications in healthcare, education, chatbots, and social media platforms. Previous approaches to $\text {ERC}$ analysis involved using graph neural network architectures to model both speaker and long-term contextual information. In this paper, we introduce new models for $\text {ERC}$ analysis: the LineConGCN and LineConGAT models, which are constructed using a graph construction strategy for conversations called line conversational graphs (LineConGraphs). LineConGraph is designed to capture short-term conversational context using one previous and future utterance, while also capturing long-term context using GCN or GAT layers without explicitly integrating into the graph construction strategy. We evaluate the performance of our proposed models on two benchmark datasets, $\text {IEMOCAP}$ and $\text {MELD}$, and show that our LineConGAT model outperforms the state-of-the-art methods with an F1-score of $\text {64.58}\%$ and $\text {76.50}\%$. Furthermore, we demonstrate that incorporating sentiment shift information into line conversation graphs further enhances $\text {ERC}$ performance in the case of LineConGCN models. We also evaluate the performance of our proposed model by embedding speaker information into LineConGCN and LineConGAT models and show that LineConGAT and LineConGAT with speaker embeddings performed equally for ERC analysis.
期刊介绍:
The IEEE Transactions on Affective Computing is an international and interdisciplinary journal. Its primary goal is to share research findings on the development of systems capable of recognizing, interpreting, and simulating human emotions and related affective phenomena. The journal publishes original research on the underlying principles and theories that explain how and why affective factors shape human-technology interactions. It also focuses on how techniques for sensing and simulating affect can enhance our understanding of human emotions and processes. Additionally, the journal explores the design, implementation, and evaluation of systems that prioritize the consideration of affect in their usability. We also welcome surveys of existing work that provide new perspectives on the historical and future directions of this field.