{"title":"Hierarchical Knowledge Stripping for Multimodal Sentiment Analysis","authors":"Aolin Xiong;Ying Zeng;Haifeng Hu","doi":"10.1109/TAFFC.2024.3456117","DOIUrl":null,"url":null,"abstract":"Multimodal sentiment analysis (MSA) has emerged as a prominent research area that focuses on leveraging multimodal data to understand intention and sentiment signals. Despite significant progress, two major challenges remain in integrating diverse modalities: modal heterogeneity and interference information. To address these issues, we propose a novel framework called Multimodal Hierarchical Knowledge Stripping (MHKS), which enables the progressive extraction of informative knowledge. First, inspired by the information bottleneck (IB), we design a hierarchical disentanglement strategy to stepwise separate task-relevant and task-irrelevant information at the feature, attribute, and semantic levels. This enables MHKS to extract valuable knowledge in unimodal representations and eliminate interference information. Then, to mitigate the distribution gap across multiple modalities, we further design an adaptive alignment strategy based on contrastive learning. We utilize text modality as a bridge to connect other nonverbal modalities, which encourages adaptive alignment across modalities and facilitates the learning of more harmonized joint representations. Comprehensive experiments on three popular datasets demonstrate our method achieves excellent performance on MSA tasks.","PeriodicalId":13131,"journal":{"name":"IEEE Transactions on Affective Computing","volume":"16 2","pages":"669-682"},"PeriodicalIF":9.8000,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Affective Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10669761/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Multimodal sentiment analysis (MSA) has emerged as a prominent research area that focuses on leveraging multimodal data to understand intention and sentiment signals. Despite significant progress, two major challenges remain in integrating diverse modalities: modal heterogeneity and interference information. To address these issues, we propose a novel framework called Multimodal Hierarchical Knowledge Stripping (MHKS), which enables the progressive extraction of informative knowledge. First, inspired by the information bottleneck (IB), we design a hierarchical disentanglement strategy to stepwise separate task-relevant and task-irrelevant information at the feature, attribute, and semantic levels. This enables MHKS to extract valuable knowledge in unimodal representations and eliminate interference information. Then, to mitigate the distribution gap across multiple modalities, we further design an adaptive alignment strategy based on contrastive learning. We utilize text modality as a bridge to connect other nonverbal modalities, which encourages adaptive alignment across modalities and facilitates the learning of more harmonized joint representations. Comprehensive experiments on three popular datasets demonstrate our method achieves excellent performance on MSA tasks.
期刊介绍:
The IEEE Transactions on Affective Computing is an international and interdisciplinary journal. Its primary goal is to share research findings on the development of systems capable of recognizing, interpreting, and simulating human emotions and related affective phenomena. The journal publishes original research on the underlying principles and theories that explain how and why affective factors shape human-technology interactions. It also focuses on how techniques for sensing and simulating affect can enhance our understanding of human emotions and processes. Additionally, the journal explores the design, implementation, and evaluation of systems that prioritize the consideration of affect in their usability. We also welcome surveys of existing work that provide new perspectives on the historical and future directions of this field.