Multimodal Affective Analysis Using Hierarchical Attention Strategy with Word-Level Alignment.

Proceedings of the conference. Association for Computational Linguistics. Meeting Pub Date : 2018-07-01

Yue Gu, Kangning Yang, Shiyu Fu, Shuhong Chen, Xinyu Li, Ivan Marsic

引用次数: 0

Abstract

Multimodal affective computing, learning to recognize and interpret human affect and subjective information from multiple data sources, is still challenging because:(i) it is hard to extract informative features to represent human affects from heterogeneous inputs; (ii) current fusion strategies only fuse different modalities at abstract levels, ignoring time-dependent interactions between modalities. Addressing such issues, we introduce a hierarchical multimodal architecture with attention and word-level fusion to classify utterance-level sentiment and emotion from text and audio data. Our introduced model outperforms state-of-the-art approaches on published datasets, and we demonstrate that our model's synchronized attention over modalities offers visual interpretability.

Abstract Image

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使用词级对齐的层次注意策略进行多模式情感分析。

多模式情感计算，学习识别和解释来自多个数据源的人类情感和主观信息，仍然具有挑战性，因为：（i）很难从异质输入中提取信息特征来表示人类情感；（ii）目前的融合策略只在抽象层面融合不同的模态，忽略了模态之间与时间相关的相互作用。针对这些问题，我们引入了一种具有注意力和词级融合的分层多模式架构，以从文本和音频数据中对话语级情感和情绪进行分类。我们引入的模型在已发布的数据集上优于最先进的方法，我们证明了我们的模型对模态的同步关注提供了视觉可解释性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the conference. Association for Computational Linguistics. Meeting

自引率

0.00%

发文量