Conversational Memory Network for Emotion Recognition in Dyadic Dialogue Videos.

Proceedings of the conference. Association for Computational Linguistics. North American Chapter. Meeting Pub Date : 2018-06-01 DOI:10.18653/v1/n18-1193

Devamanyu Hazarika, Soujanya Poria, Amir Zadeh, Erik Cambria, Louis-Philippe Morency, Roger Zimmermann

引用次数: 282

Abstract

Emotion recognition in conversations is crucial for the development of empathetic machines. Present methods mostly ignore the role of inter-speaker dependency relations while classifying emotions in conversations. In this paper, we address recognizing utterance-level emotions in dyadic conversational videos. We propose a deep neural framework, termed conversational memory network, which leverages contextual information from the conversation history. The framework takes a multimodal approach comprising audio, visual and textual features with gated recurrent units to model past utterances of each speaker into memories. Such memories are then merged using attention-based hops to capture inter-speaker dependencies. Experiments show an accuracy improvement of 3-4% over the state of the art.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

二元对话视频中情感识别的会话记忆网络。

对话中的情绪识别对于移情机器的发展至关重要。目前的方法在对会话情绪进行分类时，大多忽略了说话人间依赖关系的作用。在本文中，我们讨论了识别二元对话视频中的话语级情绪。我们提出了一个深层神经框架，称为会话记忆网络，它利用会话历史中的上下文信息。该框架采用多模态方法，包括音频、视觉和文本特征，以及门控循环单元，将每个说话者过去的话语建模为记忆。然后，这些记忆通过基于注意力的跳跃来合并，以捕捉说话者之间的依赖关系。实验表明，该方法的精度比目前的方法提高了3-4%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the conference. Association for Computational Linguistics. North American Chapter. Meeting

自引率

0.00%

发文量