Multi-turn Dialogue Model Based on the Improved Hierarchical Recurrent Attention Network

Q3 Engineering International Journal for Engineering Modelling Pub Date : 2021-07-22 DOI:10.31534/engmod.2021.2.ri.02d

Jiawei Miao, Jiansheng Wu

引用次数: 1

Abstract

When considering the multi-turn dialogue systems, the model needs to generate a natural and contextual response. At present, HRAN, one of the most advanced models for multi-turn dialogue problems, uses a hierarchical recurrent encoder-decoder combined with a hierarchical attention mechanism. However, for complex conversations, the traditional attention-based RNN does not fully understand the context, which results in attention to the wrong context that generates irrelevant responses. To solve this problem, we proposed an improved hierarchical recurrent attention network, a self-attention network (HSAN), instead of RNN, to learn word representations and utterances representations. Empirical studies on both Chinese and English datasets show that the proposed model has achieved significant improvement.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于改进层次循环注意网络的多回合对话模型

当考虑多回合对话系统时，模型需要生成自然的上下文响应。HRAN是目前研究多回合对话问题最先进的模型之一，它采用分层循环编解码器和分层注意机制相结合的方法。然而，对于复杂的对话，传统的基于注意力的RNN不能完全理解上下文，这导致对错误上下文的关注产生不相关的响应。为了解决这个问题，我们提出了一种改进的分层循环注意网络，即自注意网络(HSAN)，而不是RNN，来学习单词表征和话语表征。对中英文数据集的实证研究表明，本文提出的模型取得了显著的改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Journal for Engineering Modelling Engineering-Mechanical Engineering

CiteScore

0.90

自引率

0.00%

发文量

期刊介绍： Engineering Modelling is a refereed international journal providing an up-to-date reference for the engineers and researchers engaged in computer aided analysis, design and research in the fields of computational mechanics, numerical methods, software develop-ment and engineering modelling.