Proceedings of the First Workshop on NLP for Conversational AI最新文献

英文中文

Energy-Based Modelling for Dialogue State Tracking 基于能量的对话状态跟踪建模

Proceedings of the First Workshop on NLP for Conversational AI

Pub Date : 2019-08-01 DOI: 10.18653/v1/W19-4109

A. Trinh, R. Ross, John D. Kelleher

The uncertainties of language and the complexity of dialogue contexts make accurate dialogue state tracking one of the more challenging aspects of dialogue processing. To improve state tracking quality, we argue that relationships between different aspects of dialogue state must be taken into account as they can often guide a more accurate interpretation process. To this end, we present an energy-based approach to dialogue state tracking as a structured classification task. The novelty of our approach lies in the use of an energy network on top of a deep learning architecture to explore more signal correlations between network variables including input features and output labels. We demonstrate that the energy-based approach improves the performance of a deep learning dialogue state tracker towards state-of-the-art results without the need for many of the other steps required by current state-of-the-art methods.

语言的不确定性和对话语境的复杂性使得准确的对话状态跟踪成为对话处理中更具挑战性的方面之一。为了提高状态跟踪质量，我们认为必须考虑对话状态不同方面之间的关系，因为它们通常可以指导更准确的解释过程。为此，我们提出了一种基于能量的对话状态跟踪方法，将其作为结构化分类任务。我们方法的新颖之处在于在深度学习架构之上使用能量网络来探索网络变量(包括输入特征和输出标签)之间的更多信号相关性。我们证明，基于能量的方法可以提高深度学习对话状态跟踪器的性能，使其达到最先进的结果，而不需要当前最先进的方法所需的许多其他步骤。

引用次数: 3

Relevant and Informative Response Generation using Pointwise Mutual Information 利用点对相互信息生成相关和信息丰富的响应

Proceedings of the First Workshop on NLP for Conversational AI

Pub Date : 2019-08-01 DOI: 10.18653/v1/W19-4115

Junya Takayama, Yuki Arase

A sequence-to-sequence model tends to generate generic responses with little information for input utterances. To solve this problem, we propose a neural model that generates relevant and informative responses. Our model has simple architecture to enable easy application to existing neural dialogue models. Specifically, using positive pointwise mutual information, it first identifies keywords that frequently co-occur in responses given an utterance. Then, the model encourages the decoder to use the keywords for response generation. Experiment results demonstrate that our model successfully diversifies responses relative to previous models.

序列到序列模型倾向于在输入话语信息很少的情况下生成通用响应。为了解决这个问题，我们提出了一个神经模型来产生相关的信息响应。该模型结构简单，易于应用于已有的神经对话模型。具体来说，使用积极的点对点互信息，它首先识别出在给定话语的回答中经常同时出现的关键词。然后，该模型鼓励解码器使用关键字来生成响应。实验结果表明，相对于以前的模型，我们的模型成功地实现了响应的多样化。

引用次数: 8

Insights from Building an Open-Ended Conversational Agent 构建开放式会话代理的见解

Proceedings of the First Workshop on NLP for Conversational AI

Pub Date : 1900-01-01 DOI: 10.18653/v1/W19-4112

Khyatti Gupta, Meghana Joshi, Ankush Chatterjee, Sonam Damani, Kedhar Nath Narahari, Puneet Agrawal

Dialogue systems and conversational agents are becoming increasingly popular in modern society. We conceptualized one such conversational agent, Microsoft’s “Ruuh” with the promise to be able to talk to its users on any subject they choose. Building an open-ended conversational agent like Ruuh at onset seems like a daunting task, since the agent needs to think beyond the utilitarian notion of merely generating “relevant” responses and meet a wider range of user social needs, like expressing happiness when user’s favourite sports team wins, sharing a cute comment on showing the pictures of the user’s pet and so on. The agent also needs to detect and respond to abusive language, sensitive topics and trolling behaviour of the users. Many of these problems pose significant research challenges as well as product design limitations as one needs to circumnavigate the technical limitations to create an acceptable user experience. However, as the product reaches the real users the true test begins, and one realizes the challenges and opportunities that lie in the vast domain of conversations. With over 2.5 million real-world users till date who have generated over 300 million user conversations with Ruuh, there is a plethora of learning, insights and opportunities that we will talk about in this paper.

对话系统和会话代理在现代社会中越来越受欢迎。我们构想了一个这样的对话代理，微软的“Ruuh”，它承诺能够与用户就他们选择的任何主题进行对话。建立一个像Ruuh这样的开放式对话代理一开始似乎是一项艰巨的任务，因为代理需要超越仅仅产生“相关”响应的实用概念，满足更广泛的用户社交需求，比如在用户最喜欢的运动队获胜时表达快乐，在展示用户宠物的照片时分享可爱的评论等等。代理还需要检测和响应辱骂语言，敏感话题和用户的拖钓行为。许多这些问题构成了重大的研究挑战和产品设计限制，因为人们需要绕过技术限制来创建可接受的用户体验。然而，当产品接触到真正的用户时，真正的考验就开始了，人们意识到在广阔的对话领域中存在着挑战和机遇。到目前为止，Ruuh有超过250万的真实用户，他们与Ruuh产生了超过3亿的用户对话，我们将在本文中讨论大量的学习、见解和机会。

{"title":"Insights from Building an Open-Ended Conversational Agent","authors":"Khyatti Gupta, Meghana Joshi, Ankush Chatterjee, Sonam Damani, Kedhar Nath Narahari, Puneet Agrawal","doi":"10.18653/v1/W19-4112","DOIUrl":"https://doi.org/10.18653/v1/W19-4112","url":null,"abstract":"Dialogue systems and conversational agents are becoming increasingly popular in modern society. We conceptualized one such conversational agent, Microsoft’s “Ruuh” with the promise to be able to talk to its users on any subject they choose. Building an open-ended conversational agent like Ruuh at onset seems like a daunting task, since the agent needs to think beyond the utilitarian notion of merely generating “relevant” responses and meet a wider range of user social needs, like expressing happiness when user’s favourite sports team wins, sharing a cute comment on showing the pictures of the user’s pet and so on. The agent also needs to detect and respond to abusive language, sensitive topics and trolling behaviour of the users. Many of these problems pose significant research challenges as well as product design limitations as one needs to circumnavigate the technical limitations to create an acceptable user experience. However, as the product reaches the real users the true test begins, and one realizes the challenges and opportunities that lie in the vast domain of conversations. With over 2.5 million real-world users till date who have generated over 300 million user conversations with Ruuh, there is a plethora of learning, insights and opportunities that we will talk about in this paper.","PeriodicalId":178445,"journal":{"name":"Proceedings of the First Workshop on NLP for Conversational AI","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126665734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Responsive and Self-Expressive Dialogue Generation 反应和自我表达的对话生成

Proceedings of the First Workshop on NLP for Conversational AI

Pub Date : 1900-01-01 DOI: 10.18653/v1/W19-4116

Kozo Chikai, Junya Takayama, Yuki Arase

A neural conversation model is a promising approach to develop dialogue systems with the ability of chit-chat. It allows training a model in an end-to-end manner without complex rule design nor feature engineering. However, as a side effect, the neural model tends to generate safe but uninformative and insensitive responses like “OK” and “I don’t know.” Such replies are called generic responses and regarded as a critical problem for user-engagement of dialogue systems. For a more engaging chit-chat experience, we propose a neural conversation model that generates responsive and self-expressive replies. Specifically, our model generates domain-aware and sentiment-rich responses. Experiments empirically confirmed that our model outperformed the sequence-to-sequence model; 68.1% of our responses were domain-aware with sentiment polarities, which was only 2.7% for responses generated by the sequence-to-sequence model.

神经对话模型是开发具有聊天能力的对话系统的一种很有前途的方法。它允许以端到端方式训练模型，而无需复杂的规则设计或特征工程。然而，作为副作用，神经模型倾向于产生安全但缺乏信息和不敏感的反应，如“好”和“我不知道”。这种答复被称为一般答复，被认为是用户参与对话系统的一个关键问题。为了获得更有吸引力的聊天体验，我们提出了一种神经对话模型，该模型可以生成响应性和自我表达性的回复。具体来说，我们的模型生成领域感知和情感丰富的响应。实验经验证实，我们的模型优于序列到序列模型;68.1%的响应是具有情感极性的领域感知，而序列到序列模型生成的响应仅为2.7%。

引用次数: 1

DSTC7 Task 1: Noetic End-to-End Response Selection DSTC7任务1:理性的端到端响应选择

Proceedings of the First Workshop on NLP for Conversational AI

Pub Date : 1900-01-01 DOI: 10.18653/v1/W19-4107

Chulaka Gunasekara, Jonathan K. Kummerfeld, L. Polymenakos, Walter S. Lasecki

Goal-oriented dialogue in complex domains is an extremely challenging problem and there are relatively few datasets. This task provided two new resources that presented different challenges: one was focused but small, while the other was large but diverse. We also considered several new variations on the next utterance selection problem: (1) increasing the number of candidates, (2) including paraphrases, and (3) not including a correct option in the candidate set. Twenty teams participated, developing a range of neural network models, including some that successfully incorporated external data to boost performance. Both datasets have been publicly released, enabling future work to build on these results, working towards robust goal-oriented dialogue systems.

复杂领域的目标导向对话是一个极具挑战性的问题，而且数据集相对较少。这项任务提供了两种新的资源，它们提出了不同的挑战:一种是集中但规模小，另一种是规模大但种类繁多。我们还考虑了下一个话语选择问题的几个新变化:(1)增加候选者的数量，(2)包括释义，(3)在候选者集中不包括正确的选项。20个团队参与其中，开发了一系列神经网络模型，其中一些模型成功地整合了外部数据以提高性能。这两个数据集都已公开发布，使未来的工作能够以这些结果为基础，努力建立健全的目标导向的对话系统。

引用次数: 50

End-to-End Neural Context Reconstruction in Chinese Dialogue 中文对话的端到端神经语境重建

Proceedings of the First Workshop on NLP for Conversational AI

Pub Date : 1900-01-01 DOI: 10.18653/v1/W19-4108

Wei Yang, Rui Qiao, Haocheng Qin, Amy Sun, Luchen Tan, Kun Xiong, Ming Li

We tackle the problem of context reconstruction in Chinese dialogue, where the task is to replace pronouns, zero pronouns, and other referring expressions with their referent nouns so that sentences can be processed in isolation without context. Following a standard decomposition of the context reconstruction task into referring expression detection and coreference resolution, we propose a novel end-to-end architecture for separately and jointly accomplishing this task. Key features of this model include POS and position encoding using CNNs and a novel pronoun masking mechanism. One perennial problem in building such models is the paucity of training data, which we address by augmenting previously-proposed methods to generate a large amount of realistic training data. The combination of more data and better models yields accuracy higher than the state-of-the-art method in coreference resolution and end-to-end context reconstruction.

本文研究了汉语对话中的语境重建问题，其任务是用指代名词代替代词、零代词和其他指称表达，使句子可以在没有语境的情况下独立处理。在将上下文重建任务标准分解为引用表达式检测和共同引用解析之后，我们提出了一种新的端到端架构来分别和共同完成这一任务。该模型的主要特点包括使用cnn进行词性和位置编码，以及一种新颖的代词掩蔽机制。建立这种模型的一个长期问题是训练数据的缺乏，我们通过增加先前提出的方法来生成大量真实的训练数据来解决这个问题。更多的数据和更好的模型的结合在共参考分辨率和端到端上下文重建方面比最先进的方法具有更高的准确性。

引用次数: 7

Co-Operation as an Asymmetric Form of Human-Computer Creativity. Case: Peace Machine 合作是人机创造的一种不对称形式。案例:和平机器

Proceedings of the First Workshop on NLP for Conversational AI

Pub Date : 1900-01-01 DOI: 10.18653/v1/W19-4105

Mika Hämäläinen, T. Honkela

This theoretical paper identifies a need for a definition of asymmetric co-creativity where creativity is expected from the computational agent but not from the human user. Our co-operative creativity framework takes into account that the computational agent has a message to convey in a co-operative fashion, which introduces a trade-off on how creative the computer can be. The requirements of co-operation are identified from an interdisciplinary point of view. We divide co-operative creativity in message creativity, contextual creativity and communicative creativity. Finally these notions are applied in the context of the Peace Machine system concept.

这篇理论论文确定了对非对称共同创造力的定义的需求，其中创造力是来自计算代理而不是来自人类用户。我们的合作创造力框架考虑到计算代理以合作的方式传递信息，这就引入了对计算机创造力的权衡。合作的要求是从跨学科的角度来确定的。我们将合作创造力分为信息创造力、语境创造力和交际创造力。最后，这些概念在和平机器系统概念的背景下得到了应用。

引用次数: 3

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the First Workshop on NLP for Conversational AI

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀