Hybrid BiLSTM-Siamese network for FAQ Assistance

Proceedings of the 2017 ACM on Conference on Information and Knowledge Management Pub Date : 2017-11-06 DOI:10.1145/3132847.3132861

Prerna Khurana, P. Agarwal, Gautam M. Shroff, L. Vig, A. Srinivasan

{"title":"Hybrid BiLSTM-Siamese network for FAQ Assistance","authors":"Prerna Khurana, P. Agarwal, Gautam M. Shroff, L. Vig, A. Srinivasan","doi":"10.1145/3132847.3132861","DOIUrl":null,"url":null,"abstract":"We describe an automated assistant for answering frequently asked questions; our system has been deployed, and is currently answering HR-related queries in two different areas (leave management and health insurance) to a large number of users. The needs of a large global corporate lead us to model a frequently asked question (FAQ) to be an equivalence class of actually asked questions, for which there is a common answer (certified as being consistent with the organization's policy). When a new question is posed to our system, it finds the class of question, and responds with the answer for the class. At this point, the system is either correct (gives correct answer); or incorrect (gives wrong answer); or incomplete (says \"I don't know''). We employ a hybrid deep-learning architecture in which a BiLSTM-based classifier is combined with second BiLSTM-based Siamese network in an iterative manner: Questions for which the classifier makes an error during training are used to generate a set of misclassified question-question pairs. These, along with correct pairs, are used to train the Siamese network to drive apart the (hidden) representations of the misclassified pairs. We present experimental results from our deployment showing that our iteratively trained hybrid network: (a) results in better performance than using just a classifier network, or just a Siamese network; (b) performs better than state-of-the art sentence classifiers in the two areas in which it has been deployed, in terms of both accuracy as well as precision-recall tradeoff; and (c) also performs well on a benchmark public dataset. We also observe that using question-question pairs in our hybrid network, results in marginally better performance than using question-to-answer pairs. Finally, estimates of precision and recall from the deployment of our automated assistant suggest that we can expect the burden on our HR department to drop from answering about 6000 queries a day to about 1000.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"2 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3132847.3132861","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

Abstract

We describe an automated assistant for answering frequently asked questions; our system has been deployed, and is currently answering HR-related queries in two different areas (leave management and health insurance) to a large number of users. The needs of a large global corporate lead us to model a frequently asked question (FAQ) to be an equivalence class of actually asked questions, for which there is a common answer (certified as being consistent with the organization's policy). When a new question is posed to our system, it finds the class of question, and responds with the answer for the class. At this point, the system is either correct (gives correct answer); or incorrect (gives wrong answer); or incomplete (says "I don't know''). We employ a hybrid deep-learning architecture in which a BiLSTM-based classifier is combined with second BiLSTM-based Siamese network in an iterative manner: Questions for which the classifier makes an error during training are used to generate a set of misclassified question-question pairs. These, along with correct pairs, are used to train the Siamese network to drive apart the (hidden) representations of the misclassified pairs. We present experimental results from our deployment showing that our iteratively trained hybrid network: (a) results in better performance than using just a classifier network, or just a Siamese network; (b) performs better than state-of-the art sentence classifiers in the two areas in which it has been deployed, in terms of both accuracy as well as precision-recall tradeoff; and (c) also performs well on a benchmark public dataset. We also observe that using question-question pairs in our hybrid network, results in marginally better performance than using question-to-answer pairs. Finally, estimates of precision and recall from the deployment of our automated assistant suggest that we can expect the burden on our HR department to drop from answering about 6000 queries a day to about 1000.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

混合BiLSTM-Siamese网络常见问题解答

我们描述了一个用于回答常见问题的自动助手;我们的系统已经部署完毕，目前正在回答大量用户在两个不同领域(休假管理和健康保险)的人力资源相关查询。大型全球公司的需求导致我们将经常问的问题(FAQ)建模为实际问的问题的等价类，对于这些问题，有一个共同的答案(被认证为与组织的政策一致)。当向我们的系统提出一个新问题时，它会找到问题的类别，并根据类别给出答案。此时，系统要么是正确的(给出正确答案);或incorrect(给出错误的答案);或者不完整(说“我不知道”)。我们采用了一种混合深度学习架构，其中基于bilstm的分类器以迭代的方式与第二个基于bilstm的Siamese网络相结合:分类器在训练过程中出错的问题被用来生成一组错误分类的问题对。这些和正确的配对一起用于训练Siamese网络，以分离(隐藏的)错误分类配对的表示。我们展示了我们部署的实验结果，表明我们迭代训练的混合网络:(a)比仅使用分类器网络或仅使用暹罗网络具有更好的性能;(b)在准确率和准确率-查全率权衡方面，它比目前最先进的句子分类器表现更好;并且(c)在基准公共数据集上也表现良好。我们还观察到，在我们的混合网络中使用问题-问题对，结果比使用问题-答案对略微更好。最后，对部署自动助手的准确率和召回率的估计表明，我们可以预期人力资源部门的负担将从每天回答大约6000个查询下降到大约1000个。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 2017 ACM on Conference on Information and Knowledge Management

自引率

0.00%

发文量

期刊最新文献

Query and Animate Multi-attribute Trajectory Data HyPerInsight: Data Exploration Deep Inside HyPer Algorithmic Bias: Do Good Systems Make Relevant Documents More Retrievable? NeuPL: Attention-based Semantic Matching and Pair-Linking for Entity Disambiguation Health Forum Thread Recommendation Using an Interest Aware Topic Model