A Method For Answer Selection Using DistilBERT And Important Words

2020 6th International Conference on Web Research (ICWR) Pub Date : 2020-04-01 DOI:10.1109/ICWR49608.2020.9122302

Jamshid Mozafari, A. Fatemi, P. Moradi

{"title":"A Method For Answer Selection Using DistilBERT And Important Words","authors":"Jamshid Mozafari, A. Fatemi, P. Moradi","doi":"10.1109/ICWR49608.2020.9122302","DOIUrl":null,"url":null,"abstract":"Question Answering is a hot topic in artificial intelligence and has many real-world applications. This field aims at generating an answer to the user's question by analyzing a massive volume of text documents. Answer Selection is a significant part of a question answering system and attempts to extract the most relevant answers to the user's question from the candidate answers pool. Recently, researchers have attempted to resolve the answer selection task by using deep neural networks. They first employed the recurrent neural networks and then gradually migrated to convolutional neural networks. Nevertheless, the use of language models, which is implemented by deep neural networks, has recently been considered. In this research, the DistilBERT language model was employed as the language model. The outputs of the Question Analysis part and Expected Answer Extraction component are also applied with [CLS] token output as the final feature vector. This operation leads to improving the method performance. Several experiments are performed to evaluate the effectiveness of the proposed method, and the results are reported based on the MAP and MRR metrics. The results show that the MAP values of the proposed method improved by 0.6%, and the MRR metric is improved by 0.2%. The results of our research show that using a heavy language model does not guarantee a more reliable method for answer selection problem. It also shows that the use of particular words, such as Question Word and Expected Answer word, can improve the performance of the method.","PeriodicalId":231982,"journal":{"name":"2020 6th International Conference on Web Research (ICWR)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 6th International Conference on Web Research (ICWR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWR49608.2020.9122302","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Question Answering is a hot topic in artificial intelligence and has many real-world applications. This field aims at generating an answer to the user's question by analyzing a massive volume of text documents. Answer Selection is a significant part of a question answering system and attempts to extract the most relevant answers to the user's question from the candidate answers pool. Recently, researchers have attempted to resolve the answer selection task by using deep neural networks. They first employed the recurrent neural networks and then gradually migrated to convolutional neural networks. Nevertheless, the use of language models, which is implemented by deep neural networks, has recently been considered. In this research, the DistilBERT language model was employed as the language model. The outputs of the Question Analysis part and Expected Answer Extraction component are also applied with [CLS] token output as the final feature vector. This operation leads to improving the method performance. Several experiments are performed to evaluate the effectiveness of the proposed method, and the results are reported based on the MAP and MRR metrics. The results show that the MAP values of the proposed method improved by 0.6%, and the MRR metric is improved by 0.2%. The results of our research show that using a heavy language model does not guarantee a more reliable method for answer selection problem. It also shows that the use of particular words, such as Question Word and Expected Answer word, can improve the performance of the method.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一种利用蒸馏酒和重要词进行答案选择的方法

问答是人工智能领域的一个热门话题，在现实世界中有着广泛的应用。该领域旨在通过分析大量文本文档来生成用户问题的答案。答案选择是问答系统的重要组成部分，它试图从候选答案池中提取与用户问题最相关的答案。最近，研究人员试图利用深度神经网络解决答案选择任务。他们首先使用循环神经网络，然后逐渐迁移到卷积神经网络。然而，语言模型的使用，这是由深度神经网络实现，最近被考虑。本研究采用蒸馏器语言模型作为语言模型。问题分析部分和期望答案提取组件的输出也应用[CLS]令牌输出作为最终特征向量。该操作可提高方法的性能。通过实验对该方法的有效性进行了评估，并基于MAP和MRR指标报告了结果。结果表明，该方法的MAP值提高了0.6%，MRR指标提高了0.2%。我们的研究结果表明，使用重语言模型并不能保证答案选择问题的方法更可靠。研究还表明，使用特定的词，如问题词和期望答案词，可以提高方法的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2020 6th International Conference on Web Research (ICWR)

自引率

0.00%

发文量

期刊最新文献

Hierarchical Three-module Method of Text Classification in Web Big Data RePersian:An Efficient Open Information Extraction Tool in Persian Personalization of E-Learning Environment Using the Kolb's Learning Style Model A Multiagent Approach To Web Service Composition Based On TROPOS Methodology Analyzing the Robustness of Web Service Networks