Label Distribution Augmented Maximum Likelihood Estimation for Reading Comprehension

Proceedings of the 13th International Conference on Web Search and Data Mining Pub Date : 2020-01-20 DOI:10.1145/3336191.3371835

Lixin Su, Jiafeng Guo, Yixing Fan, Yanyan Lan, Xueqi Cheng

{"title":"Label Distribution Augmented Maximum Likelihood Estimation for Reading Comprehension","authors":"Lixin Su, Jiafeng Guo, Yixing Fan, Yanyan Lan, Xueqi Cheng","doi":"10.1145/3336191.3371835","DOIUrl":null,"url":null,"abstract":"Reading comprehension (RC) aims to locate a text span from a context passage to answer the given question. Despite the effectiveness of modern neural RC models, most existing work relies on maximum likelihood estimation (MLE) and ignores the structure of the output space. That is during training, one treats all the text spans do not match the ground truth as equally poor, leading to overconfident predictions on ground truth labels and reduced generalization ability in test. One way to bridge the gap between training and test is to take into account the task reward of alternative outputs using the reinforcement learning (RL) algorithms, which is often deficient in optimization as compared with MLE. In this paper, we propose a new learning criterion for the RC task which combines the merits of both MLE and RL-based methods. Specifically, we show that we are able to derive the distribution of the outputs, i.e., label distribution, using their corresponding task rewards based on the decomposition property of the RC problem. We then optimize the RC model by directly learning towards the auxiliary label distribution, instead of the ground truth label, using the MLE framework. In this way, we can make use of the structure of the output space for better generalization (as RL) via efficient optimization (as MLE). We name our approach as Label Distribution augmented MLE (LD-MLE), which is a general learning criterion that could be adopted by almost all the existing RC models. Experiments on three representative benchmark datasets demonstrate that RC models learned with the LD-MLE criterion can achieve consistently improved results over those based on the traditional MLE and RL-based criteria.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"473 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 13th International Conference on Web Search and Data Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3336191.3371835","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Reading comprehension (RC) aims to locate a text span from a context passage to answer the given question. Despite the effectiveness of modern neural RC models, most existing work relies on maximum likelihood estimation (MLE) and ignores the structure of the output space. That is during training, one treats all the text spans do not match the ground truth as equally poor, leading to overconfident predictions on ground truth labels and reduced generalization ability in test. One way to bridge the gap between training and test is to take into account the task reward of alternative outputs using the reinforcement learning (RL) algorithms, which is often deficient in optimization as compared with MLE. In this paper, we propose a new learning criterion for the RC task which combines the merits of both MLE and RL-based methods. Specifically, we show that we are able to derive the distribution of the outputs, i.e., label distribution, using their corresponding task rewards based on the decomposition property of the RC problem. We then optimize the RC model by directly learning towards the auxiliary label distribution, instead of the ground truth label, using the MLE framework. In this way, we can make use of the structure of the output space for better generalization (as RL) via efficient optimization (as MLE). We name our approach as Label Distribution augmented MLE (LD-MLE), which is a general learning criterion that could be adopted by almost all the existing RC models. Experiments on three representative benchmark datasets demonstrate that RC models learned with the LD-MLE criterion can achieve consistently improved results over those based on the traditional MLE and RL-based criteria.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

标签分布增强最大似然估计的阅读理解

阅读理解(RC)的目的是从上下文文章中找到一个文本跨度来回答给定的问题。尽管现代神经RC模型是有效的，但大多数现有的工作依赖于最大似然估计(MLE)而忽略了输出空间的结构。也就是说，在训练过程中，人们将所有与基础真值不匹配的文本跨度视为同等差，导致对基础真值标签的预测过于自信，从而降低了测试中的泛化能力。一种弥合训练和测试之间差距的方法是使用强化学习(RL)算法考虑备选输出的任务奖励，与MLE相比，RL算法通常缺乏优化。在本文中，我们提出了一种新的RC任务学习准则，该准则结合了基于最大似然学习和基于强化学习两种方法的优点。具体来说，我们证明了我们能够根据RC问题的分解性质，利用它们相应的任务奖励，推导出输出的分布，即标签分布。然后，我们通过直接学习辅助标签分布来优化RC模型，而不是使用MLE框架的真实标签。这样，我们就可以利用输出空间的结构，通过高效的优化(如MLE)进行更好的泛化(如RL)。我们将我们的方法命名为标签分布增强MLE (LD-MLE)，这是一种可以被几乎所有现有RC模型采用的通用学习标准。在三个具有代表性的基准数据集上的实验表明，使用LD-MLE准则学习的RC模型比基于传统MLE和基于rl的准则学习的RC模型取得了一致的改进结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 13th International Conference on Web Search and Data Mining

自引率

0.00%

发文量

期刊最新文献

Recurrent Memory Reasoning Network for Expert Finding in Community Question Answering Joint Recognition of Names and Publications in Academic Homepages LouvainNE Enhancing Re-finding Behavior with External Memories for Personalized Search Temporal Pattern of Retweet(s) Help to Maximize Information Diffusion in Twitter