Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies最新文献

英文中文

Active2 Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation Active2学习:主动减少序列标注和机器翻译学习的主动学习方法中的冗余:主动减少序列标注和机器翻译的主动学习方法中的冗余

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

Pub Date : 2021-03-11 DOI: 10.18653/V1/2021.NAACL-MAIN.159

Rishi Hazra, Parag Dutta, Shubham Gupta, Mohammed Abdul Qaathir, Ambedkar Dukkipati

While deep learning is a powerful tool for natural language processing (NLP) problems, successful solutions to these problems rely heavily on large amounts of annotated samples. However, manually annotating data is expensive and time-consuming. Active Learning (AL) strategies reduce the need for huge volumes of labeled data by iteratively selecting a small number of examples for manual annotation based on their estimated utility in training the given model. In this paper, we argue that since AL strategies choose examples independently, they may potentially select similar examples, all of which may not contribute significantly to the learning process. Our proposed approach, Activemathbf{^2} Learning (Amathbf{^2}L), actively adapts to the deep learning model being trained to eliminate such redundant examples chosen by an AL strategy. We show that Amathbf{^2}L is widely applicable by using it in conjunction with several different AL strategies and NLP tasks. We empirically demonstrate that the proposed approach is further able to reduce the data requirements of state-of-the-art AL strategies by ≈ mathbf{3-25%} on an absolute scale on multiple NLP tasks while achieving the same performance with virtually no additional computation overhead.

虽然深度学习是自然语言处理(NLP)问题的强大工具，但这些问题的成功解决方案在很大程度上依赖于大量带注释的样本。然而，手动标注数据既昂贵又耗时。主动学习(AL)策略通过基于训练给定模型的估计效用，迭代地选择少量示例进行手动标注，从而减少了对大量标记数据的需求。在本文中，我们认为，由于人工智能策略独立地选择示例，它们可能会选择相似的示例，所有这些示例都可能对学习过程没有显著贡献。我们提出的方法，主动mathbf{^2}学习(Amathbf{^2}L)，主动适应被训练的深度学习模型，以消除由人工智能策略选择的冗余示例。我们通过将Amathbf{^2}L与几种不同的人工智能策略和NLP任务结合使用，证明了它的广泛适用性。我们的经验证明，所提出的方法进一步能够在多个NLP任务上以绝对值≈mathbf{3-25%}的幅度减少最先进的人工智能策略的数据需求，同时在几乎没有额外计算开销的情况下实现相同的性能。

{"title":"Active2 Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation","authors":"Rishi Hazra, Parag Dutta, Shubham Gupta, Mohammed Abdul Qaathir, Ambedkar Dukkipati","doi":"10.18653/V1/2021.NAACL-MAIN.159","DOIUrl":"https://doi.org/10.18653/V1/2021.NAACL-MAIN.159","url":null,"abstract":"While deep learning is a powerful tool for natural language processing (NLP) problems, successful solutions to these problems rely heavily on large amounts of annotated samples. However, manually annotating data is expensive and time-consuming. Active Learning (AL) strategies reduce the need for huge volumes of labeled data by iteratively selecting a small number of examples for manual annotation based on their estimated utility in training the given model. In this paper, we argue that since AL strategies choose examples independently, they may potentially select similar examples, all of which may not contribute significantly to the learning process. Our proposed approach, Activemathbf{^2} Learning (Amathbf{^2}L), actively adapts to the deep learning model being trained to eliminate such redundant examples chosen by an AL strategy. We show that Amathbf{^2}L is widely applicable by using it in conjunction with several different AL strategies and NLP tasks. We empirically demonstrate that the proposed approach is further able to reduce the data requirements of state-of-the-art AL strategies by ≈ mathbf{3-25%} on an absolute scale on multiple NLP tasks while achieving the same performance with virtually no additional computation overhead.","PeriodicalId":251110,"journal":{"name":"Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","volume":"04 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127178970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Text Modular Networks: Learning to Decompose Tasks in the Language of Existing Models 文本模块化网络:学习用现有模型的语言分解任务

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

Pub Date : 1900-01-01 DOI: 10.18653/V1/2021.NAACL-MAIN.99

Tushar Khot, Daniel Khashabi, Kyle Richardson, Peter Clark, Ashish Sabharwal

We propose a general framework called Text Modular Networks(TMNs) for building interpretable systems that learn to solve complex tasks by decomposing them into simpler ones solvable by existing models. To ensure solvability of simpler tasks, TMNs learn the textual input-output behavior (i.e., language) of existing models through their datasets. This differs from prior decomposition-based approaches which, besides being designed specifically for each complex task, produce decompositions independent of existing sub-models. Specifically, we focus on Question Answering (QA) and show how to train a next-question generator to sequentially produce sub-questions targeting appropriate sub-models, without additional human annotation. These sub-questions and answers provide a faithful natural language explanation of the model’s reasoning. We use this framework to build ModularQA, a system that can answer multi-hop reasoning questions by decomposing them into sub-questions answerable by a neural factoid single-span QA model and a symbolic calculator. Our experiments show that ModularQA is more versatile than existing explainable systems for DROP and HotpotQA datasets, is more robust than state-of-the-art blackbox (uninterpretable) systems, and generates more understandable and trustworthy explanations compared to prior work.

我们提出了一个称为文本模块化网络(tmn)的通用框架，用于构建可解释的系统，该系统通过将复杂任务分解为可由现有模型解决的更简单的任务来学习解决复杂任务。为了确保简单任务的可解决性，tmn通过其数据集学习现有模型的文本输入输出行为(即语言)。这不同于先前的基于分解的方法，除了为每个复杂任务专门设计之外，还产生独立于现有子模型的分解。具体来说，我们将重点放在问答(QA)上，并展示如何训练下一个问题生成器，以便在不需要额外的人工注释的情况下，针对适当的子模型依次生成子问题。这些子问题和答案为模型的推理提供了忠实的自然语言解释。我们使用该框架构建了模块化QA系统，该系统可以通过神经因子单跨QA模型和符号计算器将多跳推理问题分解为可回答的子问题来回答多跳推理问题。我们的实验表明，对于DROP和HotpotQA数据集，ModularQA比现有的可解释系统更通用，比最先进的黑箱(不可解释)系统更健壮，并且与之前的工作相比，产生了更可理解和值得信赖的解释。

{"title":"Text Modular Networks: Learning to Decompose Tasks in the Language of Existing Models","authors":"Tushar Khot, Daniel Khashabi, Kyle Richardson, Peter Clark, Ashish Sabharwal","doi":"10.18653/V1/2021.NAACL-MAIN.99","DOIUrl":"https://doi.org/10.18653/V1/2021.NAACL-MAIN.99","url":null,"abstract":"We propose a general framework called Text Modular Networks(TMNs) for building interpretable systems that learn to solve complex tasks by decomposing them into simpler ones solvable by existing models. To ensure solvability of simpler tasks, TMNs learn the textual input-output behavior (i.e., language) of existing models through their datasets. This differs from prior decomposition-based approaches which, besides being designed specifically for each complex task, produce decompositions independent of existing sub-models. Specifically, we focus on Question Answering (QA) and show how to train a next-question generator to sequentially produce sub-questions targeting appropriate sub-models, without additional human annotation. These sub-questions and answers provide a faithful natural language explanation of the model’s reasoning. We use this framework to build ModularQA, a system that can answer multi-hop reasoning questions by decomposing them into sub-questions answerable by a neural factoid single-span QA model and a symbolic calculator. Our experiments show that ModularQA is more versatile than existing explainable systems for DROP and HotpotQA datasets, is more robust than state-of-the-art blackbox (uninterpretable) systems, and generates more understandable and trustworthy explanations compared to prior work.","PeriodicalId":251110,"journal":{"name":"Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116499146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 70

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀