Learning Label Modular Prompts for Text Classification in the Wild

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing Pub Date : 2022-11-30 DOI:10.48550/arXiv.2211.17142

Hailin Chen, Amrita Saha, Shafiq R. Joty, Steven C. H. Hoi

{"title":"Learning Label Modular Prompts for Text Classification in the Wild","authors":"Hailin Chen, Amrita Saha, Shafiq R. Joty, Steven C. H. Hoi","doi":"10.48550/arXiv.2211.17142","DOIUrl":null,"url":null,"abstract":"Machine learning models usually assume i.i.d data during training and testing, but data and tasks in real world often change over time. To emulate the transient nature of real world, we propose a challenging but practical task: text classification in-the-wild, which introduces different non-stationary training/testing stages. Decomposing a complex task into modular components can enable robust generalisation under such non-stationary environment. However, current modular approaches in NLP do not take advantage of recent advances in parameter efficient tuning of pretrained language models. To close this gap, we propose ModularPrompt, a label-modular prompt tuning framework for text classification tasks. In ModularPrompt, the input prompt consists of a sequence of soft label prompts, each encoding modular knowledge related to the corresponding class label. In two of most formidable settings, ModularPrompt outperforms relevant baselines by a large margin demonstrating strong generalisation ability. We also conduct comprehensive analysis to validate whether the learned prompts satisfy properties of a modular representation.","PeriodicalId":74540,"journal":{"name":"Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing","volume":"19 1","pages":"1677-1690"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2211.17142","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Machine learning models usually assume i.i.d data during training and testing, but data and tasks in real world often change over time. To emulate the transient nature of real world, we propose a challenging but practical task: text classification in-the-wild, which introduces different non-stationary training/testing stages. Decomposing a complex task into modular components can enable robust generalisation under such non-stationary environment. However, current modular approaches in NLP do not take advantage of recent advances in parameter efficient tuning of pretrained language models. To close this gap, we propose ModularPrompt, a label-modular prompt tuning framework for text classification tasks. In ModularPrompt, the input prompt consists of a sequence of soft label prompts, each encoding modular knowledge related to the corresponding class label. In two of most formidable settings, ModularPrompt outperforms relevant baselines by a large margin demonstrating strong generalisation ability. We also conduct comprehensive analysis to validate whether the learned prompts satisfy properties of a modular representation.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

学习标签模块化提示文本分类在野外

机器学习模型通常在训练和测试期间假设人工智能数据，但现实世界中的数据和任务通常会随着时间的推移而变化。为了模拟现实世界的瞬态性质，我们提出了一个具有挑战性但实用的任务:文本分类在野外，它引入了不同的非平稳训练/测试阶段。将复杂任务分解为模块组件可以实现这种非平稳环境下的鲁棒泛化。然而，目前NLP中的模块化方法并没有利用预训练语言模型的参数有效调优的最新进展。为了缩小这一差距，我们提出了ModularPrompt，这是一个用于文本分类任务的标签模块化提示调优框架。在ModularPrompt中，输入提示由一系列软标签提示组成，每个软标签提示编码与相应类标签相关的模块知识。在两个最令人生畏的设置中，ModularPrompt的表现远远超过相关基线，显示出强大的泛化能力。我们还进行了全面的分析，以验证学习到的提示是否满足模块化表示的属性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing

自引率

0.00%

发文量