Adaptive Meta-learner via Gradient Similarity for Few-shot Text Classification

Proceedings of COLING. International Conference on Computational Linguistics Pub Date : 2022-09-10 DOI:10.48550/arXiv.2209.04702

Tianyi Lei, Honghui Hu, Qiaoyang Luo, Dezhong Peng, Xu Wang

{"title":"Adaptive Meta-learner via Gradient Similarity for Few-shot Text Classification","authors":"Tianyi Lei, Honghui Hu, Qiaoyang Luo, Dezhong Peng, Xu Wang","doi":"10.48550/arXiv.2209.04702","DOIUrl":null,"url":null,"abstract":"Few-shot text classification aims to classify the text under the few-shot scenario. Most of the previous methods adopt optimization-based meta learning to obtain task distribution. However, due to the neglect of matching between the few amount of samples and complicated models, as well as the distinction between useful and useless task features, these methods suffer from the overfitting issue. To address this issue, we propose a novel Adaptive Meta-learner via Gradient Similarity (AMGS) method to improve the model generalization ability to a new task. Specifically, the proposed AMGS alleviates the overfitting based on two aspects: (i) acquiring the potential semantic representation of samples and improving model generalization through the self-supervised auxiliary task in the inner loop, (ii) leveraging the adaptive meta-learner via gradient similarity to add constraints on the gradient obtained by base-learner in the outer loop. Moreover, we make a systematic analysis of the influence of regularization on the entire framework. Experimental results on several benchmarks demonstrate that the proposed AMGS consistently improves few-shot text classification performance compared with the state-of-the-art optimization-based meta-learning approaches. The code is available at: https://github.com/Tianyi-Lei.","PeriodicalId":91381,"journal":{"name":"Proceedings of COLING. International Conference on Computational Linguistics","volume":"56 1","pages":"4873-4882"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of COLING. International Conference on Computational Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2209.04702","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

Few-shot text classification aims to classify the text under the few-shot scenario. Most of the previous methods adopt optimization-based meta learning to obtain task distribution. However, due to the neglect of matching between the few amount of samples and complicated models, as well as the distinction between useful and useless task features, these methods suffer from the overfitting issue. To address this issue, we propose a novel Adaptive Meta-learner via Gradient Similarity (AMGS) method to improve the model generalization ability to a new task. Specifically, the proposed AMGS alleviates the overfitting based on two aspects: (i) acquiring the potential semantic representation of samples and improving model generalization through the self-supervised auxiliary task in the inner loop, (ii) leveraging the adaptive meta-learner via gradient similarity to add constraints on the gradient obtained by base-learner in the outer loop. Moreover, we make a systematic analysis of the influence of regularization on the entire framework. Experimental results on several benchmarks demonstrate that the proposed AMGS consistently improves few-shot text classification performance compared with the state-of-the-art optimization-based meta-learning approaches. The code is available at: https://github.com/Tianyi-Lei.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于梯度相似度的小样本文本分类自适应元学习器

少镜头文本分类的目的是对少镜头场景下的文本进行分类。以往的方法大多采用基于优化的元学习来获得任务分布。然而，由于忽略了少量样本和复杂模型之间的匹配，以及有用和无用任务特征的区分，这些方法存在过拟合问题。为了解决这一问题，我们提出了一种基于梯度相似度(AMGS)的自适应元学习器，以提高模型对新任务的泛化能力。具体而言，本文提出的AMGS从两个方面缓解了过拟合问题:(1)在内环中通过自监督辅助任务获取样本的潜在语义表示并提高模型泛化;(2)在外环中利用自适应元学习器通过梯度相似性对基础学习器获得的梯度添加约束。此外，我们还系统地分析了正则化对整个框架的影响。几个基准测试的实验结果表明，与最先进的基于优化的元学习方法相比，所提出的AMGS方法持续提高了少量文本分类性能。代码可从https://github.com/Tianyi-Lei获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of COLING. International Conference on Computational Linguistics

自引率

0.00%

发文量