鲁棒语言模型的提示扰动一致性学习

Findings Pub Date : 2024-02-24 DOI:10.48550/arXiv.2402.15833

Yao Qiang, Subhrangshu Nandi, Ninareh Mehrabi, G. V. Steeg, Anoop Kumar, Anna Rumshisky, A. Galstyan

{"title":"鲁棒语言模型的提示扰动一致性学习","authors":"Yao Qiang, Subhrangshu Nandi, Ninareh Mehrabi, G. V. Steeg, Anoop Kumar, Anna Rumshisky, A. Galstyan","doi":"10.48550/arXiv.2402.15833","DOIUrl":null,"url":null,"abstract":"Large language models (LLMs) have demonstrated impressive performance on a number of natural language processing tasks, such as question answering and text summarization. However, their performance on sequence labeling tasks such as intent classification and slot filling (IC-SF), which is a central component in personal assistant systems, lags significantly behind discriminative models. Furthermore, there is a lack of substantive research on robustness of LLMs to various perturbations in the input prompts. The contributions of this paper are three-fold. First, we show that fine-tuning sufficiently large LLMs can produce IC-SF performance comparable to discriminative models. Next, we systematically analyze the performance deterioration of those fine-tuned models due to three distinct yet relevant types of input perturbations - oronyms, synonyms, and paraphrasing. Finally, we propose an efficient mitigation approach, Prompt Perturbation Consistency Learning (PPCL), which works by regularizing the divergence between losses from clean and perturbed samples. Our experiments show that PPCL can recover on an average 59% and 69% of the performance drop for IC and SF tasks, respectively. Furthermore, PPCL beats data augmentation approach while using ten times fewer augmented data samples.","PeriodicalId":508951,"journal":{"name":"Findings","volume":"54 4","pages":"1357-1370"},"PeriodicalIF":0.0000,"publicationDate":"2024-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prompt Perturbation Consistency Learning for Robust Language Models\",\"authors\":\"Yao Qiang, Subhrangshu Nandi, Ninareh Mehrabi, G. V. Steeg, Anoop Kumar, Anna Rumshisky, A. Galstyan\",\"doi\":\"10.48550/arXiv.2402.15833\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Large language models (LLMs) have demonstrated impressive performance on a number of natural language processing tasks, such as question answering and text summarization. However, their performance on sequence labeling tasks such as intent classification and slot filling (IC-SF), which is a central component in personal assistant systems, lags significantly behind discriminative models. Furthermore, there is a lack of substantive research on robustness of LLMs to various perturbations in the input prompts. The contributions of this paper are three-fold. First, we show that fine-tuning sufficiently large LLMs can produce IC-SF performance comparable to discriminative models. Next, we systematically analyze the performance deterioration of those fine-tuned models due to three distinct yet relevant types of input perturbations - oronyms, synonyms, and paraphrasing. Finally, we propose an efficient mitigation approach, Prompt Perturbation Consistency Learning (PPCL), which works by regularizing the divergence between losses from clean and perturbed samples. Our experiments show that PPCL can recover on an average 59% and 69% of the performance drop for IC and SF tasks, respectively. Furthermore, PPCL beats data augmentation approach while using ten times fewer augmented data samples.\",\"PeriodicalId\":508951,\"journal\":{\"name\":\"Findings\",\"volume\":\"54 4\",\"pages\":\"1357-1370\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Findings\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2402.15833\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Findings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2402.15833","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

大型语言模型（LLM）在许多自然语言处理任务（如问题解答和文本摘要）中都表现出了令人印象深刻的性能。然而，它们在序列标注任务（如个人助理系统的核心组件--意图分类和槽填充（IC-SF））上的表现却明显落后于判别模型。此外，关于 LLM 对输入提示中各种扰动的鲁棒性还缺乏实质性的研究。本文的贡献有三方面。首先，我们证明了微调足够大的 LLM 可以产生与判别模型相当的 IC-SF 性能。接下来，我们系统地分析了这些微调模型的性能因三种不同但相关的输入扰动类型--同义词、近义词和意译--而下降的情况。最后，我们提出了一种高效的缓解方法--即时扰动一致性学习（PPCL），该方法通过规范化来自干净样本和扰动样本的损失之间的差异来发挥作用。我们的实验表明，对于 IC 和 SF 任务，PPCL 平均可分别恢复 59% 和 69% 的性能下降。此外，PPCL 比数据增强方法少用十倍的增强数据样本。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Prompt Perturbation Consistency Learning for Robust Language Models

Large language models (LLMs) have demonstrated impressive performance on a number of natural language processing tasks, such as question answering and text summarization. However, their performance on sequence labeling tasks such as intent classification and slot filling (IC-SF), which is a central component in personal assistant systems, lags significantly behind discriminative models. Furthermore, there is a lack of substantive research on robustness of LLMs to various perturbations in the input prompts. The contributions of this paper are three-fold. First, we show that fine-tuning sufficiently large LLMs can produce IC-SF performance comparable to discriminative models. Next, we systematically analyze the performance deterioration of those fine-tuned models due to three distinct yet relevant types of input perturbations - oronyms, synonyms, and paraphrasing. Finally, we propose an efficient mitigation approach, Prompt Perturbation Consistency Learning (PPCL), which works by regularizing the divergence between losses from clean and perturbed samples. Our experiments show that PPCL can recover on an average 59% and 69% of the performance drop for IC and SF tasks, respectively. Furthermore, PPCL beats data augmentation approach while using ten times fewer augmented data samples.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Findings

自引率

0.00%

发文量