Prompt Combines Paraphrase: Teaching Pre-trained Models to Understand Rare Biomedical Words

Proceedings of COLING. International Conference on Computational Linguistics Pub Date : 2022-09-14 DOI:10.48550/arXiv.2209.06453

Hao Wang, Chi-Liang Liu, Nuwa Xi, Sendong Zhao, Meizhi Ju, Shiwei Zhang, Ziheng Zhang, Yefeng Zheng, Bing Qin, Ting Liu

引用次数: 2

Abstract

Prompt-based fine-tuning for pre-trained models has proven effective for many natural language processing tasks under few-shot settings in general domain. However, tuning with prompt in biomedical domain has not been investigated thoroughly. Biomedical words are often rare in general domain, but quite ubiquitous in biomedical contexts, which dramatically deteriorates the performance of pre-trained models on downstream biomedical applications even after fine-tuning, especially in low-resource scenarios. We propose a simple yet effective approach to helping models learn rare biomedical words during tuning with prompt. Experimental results show that our method can achieve up to 6% improvement in biomedical natural language inference task without any extra parameters or training steps using few-shot vanilla prompt settings.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

提示结合释义:教授预先训练的模型来理解罕见的生物医学词汇

预训练模型的基于提示的微调已被证明对一般领域中许多自然语言处理任务在少数镜头设置下是有效的。然而，在生物医学领域，对提示调谐的研究还不够深入。生物医学词汇通常在一般领域很少出现，但在生物医学语境中却非常普遍，这大大降低了预训练模型在下游生物医学应用中的性能，即使经过微调，特别是在低资源场景下。我们提出了一种简单而有效的方法来帮助模型在调整过程中学习罕见的生物医学词汇。实验结果表明，该方法在不需要任何额外参数或训练步骤的情况下，使用少量的提示设置，可以将生物医学自然语言推理任务提高6%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of COLING. International Conference on Computational Linguistics

自引率

0.00%

发文量