UoM&MMU at TSAR-2022 Shared Task: Prompt Learning for Lexical Simplification

Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022) Pub Date : 1900-01-01 DOI:10.18653/v1/2022.tsar-1.23

Laura Vásquez-Rodríguez, Nhung T. H. Nguyen, M. Shardlow, S. Ananiadou

引用次数: 7

Abstract

We present PromptLS, a method for fine-tuning large pre-trained Language Models (LM) to perform the task of Lexical Simplification. We use a predefined template to attain appropriate replacements for a term, and fine-tune a LM using this template on language specific datasets. We filter candidate lists in post-processing to improve accuracy. We demonstrate that our model can work in a) a zero shot setting (where we only require a pre-trained LM), b) a fine-tuned setting (where language-specific data is required), and c) a multilingual setting (where the model is pre-trained across multiple languages and fine-tuned in an specific language). Experimental results show that, although the zero-shot setting is competitive, its performance is still far from the fine-tuned setting. Also, the multilingual is unsurprisingly worse than the fine-tuned model. Among all TSAR-2022 Shared Task participants, our team was ranked second in Spanish and third in English.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

共享任务:词汇简化的提示学习

我们提出了一种用于微调大型预训练语言模型(LM)以执行词汇简化任务的方法PromptLS。我们使用预定义的模板来获得术语的适当替换，并在特定于语言的数据集上使用该模板对LM进行微调。我们在后处理中过滤候选列表以提高准确性。我们证明了我们的模型可以在a)零射击设置(我们只需要预训练的LM)， b)微调设置(需要特定语言的数据)以及c)多语言设置(其中模型跨多种语言进行预训练并在特定语言中进行微调)中工作。实验结果表明，虽然零弹设置具有竞争力，但其性能与微调设置相比仍有很大差距。此外，多语言模式比微调模式更糟糕也就不足为奇了。在所有TSAR-2022共享任务参与者中，我们的团队西班牙语排名第二，英语排名第三。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022)

自引率

0.00%

发文量