Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM

European Association for Machine Translation Conferences/Workshops Pub Date : 2023-03-03 DOI:10.48550/arXiv.2303.01911

Rachel Bawden, Franccois Yvon

引用次数: 19

Abstract

The NLP community recently saw the release of a new large open-access multilingual language model, BLOOM (BigScience et al., 2022) covering 46 languages. We focus on BLOOM’s multilingual ability by evaluating its machine translation performance across several datasets (WMT, Flores-101 and DiaBLa) and language pairs (high- and low-resourced). Our results show that 0-shot performance suffers from overgeneration and generating in the wrong language, but this is greatly improved in the few-shot setting, with very good results for a number of language pairs. We study several aspects including prompt design, model sizes, cross-lingual transfer and the use of discursive context.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

大型多语言模型的翻译性能研究:以BLOOM为例

NLP社区最近发布了一个新的大型开放获取多语言模型BLOOM (BigScience et al.， 2022)，涵盖46种语言。我们通过评估BLOOM在多个数据集(WMT, Flores-101和DiaBLa)和语言对(高资源和低资源)上的机器翻译性能来关注BLOOM的多语言能力。我们的结果表明，0次射击的性能会受到过度生成和错误语言生成的影响，但在少数射击设置中，这种情况得到了极大的改善，对于许多语言对都有很好的结果。我们研究了几个方面，包括提示设计，模型大小，跨语言迁移和语篇语境的使用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

European Association for Machine Translation Conferences/Workshops

自引率

0.00%

发文量