Adapting to Non-Centered Languages for Zero-shot Multilingual Translation

Proceedings of COLING. International Conference on Computational Linguistics Pub Date : 2022-09-09 DOI:10.48550/arXiv.2209.04138

Zhi Qu, Taro Watanabe

引用次数: 2

Abstract

Multilingual neural machine translation can translate unseen language pairs during training, i.e. zero-shot translation. However, the zero-shot translation is always unstable. Although prior works attributed the instability to the domination of central language, e.g. English, we supplement this viewpoint with the strict dependence of non-centered languages. In this work, we propose a simple, lightweight yet effective language-specific modeling method by adapting to non-centered languages and combining the shared information and the language-specific information to counteract the instability of zero-shot translation. Experiments with Transformer on IWSLT17, Europarl, TED talks, and OPUS-100 datasets show that our method not only performs better than strong baselines in centered data conditions but also can easily fit non-centered data conditions. By further investigating the layer attribution, we show that our proposed method can disentangle the coupled representation in the correct direction.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

适应非中心语言的零射击多语种翻译

多语言神经机器翻译可以在训练过程中翻译未见过的语言对，即零射击翻译。然而，零点平移总是不稳定的。虽然先前的研究将这种不稳定归因于中心语言(如英语)的统治，但我们补充了非中心语言的严格依赖。在这项工作中，我们提出了一种简单，轻量级但有效的特定语言建模方法，通过适应非中心语言，将共享信息和特定语言信息相结合来抵消零射击翻译的不稳定性。Transformer在IWSLT17、Europarl、TED talks和OPUS-100数据集上的实验表明，我们的方法不仅在中心数据条件下优于强基线，而且可以很容易地拟合非中心数据条件。通过对层属性的进一步研究，我们证明了我们的方法可以在正确的方向上解开耦合表示。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of COLING. International Conference on Computational Linguistics

自引率

0.00%

发文量