Frustratingly Easy Multilingual Grapheme-to-Phoneme Conversion

Special Interest Group on Computational Morphology and Phonology Workshop Pub Date : 2020-07-01 DOI:10.18653/v1/2020.sigmorphon-1.13

Nikhil Prabhu, Katharina Kann

引用次数: 4

Abstract

In this paper, we describe two CU-Boulder submissions to the SIGMORPHON 2020 Task 1 on multilingual grapheme-to-phoneme conversion (G2P). Inspired by the high performance of a standard transformer model (Vaswani et al., 2017) on the task, we improve over this approach by adding two modifications: (i) Instead of training exclusively on G2P, we additionally create examples for the opposite direction, phoneme-to-grapheme conversion (P2G). We then perform multi-task training on both tasks. (ii) We produce ensembles of our models via majority voting. Our approaches, though being conceptually simple, result in systems that place 6th and 8th amongst 23 submitted systems, and obtain the best results out of all systems on Lithuanian and Modern Greek, respectively.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

令人沮丧的简单多语言字母到音素转换

在本文中，我们描述了两个CU-Boulder提交给SIGMORPHON 2020任务1的多语言字素到音素转换(G2P)。受到标准转换器模型(Vaswani et al.， 2017)在该任务上的高性能的启发，我们通过添加两个修改来改进这种方法:(i)不是仅在G2P上进行训练，我们还为相反的方向创建了音素到字素转换(P2G)的示例。然后我们对这两个任务进行多任务训练。(ii)我们通过多数投票产生模型的集合。我们的方法虽然在概念上很简单，但结果系统在23个提交的系统中排名第6和第8，并分别在立陶宛语和现代希腊语的所有系统中获得最佳结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Special Interest Group on Computational Morphology and Phonology Workshop

自引率

0.00%

发文量