LISN @ SIGMORPHON 2023 Shared Task on Interlinear Glossing

Special Interest Group on Computational Morphology and Phonology Workshop Pub Date : 1900-01-01 DOI:10.18653/v1/2023.sigmorphon-1.21

Shu Okabe, François Yvon

引用次数: 1

Abstract

This paper describes LISN”’“s submission to the second track (open track) of the shared task on Interlinear Glossing for SIGMORPHON 2023. Our systems are based on Lost, a variation of linear Conditional Random Fields initially developed as a probabilistic translation model and then adapted to the glossing task. This model allows us to handle one of the main challenges posed by glossing, i.e. the fact that the list of potential labels for lexical morphemes is not fixed in advance and needs to be extended dynamically when labelling units are not seen in training. In such situations, we show how to make use of candidate lexical glosses found in the translation and discuss how such extension affects the training and inference procedures. The resulting automatic glossing systems prove to yield very competitive results, especially in low-resource settings.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

线性间光泽的共享任务

本文描述了LISN在SIGMORPHON 2023中提交的关于行间光泽的共享任务的第二轨道(开放轨道)。我们的系统基于Lost，它是线性条件随机场的一种变体，最初是作为概率翻译模型开发的，然后适应于上光任务。该模型允许我们处理由注释带来的主要挑战之一，即词汇语素的潜在标签列表不是预先固定的，并且需要在训练中没有看到标签单元时动态扩展。在这种情况下，我们展示了如何利用翻译中发现的候选词汇注释，并讨论了这种扩展如何影响训练和推理过程。由此产生的自动上光系统证明产生非常有竞争力的结果，特别是在低资源设置。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Special Interest Group on Computational Morphology and Phonology Workshop

自引率

0.00%

发文量