Translating Between Morphologically Rich Languages: An Arabic-to-Turkish Machine Translation System

WANLP@ACL 2019 Pub Date : 2019-08-01 DOI:10.18653/v1/W19-4617

Ilknur Durgar El-Kahlout, E. Bektas, N. S. Erdem, Hamza Kaya

引用次数: 7

Abstract

This paper introduces the work on building a machine translation system for Arabic-to-Turkish in the news domain. Our work includes collecting parallel datasets in several ways for a new and low-resourced language pair, building baseline systems with state-of-the-art architectures and developing language specific algorithms for better translation. Parallel datasets are mainly collected three different ways; i) translating Arabic texts into Turkish by professional translators, ii) exploiting the web for open-source Arabic-Turkish parallel texts, iii) using back-translation. We per-formed preliminary experiments for Arabic-to-Turkish machine translation with neural(Marian) machine translation tools with a novel morphologically motivated vocabulary reduction method.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

形态丰富的语言之间的翻译:一个阿拉伯语到土耳其语的机器翻译系统

本文介绍了在新闻领域建立阿拉伯语到土耳其语机器翻译系统的工作。我们的工作包括以多种方式为新的低资源语言对收集并行数据集，用最先进的架构构建基线系统，以及为更好的翻译开发特定于语言的算法。并行数据集的收集主要有三种不同的方式;i)由专业翻译人员将阿拉伯语文本翻译成土耳其语，ii)利用网络开发开源阿拉伯-土耳其语平行文本，iii)使用反向翻译。本文采用一种新颖的形态学动机词汇缩减方法，利用神经(Marian)机器翻译工具对阿拉伯语到土耳其语的机器翻译进行了初步实验。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

WANLP@ACL 2019

自引率

0.00%

发文量