Enhancing Machine Translation by Integrating Linguistic Knowledge in the Word Alignment Module

Safae Berrichi, A. Mazroui
{"title":"Enhancing Machine Translation by Integrating Linguistic Knowledge in the Word Alignment Module","authors":"Safae Berrichi, A. Mazroui","doi":"10.1109/ISCV49265.2020.9204328","DOIUrl":null,"url":null,"abstract":"The word alignment process, which is a critical step in statistical translation systems (SMT), has been suggested by several researchers as a promising track for enhancing neural translation system (NMT) performance in low-resource environments. Furthermore, given the negative impact on English/Arabic machine translation quality arising from the morphological richness and complexity of the Arabic language compared to the English language, we assessed in this study the relevance of the integration of morphosyntactic characteristics during the alignment phase. Indeed, we have enriched parallel corpora by morphosyntactic features such as stems, lemmas, roots, and POS tags; yet we have developed new SMT systems embedding one of these features in the word alignment phase. The test results proved the interest to use these features and highlighted the most relevant morphosyntactic information to the translation system.","PeriodicalId":313743,"journal":{"name":"2020 International Conference on Intelligent Systems and Computer Vision (ISCV)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Intelligent Systems and Computer Vision (ISCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCV49265.2020.9204328","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The word alignment process, which is a critical step in statistical translation systems (SMT), has been suggested by several researchers as a promising track for enhancing neural translation system (NMT) performance in low-resource environments. Furthermore, given the negative impact on English/Arabic machine translation quality arising from the morphological richness and complexity of the Arabic language compared to the English language, we assessed in this study the relevance of the integration of morphosyntactic characteristics during the alignment phase. Indeed, we have enriched parallel corpora by morphosyntactic features such as stems, lemmas, roots, and POS tags; yet we have developed new SMT systems embedding one of these features in the word alignment phase. The test results proved the interest to use these features and highlighted the most relevant morphosyntactic information to the translation system.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在词对齐模块中集成语言知识增强机器翻译
词对齐过程是统计翻译系统(SMT)的关键步骤,已被一些研究人员认为是在低资源环境下提高神经翻译系统(NMT)性能的一个有前途的途径。此外,考虑到阿拉伯文相对于英文的形态丰富性和复杂性对英语/阿拉伯文机器翻译质量的负面影响,我们在本研究中评估了在对齐阶段形态句法特征整合的相关性。事实上,我们通过词干、外稃、词根和词性标记等形态句法特征丰富了平行语料库;然而,我们已经开发了新的SMT系统,在单词对齐阶段嵌入这些特征之一。测试结果证明了使用这些特征的兴趣,并突出了与翻译系统最相关的形态句法信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Survey on how computer vision can response to urgent need to contribute in COVID-19 pandemics Toward Classification of Arabic Manuscripts Words Based on the Deep Convolutional Neural Networks Sharing Emotions in the Distance Education Experience: Attitudes and Motivation of University Students k-eNSC: k-estimation for Normalized Spectral Clustering Effective CU size decision algorithm based on depth map homogeneity for 3D-HEVC inter-coding
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1