英语-加泰罗尼亚语神经机器翻译:最先进的技术、质量和生产力

Vicent Briva-Iglesias
{"title":"英语-加泰罗尼亚语神经机器翻译:最先进的技术、质量和生产力","authors":"Vicent Briva-Iglesias","doi":"10.5565/rev/tradumatica.303","DOIUrl":null,"url":null,"abstract":"Recent major changes and technological advances have consolidated machine translation (MT) as a key player to be considered in the language services world. In numerous instances, it is even an essential player due to budget and time constraints. Much attention has been paid to MT research recently, and MT use by professional or amateur users has increased. Yet, research has focused mainly on language combinations with huge amounts of online available corpora (e.g. English-Spanish). The situation for minoritized or stateless languages like Catalan is different. This study analyses Softcatalà’s new open-source, neural machine translation engine and compares it with Google Translate and Apertium in the English-Catalan language pair. Although MT engine developers use automatic metrics for MT engine evaluation, human evaluation remains the gold standard, despite its cost. Using TAUS DQF tools, translation quality (in terms of relative ranking, adequacy and fluency) and productivity (comparing editing times and distances) have been evaluated with the participation of 11 evaluators. Results show that Softcatalà's Translator offers higher quality and productivity than the other engines analysed.","PeriodicalId":42402,"journal":{"name":"Tradumatica-Traduccio i Tecnologies de la Informacio i la Comunicacio","volume":"7 1","pages":""},"PeriodicalIF":0.8000,"publicationDate":"2022-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"English-Catalan Neural Machine Translation: state-of-the-art technology, quality, and productivity\",\"authors\":\"Vicent Briva-Iglesias\",\"doi\":\"10.5565/rev/tradumatica.303\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recent major changes and technological advances have consolidated machine translation (MT) as a key player to be considered in the language services world. In numerous instances, it is even an essential player due to budget and time constraints. Much attention has been paid to MT research recently, and MT use by professional or amateur users has increased. Yet, research has focused mainly on language combinations with huge amounts of online available corpora (e.g. English-Spanish). The situation for minoritized or stateless languages like Catalan is different. This study analyses Softcatalà’s new open-source, neural machine translation engine and compares it with Google Translate and Apertium in the English-Catalan language pair. Although MT engine developers use automatic metrics for MT engine evaluation, human evaluation remains the gold standard, despite its cost. Using TAUS DQF tools, translation quality (in terms of relative ranking, adequacy and fluency) and productivity (comparing editing times and distances) have been evaluated with the participation of 11 evaluators. Results show that Softcatalà's Translator offers higher quality and productivity than the other engines analysed.\",\"PeriodicalId\":42402,\"journal\":{\"name\":\"Tradumatica-Traduccio i Tecnologies de la Informacio i la Comunicacio\",\"volume\":\"7 1\",\"pages\":\"\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2022-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Tradumatica-Traduccio i Tecnologies de la Informacio i la Comunicacio\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5565/rev/tradumatica.303\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tradumatica-Traduccio i Tecnologies de la Informacio i la Comunicacio","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5565/rev/tradumatica.303","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"LINGUISTICS","Score":null,"Total":0}
引用次数: 1

摘要

最近的重大变化和技术进步已经巩固了机器翻译(MT)在语言服务领域的重要地位。在许多情况下,由于预算和时间限制,它甚至是一个必不可少的参与者。近年来,人们对机器翻译的研究越来越重视,专业和业余用户对机器翻译的使用也越来越多。然而,研究主要集中在具有大量在线可用语料库的语言组合上(例如英语-西班牙语)。像加泰罗尼亚语这样的少数民族或无国籍语言的情况就不同了。本研究分析了softcatalo的新开源神经机器翻译引擎,并将其与谷歌翻译和Apertium的英语-加泰罗尼亚语对进行了比较。尽管机器翻译引擎开发人员使用自动度量来评估机器翻译引擎,但人工评估仍然是黄金标准,尽管成本很高。使用TAUS DQF工具,在11名评估人员的参与下,评估了翻译质量(相对排名、充足性和流畅性)和生产力(比较编辑时间和距离)。结果表明,softcatalo的翻译器提供了比其他分析引擎更高的质量和生产力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
English-Catalan Neural Machine Translation: state-of-the-art technology, quality, and productivity
Recent major changes and technological advances have consolidated machine translation (MT) as a key player to be considered in the language services world. In numerous instances, it is even an essential player due to budget and time constraints. Much attention has been paid to MT research recently, and MT use by professional or amateur users has increased. Yet, research has focused mainly on language combinations with huge amounts of online available corpora (e.g. English-Spanish). The situation for minoritized or stateless languages like Catalan is different. This study analyses Softcatalà’s new open-source, neural machine translation engine and compares it with Google Translate and Apertium in the English-Catalan language pair. Although MT engine developers use automatic metrics for MT engine evaluation, human evaluation remains the gold standard, despite its cost. Using TAUS DQF tools, translation quality (in terms of relative ranking, adequacy and fluency) and productivity (comparing editing times and distances) have been evaluated with the participation of 11 evaluators. Results show that Softcatalà's Translator offers higher quality and productivity than the other engines analysed.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
1.50
自引率
50.00%
发文量
0
审稿时长
16 weeks
期刊最新文献
La documentació aplicada a la traducció especialitzada i a la traducció literària Cercadors de recursos web especialitzats en Traducció Competencia informacional para la actividad traductora El lenguaje en la comunicación y recuperación de información El documento como dato, conocimiento e información
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1