A method for automatic detection and manual localization of content-based translation errors and shifts

Éric André Poirier
{"title":"A method for automatic detection and manual localization of content-based translation errors and shifts","authors":"Éric André Poirier","doi":"10.1016/j.jides.2015.02.004","DOIUrl":null,"url":null,"abstract":"<div><p>In this paper, we describe a method for automatic detection and manual localization of four compositional translation errors and shifts between automatically aligned segments in source and target languages. The automatic detection of errors and shifts is based on a content word precision algorithm which measures the equality of information content between source and target segments. The manual localization of errors and shifts within the segments is based on the compositionality principle. The method allows for the detection and localization of two potential errors; omission and addition, as well as two translation shifts required to avoid a translation error such as over-translation (when a correct translation results in more content words than in the source segment) and under-translation (when a correct translation results in less content words than in the source segment). Because of manual localization within bilingual pairs of segments, the method is not intended for automatic error detection but for human-assisted revision of translations. The analysis, described with the method and the algorithm is applied to real translation examples culled from a state-of-the-art translation corpus sampled for various translation errors. The algorithm and the localization method have implications for the development of more content-oriented natural language processing as well as for the training of professional translators; they can also be useful for formal and systematic description of content-based translation errors.</p></div>","PeriodicalId":100792,"journal":{"name":"Journal of Innovation in Digital Ecosystems","volume":"1 1","pages":"Pages 38-46"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.jides.2015.02.004","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Innovation in Digital Ecosystems","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S235266451500005X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

In this paper, we describe a method for automatic detection and manual localization of four compositional translation errors and shifts between automatically aligned segments in source and target languages. The automatic detection of errors and shifts is based on a content word precision algorithm which measures the equality of information content between source and target segments. The manual localization of errors and shifts within the segments is based on the compositionality principle. The method allows for the detection and localization of two potential errors; omission and addition, as well as two translation shifts required to avoid a translation error such as over-translation (when a correct translation results in more content words than in the source segment) and under-translation (when a correct translation results in less content words than in the source segment). Because of manual localization within bilingual pairs of segments, the method is not intended for automatic error detection but for human-assisted revision of translations. The analysis, described with the method and the algorithm is applied to real translation examples culled from a state-of-the-art translation corpus sampled for various translation errors. The algorithm and the localization method have implications for the development of more content-oriented natural language processing as well as for the training of professional translators; they can also be useful for formal and systematic description of content-based translation errors.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种基于内容的翻译错误和移位的自动检测和手动定位方法
在本文中,我们描述了一种自动检测和手动定位源语言和目标语言中四种组合翻译错误和自动对齐片段之间偏移的方法。错误和移位的自动检测基于内容词精度算法,该算法测量源段和目标段之间信息内容的相等性。人工定位的误差和位移分段是基于组合性原则。该方法允许检测和定位两个潜在的错误;省略和添加,以及避免翻译错误所需的两次翻译转换,例如过度翻译(正确的翻译导致的内容词比源段多)和欠翻译(正确的翻译导致的内容词比源段少)。由于在双语片段对中需要手动定位,因此该方法不是用于自动错误检测,而是用于人工辅助的翻译修订。用该方法和算法描述的分析应用于从最先进的翻译语料库中挑选的真实翻译示例,以采样各种翻译错误。该算法和本地化方法对面向内容的自然语言处理的发展以及专业翻译人员的培训具有重要意义;它们还可以用于正式和系统地描述基于内容的翻译错误。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Preface Meaning-based machine learning for information assurance Wavelet decomposition of software entropy reveals symptoms of malicious code Evaluating the descriptive power of Instagram hashtags Occupancy driven building performance assessment
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1