{"title":"An Analysis of Josa and Eomi in Translating Korean TV Dramas Into English With Artificial Intelligence","authors":"Sungran Koh","doi":"10.16875/stem.2022.23.2.14","DOIUrl":null,"url":null,"abstract":"The goal of this study is to find out why the English subtitles of Korean TV dramas have frequent errors. It is anticipated that the findings would shed light on innovative ways for machine translation technology to agglutinate languages. To do this, as a first step, Korean-English subtitles were grammatically tagged according to the category part of speech (POS) to find out which POS has the most frequent errors in each language. Thirty-one groups were analyzed and categorized by tagging the part of speech. Then, for the Korean language, the Kokoma Korean morpheme analyzer was run to tag the Korean script according to the category noun, verb, adjective, etc. These were categorized into forty-five groups. This categorization included nine subsets of josa (postposition) and fourteen of eomi (ending), which are the most difficult parts of Korean to translate into English due to differences in linguistic structure. As a next step, the subtitles were scored and graded as the most corrected and the least corrected by Korean-American bilinguals. The results show that the most frequent error of josa is JX (auxiliary particle) among nine groups whereas the frequent error of eomi is EPT (tense prefinal ending).","PeriodicalId":38955,"journal":{"name":"Open Stem Cell Journal","volume":"51 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Open Stem Cell Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.16875/stem.2022.23.2.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 2
Abstract
The goal of this study is to find out why the English subtitles of Korean TV dramas have frequent errors. It is anticipated that the findings would shed light on innovative ways for machine translation technology to agglutinate languages. To do this, as a first step, Korean-English subtitles were grammatically tagged according to the category part of speech (POS) to find out which POS has the most frequent errors in each language. Thirty-one groups were analyzed and categorized by tagging the part of speech. Then, for the Korean language, the Kokoma Korean morpheme analyzer was run to tag the Korean script according to the category noun, verb, adjective, etc. These were categorized into forty-five groups. This categorization included nine subsets of josa (postposition) and fourteen of eomi (ending), which are the most difficult parts of Korean to translate into English due to differences in linguistic structure. As a next step, the subtitles were scored and graded as the most corrected and the least corrected by Korean-American bilinguals. The results show that the most frequent error of josa is JX (auxiliary particle) among nine groups whereas the frequent error of eomi is EPT (tense prefinal ending).