Emmanuel Agyei, Xiaoling Zhang, S. B. Yussif, B. L. Y. Agbley
{"title":"阿坎英语:低资源翻译的转换器","authors":"Emmanuel Agyei, Xiaoling Zhang, S. B. Yussif, B. L. Y. Agbley","doi":"10.1109/ICCWAMTIP53232.2021.9674076","DOIUrl":null,"url":null,"abstract":"Recent years, Neural Machine Translation outperforms the existing traditional-based models for language translation with the use of sequential architecture. However, low resource languages, such as Akan-Twi, lack structured language resources largely due to the limited amount of research geared towards their revival and resuscitation. In an effort to mitigate the issue of low resource language translation from Akan-Twi to English in machine learning, we utilize the low resource parallel corpus for the translation setting of Akan-English. Using our proposed training process and Transformer-based model, we generate augmented corpus which is used to improve the performance of our final translation model. We achieved best BLEU score of 12.96 and 17.57 for the Akan-English and English-Akan, respectively.","PeriodicalId":358772,"journal":{"name":"2021 18th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Akan-English: Transformer for Low Resource Translation\",\"authors\":\"Emmanuel Agyei, Xiaoling Zhang, S. B. Yussif, B. L. Y. Agbley\",\"doi\":\"10.1109/ICCWAMTIP53232.2021.9674076\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recent years, Neural Machine Translation outperforms the existing traditional-based models for language translation with the use of sequential architecture. However, low resource languages, such as Akan-Twi, lack structured language resources largely due to the limited amount of research geared towards their revival and resuscitation. In an effort to mitigate the issue of low resource language translation from Akan-Twi to English in machine learning, we utilize the low resource parallel corpus for the translation setting of Akan-English. Using our proposed training process and Transformer-based model, we generate augmented corpus which is used to improve the performance of our final translation model. We achieved best BLEU score of 12.96 and 17.57 for the Akan-English and English-Akan, respectively.\",\"PeriodicalId\":358772,\"journal\":{\"name\":\"2021 18th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 18th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCWAMTIP53232.2021.9674076\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 18th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCWAMTIP53232.2021.9674076","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Akan-English: Transformer for Low Resource Translation
Recent years, Neural Machine Translation outperforms the existing traditional-based models for language translation with the use of sequential architecture. However, low resource languages, such as Akan-Twi, lack structured language resources largely due to the limited amount of research geared towards their revival and resuscitation. In an effort to mitigate the issue of low resource language translation from Akan-Twi to English in machine learning, we utilize the low resource parallel corpus for the translation setting of Akan-English. Using our proposed training process and Transformer-based model, we generate augmented corpus which is used to improve the performance of our final translation model. We achieved best BLEU score of 12.96 and 17.57 for the Akan-English and English-Akan, respectively.