Abhinav Jha, H. Patil, S. Jindal, Sardar M N Islam
{"title":"使用mT5变压器的多语种印度语神经机器翻译系统","authors":"Abhinav Jha, H. Patil, S. Jindal, Sardar M N Islam","doi":"10.1109/PCEMS58491.2023.10136051","DOIUrl":null,"url":null,"abstract":"This paper demonstrates the development and evaluation of a multilingual neural machine translation system for Indian languages based on the mT5 transformer, successfully utilized to develop multiple state-of-the-art NLP models. We used the modified Asian Language Treebank multilingual dataset to train the system for developing a Machine Translation model capable of translating text in English, Hindi and Bengali amongst each other. Our system was able to achieve acceptable BLEU scores of over 20 in five of the six language pairs, with the English to Bengali system achieving a maximum BLEU score of 49.87 and the Bengali to English system achieving an average BLEU score of 42.43.","PeriodicalId":330870,"journal":{"name":"2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"169 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multilingual Indian Language Neural Machine Translation System Using mT5 Transformer\",\"authors\":\"Abhinav Jha, H. Patil, S. Jindal, Sardar M N Islam\",\"doi\":\"10.1109/PCEMS58491.2023.10136051\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper demonstrates the development and evaluation of a multilingual neural machine translation system for Indian languages based on the mT5 transformer, successfully utilized to develop multiple state-of-the-art NLP models. We used the modified Asian Language Treebank multilingual dataset to train the system for developing a Machine Translation model capable of translating text in English, Hindi and Bengali amongst each other. Our system was able to achieve acceptable BLEU scores of over 20 in five of the six language pairs, with the English to Bengali system achieving a maximum BLEU score of 49.87 and the Bengali to English system achieving an average BLEU score of 42.43.\",\"PeriodicalId\":330870,\"journal\":{\"name\":\"2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)\",\"volume\":\"169 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PCEMS58491.2023.10136051\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PCEMS58491.2023.10136051","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multilingual Indian Language Neural Machine Translation System Using mT5 Transformer
This paper demonstrates the development and evaluation of a multilingual neural machine translation system for Indian languages based on the mT5 transformer, successfully utilized to develop multiple state-of-the-art NLP models. We used the modified Asian Language Treebank multilingual dataset to train the system for developing a Machine Translation model capable of translating text in English, Hindi and Bengali amongst each other. Our system was able to achieve acceptable BLEU scores of over 20 in five of the six language pairs, with the English to Bengali system achieving a maximum BLEU score of 49.87 and the Bengali to English system achieving an average BLEU score of 42.43.