{"title":"集成句法信息的藏汉神经机器翻译研究","authors":"Maoxian Zhou, Secha Jia, Rangjia Cai","doi":"10.1145/3503047.3503120","DOIUrl":null,"url":null,"abstract":"In recent years, Neural Networks have gradually replaced other methods in the field of Machine Translation and become the mainstream which have excellent performance in many languages. However, the performance of Neural Machine Translation mainly relies on large-scale parallel corpora, which is not ideal for low-resource languages, especially Tibetan-Chinese Machine Translation. In order to obtain the best translation performance with more external information on the basis of limited corpus, this paper introduces syntactic information, that is, adding part-of-speech(POS) tags as input features in the training process. Experiments verify the effectiveness of this method, which can improve the translation performance to a certain extent.","PeriodicalId":190604,"journal":{"name":"Proceedings of the 3rd International Conference on Advanced Information Science and System","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Research on Tibetan-Chinese Neural Machine Translation Integrating Syntactic Information\",\"authors\":\"Maoxian Zhou, Secha Jia, Rangjia Cai\",\"doi\":\"10.1145/3503047.3503120\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, Neural Networks have gradually replaced other methods in the field of Machine Translation and become the mainstream which have excellent performance in many languages. However, the performance of Neural Machine Translation mainly relies on large-scale parallel corpora, which is not ideal for low-resource languages, especially Tibetan-Chinese Machine Translation. In order to obtain the best translation performance with more external information on the basis of limited corpus, this paper introduces syntactic information, that is, adding part-of-speech(POS) tags as input features in the training process. Experiments verify the effectiveness of this method, which can improve the translation performance to a certain extent.\",\"PeriodicalId\":190604,\"journal\":{\"name\":\"Proceedings of the 3rd International Conference on Advanced Information Science and System\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 3rd International Conference on Advanced Information Science and System\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3503047.3503120\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 3rd International Conference on Advanced Information Science and System","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3503047.3503120","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research on Tibetan-Chinese Neural Machine Translation Integrating Syntactic Information
In recent years, Neural Networks have gradually replaced other methods in the field of Machine Translation and become the mainstream which have excellent performance in many languages. However, the performance of Neural Machine Translation mainly relies on large-scale parallel corpora, which is not ideal for low-resource languages, especially Tibetan-Chinese Machine Translation. In order to obtain the best translation performance with more external information on the basis of limited corpus, this paper introduces syntactic information, that is, adding part-of-speech(POS) tags as input features in the training process. Experiments verify the effectiveness of this method, which can improve the translation performance to a certain extent.