{"title":"Research on Tibetan-Chinese Neural Machine Translation Integrating Syntactic Information","authors":"Maoxian Zhou, Secha Jia, Rangjia Cai","doi":"10.1145/3503047.3503120","DOIUrl":null,"url":null,"abstract":"In recent years, Neural Networks have gradually replaced other methods in the field of Machine Translation and become the mainstream which have excellent performance in many languages. However, the performance of Neural Machine Translation mainly relies on large-scale parallel corpora, which is not ideal for low-resource languages, especially Tibetan-Chinese Machine Translation. In order to obtain the best translation performance with more external information on the basis of limited corpus, this paper introduces syntactic information, that is, adding part-of-speech(POS) tags as input features in the training process. Experiments verify the effectiveness of this method, which can improve the translation performance to a certain extent.","PeriodicalId":190604,"journal":{"name":"Proceedings of the 3rd International Conference on Advanced Information Science and System","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 3rd International Conference on Advanced Information Science and System","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3503047.3503120","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In recent years, Neural Networks have gradually replaced other methods in the field of Machine Translation and become the mainstream which have excellent performance in many languages. However, the performance of Neural Machine Translation mainly relies on large-scale parallel corpora, which is not ideal for low-resource languages, especially Tibetan-Chinese Machine Translation. In order to obtain the best translation performance with more external information on the basis of limited corpus, this paper introduces syntactic information, that is, adding part-of-speech(POS) tags as input features in the training process. Experiments verify the effectiveness of this method, which can improve the translation performance to a certain extent.