{"title":"Thai Dependency Parsing with Character Embedding","authors":"Sattaya Singkul, K. Woraratpanya","doi":"10.1109/ICITEED.2019.8930002","DOIUrl":null,"url":null,"abstract":"Dependency parsing (DP) becomes an important part of natural language processing (NLP) applications. However, most of DP methods have been developed for English language, but not for Thai language. In addition, the existing DP methods were still unsolved the problems of long and complex sentences. Therefore, this paper proposes seven Thai DP algorithms. Five different Thai DP algorithms was developed from transition-based parsing and the other two was developed from graph-based parsing. Based on Thai-PUD and English-PUD datasets, containing both long and complex sentences, the experimental results showed that all Thai DP algorithms bundled with character embedding can outperform the baselines.","PeriodicalId":6598,"journal":{"name":"2019 11th International Conference on Information Technology and Electrical Engineering (ICITEE)","volume":"21 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 11th International Conference on Information Technology and Electrical Engineering (ICITEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITEED.2019.8930002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Dependency parsing (DP) becomes an important part of natural language processing (NLP) applications. However, most of DP methods have been developed for English language, but not for Thai language. In addition, the existing DP methods were still unsolved the problems of long and complex sentences. Therefore, this paper proposes seven Thai DP algorithms. Five different Thai DP algorithms was developed from transition-based parsing and the other two was developed from graph-based parsing. Based on Thai-PUD and English-PUD datasets, containing both long and complex sentences, the experimental results showed that all Thai DP algorithms bundled with character embedding can outperform the baselines.