{"title":"阿拉伯语几种监督词法消歧方法的实验研究","authors":"L. Merhbene, A. Zouaghi, M. Zrigui","doi":"10.1109/ICTA.2013.6815307","DOIUrl":null,"url":null,"abstract":"In this paper we propose an experimental study for some supervised algorithms to disambiguate arabic words. Due to the lack of linguistic data for the Arabic language, we work on non-annotated corpus and with the help of four annotators; we were able to annotate the different samples containing the ambiguous words. Since that, we test the naïve Bayes algorithm, the decision lists and the exemplar based algorithm. During the experimental study, we test the influence of the window size on the disambiguation quality, the derivation and the technique of smoothing for the (2n+1)-grams. We find that the exemplar based algorithm achieves the best rate of precision.","PeriodicalId":188977,"journal":{"name":"Fourth International Conference on Information and Communication Technology and Accessibility (ICTA)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"An experimental study for some supervised lexical disambiguation methods of arabic language\",\"authors\":\"L. Merhbene, A. Zouaghi, M. Zrigui\",\"doi\":\"10.1109/ICTA.2013.6815307\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we propose an experimental study for some supervised algorithms to disambiguate arabic words. Due to the lack of linguistic data for the Arabic language, we work on non-annotated corpus and with the help of four annotators; we were able to annotate the different samples containing the ambiguous words. Since that, we test the naïve Bayes algorithm, the decision lists and the exemplar based algorithm. During the experimental study, we test the influence of the window size on the disambiguation quality, the derivation and the technique of smoothing for the (2n+1)-grams. We find that the exemplar based algorithm achieves the best rate of precision.\",\"PeriodicalId\":188977,\"journal\":{\"name\":\"Fourth International Conference on Information and Communication Technology and Accessibility (ICTA)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fourth International Conference on Information and Communication Technology and Accessibility (ICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTA.2013.6815307\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fourth International Conference on Information and Communication Technology and Accessibility (ICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTA.2013.6815307","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An experimental study for some supervised lexical disambiguation methods of arabic language
In this paper we propose an experimental study for some supervised algorithms to disambiguate arabic words. Due to the lack of linguistic data for the Arabic language, we work on non-annotated corpus and with the help of four annotators; we were able to annotate the different samples containing the ambiguous words. Since that, we test the naïve Bayes algorithm, the decision lists and the exemplar based algorithm. During the experimental study, we test the influence of the window size on the disambiguation quality, the derivation and the technique of smoothing for the (2n+1)-grams. We find that the exemplar based algorithm achieves the best rate of precision.