{"title":"印尼语不同词汇量下特征向量提取的类比网格研究","authors":"Rashel Fam, Y. Lepage","doi":"10.1109/ICACSIS47736.2019.8979864","DOIUrl":null,"url":null,"abstract":"Indonesian as an agglutinating language is known for its derivative morphological richness. Word forms are constructed by combining stem and affixes. In this paper, we study the influence of surface form and morphological information in analogical grids extracted from a set of word forms with varying sizes. Each word form is represented as a feature vector. In the experiment setting, we consider three features: characters, affixes, and morphosyntactic definition. The sizes and saturation are then observed to characterize the extracted grids.","PeriodicalId":165090,"journal":{"name":"2019 International Conference on Advanced Computer Science and information Systems (ICACSIS)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A study of analogical grids extracted using feature vectors on varying vocabulary sizes in Indonesian\",\"authors\":\"Rashel Fam, Y. Lepage\",\"doi\":\"10.1109/ICACSIS47736.2019.8979864\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Indonesian as an agglutinating language is known for its derivative morphological richness. Word forms are constructed by combining stem and affixes. In this paper, we study the influence of surface form and morphological information in analogical grids extracted from a set of word forms with varying sizes. Each word form is represented as a feature vector. In the experiment setting, we consider three features: characters, affixes, and morphosyntactic definition. The sizes and saturation are then observed to characterize the extracted grids.\",\"PeriodicalId\":165090,\"journal\":{\"name\":\"2019 International Conference on Advanced Computer Science and information Systems (ICACSIS)\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Advanced Computer Science and information Systems (ICACSIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICACSIS47736.2019.8979864\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Advanced Computer Science and information Systems (ICACSIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACSIS47736.2019.8979864","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A study of analogical grids extracted using feature vectors on varying vocabulary sizes in Indonesian
Indonesian as an agglutinating language is known for its derivative morphological richness. Word forms are constructed by combining stem and affixes. In this paper, we study the influence of surface form and morphological information in analogical grids extracted from a set of word forms with varying sizes. Each word form is represented as a feature vector. In the experiment setting, we consider three features: characters, affixes, and morphosyntactic definition. The sizes and saturation are then observed to characterize the extracted grids.