{"title":"A study of analogical grids extracted using feature vectors on varying vocabulary sizes in Indonesian","authors":"Rashel Fam, Y. Lepage","doi":"10.1109/ICACSIS47736.2019.8979864","DOIUrl":null,"url":null,"abstract":"Indonesian as an agglutinating language is known for its derivative morphological richness. Word forms are constructed by combining stem and affixes. In this paper, we study the influence of surface form and morphological information in analogical grids extracted from a set of word forms with varying sizes. Each word form is represented as a feature vector. In the experiment setting, we consider three features: characters, affixes, and morphosyntactic definition. The sizes and saturation are then observed to characterize the extracted grids.","PeriodicalId":165090,"journal":{"name":"2019 International Conference on Advanced Computer Science and information Systems (ICACSIS)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Advanced Computer Science and information Systems (ICACSIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACSIS47736.2019.8979864","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Indonesian as an agglutinating language is known for its derivative morphological richness. Word forms are constructed by combining stem and affixes. In this paper, we study the influence of surface form and morphological information in analogical grids extracted from a set of word forms with varying sizes. Each word form is represented as a feature vector. In the experiment setting, we consider three features: characters, affixes, and morphosyntactic definition. The sizes and saturation are then observed to characterize the extracted grids.