{"title":"Testing linear separability in classification of inflection rules","authors":"Z. Tóth, L. Kovács","doi":"10.1109/SISY.2014.6923610","DOIUrl":null,"url":null,"abstract":"Agglutinative languages, such as Hungarian, use inflection to modify the meaning of words. Inflection is a string transformation which describe how can a word converted into its inflected form. The transformation can be described by a transformational string. The words can be classified by their transformational string, so inflection is considered as a classification. Linear separability of clusters is important to create an efficient and accurate classification method. This paper review a linear programming based testing method of linear separability. This method was analyzed on generated data sets, these measurements showed the time cost of the algorithm grows polynomially with the number of the points. The accusative case of Hungarian was used to create a data set of 56.000 samples. The words were represented in vector space by alphabetical and phonetic encoding and left and right adjust, thus four different representation of words were used during the tests. Our test results showed there are non linear separable cluster pairs in both of the representations.","PeriodicalId":277041,"journal":{"name":"2014 IEEE 12th International Symposium on Intelligent Systems and Informatics (SISY)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE 12th International Symposium on Intelligent Systems and Informatics (SISY)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SISY.2014.6923610","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Agglutinative languages, such as Hungarian, use inflection to modify the meaning of words. Inflection is a string transformation which describe how can a word converted into its inflected form. The transformation can be described by a transformational string. The words can be classified by their transformational string, so inflection is considered as a classification. Linear separability of clusters is important to create an efficient and accurate classification method. This paper review a linear programming based testing method of linear separability. This method was analyzed on generated data sets, these measurements showed the time cost of the algorithm grows polynomially with the number of the points. The accusative case of Hungarian was used to create a data set of 56.000 samples. The words were represented in vector space by alphabetical and phonetic encoding and left and right adjust, thus four different representation of words were used during the tests. Our test results showed there are non linear separable cluster pairs in both of the representations.