{"title":"平衡与非平衡数据集的分类性能分析","authors":"S. Padma, S. Kumar, R. Manavalan","doi":"10.1109/ICIINFS.2011.6038084","DOIUrl":null,"url":null,"abstract":"This paper focuses on performance evaluation of the classification algorithms for problems of unbalanced and balanced large data sets. Three methods such as ELM, MRAN, and SRAN have been proposed for solving the set classification problem and studied. The ELM is based on randomly chosen hidden nodes and analytically determines the output weights of SLFNs. Then the next method M-RAN is a sequential learning radial basis function neural network which combines the growth criterion of the resource allocating network (RAN) of Platt with a pruning strategy based on the relative contribution of each hidden unit to the overall network output. The last method SRAN uses of misclassification information and hinge loss error in growing/learning criterion helps in approximating the decision function accurately. The performance evaluation using balanced and imbalanced data sets shows that one of the proposed algorithms SRAN generates minimal network with higher classification performance.","PeriodicalId":353966,"journal":{"name":"2011 6th International Conference on Industrial and Information Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Performance analysis for classification in balanced and unbalanced data set\",\"authors\":\"S. Padma, S. Kumar, R. Manavalan\",\"doi\":\"10.1109/ICIINFS.2011.6038084\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper focuses on performance evaluation of the classification algorithms for problems of unbalanced and balanced large data sets. Three methods such as ELM, MRAN, and SRAN have been proposed for solving the set classification problem and studied. The ELM is based on randomly chosen hidden nodes and analytically determines the output weights of SLFNs. Then the next method M-RAN is a sequential learning radial basis function neural network which combines the growth criterion of the resource allocating network (RAN) of Platt with a pruning strategy based on the relative contribution of each hidden unit to the overall network output. The last method SRAN uses of misclassification information and hinge loss error in growing/learning criterion helps in approximating the decision function accurately. The performance evaluation using balanced and imbalanced data sets shows that one of the proposed algorithms SRAN generates minimal network with higher classification performance.\",\"PeriodicalId\":353966,\"journal\":{\"name\":\"2011 6th International Conference on Industrial and Information Systems\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 6th International Conference on Industrial and Information Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIINFS.2011.6038084\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 6th International Conference on Industrial and Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIINFS.2011.6038084","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Performance analysis for classification in balanced and unbalanced data set
This paper focuses on performance evaluation of the classification algorithms for problems of unbalanced and balanced large data sets. Three methods such as ELM, MRAN, and SRAN have been proposed for solving the set classification problem and studied. The ELM is based on randomly chosen hidden nodes and analytically determines the output weights of SLFNs. Then the next method M-RAN is a sequential learning radial basis function neural network which combines the growth criterion of the resource allocating network (RAN) of Platt with a pruning strategy based on the relative contribution of each hidden unit to the overall network output. The last method SRAN uses of misclassification information and hinge loss error in growing/learning criterion helps in approximating the decision function accurately. The performance evaluation using balanced and imbalanced data sets shows that one of the proposed algorithms SRAN generates minimal network with higher classification performance.