{"title":"基于最近邻的实例选择用于分类","authors":"G. Yu, Jin Tian, Minqiang Li","doi":"10.1109/FSKD.2016.7603154","DOIUrl":null,"url":null,"abstract":"With the increasing size of big data, classifiers usually suffer from intractable computing and storage issues. Moreover, decision boundaries in complex classification problems are usually complicated and circuitous. Modeling on too many instances can sometimes cause oversensitivity to noise and degrade the learning accuracies. Instance selection offers an effective way to improve classification performance based on partial but significant data. This paper presents a novel instance selection algorithm based on nearest enemy information. The dataset is divided into several partitions corresponding to instances' nearest enemies. In every partition, representative instances are selected based on the distribution information to represent both sides of decision boundary. A support vector machine (SVM) is then adopted to conduct the classification model based on these representative instances. Experimental results illustrate that the proposed algorithm outperforms some conventional instance selection methods with higher classification accuracy and smaller size of selected instances.","PeriodicalId":373155,"journal":{"name":"2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)","volume":"128 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Nearest neighbor-based instance selection for classification\",\"authors\":\"G. Yu, Jin Tian, Minqiang Li\",\"doi\":\"10.1109/FSKD.2016.7603154\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the increasing size of big data, classifiers usually suffer from intractable computing and storage issues. Moreover, decision boundaries in complex classification problems are usually complicated and circuitous. Modeling on too many instances can sometimes cause oversensitivity to noise and degrade the learning accuracies. Instance selection offers an effective way to improve classification performance based on partial but significant data. This paper presents a novel instance selection algorithm based on nearest enemy information. The dataset is divided into several partitions corresponding to instances' nearest enemies. In every partition, representative instances are selected based on the distribution information to represent both sides of decision boundary. A support vector machine (SVM) is then adopted to conduct the classification model based on these representative instances. Experimental results illustrate that the proposed algorithm outperforms some conventional instance selection methods with higher classification accuracy and smaller size of selected instances.\",\"PeriodicalId\":373155,\"journal\":{\"name\":\"2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)\",\"volume\":\"128 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/FSKD.2016.7603154\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FSKD.2016.7603154","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Nearest neighbor-based instance selection for classification
With the increasing size of big data, classifiers usually suffer from intractable computing and storage issues. Moreover, decision boundaries in complex classification problems are usually complicated and circuitous. Modeling on too many instances can sometimes cause oversensitivity to noise and degrade the learning accuracies. Instance selection offers an effective way to improve classification performance based on partial but significant data. This paper presents a novel instance selection algorithm based on nearest enemy information. The dataset is divided into several partitions corresponding to instances' nearest enemies. In every partition, representative instances are selected based on the distribution information to represent both sides of decision boundary. A support vector machine (SVM) is then adopted to conduct the classification model based on these representative instances. Experimental results illustrate that the proposed algorithm outperforms some conventional instance selection methods with higher classification accuracy and smaller size of selected instances.