{"title":"考虑类分布模型的大型数据集SVM分类","authors":"Jair Cervantes, Xiaoou Li, Wen Yu","doi":"10.1109/MICAI.2007.27","DOIUrl":null,"url":null,"abstract":"Despite of good theoretic foundations and high classification accuracy of support vector machines (SVM), normal SVM is not suitable for classification of large data sets, because the training complexity of SVM is very high. This paper presents a novel SVM classification approach for large data sets by considering models of classes distribution (MCD). A first stage uses SVM classification in order to gets a sketch of classes distribution. Then the algorithm obtain the support vectors (SVs) most close between each class and construct a ball using minimum enclosing ball from each pair of SVs with different label. The data points included in the balls constitute the MCD, which is the framework in the boundary of each class and represents the most important data points, these data points are used as training data for a posterior SVM classification. Experimental results show that our approach has good classification accuracy while the training is significantly faster than other SVM classifiers.","PeriodicalId":296192,"journal":{"name":"2007 Sixth Mexican International Conference on Artificial Intelligence, Special Session (MICAI)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"35","resultStr":"{\"title\":\"SVM Classification for Large Data Sets by Considering Models of Classes Distribution\",\"authors\":\"Jair Cervantes, Xiaoou Li, Wen Yu\",\"doi\":\"10.1109/MICAI.2007.27\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Despite of good theoretic foundations and high classification accuracy of support vector machines (SVM), normal SVM is not suitable for classification of large data sets, because the training complexity of SVM is very high. This paper presents a novel SVM classification approach for large data sets by considering models of classes distribution (MCD). A first stage uses SVM classification in order to gets a sketch of classes distribution. Then the algorithm obtain the support vectors (SVs) most close between each class and construct a ball using minimum enclosing ball from each pair of SVs with different label. The data points included in the balls constitute the MCD, which is the framework in the boundary of each class and represents the most important data points, these data points are used as training data for a posterior SVM classification. Experimental results show that our approach has good classification accuracy while the training is significantly faster than other SVM classifiers.\",\"PeriodicalId\":296192,\"journal\":{\"name\":\"2007 Sixth Mexican International Conference on Artificial Intelligence, Special Session (MICAI)\",\"volume\":\"74 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-11-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"35\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2007 Sixth Mexican International Conference on Artificial Intelligence, Special Session (MICAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MICAI.2007.27\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 Sixth Mexican International Conference on Artificial Intelligence, Special Session (MICAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MICAI.2007.27","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
SVM Classification for Large Data Sets by Considering Models of Classes Distribution
Despite of good theoretic foundations and high classification accuracy of support vector machines (SVM), normal SVM is not suitable for classification of large data sets, because the training complexity of SVM is very high. This paper presents a novel SVM classification approach for large data sets by considering models of classes distribution (MCD). A first stage uses SVM classification in order to gets a sketch of classes distribution. Then the algorithm obtain the support vectors (SVs) most close between each class and construct a ball using minimum enclosing ball from each pair of SVs with different label. The data points included in the balls constitute the MCD, which is the framework in the boundary of each class and represents the most important data points, these data points are used as training data for a posterior SVM classification. Experimental results show that our approach has good classification accuracy while the training is significantly faster than other SVM classifiers.