{"title":"基于数据引力分类的欠采样不平衡学习","authors":"Lizhi Peng, Bo Yang, Yuehui Chen, Xiaoqing Zhou","doi":"10.1109/FSKD.2016.7603210","DOIUrl":null,"url":null,"abstract":"With one class outnumbering another, many real classification tasks show imbalanced class distributions, which brings big trouble to standard classification models: they usually intend to recognize a minority instance as a majority one. The data gravitation based classification (DGC) model, a newly developed physical-inspired supervised learning model, has been proven effective for standard supervised learning tasks. However, DGC is not able to get high performances for imbalanced data sets, like most other standard learning algorithms do. Thus, to address the problem, an under-sampling technique, together with an ensemble technique, has been designed to adapt the standard DGC model for imbalanced learning tasks. The new adapted DGC model is called UI-DGC. 22 low imbalanced and 22 high imbalanced data sets are selected for the experimental study. UI-DGC is compared with standard and imbalanced learning algorithms. Empirical studies suggest that the UI-DGC model can get high imbalanced classification performances, especially for high imbalanced tasks.","PeriodicalId":373155,"journal":{"name":"2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)","volume":"158 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"An under-sampling imbalanced learning of data gravitation based classification\",\"authors\":\"Lizhi Peng, Bo Yang, Yuehui Chen, Xiaoqing Zhou\",\"doi\":\"10.1109/FSKD.2016.7603210\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With one class outnumbering another, many real classification tasks show imbalanced class distributions, which brings big trouble to standard classification models: they usually intend to recognize a minority instance as a majority one. The data gravitation based classification (DGC) model, a newly developed physical-inspired supervised learning model, has been proven effective for standard supervised learning tasks. However, DGC is not able to get high performances for imbalanced data sets, like most other standard learning algorithms do. Thus, to address the problem, an under-sampling technique, together with an ensemble technique, has been designed to adapt the standard DGC model for imbalanced learning tasks. The new adapted DGC model is called UI-DGC. 22 low imbalanced and 22 high imbalanced data sets are selected for the experimental study. UI-DGC is compared with standard and imbalanced learning algorithms. Empirical studies suggest that the UI-DGC model can get high imbalanced classification performances, especially for high imbalanced tasks.\",\"PeriodicalId\":373155,\"journal\":{\"name\":\"2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)\",\"volume\":\"158 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/FSKD.2016.7603210\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FSKD.2016.7603210","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An under-sampling imbalanced learning of data gravitation based classification
With one class outnumbering another, many real classification tasks show imbalanced class distributions, which brings big trouble to standard classification models: they usually intend to recognize a minority instance as a majority one. The data gravitation based classification (DGC) model, a newly developed physical-inspired supervised learning model, has been proven effective for standard supervised learning tasks. However, DGC is not able to get high performances for imbalanced data sets, like most other standard learning algorithms do. Thus, to address the problem, an under-sampling technique, together with an ensemble technique, has been designed to adapt the standard DGC model for imbalanced learning tasks. The new adapted DGC model is called UI-DGC. 22 low imbalanced and 22 high imbalanced data sets are selected for the experimental study. UI-DGC is compared with standard and imbalanced learning algorithms. Empirical studies suggest that the UI-DGC model can get high imbalanced classification performances, especially for high imbalanced tasks.