{"title":"噪声不平衡数据下重采样技术的比较分析","authors":"Arjun Puri, M. Gupta","doi":"10.1109/ICICT46931.2019.8977650","DOIUrl":null,"url":null,"abstract":"Classification is mainly challenged by the problems in the dataset. When dataset have uneven distribution of data among classes, then class imbalance problem arise. Class imbalance with noise creates immense effect on classification of instances of classes. The main focus of this article is to provide the detail comparative analysis of seven Resampling techniques under 16 noisy imbalanced datasets using C4.5 classifier. The performance evaluation is done by using AUC, F1 score, G-mean. Based on the evaluation, article inferred that SMOTE-ENN perform better than rest of resampling techniques.","PeriodicalId":412668,"journal":{"name":"2019 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Comparative Analysis of Resampling Techniques under Noisy Imbalanced Datasets\",\"authors\":\"Arjun Puri, M. Gupta\",\"doi\":\"10.1109/ICICT46931.2019.8977650\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Classification is mainly challenged by the problems in the dataset. When dataset have uneven distribution of data among classes, then class imbalance problem arise. Class imbalance with noise creates immense effect on classification of instances of classes. The main focus of this article is to provide the detail comparative analysis of seven Resampling techniques under 16 noisy imbalanced datasets using C4.5 classifier. The performance evaluation is done by using AUC, F1 score, G-mean. Based on the evaluation, article inferred that SMOTE-ENN perform better than rest of resampling techniques.\",\"PeriodicalId\":412668,\"journal\":{\"name\":\"2019 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICICT46931.2019.8977650\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICT46931.2019.8977650","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Comparative Analysis of Resampling Techniques under Noisy Imbalanced Datasets
Classification is mainly challenged by the problems in the dataset. When dataset have uneven distribution of data among classes, then class imbalance problem arise. Class imbalance with noise creates immense effect on classification of instances of classes. The main focus of this article is to provide the detail comparative analysis of seven Resampling techniques under 16 noisy imbalanced datasets using C4.5 classifier. The performance evaluation is done by using AUC, F1 score, G-mean. Based on the evaluation, article inferred that SMOTE-ENN perform better than rest of resampling techniques.