{"title":"Oversampling Algorithm Based on Spatial Distribution of Data Sets for Imbalance Learning","authors":"Yiran Liu, Wanjiang Han, Xiaoxiang Wang, Qi Li","doi":"10.1109/ICCCS49078.2020.9118573","DOIUrl":null,"url":null,"abstract":"Imbalance problem is widespread in machine learning. Most learning algorithms can’t get satisfied performance when they are applied on imbalance data sets, because they can be deteriorated by this problem easily. This paper proposed SDSMOTE method which captures the spatial distribution of imbalance data sets, and changes the tendency of learning algorithm by over sampling by oversampling according to the recognition difficulty. Experiments on 5 UCI data sets validate the effectiveness of this oversampling algorithm.","PeriodicalId":105556,"journal":{"name":"2020 5th International Conference on Computer and Communication Systems (ICCCS)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Computer and Communication Systems (ICCCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCS49078.2020.9118573","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Imbalance problem is widespread in machine learning. Most learning algorithms can’t get satisfied performance when they are applied on imbalance data sets, because they can be deteriorated by this problem easily. This paper proposed SDSMOTE method which captures the spatial distribution of imbalance data sets, and changes the tendency of learning algorithm by over sampling by oversampling according to the recognition difficulty. Experiments on 5 UCI data sets validate the effectiveness of this oversampling algorithm.