{"title":"Oversampling Algorithm based on Reinforcement Learning in Imbalanced Problems","authors":"Ying Zhou, Jiangang Shu, Xiaoxiong Zhong, Xingsen Huang, Chenguang Luo, Jianwen Ai","doi":"10.1109/GLOBECOM42002.2020.9322179","DOIUrl":null,"url":null,"abstract":"The imbalanced problem indicates that the data set is unevenly distributed, resulting in sub-optimal classifiers to recognize the minority class. Traditional solutions try to design new classifiers to solve this problem or balance the skewed data sets, the former is too costly while the latter has an uncertain effect on different combinations of classifiers and measurements. In this paper, we propose a reinforcement learning-based oversampling method, which can directly produce targeted samples according to the downstream classifiers and measurements. During training, our learning procedure introduces the classification information to the generation process. Moreover, as opposed to oversampling approaches, we have no assumption of the downstream classifiers and performance metrics, and the proposed has a wider application. We carry out experiments on 17 UCI and KEEL data sets, experimental results demonstrate the superior performance of our proposed method.","PeriodicalId":12759,"journal":{"name":"GLOBECOM 2020 - 2020 IEEE Global Communications Conference","volume":"509 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"GLOBECOM 2020 - 2020 IEEE Global Communications Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GLOBECOM42002.2020.9322179","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The imbalanced problem indicates that the data set is unevenly distributed, resulting in sub-optimal classifiers to recognize the minority class. Traditional solutions try to design new classifiers to solve this problem or balance the skewed data sets, the former is too costly while the latter has an uncertain effect on different combinations of classifiers and measurements. In this paper, we propose a reinforcement learning-based oversampling method, which can directly produce targeted samples according to the downstream classifiers and measurements. During training, our learning procedure introduces the classification information to the generation process. Moreover, as opposed to oversampling approaches, we have no assumption of the downstream classifiers and performance metrics, and the proposed has a wider application. We carry out experiments on 17 UCI and KEEL data sets, experimental results demonstrate the superior performance of our proposed method.