{"title":"Rectified Multi-class AdaBoost for Noisy Dataset Based on Weight Adjustment Standard","authors":"Keke Hu, Wanwei Liu, Tun Li","doi":"10.1145/3456126.3456143","DOIUrl":null,"url":null,"abstract":"Boosting, as a meta-algorithm for ensemble learning, have been widely applied to variety popular machine learning algorithms. However, noises in training and testing datasets could significantly affect the performance of boosting algorithm. SAMME pays too much attention to samples that are not correctly classified in multiple iterations. These samples could be mislabeled samples that cannot be correctly classified, so the classifier cannot learn the actual distribution of the original data. To solve this problem, in this paper, we proposed a rectified algorithm R.SAMME based on multi-class classification algorithm SAMME by limiting the weight of each sample based on current accuracy. We evaluate our approach on UCI benchmark datasets, experiments show that R.SAMME has better performance in noisy datasets.","PeriodicalId":431685,"journal":{"name":"2021 2nd Asia Service Sciences and Software Engineering Conference","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 2nd Asia Service Sciences and Software Engineering Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3456126.3456143","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Boosting, as a meta-algorithm for ensemble learning, have been widely applied to variety popular machine learning algorithms. However, noises in training and testing datasets could significantly affect the performance of boosting algorithm. SAMME pays too much attention to samples that are not correctly classified in multiple iterations. These samples could be mislabeled samples that cannot be correctly classified, so the classifier cannot learn the actual distribution of the original data. To solve this problem, in this paper, we proposed a rectified algorithm R.SAMME based on multi-class classification algorithm SAMME by limiting the weight of each sample based on current accuracy. We evaluate our approach on UCI benchmark datasets, experiments show that R.SAMME has better performance in noisy datasets.