{"title":"一类偏斜数据的抗噪声增强算法","authors":"J. V. Hulse, T. Khoshgoftaar, Amri Napolitano","doi":"10.1109/ICMLA.2012.153","DOIUrl":null,"url":null,"abstract":"Boosting methods have been successfully applied in a wide variety of machine learning applications. In the context of data quality issues, a number of variants of the standard boosting method have been proposed and evaluated. To address the problem of mislabeled examples, ORBoost was developed to prevent over fitting to noisy examples. Our research group has recently proposed RUSBoost as an enhancement to the AdaBoost algorithm for dealing with skewed class distributions. This work proposes a modification to the RUSBoost algorithm, incorporating the noise-handling ability of ORBoost, to improve its handling of noisy data. The new method is compared with both ORBoost and RUSBoost in an extensive set of experiments using five real-world datasets with various levels of simulated noise.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A Novel Noise-Resistant Boosting Algorithm for Class-Skewed Data\",\"authors\":\"J. V. Hulse, T. Khoshgoftaar, Amri Napolitano\",\"doi\":\"10.1109/ICMLA.2012.153\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Boosting methods have been successfully applied in a wide variety of machine learning applications. In the context of data quality issues, a number of variants of the standard boosting method have been proposed and evaluated. To address the problem of mislabeled examples, ORBoost was developed to prevent over fitting to noisy examples. Our research group has recently proposed RUSBoost as an enhancement to the AdaBoost algorithm for dealing with skewed class distributions. This work proposes a modification to the RUSBoost algorithm, incorporating the noise-handling ability of ORBoost, to improve its handling of noisy data. The new method is compared with both ORBoost and RUSBoost in an extensive set of experiments using five real-world datasets with various levels of simulated noise.\",\"PeriodicalId\":157399,\"journal\":{\"name\":\"2012 11th International Conference on Machine Learning and Applications\",\"volume\":\"62 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-12-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 11th International Conference on Machine Learning and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLA.2012.153\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 11th International Conference on Machine Learning and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2012.153","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Novel Noise-Resistant Boosting Algorithm for Class-Skewed Data
Boosting methods have been successfully applied in a wide variety of machine learning applications. In the context of data quality issues, a number of variants of the standard boosting method have been proposed and evaluated. To address the problem of mislabeled examples, ORBoost was developed to prevent over fitting to noisy examples. Our research group has recently proposed RUSBoost as an enhancement to the AdaBoost algorithm for dealing with skewed class distributions. This work proposes a modification to the RUSBoost algorithm, incorporating the noise-handling ability of ORBoost, to improve its handling of noisy data. The new method is compared with both ORBoost and RUSBoost in an extensive set of experiments using five real-world datasets with various levels of simulated noise.