{"title":"An imbalanced data classification algorithm of improved autoencoder neural network","authors":"Chenggang Zhang, Wei Gao, Jiazhi Song, Jinqing Jiang","doi":"10.1109/ICACI.2016.7449810","DOIUrl":null,"url":null,"abstract":"Imbalanced data classification problem has always been a hotspot in the field of machine learning research. Pointing to the overfitting and noise problems of oversampling algorithm when synthesizing new minority class samples, the current study proposed a stacked denoising autoencoder neural network (SDAE) algorithm based on cost-sensitive oversampling, combining the cost-sensitive learning with denoising autoencoder neural network. The proposed algorithm can not only oversample minority class sample through misclassification cost, but it can denoise and classify the sampled dataset. Experiment shows that, compared with the traditional stacked autoencoder neural network (SAE) and oversampling autoencoder neural network without denoising process (OS-SAE), the proposed algorithm improves the classification accuracy of minority class of imbalanced datasets.","PeriodicalId":211040,"journal":{"name":"2016 Eighth International Conference on Advanced Computational Intelligence (ICACI)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"33","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Eighth International Conference on Advanced Computational Intelligence (ICACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACI.2016.7449810","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 33
Abstract
Imbalanced data classification problem has always been a hotspot in the field of machine learning research. Pointing to the overfitting and noise problems of oversampling algorithm when synthesizing new minority class samples, the current study proposed a stacked denoising autoencoder neural network (SDAE) algorithm based on cost-sensitive oversampling, combining the cost-sensitive learning with denoising autoencoder neural network. The proposed algorithm can not only oversample minority class sample through misclassification cost, but it can denoise and classify the sampled dataset. Experiment shows that, compared with the traditional stacked autoencoder neural network (SAE) and oversampling autoencoder neural network without denoising process (OS-SAE), the proposed algorithm improves the classification accuracy of minority class of imbalanced datasets.