Bao Ngoc Vi, Dinh Tan Nguyen, Cao Truong Tran, Huu Phuc Ngo, Chi Cong Nguyen, Hai-Hong Phan
{"title":"Multiple Imputation by Generative Adversarial Networks for Classification with Incomplete Data","authors":"Bao Ngoc Vi, Dinh Tan Nguyen, Cao Truong Tran, Huu Phuc Ngo, Chi Cong Nguyen, Hai-Hong Phan","doi":"10.1109/RIVF51545.2021.9642138","DOIUrl":null,"url":null,"abstract":"Missing values present as the most common problem in real-world data science. Inadequate treatment of missing values could often result in mass errors. Hence missing values should be managed conscientiously for classification. Generative Adversarial Networks (GANs) have been applied for imputing missing values in most recent years. This paper proposes a multiple imputation method to estimate missing values for classification through the integration of GAN and ensemble learning. Our propose method MIGAN utilises GAN to generate different training observations which are then used to conduct ensemble classifiers for classification with missing data. We conducted our experiments examine MIGAN on various data sets as well as comparing MIGAN with the state-of-the-art imputation methods. The experimental results show significant results, which highlights the accuracy of MIGAN in classifying the missing data.","PeriodicalId":6860,"journal":{"name":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","volume":"15 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RIVF51545.2021.9642138","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Missing values present as the most common problem in real-world data science. Inadequate treatment of missing values could often result in mass errors. Hence missing values should be managed conscientiously for classification. Generative Adversarial Networks (GANs) have been applied for imputing missing values in most recent years. This paper proposes a multiple imputation method to estimate missing values for classification through the integration of GAN and ensemble learning. Our propose method MIGAN utilises GAN to generate different training observations which are then used to conduct ensemble classifiers for classification with missing data. We conducted our experiments examine MIGAN on various data sets as well as comparing MIGAN with the state-of-the-art imputation methods. The experimental results show significant results, which highlights the accuracy of MIGAN in classifying the missing data.