{"title":"Missing Data Processing Based on Deep Neural Network Enhanced by K-Means","authors":"Bin Yu, Chen Zhang, Z. Tang","doi":"10.1145/3318299.3318391","DOIUrl":null,"url":null,"abstract":"This paper proposes a neural network model based on K-means to process the problem of data missing. The method first clusters the samples according to the attributes without missing values to get several clusters, and then puts these clusters into different neural networks to predict the missing values. In this paper, the data can be divided into two types: the continuous numerical type and the discrete numerical type. At the same time, corresponding neural network models are established for these two types. We conduct experiments on the dataset called Human Development Index and Its Components, showing our method to be feasible and superior.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"93 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Machine Learning and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3318299.3318391","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper proposes a neural network model based on K-means to process the problem of data missing. The method first clusters the samples according to the attributes without missing values to get several clusters, and then puts these clusters into different neural networks to predict the missing values. In this paper, the data can be divided into two types: the continuous numerical type and the discrete numerical type. At the same time, corresponding neural network models are established for these two types. We conduct experiments on the dataset called Human Development Index and Its Components, showing our method to be feasible and superior.