{"title":"An Accurate and Robust Missing Value Estimation for Microarray Data: Least Absolute Deviation Imputation","authors":"Yi Cao, K. Poh","doi":"10.1109/ICMLA.2006.11","DOIUrl":null,"url":null,"abstract":"Microarray experiments often produce missing expression values due to various reasons. Accurate and robust estimation methods of missing values are needed since many algorithms and statistical analysis require a complete data set. In this paper, novel imputation methods based on least absolute deviation estimate, referred to as LADimpute, are proposed to estimate missing entries in microarray data. The proposed LADimpute method takes into consideration the local similarity structures in addition to employment of least absolute deviation estimate. Once those genes similar to the target gene with missing values are selected based on some metric, all missing values in the target gene can be estimated by the linear combination of the similar genes simultaneously. In our experiments, the proposed LADimpute method exhibits its accurate and robust performance when compared to other methods over different datasets, changing missing rates and various noise levels","PeriodicalId":297071,"journal":{"name":"2006 5th International Conference on Machine Learning and Applications (ICMLA'06)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 5th International Conference on Machine Learning and Applications (ICMLA'06)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2006.11","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Microarray experiments often produce missing expression values due to various reasons. Accurate and robust estimation methods of missing values are needed since many algorithms and statistical analysis require a complete data set. In this paper, novel imputation methods based on least absolute deviation estimate, referred to as LADimpute, are proposed to estimate missing entries in microarray data. The proposed LADimpute method takes into consideration the local similarity structures in addition to employment of least absolute deviation estimate. Once those genes similar to the target gene with missing values are selected based on some metric, all missing values in the target gene can be estimated by the linear combination of the similar genes simultaneously. In our experiments, the proposed LADimpute method exhibits its accurate and robust performance when compared to other methods over different datasets, changing missing rates and various noise levels