{"title":"Microarray missing data imputation using regression","authors":"T. Bayrak, H. Oğul","doi":"10.2316/P.2017.852-033","DOIUrl":null,"url":null,"abstract":"Having missing values due to several experimental conditions is a common problem in analyzing the results of microarray experiments. Although many imputation methods exist, comparative studies based on regression based models are very limited. Particularly, Relevance Vector Machine (RVM), a recent regression method shown to be effective in various domains, has not been considered so far for missing value imputation in microarray data. In this study, we present a comparative study between regression based models, including linear regression, k-nearest neighbor regression and RVM that uses data obtained from breast, colon and prostate cancer tissues through the microarray technology. The leave-one-out (or Jackknife) procedure is applied for the validation. To measure the performance of the model we used Spearman correlation coefficient (CC). The results reveal that RVM with a Gaussian kernel outperforms other regression models in some cases.","PeriodicalId":6635,"journal":{"name":"2017 13th IASTED International Conference on Biomedical Engineering (BioMed)","volume":"54 1","pages":"68-73"},"PeriodicalIF":0.0000,"publicationDate":"2017-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 13th IASTED International Conference on Biomedical Engineering (BioMed)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2316/P.2017.852-033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Having missing values due to several experimental conditions is a common problem in analyzing the results of microarray experiments. Although many imputation methods exist, comparative studies based on regression based models are very limited. Particularly, Relevance Vector Machine (RVM), a recent regression method shown to be effective in various domains, has not been considered so far for missing value imputation in microarray data. In this study, we present a comparative study between regression based models, including linear regression, k-nearest neighbor regression and RVM that uses data obtained from breast, colon and prostate cancer tissues through the microarray technology. The leave-one-out (or Jackknife) procedure is applied for the validation. To measure the performance of the model we used Spearman correlation coefficient (CC). The results reveal that RVM with a Gaussian kernel outperforms other regression models in some cases.