Xiubao Jiang, Xinge You, Yi Mou, Shujian Yu, W. Zeng
{"title":"Gaussian latent variable models for variable selection","authors":"Xiubao Jiang, Xinge You, Yi Mou, Shujian Yu, W. Zeng","doi":"10.1109/SPAC.2014.6982714","DOIUrl":null,"url":null,"abstract":"Variable selection has been extensively studied in linear regression and classification models. Most of these models assume that the input variables are noise free, the response variables are corrupted by Gaussian noise. In this paper, we discuss the variable selection problem assuming that both input variables and response variables are corrupted by Gaussian noise. We analyze the prediction error when augment one related noise variable. We show that the prediction error always decrease when more variable were employed for prediction when the joint distribution of variables are known. Based on this analysis, in sense of mean square error, the optimal variable selection can be obtained. We found that the results is very different from the matching pursuit algorithm(MP), which is widely used in variable selection problems.","PeriodicalId":326246,"journal":{"name":"Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPAC.2014.6982714","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Variable selection has been extensively studied in linear regression and classification models. Most of these models assume that the input variables are noise free, the response variables are corrupted by Gaussian noise. In this paper, we discuss the variable selection problem assuming that both input variables and response variables are corrupted by Gaussian noise. We analyze the prediction error when augment one related noise variable. We show that the prediction error always decrease when more variable were employed for prediction when the joint distribution of variables are known. Based on this analysis, in sense of mean square error, the optimal variable selection can be obtained. We found that the results is very different from the matching pursuit algorithm(MP), which is widely used in variable selection problems.