{"title":"Variable selection in multivariate regression models with measurement error in covariates","authors":"Jingyu Cui , Grace Y. Yi","doi":"10.1016/j.jmva.2024.105299","DOIUrl":null,"url":null,"abstract":"<div><p>Multivariate regression models have been broadly used in analyzing data having multi-dimensional response variables. The use of such models is, however, impeded by the presence of measurement error and spurious variables. While data with such features are common in applications, there has been little work available concerning these features jointly. In this article, we consider variable selection under multivariate regression models with covariates subject to measurement error. To gain flexibility, we allow the dimensions of the covariate and response variables to be either fixed or diverging as the sample size increases. A new regularized method is proposed to handle both variable selection and measurement error effects for error-contaminated data. Our proposed penalized bias-corrected least squares method offers flexibility in selecting the penalty function from a class of functions with different features. Importantly, our method does not require full distributional assumptions for the associated variables, thereby broadening its applicability. We rigorously establish theoretical results and describe a computationally efficient procedure for the proposed method. Numerical studies confirm the satisfactory performance of the proposed method under finite settings, and also demonstrate deleterious effects of ignoring measurement error in inferential procedures.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"202 ","pages":"Article 105299"},"PeriodicalIF":1.4000,"publicationDate":"2024-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Multivariate Analysis","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0047259X2400006X","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
Multivariate regression models have been broadly used in analyzing data having multi-dimensional response variables. The use of such models is, however, impeded by the presence of measurement error and spurious variables. While data with such features are common in applications, there has been little work available concerning these features jointly. In this article, we consider variable selection under multivariate regression models with covariates subject to measurement error. To gain flexibility, we allow the dimensions of the covariate and response variables to be either fixed or diverging as the sample size increases. A new regularized method is proposed to handle both variable selection and measurement error effects for error-contaminated data. Our proposed penalized bias-corrected least squares method offers flexibility in selecting the penalty function from a class of functions with different features. Importantly, our method does not require full distributional assumptions for the associated variables, thereby broadening its applicability. We rigorously establish theoretical results and describe a computationally efficient procedure for the proposed method. Numerical studies confirm the satisfactory performance of the proposed method under finite settings, and also demonstrate deleterious effects of ignoring measurement error in inferential procedures.
期刊介绍:
Founded in 1971, the Journal of Multivariate Analysis (JMVA) is the central venue for the publication of new, relevant methodology and particularly innovative applications pertaining to the analysis and interpretation of multidimensional data.
The journal welcomes contributions to all aspects of multivariate data analysis and modeling, including cluster analysis, discriminant analysis, factor analysis, and multidimensional continuous or discrete distribution theory. Topics of current interest include, but are not limited to, inferential aspects of
Copula modeling
Functional data analysis
Graphical modeling
High-dimensional data analysis
Image analysis
Multivariate extreme-value theory
Sparse modeling
Spatial statistics.