{"title":"Orthogonality-projection-based penalized variable selection for high-dimensional partially linear models","authors":"Yiping Yang , Peixin Zhao , Jun Zhang","doi":"10.1016/j.apm.2024.115785","DOIUrl":null,"url":null,"abstract":"<div><div>The Smoothly Clipped Absolute Deviation Net variable selection process is introduced for high-dimensional partially linear models. To mitigate the influence of nonparametric components on variable selection and estimation of parametric components, B-spline and orthogonality-projection techniques are employed, thereby constructing the Smoothly Clipped Absolute Deviation Net penalized least squares objective function. The proposed variable selection process not only identifies relevant variables but also simultaneously provides estimates for the selected variables. Additionally, we present estimators for the nonparametric components. Under certain regularization conditions, the variable selection procedure for the proposed parametric components has been proven to exhibit the grouping effect and oracle property, ensuring accurate variable selection. Concurrently, the estimation of nonparametric components achieves the optimal convergence rate for nonparametric estimation. To efficiently implement this process, we propose an algorithm based on local linear approximation and utilize <em>K</em>-fold cross-validation to select the penalty parameter. Our simulation studies and analyses of real data comprehensively cover scenarios where the number of variables <em>p</em> is either less than or far greater than the sample size <em>n</em>.</div></div>","PeriodicalId":50980,"journal":{"name":"Applied Mathematical Modelling","volume":"138 ","pages":"Article 115785"},"PeriodicalIF":4.4000,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Mathematical Modelling","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0307904X24005389","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The Smoothly Clipped Absolute Deviation Net variable selection process is introduced for high-dimensional partially linear models. To mitigate the influence of nonparametric components on variable selection and estimation of parametric components, B-spline and orthogonality-projection techniques are employed, thereby constructing the Smoothly Clipped Absolute Deviation Net penalized least squares objective function. The proposed variable selection process not only identifies relevant variables but also simultaneously provides estimates for the selected variables. Additionally, we present estimators for the nonparametric components. Under certain regularization conditions, the variable selection procedure for the proposed parametric components has been proven to exhibit the grouping effect and oracle property, ensuring accurate variable selection. Concurrently, the estimation of nonparametric components achieves the optimal convergence rate for nonparametric estimation. To efficiently implement this process, we propose an algorithm based on local linear approximation and utilize K-fold cross-validation to select the penalty parameter. Our simulation studies and analyses of real data comprehensively cover scenarios where the number of variables p is either less than or far greater than the sample size n.
针对高维部分线性模型引入了平滑截断绝对偏差网变量选择过程。为了减轻非参数成分对变量选择和参数成分估算的影响,采用了 B-样条和正交投影技术,从而构建了平滑截断绝对偏差网惩罚最小二乘法目标函数。拟议的变量选择过程不仅能识别相关变量,还能同时提供所选变量的估计值。此外,我们还提出了非参数成分的估计值。在一定的正则化条件下,所提出的参数成分的变量选择过程已被证明具有分组效应和甲骨文特性,从而确保了变量选择的准确性。同时,非参数成分的估计达到了非参数估计的最佳收敛速率。为了有效地实现这一过程,我们提出了一种基于局部线性近似的算法,并利用 K 倍交叉验证来选择惩罚参数。我们的模拟研究和真实数据分析全面涵盖了变量数量 p 小于或远大于样本量 n 的情况。
期刊介绍:
Applied Mathematical Modelling focuses on research related to the mathematical modelling of engineering and environmental processes, manufacturing, and industrial systems. A significant emerging area of research activity involves multiphysics processes, and contributions in this area are particularly encouraged.
This influential publication covers a wide spectrum of subjects including heat transfer, fluid mechanics, CFD, and transport phenomena; solid mechanics and mechanics of metals; electromagnets and MHD; reliability modelling and system optimization; finite volume, finite element, and boundary element procedures; modelling of inventory, industrial, manufacturing and logistics systems for viable decision making; civil engineering systems and structures; mineral and energy resources; relevant software engineering issues associated with CAD and CAE; and materials and metallurgical engineering.
Applied Mathematical Modelling is primarily interested in papers developing increased insights into real-world problems through novel mathematical modelling, novel applications or a combination of these. Papers employing existing numerical techniques must demonstrate sufficient novelty in the solution of practical problems. Papers on fuzzy logic in decision-making or purely financial mathematics are normally not considered. Research on fractional differential equations, bifurcation, and numerical methods needs to include practical examples. Population dynamics must solve realistic scenarios. Papers in the area of logistics and business modelling should demonstrate meaningful managerial insight. Submissions with no real-world application will not be considered.