{"title":"Oracle-efficient M-estimation for single-index models with a smooth simultaneous confidence band","authors":"Li Cai, Lei Jin, Jiuzhou Miao, Suojin Wang","doi":"10.1007/s11749-024-00935-9","DOIUrl":null,"url":null,"abstract":"<p>Single-index models are important and popular semiparametric models, as they can handle the problem of the “curse of dimensionality” and enjoy the flexibility of nonparametric modeling and the interpretability of parametric modeling. Most existing methods for single-index models are sensitive to outliers or heavy-tailed distributions because they use the least squares criterion. An oracle-efficient M-estimator is proposed for single-index models, and a smooth simultaneous confidence band is constructed by treating the index coefficients as nuisance parameters. Under general assumptions it is shown that the M-estimator for the nonparametric link function, based on any <span>\\(\\sqrt{n}\\)</span>-consistent coefficient index parameter estimators, is oracle-efficient. This means that it is uniformly as efficient as the infeasible one obtained by M-regression using the true single-index coefficient parameters. As a result, the asymptotic distribution of the maximal deviation between the M-type kernel estimator and the true link function is derived, and an asymptotically accurate simultaneous confidence band is established as a global inference tool for the link function. The proposed method generalizes the desirable uniform convergence property of ordinary least squares to the M-estimation. Meanwhile, it is a general approach that allows any <span>\\(\\sqrt{n}\\)</span>-consistent coefficient parameter estimators to be applied in the procedure to make global inferences for the link function. Simulation studies with commonly encountered sample sizes are reported to support the theoretical findings. These numerical results show certain desirable robustness properties against heavy-tailed errors and outliers. As an illustration, the proposed method is applied to the analysis of a car purchasing dataset.</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"32 1","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Test","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s11749-024-00935-9","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
Single-index models are important and popular semiparametric models, as they can handle the problem of the “curse of dimensionality” and enjoy the flexibility of nonparametric modeling and the interpretability of parametric modeling. Most existing methods for single-index models are sensitive to outliers or heavy-tailed distributions because they use the least squares criterion. An oracle-efficient M-estimator is proposed for single-index models, and a smooth simultaneous confidence band is constructed by treating the index coefficients as nuisance parameters. Under general assumptions it is shown that the M-estimator for the nonparametric link function, based on any \(\sqrt{n}\)-consistent coefficient index parameter estimators, is oracle-efficient. This means that it is uniformly as efficient as the infeasible one obtained by M-regression using the true single-index coefficient parameters. As a result, the asymptotic distribution of the maximal deviation between the M-type kernel estimator and the true link function is derived, and an asymptotically accurate simultaneous confidence band is established as a global inference tool for the link function. The proposed method generalizes the desirable uniform convergence property of ordinary least squares to the M-estimation. Meanwhile, it is a general approach that allows any \(\sqrt{n}\)-consistent coefficient parameter estimators to be applied in the procedure to make global inferences for the link function. Simulation studies with commonly encountered sample sizes are reported to support the theoretical findings. These numerical results show certain desirable robustness properties against heavy-tailed errors and outliers. As an illustration, the proposed method is applied to the analysis of a car purchasing dataset.
单指数模型是重要而流行的半参数模型,因为它们可以处理 "维度诅咒 "问题,并享有非参数建模的灵活性和参数建模的可解释性。大多数现有的单指数模型方法都对异常值或重尾分布很敏感,因为它们使用的是最小二乘法准则。本文为单指数模型提出了一种具有甲骨文效率的 M 估计器,并通过将指数系数视为干扰参数,构建了一个平滑的同步置信带。在一般假设下,基于任何 \(\sqrt{n}\)-一致的系数指数参数估计器的非参数链接函数的 M-估计器是具有甲骨文效率的。这意味着它的效率与使用真实的单指数系数参数通过 M 回归得到的不可行估计一样高。因此,推导出了 M 型核估计器与真实链接函数之间最大偏差的渐近分布,并建立了渐近精确的同步置信带,作为链接函数的全局推断工具。所提出的方法将普通最小二乘法理想的均匀收敛特性推广到了 M 型估计。同时,它是一种通用方法,允许在程序中应用任何(\sqrt{n}\)一致的系数参数估计器来对联系函数进行全局推断。为支持理论研究结果,报告还对常见样本量进行了模拟研究。这些数值结果表明,对重尾误差和异常值具有某些理想的稳健性。作为示例,我们将所提出的方法应用于汽车购买数据集的分析。
期刊介绍:
TEST is an international journal of Statistics and Probability, sponsored by the Spanish Society of Statistics and Operations Research. English is the official language of the journal.
The emphasis of TEST is placed on papers containing original theoretical contributions of direct or potential value in applications. In this respect, the methodological contents are considered to be crucial for the papers published in TEST, but the practical implications of the methodological aspects are also relevant. Original sound manuscripts on either well-established or emerging areas in the scope of the journal are welcome.
One volume is published annually in four issues. In addition to the regular contributions, each issue of TEST contains an invited paper from a world-wide recognized outstanding statistician on an up-to-date challenging topic, including discussions.