首页 > 最新文献

Journal of Nonparametric Statistics最新文献

英文 中文
Nonparametric estimation of isotropic covariance function 各向同性协方差函数的非参数估计
IF 1.2 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-11-22 DOI: 10.1080/10485252.2022.2146111
Yiming Wang, Sujit K. Ghosh
A nonparametric model using a sequence of Bernstein polynomials is constructed to approximate arbitrary isotropic covariance functions valid in and related approximation properties are investigated using the popular norm and norms. A computationally efficient sieve maximum likelihood (sML) estimation is then developed to nonparametrically estimate the unknown isotropic covariance function valid in . Consistency of the proposed sieve ML estimator is established under increasing domain regime. The proposed methodology is compared numerically with couple of existing nonparametric as well as with commonly used parametric methods. Numerical results based on simulated data show that our approach outperforms the parametric methods in reducing bias due to model misspecification and also the nonparametric methods in terms of having significantly lower values of expected and norms. Application to precipitation data is illustrated to showcase a real case study. Additional technical details and numerical illustrations are also made available.
利用Bernstein多项式序列构造了一个非参数模型来逼近任意有效的各向同性协方差函数,并利用常用范数和范数研究了相关的逼近性质。然后,提出了一种计算效率高的筛极大似然(sML)估计方法,用于非参数估计中有效的未知各向同性协方差函数。在递增域下,证明了所提筛ML估计的一致性。将该方法与现有的几种非参数方法以及常用的参数方法进行了数值比较。基于模拟数据的数值结果表明,我们的方法在减少由于模型错误规范引起的偏差方面优于参数方法,并且在期望值和规范值显着降低方面优于非参数方法。并举例说明了在降水数据中的应用,以展示一个实际案例研究。还提供了额外的技术细节和数字插图。
{"title":"Nonparametric estimation of isotropic covariance function","authors":"Yiming Wang, Sujit K. Ghosh","doi":"10.1080/10485252.2022.2146111","DOIUrl":"https://doi.org/10.1080/10485252.2022.2146111","url":null,"abstract":"A nonparametric model using a sequence of Bernstein polynomials is constructed to approximate arbitrary isotropic covariance functions valid in and related approximation properties are investigated using the popular norm and norms. A computationally efficient sieve maximum likelihood (sML) estimation is then developed to nonparametrically estimate the unknown isotropic covariance function valid in . Consistency of the proposed sieve ML estimator is established under increasing domain regime. The proposed methodology is compared numerically with couple of existing nonparametric as well as with commonly used parametric methods. Numerical results based on simulated data show that our approach outperforms the parametric methods in reducing bias due to model misspecification and also the nonparametric methods in terms of having significantly lower values of expected and norms. Application to precipitation data is illustrated to showcase a real case study. Additional technical details and numerical illustrations are also made available.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"19 1","pages":"198 - 237"},"PeriodicalIF":1.2,"publicationDate":"2022-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76341515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A high-dimensional test for the k-sample Behrens–Fisher problem k-样本Behrens-Fisher问题的高维检验
IF 1.2 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-11-21 DOI: 10.1080/10485252.2022.2147172
Daojiang He, Huijun Shi, Kai Xu, M. Cao
In this paper, the problem of testing the equality of the mean vectors of k populations with possibly unknown and unequal covariance matrices is investigated in high-dimensional settings. The null distributions of most existing tests are asymptotically normal which inevitably imposes strong conditions on covariance matrices. However, we assume here only mild additional conditions on the proposed test, which offers much flexibility in practical applications. Additionally, the Welch–Satterthwaite -approximation we adopted can automatically mimic the shape of the null distribution of the proposed test statistic, while the normal approximation cannot achieve the adaptivity. Finally, an extensive simulation study shows that the proposed test has better performance on both size and power compared with existing methods.
本文研究了在高维环境下,k个具有可能未知的不相等协方差矩阵的总体的均值向量是否相等的检验问题。大多数现有检验的零分布是渐近正态的,这不可避免地对协方差矩阵施加了很强的条件。然而,我们在这里假设只有轻微的附加条件,建议的测试,这在实际应用中提供了很大的灵活性。此外,我们采用的Welch-Satterthwaite近似可以自动模拟所提出的检验统计量的零分布形状,而正态近似不能实现自适应。最后,大量的仿真研究表明,与现有的测试方法相比,所提出的测试方法在尺寸和功耗方面都具有更好的性能。
{"title":"A high-dimensional test for the k-sample Behrens–Fisher problem","authors":"Daojiang He, Huijun Shi, Kai Xu, M. Cao","doi":"10.1080/10485252.2022.2147172","DOIUrl":"https://doi.org/10.1080/10485252.2022.2147172","url":null,"abstract":"In this paper, the problem of testing the equality of the mean vectors of k populations with possibly unknown and unequal covariance matrices is investigated in high-dimensional settings. The null distributions of most existing tests are asymptotically normal which inevitably imposes strong conditions on covariance matrices. However, we assume here only mild additional conditions on the proposed test, which offers much flexibility in practical applications. Additionally, the Welch–Satterthwaite -approximation we adopted can automatically mimic the shape of the null distribution of the proposed test statistic, while the normal approximation cannot achieve the adaptivity. Finally, an extensive simulation study shows that the proposed test has better performance on both size and power compared with existing methods.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"29 1","pages":"239 - 265"},"PeriodicalIF":1.2,"publicationDate":"2022-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75115241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust oracle estimation and uncertainty quantification for possibly sparse quantiles 可能稀疏分位数的稳健oracle估计和不确定性量化
IF 1.2 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-11-18 DOI: 10.1080/10485252.2023.2226779
E. Belitser, P. Serra, Alexandra G. J. Vegelien
A general many quantiles + noise model is studied in the robust formulation (allowing non-normal, non-independent observations), where the identifiability requirement for the noise is formulated in terms of quantiles rather than the traditional zero expectation assumption. We propose a penalization method based on the quantile loss function with appropriately chosen penalty function making inference on possibly sparse high-dimensional quantile vector. We apply a local approach to address the optimality by comparing procedures to the oracle sparsity structure. We establish that the proposed procedure mimics the oracle in the problems of estimation and uncertainty quantification (under the so called EBR condition). Adaptive minimax results over sparsity scale follow from our local results.
在鲁棒公式中研究了一般的多分位数+噪声模型(允许非正态、非独立的观测),其中噪声的可识别性要求是根据分位数而不是传统的零期望假设来制定的。提出了一种基于分位数损失函数的惩罚方法,选择适当的惩罚函数对可能稀疏的高维分位数向量进行推理。我们通过将过程与oracle稀疏性结构进行比较,采用局部方法来解决最优性问题。我们证明了所提出的方法在估计和不确定性量化问题中(在所谓的EBR条件下)是模拟oracle的。在稀疏度尺度上的自适应极大极小结果遵循我们的局部结果。
{"title":"Robust oracle estimation and uncertainty quantification for possibly sparse quantiles","authors":"E. Belitser, P. Serra, Alexandra G. J. Vegelien","doi":"10.1080/10485252.2023.2226779","DOIUrl":"https://doi.org/10.1080/10485252.2023.2226779","url":null,"abstract":"A general many quantiles + noise model is studied in the robust formulation (allowing non-normal, non-independent observations), where the identifiability requirement for the noise is formulated in terms of quantiles rather than the traditional zero expectation assumption. We propose a penalization method based on the quantile loss function with appropriately chosen penalty function making inference on possibly sparse high-dimensional quantile vector. We apply a local approach to address the optimality by comparing procedures to the oracle sparsity structure. We establish that the proposed procedure mimics the oracle in the problems of estimation and uncertainty quantification (under the so called EBR condition). Adaptive minimax results over sparsity scale follow from our local results.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"458 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78296795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Minimax estimation in multi-task regression under low-rank structures 低秩结构下多任务回归的极大极小估计
IF 1.2 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-11-17 DOI: 10.1080/10485252.2022.2146110
Kwan-Young Bak, J. Koo
This study investigates the minimaxity of a multi-task nonparametric regression problem. We formulate a simultaneous function estimation problem based on information pooling across multiple experiments under a low-dimensional structure. A nonparametric reduced rank regression estimator based on the nuclear norm penalisation scheme is proposed to incorporate the low-dimensional structure in the estimation process. A rank of a set of functions is defined in terms of their Fourier coefficients to formally characterise the dependence structure among functions. Minimax upper and lower bounds are established under various asymptotic scenarios to examine the role of the low-rank structure in determining optimal rates of convergence. The results confirm that exploiting the low-rank structure can significantly improve the convergence rate for the simultaneous estimation of multiple functions. The results also imply that the proposed estimator is rate optimal in the minimax sense for the rank-constraint Sobolev class of vector-valued functions.
本文研究了多任务非参数回归问题的极小性问题。在低维结构下,提出了一个基于多实验信息池的同时函数估计问题。提出了一种基于核范数惩罚方案的非参数降阶回归估计器,将低维结构纳入估计过程。一组函数的秩是用它们的傅里叶系数来定义的,以形式化地表征函数之间的依赖结构。在各种渐近情形下建立了极大极小上界和下界,以检验低秩结构在确定最优收敛速率中的作用。结果表明,利用低秩结构可以显著提高多函数同时估计的收敛速度。结果还表明,对于秩约束Sobolev类向量值函数,所提估计量在极小极大意义上是速率最优的。
{"title":"Minimax estimation in multi-task regression under low-rank structures","authors":"Kwan-Young Bak, J. Koo","doi":"10.1080/10485252.2022.2146110","DOIUrl":"https://doi.org/10.1080/10485252.2022.2146110","url":null,"abstract":"This study investigates the minimaxity of a multi-task nonparametric regression problem. We formulate a simultaneous function estimation problem based on information pooling across multiple experiments under a low-dimensional structure. A nonparametric reduced rank regression estimator based on the nuclear norm penalisation scheme is proposed to incorporate the low-dimensional structure in the estimation process. A rank of a set of functions is defined in terms of their Fourier coefficients to formally characterise the dependence structure among functions. Minimax upper and lower bounds are established under various asymptotic scenarios to examine the role of the low-rank structure in determining optimal rates of convergence. The results confirm that exploiting the low-rank structure can significantly improve the convergence rate for the simultaneous estimation of multiple functions. The results also imply that the proposed estimator is rate optimal in the minimax sense for the rank-constraint Sobolev class of vector-valued functions.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"57 1","pages":"122 - 144"},"PeriodicalIF":1.2,"publicationDate":"2022-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76856131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Functional linear model with partially observed covariate and missing values in the response 具有部分观测到的协变量和响应中缺失值的泛函线性模型
IF 1.2 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-11-11 DOI: 10.1080/10485252.2022.2142222
Christophe Crambes, Chayma Daayeb, A. Gannoun, Yousri Henchiri
Dealing with missing values is an important issue in data observation or data recording process. In this paper, we consider a functional linear regression model with partially observed covariate and missing values in the response. We use a reconstruction operator that aims at recovering the missing parts of the explanatory curves, then we are interested in regression imputation method of missing data on the response variable, using functional principal component regression to estimate the functional coefficient of the model. We study the asymptotic behaviour of the prediction error when missing data are replaced by the imputed values in the original dataset. The practical behaviour of the method is also studied on simulated data and a real dataset.
缺失值的处理是数据观测或记录过程中的一个重要问题。在本文中,我们考虑了一个响应中有部分观测协变量和缺失值的函数线性回归模型。我们使用了一个旨在恢复解释曲线缺失部分的重构算子,然后我们感兴趣的是缺失数据在响应变量上的回归imputation方法,使用函数主成分回归来估计模型的功能系数。我们研究了当缺失数据被原始数据集中的输入值所取代时预测误差的渐近行为。在模拟数据和实际数据集上研究了该方法的实际性能。
{"title":"Functional linear model with partially observed covariate and missing values in the response","authors":"Christophe Crambes, Chayma Daayeb, A. Gannoun, Yousri Henchiri","doi":"10.1080/10485252.2022.2142222","DOIUrl":"https://doi.org/10.1080/10485252.2022.2142222","url":null,"abstract":"Dealing with missing values is an important issue in data observation or data recording process. In this paper, we consider a functional linear regression model with partially observed covariate and missing values in the response. We use a reconstruction operator that aims at recovering the missing parts of the explanatory curves, then we are interested in regression imputation method of missing data on the response variable, using functional principal component regression to estimate the functional coefficient of the model. We study the asymptotic behaviour of the prediction error when missing data are replaced by the imputed values in the original dataset. The practical behaviour of the method is also studied on simulated data and a real dataset.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"36 1","pages":"172 - 197"},"PeriodicalIF":1.2,"publicationDate":"2022-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79794673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inference on semi-parametric transformation model with a pairwise likelihood based on left-truncated and interval-censored data 基于左截断和区间截除数据的两两似然半参数变换模型的推理
IF 1.2 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-10-26 DOI: 10.1080/10485252.2022.2138383
Yichen Lou, Peijie Wang, Jianguo Sun
Semi-parametric transformation models provide a general and flexible class of models for regression analysis of failure time data and many methods have been developed for their estimation. In particular, they include the proportional hazards and proportional odds models as special cases. In this paper, we discuss the situation where one observes left-truncated and interval-censored data, for which it does not seem to exist an established method. For the problem, in contrast to the commonly used conditional approach that may not be efficient, a pairwise pseudo-likelihood method is proposed to recover some missing information in the conditional method. The proposed estimators are proved to be consistent and asymptotically efficient and normal. A simulation study is conducted to assess the empirical performance of the method and suggests that it works well in practical situations. This method is illustrated by using a set of real data arising from an HIV/AIDS cohort study.
半参数转换模型为失效时间数据的回归分析提供了一种通用的、灵活的模型,并发展了许多方法对其进行估计。特别是,它们将比例风险模型和比例几率模型作为特殊情况。在本文中,我们讨论了观察左截断和区间截尾数据的情况,对于这种情况似乎不存在既定的方法。针对该问题,针对常用的条件方法可能效率不高的问题,提出了一种成对伪似然方法来恢复条件方法中缺失的部分信息。证明了所提估计量是一致的、渐近有效的和正态的。通过仿真研究对该方法的经验性能进行了评估,并表明该方法在实际情况下效果良好。该方法通过使用一组来自艾滋病毒/艾滋病队列研究的真实数据来说明。
{"title":"Inference on semi-parametric transformation model with a pairwise likelihood based on left-truncated and interval-censored data","authors":"Yichen Lou, Peijie Wang, Jianguo Sun","doi":"10.1080/10485252.2022.2138383","DOIUrl":"https://doi.org/10.1080/10485252.2022.2138383","url":null,"abstract":"Semi-parametric transformation models provide a general and flexible class of models for regression analysis of failure time data and many methods have been developed for their estimation. In particular, they include the proportional hazards and proportional odds models as special cases. In this paper, we discuss the situation where one observes left-truncated and interval-censored data, for which it does not seem to exist an established method. For the problem, in contrast to the commonly used conditional approach that may not be efficient, a pairwise pseudo-likelihood method is proposed to recover some missing information in the conditional method. The proposed estimators are proved to be consistent and asymptotically efficient and normal. A simulation study is conducted to assess the empirical performance of the method and suggests that it works well in practical situations. This method is illustrated by using a set of real data arising from an HIV/AIDS cohort study.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"122 1","pages":"38 - 55"},"PeriodicalIF":1.2,"publicationDate":"2022-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79082730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Asymptotic and finite sample properties of Hill-type estimators in the presence of errors in observations 观测值存在误差时hill型估计量的渐近和有限样本性质
IF 1.2 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-10-24 DOI: 10.1080/10485252.2022.2136662
Mihyun Kim, P. Kokoszka
We establish asymptotic and finite sample properties of the Hill and Harmonic Moment estimators applied to heavy-tailed data contaminated by errors. We formulate conditions on the errors and the number of upper order statistics under which these estimators continue to be asymptotically normal. We specify analogous conditions which must hold in finite samples for the confidence intervals derived from the asymptotic normal distribution to be reliable. In the large sample analysis, we specify conditions related to second-order regular variation and divergence rates for the number of upper order statistics, k, used to compute the estimators. In the finite sample analysis, we examine several data-driven methods of selecting k, and determine which of them are most suitable for confidence interval inference. The results of these investigations are applied to interarrival times of internet traffic anomalies, which are available only with a round-off error.
我们建立了应用于被误差污染的重尾数据的希尔矩估计和调和矩估计的渐近和有限样本性质。我们给出了这些估计量继续渐近正态的误差和上阶统计量的个数的条件。我们指定了类似的条件,这些条件必须在有限样本中成立,由渐近正态分布导出的置信区间才可靠。在大样本分析中,我们为用于计算估计量的上阶统计量k指定了与二阶正则变化和散度率相关的条件。在有限样本分析中,我们研究了几种选择k的数据驱动方法,并确定其中哪一种最适合置信区间推断。这些调查结果应用于互联网流量异常的到达间隔时间,该时间只有舍入误差。
{"title":"Asymptotic and finite sample properties of Hill-type estimators in the presence of errors in observations","authors":"Mihyun Kim, P. Kokoszka","doi":"10.1080/10485252.2022.2136662","DOIUrl":"https://doi.org/10.1080/10485252.2022.2136662","url":null,"abstract":"We establish asymptotic and finite sample properties of the Hill and Harmonic Moment estimators applied to heavy-tailed data contaminated by errors. We formulate conditions on the errors and the number of upper order statistics under which these estimators continue to be asymptotically normal. We specify analogous conditions which must hold in finite samples for the confidence intervals derived from the asymptotic normal distribution to be reliable. In the large sample analysis, we specify conditions related to second-order regular variation and divergence rates for the number of upper order statistics, k, used to compute the estimators. In the finite sample analysis, we examine several data-driven methods of selecting k, and determine which of them are most suitable for confidence interval inference. The results of these investigations are applied to interarrival times of internet traffic anomalies, which are available only with a round-off error.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"76 1","pages":"1 - 18"},"PeriodicalIF":1.2,"publicationDate":"2022-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91178412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recursive kernel estimator in a semiparametric regression model 半参数回归模型中的递归核估计
IF 1.2 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-10-10 DOI: 10.1080/10485252.2022.2130308
Emmanuel De Dieu Nkou
Sliced inverse regression (SIR) is a recommended method to identify and estimate the central dimension reduction (CDR) subspace. CDR subspace is at the base to describe the conditional distribution of the response Y given a d-dimensional predictor vector X. To estimate this space, two versions are very popular: the slice version and the kernel version. A recursive method of the slice version has already been the subject of a systematic study. In this paper, we propose to study the kernel version. It's a recursive method based on a stochastic approximation algorithm of the kernel version. The asymptotic normality of the proposed estimator is also proved. A simulation study that not only shows the good numerical performance of the proposed estimate and which also allows to evaluate its performance with respect to existing methods is presented. A real dataset is also used to illustrate the approach.
切片逆回归(SIR)是中心降维(CDR)子空间识别和估计的一种推荐方法。CDR子空间是描述给定d维预测向量x的响应Y的条件分布的基础。为了估计这个空间,有两个非常流行的版本:切片版本和内核版本。切片版本的递归方法已经得到了系统的研究。在本文中,我们建议研究内核版本。这是一种基于核版本随机逼近算法的递归方法。并证明了所提估计量的渐近正态性。仿真研究不仅表明所提出的估计具有良好的数值性能,而且还可以根据现有方法对其性能进行评估。一个真实的数据集也被用来说明这种方法。
{"title":"Recursive kernel estimator in a semiparametric regression model","authors":"Emmanuel De Dieu Nkou","doi":"10.1080/10485252.2022.2130308","DOIUrl":"https://doi.org/10.1080/10485252.2022.2130308","url":null,"abstract":"Sliced inverse regression (SIR) is a recommended method to identify and estimate the central dimension reduction (CDR) subspace. CDR subspace is at the base to describe the conditional distribution of the response Y given a d-dimensional predictor vector X. To estimate this space, two versions are very popular: the slice version and the kernel version. A recursive method of the slice version has already been the subject of a systematic study. In this paper, we propose to study the kernel version. It's a recursive method based on a stochastic approximation algorithm of the kernel version. The asymptotic normality of the proposed estimator is also proved. A simulation study that not only shows the good numerical performance of the proposed estimate and which also allows to evaluate its performance with respect to existing methods is presented. A real dataset is also used to illustrate the approach.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"187 1","pages":"145 - 171"},"PeriodicalIF":1.2,"publicationDate":"2022-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75406537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uniform convergence rates and automatic variable selection in nonparametric regression with functional and categorical covariates 函数协变量和分类协变量非参数回归的一致收敛率和自动变量选择
IF 1.2 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-09-29 DOI: 10.1080/10485252.2023.2207673
Leonie Selk
In Selk and Gertheiss (2022) a nonparametric prediction method for models with multiple functional and categorical covariates is introduced. The dependent variable can be categorical (binary or multi-class) or continuous, thus both classification and regression problems are considered. In the paper at hand the asymptotic properties of this method are developed. A uniform rate of convergence for the regression / classification estimator is given. Further it is shown that, asymptotically, a data-driven least squares cross-validation method can automatically remove irrelevant, noise variables.
在Selk和Gertheiss(2022)中,介绍了具有多个函数和分类协变量的模型的非参数预测方法。因变量可以是分类的(二元或多类)或连续的,因此分类和回归问题都被考虑。本文给出了该方法的渐近性质。给出了回归/分类估计器的统一收敛速率。进一步表明,渐近地,数据驱动的最小二乘交叉验证方法可以自动去除不相关的噪声变量。
{"title":"Uniform convergence rates and automatic variable selection in nonparametric regression with functional and categorical covariates","authors":"Leonie Selk","doi":"10.1080/10485252.2023.2207673","DOIUrl":"https://doi.org/10.1080/10485252.2023.2207673","url":null,"abstract":"In Selk and Gertheiss (2022) a nonparametric prediction method for models with multiple functional and categorical covariates is introduced. The dependent variable can be categorical (binary or multi-class) or continuous, thus both classification and regression problems are considered. In the paper at hand the asymptotic properties of this method are developed. A uniform rate of convergence for the regression / classification estimator is given. Further it is shown that, asymptotically, a data-driven least squares cross-validation method can automatically remove irrelevant, noise variables.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"1 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2022-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88658073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Another look at halfspace depth: flag halfspaces with applications 另一个关于半空间深度的研究:在应用程序中标记半空间
IF 1.2 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-09-23 DOI: 10.1080/10485252.2023.2236721
Duvsan Pokorn'y, P. Laketa, Stanislav Nagy
The halfspace depth is a well studied tool of nonparametric statistics in multivariate spaces, naturally inducing a multivariate generalisation of quantiles. The halfspace depth of a point with respect to a measure is defined as the infimum mass of closed halfspaces that contain the given point. In general, a closed halfspace that attains that infimum does not have to exist. We introduce a flag halfspace - an intermediary between a closed halfspace and its interior. We demonstrate that the halfspace depth can be equivalently formulated also in terms of flag halfspaces, and that there always exists a flag halfspace whose boundary passes through any given point $x$, and has mass exactly equal to the halfspace depth of $x$. Flag halfspaces allow us to derive theoretical results regarding the halfspace depth without the need to differentiate absolutely continuous measures from measures containing atoms, as was frequently done previously. The notion of flag halfspaces is used to state results on the dimensionality of the halfspace median set for random samples. We prove that under mild conditions, the dimension of the sample halfspace median set of $d$-variate data cannot be $d-1$, and that for $d=2$ the sample halfspace median set must be either a two-dimensional convex polygon, or a data point. The latter result guarantees that the computational algorithm for the sample halfspace median form the R package TukeyRegion is exact also in the case when the median set is less-than-full-dimensional in dimension $d=2$.
半空间深度是多元空间中非参数统计的一个很好的研究工具,自然会引起分位数的多元推广。点相对于测量的半空间深度定义为包含给定点的封闭半空间的最小质量。一般来说,达到这个极限的封闭半空间并不一定存在。我们引入了一个标志半空间——一个封闭半空间与其内部之间的中介。我们证明了半空间深度也可以等价地用标志半空间表示,并且总是存在一个标志半空间,其边界经过任意给定的点$x$,其质量正好等于$x$的半空间深度。Flag半空间允许我们得出关于半空间深度的理论结果,而不需要区分绝对连续的测量和包含原子的测量,就像以前经常做的那样。标志半空间的概念用于描述随机样本的半空间中位数集维数的结果。我们证明了在温和条件下,$d$变量数据的样本半空间中位数集的维数不能为$d-1$,并且对于$d=2$,样本半空间中位数集必须是一个二维凸多边形,或者是一个数据点。后一个结果保证了R包TukeyRegion的样本半空间中位数的计算算法在维数$d=2$中位数集小于全维的情况下也是精确的。
{"title":"Another look at halfspace depth: flag halfspaces with applications","authors":"Duvsan Pokorn'y, P. Laketa, Stanislav Nagy","doi":"10.1080/10485252.2023.2236721","DOIUrl":"https://doi.org/10.1080/10485252.2023.2236721","url":null,"abstract":"The halfspace depth is a well studied tool of nonparametric statistics in multivariate spaces, naturally inducing a multivariate generalisation of quantiles. The halfspace depth of a point with respect to a measure is defined as the infimum mass of closed halfspaces that contain the given point. In general, a closed halfspace that attains that infimum does not have to exist. We introduce a flag halfspace - an intermediary between a closed halfspace and its interior. We demonstrate that the halfspace depth can be equivalently formulated also in terms of flag halfspaces, and that there always exists a flag halfspace whose boundary passes through any given point $x$, and has mass exactly equal to the halfspace depth of $x$. Flag halfspaces allow us to derive theoretical results regarding the halfspace depth without the need to differentiate absolutely continuous measures from measures containing atoms, as was frequently done previously. The notion of flag halfspaces is used to state results on the dimensionality of the halfspace median set for random samples. We prove that under mild conditions, the dimension of the sample halfspace median set of $d$-variate data cannot be $d-1$, and that for $d=2$ the sample halfspace median set must be either a two-dimensional convex polygon, or a data point. The latter result guarantees that the computational algorithm for the sample halfspace median form the R package TukeyRegion is exact also in the case when the median set is less-than-full-dimensional in dimension $d=2$.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"31 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73149443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Journal of Nonparametric Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1