首页 > 最新文献

Annals of Statistics最新文献

英文 中文
FUNCTIONAL SUFFICIENT DIMENSION REDUCTION THROUGH AVERAGE FRÉCHET DERIVATIVES. 通过平均弗雷谢特导数实现函数充分降维。
IF 4.5 1区 数学 Q1 Mathematics Pub Date : 2022-04-01 Epub Date: 2022-04-07 DOI: 10.1214/21-aos2131
Kuang-Yao Lee, Lexin Li

Sufficient dimension reduction (SDR) embodies a family of methods that aim for reduction of dimensionality without loss of information in a regression setting. In this article, we propose a new method for nonparametric function-on-function SDR, where both the response and the predictor are a function. We first develop the notions of functional central mean subspace and functional central subspace, which form the population targets of our functional SDR. We then introduce an average Fréchet derivative estimator, which extends the gradient of the regression function to the operator level and enables us to develop estimators for our functional dimension reduction spaces. We show the resulting functional SDR estimators are unbiased and exhaustive, and more importantly, without imposing any distributional assumptions such as the linearity or the constant variance conditions that are commonly imposed by all existing functional SDR methods. We establish the uniform convergence of the estimators for the functional dimension reduction spaces, while allowing both the number of Karhunen-Loève expansions and the intrinsic dimension to diverge with the sample size. We demonstrate the efficacy of the proposed methods through both simulations and two real data examples.

充分降维法(SDR)是一系列在回归环境中减少维度而不损失信息的方法。在本文中,我们提出了一种非参数函数对函数 SDR 的新方法,其中响应和预测都是一个函数。我们首先提出了函数中心均值子空间和函数中心子空间的概念,它们构成了函数 SDR 的群体目标。然后,我们引入平均弗雷谢特导数估计器,它将回归函数的梯度扩展到算子层面,使我们能够为函数降维空间开发估计器。我们证明了由此产生的函数 SDR 估计器是无偏的、详尽的,更重要的是,它不需要施加任何分布假设,如线性或恒定方差条件,而这些假设是所有现有函数 SDR 方法普遍施加的。我们建立了函数降维空间估计器的均匀收敛性,同时允许卡尔胡宁-洛埃夫展开数和本征维度随样本大小发散。我们通过模拟和两个真实数据实例证明了所提方法的有效性。
{"title":"FUNCTIONAL SUFFICIENT DIMENSION REDUCTION THROUGH AVERAGE FRÉCHET DERIVATIVES.","authors":"Kuang-Yao Lee, Lexin Li","doi":"10.1214/21-aos2131","DOIUrl":"10.1214/21-aos2131","url":null,"abstract":"<p><p>Sufficient dimension reduction (SDR) embodies a family of methods that aim for reduction of dimensionality without loss of information in a regression setting. In this article, we propose a new method for nonparametric function-on-function SDR, where both the response and the predictor are a function. We first develop the notions of functional central mean subspace and functional central subspace, which form the population targets of our functional SDR. We then introduce an average Fréchet derivative estimator, which extends the gradient of the regression function to the operator level and enables us to develop estimators for our functional dimension reduction spaces. We show the resulting functional SDR estimators are unbiased and exhaustive, and more importantly, without imposing any distributional assumptions such as the linearity or the constant variance conditions that are commonly imposed by all existing functional SDR methods. We establish the uniform convergence of the estimators for the functional dimension reduction spaces, while allowing both the number of Karhunen-Loève expansions and the intrinsic dimension to diverge with the sample size. We demonstrate the efficacy of the proposed methods through both simulations and two real data examples.</p>","PeriodicalId":8032,"journal":{"name":"Annals of Statistics","volume":null,"pages":null},"PeriodicalIF":4.5,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10085580/pdf/nihms-1746366.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9320340","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial dependence and space–time trend in extreme events 极端事件的空间依赖性和时空趋势
IF 4.5 1区 数学 Q1 Mathematics Pub Date : 2022-02-01 DOI: 10.1214/21-aos2067
John H. J. Einmahl,Ana Ferreira,Laurens de Haan,Cláudia Neves,Chen Zhou
The statistical theory of extremes is extended to observations that are non-stationary and not independent. The non-stationarity over time and space is controlled via the scedasis (tail scale) in the marginal distributions. Spatial dependence stems from multivariate extreme value theory. We establish asymptotic theory for both the weighted sequential tail empirical process and the weighted tail quantile process based on all observations, taken over time and space. The results yield two statistical tests for homoscedasticity in the tail, one in space and one in time. Further, we show that the common extreme value index can be estimated via a pseudo-maximum likelihood procedure based on pooling all (non-stationary and dependent) observations. Our leading example and application is rainfall in Northern Germany.
极值的统计理论被扩展到非平稳和非独立的观测。随时间和空间的非平稳性是通过边际分布的尾标度来控制的。空间依赖源于多元极值理论。我们建立了加权顺序尾经验过程和加权尾分位数过程的渐近理论,基于所有的观察,采取了时间和空间。结果对尾部的均方差进行了两个统计检验,一个是空间检验,一个是时间检验。此外,我们表明,公共极值指数可以通过基于池化所有(非平稳和依赖)观测值的伪极大似然过程来估计。我们的主要例子和应用是德国北部的降雨。
{"title":"Spatial dependence and space–time trend in extreme events","authors":"John H. J. Einmahl,Ana Ferreira,Laurens de Haan,Cláudia Neves,Chen Zhou","doi":"10.1214/21-aos2067","DOIUrl":"https://doi.org/10.1214/21-aos2067","url":null,"abstract":"The statistical theory of extremes is extended to observations that are non-stationary and not independent. The non-stationarity over time and space is controlled via the scedasis (tail scale) in the marginal distributions. Spatial dependence stems from multivariate extreme value theory. We establish asymptotic theory for both the weighted sequential tail empirical process and the weighted tail quantile process based on all observations, taken over time and space. The results yield two statistical tests for homoscedasticity in the tail, one in space and one in time. Further, we show that the common extreme value index can be estimated via a pseudo-maximum likelihood procedure based on pooling all (non-stationary and dependent) observations. Our leading example and application is rainfall in Northern Germany.","PeriodicalId":8032,"journal":{"name":"Annals of Statistics","volume":null,"pages":null},"PeriodicalIF":4.5,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138531661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SEMIPARAMETRIC LATENT-CLASS MODELS FOR MULTIVARIATE LONGITUDINAL AND SURVIVAL DATA. 多变量纵向和生存数据的半参数潜伏类模型。
IF 4.5 1区 数学 Q1 Mathematics Pub Date : 2022-02-01 Epub Date: 2022-02-16 DOI: 10.1214/21-aos2117
Kin Yau Wong, Donglin Zeng, D Y Lin

In long-term follow-up studies, data are often collected on repeated measures of multivariate response variables as well as on time to the occurrence of a certain event. To jointly analyze such longitudinal data and survival time, we propose a general class of semiparametric latent-class models that accommodates a heterogeneous study population with flexible dependence structures between the longitudinal and survival outcomes. We combine nonparametric maximum likelihood estimation with sieve estimation and devise an efficient EM algorithm to implement the proposed approach. We establish the asymptotic properties of the proposed estimators through novel use of modern empirical process theory, sieve estimation theory, and semiparametric efficiency theory. Finally, we demonstrate the advantages of the proposed methods through extensive simulation studies and provide an application to the Atherosclerosis Risk in Communities study.

在长期随访研究中,经常收集多变量反应变量的重复测量数据以及某一事件发生的时间数据。为了联合分析这些纵向数据和生存时间,我们提出了一类一般的半参数潜在类模型,该模型适应了纵向结果和生存结果之间具有灵活依赖结构的异质性研究人群。我们将非参数最大似然估计与筛估计相结合,并设计了一种有效的EM算法来实现所提出的方法。我们通过新颖地使用现代经验过程理论、筛估计理论和半参数效率理论,建立了所提出的估计量的渐近性质。最后,我们通过广泛的模拟研究证明了所提出方法的优势,并为社区动脉粥样硬化风险研究提供了应用。
{"title":"SEMIPARAMETRIC LATENT-CLASS MODELS FOR MULTIVARIATE LONGITUDINAL AND SURVIVAL DATA.","authors":"Kin Yau Wong, Donglin Zeng, D Y Lin","doi":"10.1214/21-aos2117","DOIUrl":"10.1214/21-aos2117","url":null,"abstract":"<p><p>In long-term follow-up studies, data are often collected on repeated measures of multivariate response variables as well as on time to the occurrence of a certain event. To jointly analyze such longitudinal data and survival time, we propose a general class of semiparametric latent-class models that accommodates a heterogeneous study population with flexible dependence structures between the longitudinal and survival outcomes. We combine nonparametric maximum likelihood estimation with sieve estimation and devise an efficient EM algorithm to implement the proposed approach. We establish the asymptotic properties of the proposed estimators through novel use of modern empirical process theory, sieve estimation theory, and semiparametric efficiency theory. Finally, we demonstrate the advantages of the proposed methods through extensive simulation studies and provide an application to the Atherosclerosis Risk in Communities study.</p>","PeriodicalId":8032,"journal":{"name":"Annals of Statistics","volume":null,"pages":null},"PeriodicalIF":4.5,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9269993/pdf/nihms-1764505.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10155118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CANONICAL THRESHOLDING FOR NON-SPARSE HIGH-DIMENSIONAL LINEAR REGRESSION. 非稀疏高维线性回归的典型阈值。
IF 4.5 1区 数学 Q1 Mathematics Pub Date : 2022-02-01 Epub Date: 2022-02-16 DOI: 10.1214/21-aos2116
Igor Silin, Jianqing Fan

We consider a high-dimensional linear regression problem. Unlike many papers on the topic, we do not require sparsity of the regression coefficients; instead, our main structural assumption is a decay of eigenvalues of the covariance matrix of the data. We propose a new family of estimators, called the canonical thresholding estimators, which pick largest regression coefficients in the canonical form. The estimators admit an explicit form and can be linked to LASSO and Principal Component Regression (PCR). A theoretical analysis for both fixed design and random design settings is provided. Obtained bounds on the mean squared error and the prediction error of a specific estimator from the family allow to clearly state sufficient conditions on the decay of eigenvalues to ensure convergence. In addition, we promote the use of the relative errors, strongly linked with the out-of-sample R 2. The study of these relative errors leads to a new concept of joint effective dimension, which incorporates the covariance of the data and the regression coefficients simultaneously, and describes the complexity of a linear regression problem. Some minimax lower bounds are established to showcase the optimality of our procedure. Numerical simulations confirm good performance of the proposed estimators compared to the previously developed methods.

我们考虑一个高维线性回归问题。与许多关于该主题的论文不同,我们不需要回归系数的稀疏性;相反,我们的主要结构假设是数据协方差矩阵的特征值的衰减。我们提出了一种新的估计量,称为规范阈值估计量,它以规范形式选择最大的回归系数。估计量采用显式形式,可以与LASSO和主成分回归(PCR)联系起来。对固定设计和随机设计进行了理论分析。得到了该族中特定估计量的均方误差和预测误差的界,从而清楚地说明了特征值衰减的充分条件以保证收敛。此外,我们提倡使用与样本外r2密切相关的相对误差。通过对这些相对误差的研究,提出了联合有效维数的概念,该概念将数据的协方差和回归系数同时考虑在内,描述了线性回归问题的复杂性。建立了一些极大极小下界来展示我们的方法的最优性。数值模拟结果表明,所提出的估计器与以前开发的方法相比具有良好的性能。
{"title":"CANONICAL THRESHOLDING FOR NON-SPARSE HIGH-DIMENSIONAL LINEAR REGRESSION.","authors":"Igor Silin,&nbsp;Jianqing Fan","doi":"10.1214/21-aos2116","DOIUrl":"https://doi.org/10.1214/21-aos2116","url":null,"abstract":"<p><p>We consider a high-dimensional linear regression problem. Unlike many papers on the topic, we do not require sparsity of the regression coefficients; instead, our main structural assumption is a decay of eigenvalues of the covariance matrix of the data. We propose a new family of estimators, called the canonical thresholding estimators, which pick largest regression coefficients in the canonical form. The estimators admit an explicit form and can be linked to LASSO and Principal Component Regression (PCR). A theoretical analysis for both fixed design and random design settings is provided. Obtained bounds on the mean squared error and the prediction error of a specific estimator from the family allow to clearly state sufficient conditions on the decay of eigenvalues to ensure convergence. In addition, we promote the use of the relative errors, strongly linked with the out-of-sample <i>R</i> <sup>2</sup>. The study of these relative errors leads to a new concept of joint effective dimension, which incorporates the covariance of the data and the regression coefficients simultaneously, and describes the complexity of a linear regression problem. Some minimax lower bounds are established to showcase the optimality of our procedure. Numerical simulations confirm good performance of the proposed estimators compared to the previously developed methods.</p>","PeriodicalId":8032,"journal":{"name":"Annals of Statistics","volume":null,"pages":null},"PeriodicalIF":4.5,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9491498/pdf/nihms-1782574.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33478241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Total variation regularized Fréchet regression for metric-space valued data 度量空间值数据的总变分正则化fr<s:1>回归
IF 4.5 1区 数学 Q1 Mathematics Pub Date : 2021-12-01 DOI: 10.1214/21-aos2095
Zhenhua Lin,Hans-Georg Müller
Non-Euclidean data that are indexed with a scalar predictor such as time are increasingly encountered in data applications, while statistical methodology and theory for such random objects are not well developed yet. To address the need for new methodology in this area, we develop a total variation regularization technique for nonparametric Frechet regression, which refers to a regression setting where a response residing in a generic metric space is paired with a scalar predictor and the target is a conditional Frechet mean. Specifically, we seek to approximate an unknown metric-space valued function by an estimator that minimizes the Frechet version of least squares and at the same time has small total variation, appropriately defined for metric-space valued objects. We show that the resulting estimator is representable by a piece-wise constant function and establish the minimax convergence rate of the proposed estimator for metric data objects that reside in Hadamard spaces. We illustrate the numerical performance of the proposed method for both simulated and real data, including metric spaces of symmetric positive-definite matrices with the affine-invariant distance, of probability distributions on the real line with the Wasserstein distance, and of phylogenetic trees with the Billera--Holmes--Vogtmann metric.
{"title":"Total variation regularized Fréchet regression for metric-space valued data","authors":"Zhenhua Lin,Hans-Georg Müller","doi":"10.1214/21-aos2095","DOIUrl":"https://doi.org/10.1214/21-aos2095","url":null,"abstract":"Non-Euclidean data that are indexed with a scalar predictor such as time are increasingly encountered in data applications, while statistical methodology and theory for such random objects are not well developed yet. To address the need for new methodology in this area, we develop a total variation regularization technique for nonparametric Frechet regression, which refers to a regression setting where a response residing in a generic metric space is paired with a scalar predictor and the target is a conditional Frechet mean. Specifically, we seek to approximate an unknown metric-space valued function by an estimator that minimizes the Frechet version of least squares and at the same time has small total variation, appropriately defined for metric-space valued objects. We show that the resulting estimator is representable by a piece-wise constant function and establish the minimax convergence rate of the proposed estimator for metric data objects that reside in Hadamard spaces. We illustrate the numerical performance of the proposed method for both simulated and real data, including metric spaces of symmetric positive-definite matrices with the affine-invariant distance, of probability distributions on the real line with the Wasserstein distance, and of phylogenetic trees with the Billera--Holmes--Vogtmann metric.","PeriodicalId":8032,"journal":{"name":"Annals of Statistics","volume":null,"pages":null},"PeriodicalIF":4.5,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138508224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
BRIDGING CONVEX AND NONCONVEX OPTIMIZATION IN ROBUST PCA: NOISE, OUTLIERS, AND MISSING DATA. 在鲁棒 PCA 中连接凸优化和非凸优化:噪声、异常值和缺失数据。
IF 3.2 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2021-10-01 Epub Date: 2021-11-12 DOI: 10.1214/21-aos2066
Yuxin Chen, Jianqing Fan, Cong Ma, Yuling Yan

This paper delivers improved theoretical guarantees for the convex programming approach in low-rank matrix estimation, in the presence of (1) random noise, (2) gross sparse outliers, and (3) missing data. This problem, often dubbed as robust principal component analysis (robust PCA), finds applications in various domains. Despite the wide applicability of convex relaxation, the available statistical support (particularly the stability analysis vis-à-vis random noise) remains highly suboptimal, which we strengthen in this paper. When the unknown matrix is well-conditioned, incoherent, and of constant rank, we demonstrate that a principled convex program achieves near-optimal statistical accuracy, in terms of both the Euclidean loss and the loss. All of this happens even when nearly a constant fraction of observations are corrupted by outliers with arbitrary magnitudes. The key analysis idea lies in bridging the convex program in use and an auxiliary nonconvex optimization algorithm, and hence the title of this paper.

本文为低秩矩阵估计中的凸编程方法提供了改进的理论保证,这种方法适用于 (1) 随机噪声、(2) 严重稀疏异常值和 (3) 数据缺失的情况。这个问题通常被称为鲁棒主成分分析(鲁棒 PCA),在各个领域都有应用。尽管凸松弛具有广泛的适用性,但现有的统计支持(尤其是针对随机噪声的稳定性分析)仍然非常不理想,我们在本文中将对此进行强化。当未知矩阵条件良好、不连贯且秩恒定时,我们证明了原则性凸程序在欧氏损失和 ℓ ∞ 损失方面都能达到近乎最优的统计精度。即使有近乎恒定的部分观测数据被任意大小的异常值所干扰,所有这一切也会发生。关键的分析思路在于将使用中的凸程序与辅助的非凸优化算法连接起来,这也是本文标题的由来。
{"title":"BRIDGING CONVEX AND NONCONVEX OPTIMIZATION IN ROBUST PCA: NOISE, OUTLIERS, AND MISSING DATA.","authors":"Yuxin Chen, Jianqing Fan, Cong Ma, Yuling Yan","doi":"10.1214/21-aos2066","DOIUrl":"10.1214/21-aos2066","url":null,"abstract":"<p><p>This paper delivers improved theoretical guarantees for the convex programming approach in low-rank matrix estimation, in the presence of (1) random noise, (2) gross sparse outliers, and (3) missing data. This problem, often dubbed as <i>robust principal component analysis (robust PCA)</i>, finds applications in various domains. Despite the wide applicability of convex relaxation, the available statistical support (particularly the stability analysis vis-à-vis random noise) remains highly suboptimal, which we strengthen in this paper. When the unknown matrix is well-conditioned, incoherent, and of constant rank, we demonstrate that a principled convex program achieves near-optimal statistical accuracy, in terms of both the Euclidean loss and the <i>ℓ</i> <sub>∞</sub> loss. All of this happens even when nearly a constant fraction of observations are corrupted by outliers with arbitrary magnitudes. The key analysis idea lies in bridging the convex program in use and an auxiliary nonconvex optimization algorithm, and hence the title of this paper.</p>","PeriodicalId":8032,"journal":{"name":"Annals of Statistics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9491514/pdf/nihms-1782570.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33479290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BOOSTED NONPARAMETRIC HAZARDS WITH TIME-DEPENDENT COVARIATES. 具有时间相关协变量的增强非参数风险。
IF 4.5 1区 数学 Q1 Mathematics Pub Date : 2021-08-01 Epub Date: 2021-09-29 DOI: 10.1214/20-aos2028
Donald K K Lee, Ningyuan Chen, Hemant Ishwaran

Given functional data from a survival process with time-dependent covariates, we derive a smooth convex representation for its nonparametric log-likelihood functional and obtain its functional gradient. From this we devise a generic gradient boosting procedure for estimating the hazard function nonparametrically. An illustrative implementation of the procedure using regression trees is described to show how to recover the unknown hazard. The generic estimator is consistent if the model is correctly specified; alternatively an oracle inequality can be demonstrated for tree-based models. To avoid overfitting, boosting employs several regularization devices. One of them is step-size restriction, but the rationale for this is somewhat mysterious from the viewpoint of consistency. Our work brings some clarity to this issue by revealing that step-size restriction is a mechanism for preventing the curvature of the risk from derailing convergence.

给定具有时变协变量的生存过程的函数数据,导出了其非参数对数似然泛函的光滑凸表示,并得到了其泛函梯度。在此基础上,我们设计了一种非参数估计危险函数的一般梯度增强方法。描述了使用回归树的过程的说明性实现,以显示如何恢复未知的危险。如果正确指定了模型,则通用估计量是一致的;另外,可以为基于树的模型演示oracle不等式。为了避免过拟合,增强采用了几个正则化装置。其中之一是步长限制,但从一致性的角度来看,其基本原理有些神秘。我们的工作通过揭示步长限制是防止偏离收敛的风险曲率的机制,为这个问题提供了一些清晰度。
{"title":"BOOSTED NONPARAMETRIC HAZARDS WITH TIME-DEPENDENT COVARIATES.","authors":"Donald K K Lee,&nbsp;Ningyuan Chen,&nbsp;Hemant Ishwaran","doi":"10.1214/20-aos2028","DOIUrl":"https://doi.org/10.1214/20-aos2028","url":null,"abstract":"<p><p>Given functional data from a survival process with time-dependent covariates, we derive a smooth convex representation for its nonparametric log-likelihood functional and obtain its functional gradient. From this we devise a generic gradient boosting procedure for estimating the hazard function nonparametrically. An illustrative implementation of the procedure using regression trees is described to show how to recover the unknown hazard. The generic estimator is consistent if the model is correctly specified; alternatively an oracle inequality can be demonstrated for tree-based models. To avoid overfitting, boosting employs several regularization devices. One of them is step-size restriction, but the rationale for this is somewhat mysterious from the viewpoint of consistency. Our work brings some clarity to this issue by revealing that step-size restriction is a mechanism for preventing the curvature of the risk from derailing convergence.</p>","PeriodicalId":8032,"journal":{"name":"Annals of Statistics","volume":null,"pages":null},"PeriodicalIF":4.5,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8691747/pdf/nihms-1683276.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39748775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ASYMPTOTIC DISTRIBUTIONS OF HIGH-DIMENSIONAL DISTANCE CORRELATION INFERENCE. 高维距离相关推断的渐近分布。
IF 4.5 1区 数学 Q1 Mathematics Pub Date : 2021-08-01 Epub Date: 2021-09-29 DOI: 10.1214/20-aos2024
Lan Gao, Yingying Fan, Jinchi Lv, Qi-Man Shao

Distance correlation has become an increasingly popular tool for detecting the nonlinear dependence between a pair of potentially high-dimensional random vectors. Most existing works have explored its asymptotic distributions under the null hypothesis of independence between the two random vectors when only the sample size or the dimensionality diverges. Yet its asymptotic null distribution for the more realistic setting when both sample size and dimensionality diverge in the full range remains largely underdeveloped. In this paper, we fill such a gap and develop central limit theorems and associated rates of convergence for a rescaled test statistic based on the bias-corrected distance correlation in high dimensions under some mild regularity conditions and the null hypothesis. Our new theoretical results reveal an interesting phenomenon of blessing of dimensionality for high-dimensional distance correlation inference in the sense that the accuracy of normal approximation can increase with dimensionality. Moreover, we provide a general theory on the power analysis under the alternative hypothesis of dependence, and further justify the capability of the rescaled distance correlation in capturing the pure nonlinear dependency under moderately high dimensionality for a certain type of alternative hypothesis. The theoretical results and finite-sample performance of the rescaled statistic are illustrated with several simulation examples and a blockchain application.

距离相关性已成为检测一对潜在高维随机向量之间非线性依赖关系的一种日益流行的工具。大多数现有研究都探讨了距离相关在两个随机向量之间独立性的零假设下的渐近分布,即只有样本大小或维度发生偏离时的渐近分布。然而,在更现实的情况下,当样本大小和维度都在全范围内发散时,其渐近零分布在很大程度上仍未得到充分发展。在本文中,我们填补了这一空白,并在一些温和的正则条件和零假设下,建立了基于高维度偏差校正距离相关性的重标检验统计量的中心极限定理和相关收敛率。我们的新理论结果揭示了高维距离相关推断的一个有趣的维度祝福现象,即正态逼近的准确性会随着维度的增加而增加。此外,我们还提供了关于依赖性替代假设下幂次分析的一般理论,并进一步证明了在某类替代假设下,重标度距离相关在中等高维条件下捕捉纯非线性依赖性的能力。通过几个模拟实例和一个区块链应用,说明了重标度统计量的理论结果和有限样本性能。
{"title":"ASYMPTOTIC DISTRIBUTIONS OF HIGH-DIMENSIONAL DISTANCE CORRELATION INFERENCE.","authors":"Lan Gao, Yingying Fan, Jinchi Lv, Qi-Man Shao","doi":"10.1214/20-aos2024","DOIUrl":"10.1214/20-aos2024","url":null,"abstract":"<p><p>Distance correlation has become an increasingly popular tool for detecting the nonlinear dependence between a pair of potentially high-dimensional random vectors. Most existing works have explored its asymptotic distributions under the null hypothesis of independence between the two random vectors when only the sample size or the dimensionality diverges. Yet its asymptotic null distribution for the more realistic setting when both sample size and dimensionality diverge in the full range remains largely underdeveloped. In this paper, we fill such a gap and develop central limit theorems and associated rates of convergence for a rescaled test statistic based on the bias-corrected distance correlation in high dimensions under some mild regularity conditions and the null hypothesis. Our new theoretical results reveal an interesting phenomenon of blessing of dimensionality for high-dimensional distance correlation inference in the sense that the accuracy of normal approximation can increase with dimensionality. Moreover, we provide a general theory on the power analysis under the alternative hypothesis of dependence, and further justify the capability of the rescaled distance correlation in capturing the pure nonlinear dependency under moderately high dimensionality for a certain type of alternative hypothesis. The theoretical results and finite-sample performance of the rescaled statistic are illustrated with several simulation examples and a blockchain application.</p>","PeriodicalId":8032,"journal":{"name":"Annals of Statistics","volume":null,"pages":null},"PeriodicalIF":4.5,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8491772/pdf/nihms-1684707.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39495519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A SHRINKAGE PRINCIPLE FOR HEAVY-TAILED DATA: HIGH-DIMENSIONAL ROBUST LOW-RANK MATRIX RECOVERY. 重尾数据的收缩原理:高维稳健低秩矩阵恢复。
IF 4.5 1区 数学 Q1 Mathematics Pub Date : 2021-06-01 Epub Date: 2021-08-09 DOI: 10.1214/20-aos1980
Jianqing Fan, Weichen Wang, Ziwei Zhu

This paper introduces a simple principle for robust statistical inference via appropriate shrinkage on the data. This widens the scope of high-dimensional techniques, reducing the distributional conditions from sub-exponential or sub-Gaussian to more relaxed bounded second or fourth moment. As an illustration of this principle, we focus on robust estimation of the low-rank matrix Θ* from the trace regression model Y = Tr(Θ* X) + ϵ. It encompasses four popular problems: sparse linear model, compressed sensing, matrix completion and multi-task learning. We propose to apply the penalized least-squares approach to the appropriately truncated or shrunk data. Under only bounded 2+δ moment condition on the response, the proposed robust methodology yields an estimator that possesses the same statistical error rates as previous literature with sub-Gaussian errors. For sparse linear model and multi-task regression, we further allow the design to have only bounded fourth moment and obtain the same statistical rates. As a byproduct, we give a robust covariance estimator with concentration inequality and optimal rate of convergence in terms of the spectral norm, when the samples only bear bounded fourth moment. This result is of its own interest and importance. We reveal that under high dimensions, the sample covariance matrix is not optimal whereas our proposed robust covariance can achieve optimality. Extensive simulations are carried out to support the theories.

本文介绍了一种通过对数据进行适当收缩来进行稳健统计推断的简单原理。这拓宽了高维技术的范围,将分布条件从亚指数或亚高斯减少到更宽松的有界二阶或四阶矩。作为这一原理的说明,我们专注于从迹回归模型Y=Tr(Θ*⊤X)+Γ对低秩矩阵Θ*的鲁棒估计。它包括四个常见的问题:稀疏线性模型、压缩感知、矩阵完成和多任务学习。我们建议将惩罚最小二乘法应用于适当截断或收缩的数据。在只有响应的有界2+δ矩条件下,所提出的鲁棒方法产生了一个估计器,该估计器具有与先前文献相同的统计误差率,具有亚高斯误差。对于稀疏线性模型和多任务回归,我们进一步允许设计只有有界的四阶矩,并获得相同的统计率。作为副产品,当样本仅具有有界四阶矩时,我们根据谱范数给出了一个具有浓度不等式和最优收敛率的鲁棒协方差估计器。这一结果有其自身的利益和重要性。我们揭示了在高维下,样本协方差矩阵不是最优的,而我们提出的鲁棒协方差可以实现最优性。进行了大量的模拟来支持这些理论。
{"title":"A SHRINKAGE PRINCIPLE FOR HEAVY-TAILED DATA: HIGH-DIMENSIONAL ROBUST LOW-RANK MATRIX RECOVERY.","authors":"Jianqing Fan,&nbsp;Weichen Wang,&nbsp;Ziwei Zhu","doi":"10.1214/20-aos1980","DOIUrl":"10.1214/20-aos1980","url":null,"abstract":"<p><p>This paper introduces a simple principle for robust statistical inference via appropriate shrinkage on the data. This widens the scope of high-dimensional techniques, reducing the distributional conditions from sub-exponential or sub-Gaussian to more relaxed bounded second or fourth moment. As an illustration of this principle, we focus on robust estimation of the low-rank matrix <b>Θ</b>* from the trace regression model <i>Y</i> = Tr(<b>Θ</b>*<sup>⊤</sup> <b>X</b>) + <i>ϵ</i>. It encompasses four popular problems: sparse linear model, compressed sensing, matrix completion and multi-task learning. We propose to apply the penalized least-squares approach to the appropriately truncated or shrunk data. Under only bounded 2+<i>δ</i> moment condition on the response, the proposed robust methodology yields an estimator that possesses the same statistical error rates as previous literature with sub-Gaussian errors. For sparse linear model and multi-task regression, we further allow the design to have only bounded fourth moment and obtain the same statistical rates. As a byproduct, we give a robust covariance estimator with concentration inequality and optimal rate of convergence in terms of the spectral norm, when the samples only bear bounded fourth moment. This result is of its own interest and importance. We reveal that under high dimensions, the sample covariance matrix is not optimal whereas our proposed robust covariance can achieve optimality. Extensive simulations are carried out to support the theories.</p>","PeriodicalId":8032,"journal":{"name":"Annals of Statistics","volume":null,"pages":null},"PeriodicalIF":4.5,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8457508/pdf/nihms-1639579.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39443652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 69
Survival analysis via hierarchically dependent mixture hazards 通过分级依赖混合物危害进行生存分析
IF 4.5 1区 数学 Q1 Mathematics Pub Date : 2021-04-01 DOI: 10.1214/20-AOS1982
F. Camerlenghi, A. Lijoi, I. Pruenster
Hierarchical nonparametric processes are popular tools for defining priors on collections of probability distributions, which induce dependence across multiple samples. In survival analysis problems one is typically interested in modeling the hazard rates, rather than the probability distributions themselves, and the currently available methodologies are not applicable. Here we fill this gap by introducing a novel, and analytically tractable, class of multivariate mixtures whose distribution acts as a prior for the vector of sample–specific baseline hazard rates. The dependence is induced through a hierarchical specification for the mixing random measures that ultimately corresponds to a composition of random discrete combinatorial structures. Our theoretical results allow to develop a full Bayesian analysis for this class of models, which can also account for right–censored survival data and covariates, and we also show posterior consistency. In particular, we emphasize that the posterior characterization we achieve is the key for devising both marginal and conditional algorithms for evaluating Bayesian inferences of interest. The effectiveness of our proposal is illustrated through some synthetic and real data examples.
分层非参数过程是在概率分布集合上定义先验的流行工具,它会引起多个样本之间的依赖性。在生存分析问题中,人们通常对危险率建模感兴趣,而不是对概率分布本身感兴趣,并且目前可用的方法不适用。在这里,我们通过引入一类新的、可分析处理的多元混合物来填补这一空白,其分布充当样本特异性基线危险率向量的先验。这种依赖性是通过混合随机度量的分层规范来诱导的,该混合随机度量最终对应于随机离散组合结构的组成。我们的理论结果允许为这类模型开发一个完整的贝叶斯分析,它也可以解释右删失生存数据和协变量,我们还显示了后验一致性。特别是,我们强调,我们实现的后验特征是设计用于评估感兴趣的贝叶斯推断的边际和条件算法的关键。通过一些综合和真实的数据实例说明了我们的建议的有效性。
{"title":"Survival analysis via hierarchically dependent mixture hazards","authors":"F. Camerlenghi, A. Lijoi, I. Pruenster","doi":"10.1214/20-AOS1982","DOIUrl":"https://doi.org/10.1214/20-AOS1982","url":null,"abstract":"Hierarchical nonparametric processes are popular tools for defining priors on collections of probability distributions, which induce dependence across multiple samples. In survival analysis problems one is typically interested in modeling the hazard rates, rather than the probability distributions themselves, and the currently available methodologies are not applicable. Here we fill this gap by introducing a novel, and analytically tractable, class of multivariate mixtures whose distribution acts as a prior for the vector of sample–specific baseline hazard rates. The dependence is induced through a hierarchical specification for the mixing random measures that ultimately corresponds to a composition of random discrete combinatorial structures. Our theoretical results allow to develop a full Bayesian analysis for this class of models, which can also account for right–censored survival data and covariates, and we also show posterior consistency. In particular, we emphasize that the posterior characterization we achieve is the key for devising both marginal and conditional algorithms for evaluating Bayesian inferences of interest. The effectiveness of our proposal is illustrated through some synthetic and real data examples.","PeriodicalId":8032,"journal":{"name":"Annals of Statistics","volume":null,"pages":null},"PeriodicalIF":4.5,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49201957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
期刊
Annals of Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1