BOOSTED NONPARAMETRIC HAZARDS WITH TIME-DEPENDENT COVARIATES.

IF 3.7 1区数学 Q1 STATISTICS & PROBABILITY Annals of Statistics Pub Date : 2021-08-01 Epub Date: 2021-09-29 DOI:10.1214/20-aos2028

Donald K K Lee, Ningyuan Chen, Hemant Ishwaran

引用次数: 0

Abstract

Given functional data from a survival process with time-dependent covariates, we derive a smooth convex representation for its nonparametric log-likelihood functional and obtain its functional gradient. From this we devise a generic gradient boosting procedure for estimating the hazard function nonparametrically. An illustrative implementation of the procedure using regression trees is described to show how to recover the unknown hazard. The generic estimator is consistent if the model is correctly specified; alternatively an oracle inequality can be demonstrated for tree-based models. To avoid overfitting, boosting employs several regularization devices. One of them is step-size restriction, but the rationale for this is somewhat mysterious from the viewpoint of consistency. Our work brings some clarity to this issue by revealing that step-size restriction is a mechanism for preventing the curvature of the risk from derailing convergence.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

具有时间相关协变量的增强非参数风险。

给定具有时变协变量的生存过程的函数数据，导出了其非参数对数似然泛函的光滑凸表示，并得到了其泛函梯度。在此基础上，我们设计了一种非参数估计危险函数的一般梯度增强方法。描述了使用回归树的过程的说明性实现，以显示如何恢复未知的危险。如果正确指定了模型，则通用估计量是一致的;另外，可以为基于树的模型演示oracle不等式。为了避免过拟合，增强采用了几个正则化装置。其中之一是步长限制，但从一致性的角度来看，其基本原理有些神秘。我们的工作通过揭示步长限制是防止偏离收敛的风险曲率的机制，为这个问题提供了一些清晰度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Annals of Statistics 数学-统计学与概率论

CiteScore

9.30

自引率

8.90%

发文量

119

审稿时长

6-12 weeks

期刊介绍： The Annals of Statistics aim to publish research papers of highest quality reflecting the many facets of contemporary statistics. Primary emphasis is placed on importance and originality, not on formalism. The journal aims to cover all areas of statistics, especially mathematical statistics and applied & interdisciplinary statistics. Of course many of the best papers will touch on more than one of these general areas, because the discipline of statistics has deep roots in mathematics, and in substantive scientific fields.