Youngjoo Cho, Annette M Molinaro, Chen Hu, Robert L Strawderman
{"title":"Regression trees and ensembles for cumulative incidence functions.","authors":"Youngjoo Cho, Annette M Molinaro, Chen Hu, Robert L Strawderman","doi":"10.1515/ijb-2021-0014","DOIUrl":null,"url":null,"abstract":"<p><p>The use of cumulative incidence functions for characterizing the risk of one type of event in the presence of others has become increasingly popular over the past two decades. The problems of modeling, estimation and inference have been treated using parametric, nonparametric and semi-parametric methods. Efforts to develop suitable extensions of machine learning methods, such as regression trees and ensemble methods, have begun comparatively recently. In this paper, we propose a novel approach to estimating cumulative incidence curves in a competing risks setting using regression trees and associated ensemble estimators. The proposed methods use augmented estimators of the Brier score risk as the primary basis for building and pruning trees, and lead to methods that are easily implemented using existing R packages. Data from the Radiation Therapy Oncology Group (trial 9410) is used to illustrate these new methods.</p>","PeriodicalId":75022,"journal":{"name":"","volume":"18 2","pages":"397-419"},"PeriodicalIF":0.0,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9509494/pdf/ijb-18-2-ijb-2021-0014.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1515/ijb-2021-0014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/11/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The use of cumulative incidence functions for characterizing the risk of one type of event in the presence of others has become increasingly popular over the past two decades. The problems of modeling, estimation and inference have been treated using parametric, nonparametric and semi-parametric methods. Efforts to develop suitable extensions of machine learning methods, such as regression trees and ensemble methods, have begun comparatively recently. In this paper, we propose a novel approach to estimating cumulative incidence curves in a competing risks setting using regression trees and associated ensemble estimators. The proposed methods use augmented estimators of the Brier score risk as the primary basis for building and pruning trees, and lead to methods that are easily implemented using existing R packages. Data from the Radiation Therapy Oncology Group (trial 9410) is used to illustrate these new methods.
在过去二十年里,使用累积发生率函数来描述一类事件在其他事件存在的情况下的风险变得越来越流行。建模、估算和推断等问题一直使用参数、非参数和半参数方法进行处理。开发机器学习方法的适当扩展(如回归树和集合方法)的工作是最近才开始的。在本文中,我们提出了一种新方法,利用回归树和相关的集合估计器来估计竞争风险环境下的累积发病率曲线。所提出的方法使用布赖尔评分风险的增强估计器作为构建和修剪树的主要基础,并可使用现有的 R 软件包轻松实现。放疗肿瘤学组(9410 试验)的数据被用来说明这些新方法。