Pub Date : 2024-01-01Epub Date: 2023-09-02DOI: 10.1007/s10985-023-09605-8
Xiao Li, Brent R Logan, S M Ferdous Hossain, Erica E M Moodie
To achieve the goal of providing the best possible care to each individual under their care, physicians need to customize treatments for individuals with the same health state, especially when treating diseases that can progress further and require additional treatments, such as cancer. Making decisions at multiple stages as a disease progresses can be formalized as a dynamic treatment regime (DTR). Most of the existing optimization approaches for estimating dynamic treatment regimes including the popular method of Q-learning were developed in a frequentist context. Recently, a general Bayesian machine learning framework that facilitates using Bayesian regression modeling to optimize DTRs has been proposed. In this article, we adapt this approach to censored outcomes using Bayesian additive regression trees (BART) for each stage under the accelerated failure time modeling framework, along with simulation studies and a real data example that compare the proposed approach with Q-learning. We also develop an R wrapper function that utilizes a standard BART survival model to optimize DTRs for censored outcomes. The wrapper function can easily be extended to accommodate any type of Bayesian machine learning model.
{"title":"Dynamic Treatment Regimes Using Bayesian Additive Regression Trees for Censored Outcomes.","authors":"Xiao Li, Brent R Logan, S M Ferdous Hossain, Erica E M Moodie","doi":"10.1007/s10985-023-09605-8","DOIUrl":"10.1007/s10985-023-09605-8","url":null,"abstract":"<p><p>To achieve the goal of providing the best possible care to each individual under their care, physicians need to customize treatments for individuals with the same health state, especially when treating diseases that can progress further and require additional treatments, such as cancer. Making decisions at multiple stages as a disease progresses can be formalized as a dynamic treatment regime (DTR). Most of the existing optimization approaches for estimating dynamic treatment regimes including the popular method of Q-learning were developed in a frequentist context. Recently, a general Bayesian machine learning framework that facilitates using Bayesian regression modeling to optimize DTRs has been proposed. In this article, we adapt this approach to censored outcomes using Bayesian additive regression trees (BART) for each stage under the accelerated failure time modeling framework, along with simulation studies and a real data example that compare the proposed approach with Q-learning. We also develop an R wrapper function that utilizes a standard BART survival model to optimize DTRs for censored outcomes. The wrapper function can easily be extended to accommodate any type of Bayesian machine learning model.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10764602/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10513626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-01Epub Date: 2023-02-23DOI: 10.1007/s10985-023-09590-y
Wu Xue, Xiaoke Zhang, Kwun Chuen Gary Chan, Raymond K W Wong
Survival causal effect estimation based on right-censored data is of key interest in both survival analysis and causal inference. Propensity score weighting is one of the most popular methods in the literature. However, since it involves the inverse of propensity score estimates, its practical performance may be very unstable, especially when the covariate overlap is limited between treatment and control groups. To address this problem, a covariate balancing method is developed in this paper to estimate the counterfactual survival function. The proposed method is nonparametric and balances covariates in a reproducing kernel Hilbert space (RKHS) via weights that are counterparts of inverse propensity scores. The uniform rate of convergence for the proposed estimator is shown to be the same as that for the classical Kaplan-Meier estimator. The appealing practical performance of the proposed method is demonstrated by a simulation study as well as two real data applications to study the causal effect of smoking on survival time of stroke patients and that of endotoxin on survival time for female patients with lung cancer respectively.
{"title":"RKHS-based covariate balancing for survival causal effect estimation.","authors":"Wu Xue, Xiaoke Zhang, Kwun Chuen Gary Chan, Raymond K W Wong","doi":"10.1007/s10985-023-09590-y","DOIUrl":"10.1007/s10985-023-09590-y","url":null,"abstract":"<p><p>Survival causal effect estimation based on right-censored data is of key interest in both survival analysis and causal inference. Propensity score weighting is one of the most popular methods in the literature. However, since it involves the inverse of propensity score estimates, its practical performance may be very unstable, especially when the covariate overlap is limited between treatment and control groups. To address this problem, a covariate balancing method is developed in this paper to estimate the counterfactual survival function. The proposed method is nonparametric and balances covariates in a reproducing kernel Hilbert space (RKHS) via weights that are counterparts of inverse propensity scores. The uniform rate of convergence for the proposed estimator is shown to be the same as that for the classical Kaplan-Meier estimator. The appealing practical performance of the proposed method is demonstrated by a simulation study as well as two real data applications to study the causal effect of smoking on survival time of stroke patients and that of endotoxin on survival time for female patients with lung cancer respectively.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9321621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-01Epub Date: 2023-03-22DOI: 10.1007/s10985-023-09595-7
Yun-Lin Ho, Ju-Sheng Hong, Yen-Tsung Huang
Analyzing the causal mediation of semi-competing risks has become important in medical research. Semi-competing risks refers to a scenario wherein an intermediate event may be censored by a primary event but not vice versa. Causal mediation analyses decompose the effect of an exposure on the primary outcome into an indirect (mediation) effect: an effect mediated through a mediator, and a direct effect: an effect not through the mediator. Here we proposed a model-based testing procedure to examine the indirect effect of the exposure on the primary event through the intermediate event. Under the counterfactual outcome framework, we defined a causal mediation effect using counting process. To assess statistical evidence for the mediation effect, we proposed two tests: an intersection-union test (IUT) and a weighted log-rank test (WLR). The test statistic was developed from a semi-parametric estimator of the mediation effect using a Cox proportional hazards model for the primary event and a series of logistic regression models for the intermediate event. We built a connection between the IUT and WLR. Asymptotic properties of the two tests were derived, and the IUT was determined to be a size [Formula: see text] test and statistically more powerful than the WLR. In numerical simulations, both the model-based IUT and WLR can properly adjust for confounding covariates, and the Type I error rates of the proposed methods are well protected, with the IUT being more powerful than the WLR. Our methods demonstrate the strongly significant effects of hepatitis B or C on the risk of liver cancer mediated through liver cirrhosis incidence in a prospective cohort study. The proposed method is also applicable to surrogate endpoint analyses in clinical trials.
{"title":"Model-based hypothesis tests for the causal mediation of semi-competing risks.","authors":"Yun-Lin Ho, Ju-Sheng Hong, Yen-Tsung Huang","doi":"10.1007/s10985-023-09595-7","DOIUrl":"10.1007/s10985-023-09595-7","url":null,"abstract":"<p><p>Analyzing the causal mediation of semi-competing risks has become important in medical research. Semi-competing risks refers to a scenario wherein an intermediate event may be censored by a primary event but not vice versa. Causal mediation analyses decompose the effect of an exposure on the primary outcome into an indirect (mediation) effect: an effect mediated through a mediator, and a direct effect: an effect not through the mediator. Here we proposed a model-based testing procedure to examine the indirect effect of the exposure on the primary event through the intermediate event. Under the counterfactual outcome framework, we defined a causal mediation effect using counting process. To assess statistical evidence for the mediation effect, we proposed two tests: an intersection-union test (IUT) and a weighted log-rank test (WLR). The test statistic was developed from a semi-parametric estimator of the mediation effect using a Cox proportional hazards model for the primary event and a series of logistic regression models for the intermediate event. We built a connection between the IUT and WLR. Asymptotic properties of the two tests were derived, and the IUT was determined to be a size [Formula: see text] test and statistically more powerful than the WLR. In numerical simulations, both the model-based IUT and WLR can properly adjust for confounding covariates, and the Type I error rates of the proposed methods are well protected, with the IUT being more powerful than the WLR. Our methods demonstrate the strongly significant effects of hepatitis B or C on the risk of liver cancer mediated through liver cirrhosis incidence in a prospective cohort study. The proposed method is also applicable to surrogate endpoint analyses in clinical trials.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9529993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-01Epub Date: 2023-08-24DOI: 10.1007/s10985-023-09606-7
Iván Díaz, Katherine L Hoffman, Nima S Hejazi
Longitudinal modified treatment policies (LMTP) have been recently developed as a novel method to define and estimate causal parameters that depend on the natural value of treatment. LMTPs represent an important advancement in causal inference for longitudinal studies as they allow the non-parametric definition and estimation of the joint effect of multiple categorical, ordinal, or continuous treatments measured at several time points. We extend the LMTP methodology to problems in which the outcome is a time-to-event variable subject to a competing event that precludes observation of the event of interest. We present identification results and non-parametric locally efficient estimators that use flexible data-adaptive regression techniques to alleviate model misspecification bias, while retaining important asymptotic properties such as [Formula: see text]-consistency. We present an application to the estimation of the effect of the time-to-intubation on acute kidney injury amongst COVID-19 hospitalized patients, where death by other causes is taken to be the competing event.
{"title":"Causal survival analysis under competing risks using longitudinal modified treatment policies.","authors":"Iván Díaz, Katherine L Hoffman, Nima S Hejazi","doi":"10.1007/s10985-023-09606-7","DOIUrl":"10.1007/s10985-023-09606-7","url":null,"abstract":"<p><p>Longitudinal modified treatment policies (LMTP) have been recently developed as a novel method to define and estimate causal parameters that depend on the natural value of treatment. LMTPs represent an important advancement in causal inference for longitudinal studies as they allow the non-parametric definition and estimation of the joint effect of multiple categorical, ordinal, or continuous treatments measured at several time points. We extend the LMTP methodology to problems in which the outcome is a time-to-event variable subject to a competing event that precludes observation of the event of interest. We present identification results and non-parametric locally efficient estimators that use flexible data-adaptive regression techniques to alleviate model misspecification bias, while retaining important asymptotic properties such as [Formula: see text]-consistency. We present an application to the estimation of the effect of the time-to-intubation on acute kidney injury amongst COVID-19 hospitalized patients, where death by other causes is taken to be the competing event.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10423108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-01Epub Date: 2022-11-07DOI: 10.1007/s10985-022-09576-2
Helene C W Rytgaard, Mark J van der Laan
Targeted maximum likelihood estimation (TMLE) provides a general methodology for estimation of causal parameters in presence of high-dimensional nuisance parameters. Generally, TMLE consists of a two-step procedure that combines data-adaptive nuisance parameter estimation with semiparametric efficiency and rigorous statistical inference obtained via a targeted update step. In this paper, we demonstrate the practical applicability of TMLE based causal inference in survival and competing risks settings where event times are not confined to take place on a discrete and finite grid. We focus on estimation of causal effects of time-fixed treatment decisions on survival and absolute risk probabilities, considering different univariate and multidimensional parameters. Besides providing a general guidance to using TMLE for survival and competing risks analysis, we further describe how the previous work can be extended with the use of loss-based cross-validated estimation, also known as super learning, of the conditional hazards. We illustrate the usage of the considered methods using publicly available data from a trial on adjuvant chemotherapy for colon cancer. R software code to implement all considered algorithms and to reproduce all analyses is available in an accompanying online appendix on Github.
目标最大似然估计法(TMLE)为在存在高维滋扰参数的情况下估计因果参数提供了一种通用方法。一般来说,TMLE 由两步程序组成,将数据适应性滋扰参数估计与半参数效率和通过有针对性的更新步骤获得的严格统计推断相结合。在本文中,我们展示了基于 TMLE 的因果推断在生存和竞争风险环境中的实际应用性,在这些环境中,事件发生时间并不局限于离散和有限的网格上。考虑到不同的单变量和多维参数,我们重点研究了时间固定的治疗决策对生存和绝对风险概率的因果效应估计。除了为使用 TMLE 进行生存和竞争风险分析提供一般指导外,我们还进一步介绍了如何利用基于损失的交叉验证估计(也称为超级学习)来扩展之前的工作。我们使用结肠癌辅助化疗试验的公开数据来说明所考虑的方法的用法。Github 上随附的在线附录中提供了 R 软件代码,用于实现所有考虑的算法和重现所有分析。
{"title":"Targeted maximum likelihood estimation for causal inference in survival and competing risks analysis.","authors":"Helene C W Rytgaard, Mark J van der Laan","doi":"10.1007/s10985-022-09576-2","DOIUrl":"10.1007/s10985-022-09576-2","url":null,"abstract":"<p><p>Targeted maximum likelihood estimation (TMLE) provides a general methodology for estimation of causal parameters in presence of high-dimensional nuisance parameters. Generally, TMLE consists of a two-step procedure that combines data-adaptive nuisance parameter estimation with semiparametric efficiency and rigorous statistical inference obtained via a targeted update step. In this paper, we demonstrate the practical applicability of TMLE based causal inference in survival and competing risks settings where event times are not confined to take place on a discrete and finite grid. We focus on estimation of causal effects of time-fixed treatment decisions on survival and absolute risk probabilities, considering different univariate and multidimensional parameters. Besides providing a general guidance to using TMLE for survival and competing risks analysis, we further describe how the previous work can be extended with the use of loss-based cross-validated estimation, also known as super learning, of the conditional hazards. We illustrate the usage of the considered methods using publicly available data from a trial on adjuvant chemotherapy for colon cancer. R software code to implement all considered algorithms and to reproduce all analyses is available in an accompanying online appendix on Github.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40682368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-01Epub Date: 2023-08-12DOI: 10.1007/s10985-023-09607-6
Jeffrey Zhang, Dylan S Small
We conduct an observational study of the effect of sickle cell trait Haemoglobin AS (HbAS) on the hazard rate of malaria fevers in children. Assuming no unmeasured confounding, there is strong evidence that HbAS reduces the rate of malarial fevers. Since this is an observational study, however, the no unmeasured confounding assumption is strong. A sensitivity analysis considers how robust a conclusion is to a potential unmeasured confounder. We propose a new sensitivity analysis method for recurrent event data and apply it to the malaria study. We find that for the causal conclusion that HbAS is protective against malarial fevers to be overturned, the hypothesized unmeasured confounder must be as influential as all but one of the measured confounders.
{"title":"Sensitivity Analysis for Observational Studies with Recurrent Events.","authors":"Jeffrey Zhang, Dylan S Small","doi":"10.1007/s10985-023-09607-6","DOIUrl":"10.1007/s10985-023-09607-6","url":null,"abstract":"<p><p>We conduct an observational study of the effect of sickle cell trait Haemoglobin AS (HbAS) on the hazard rate of malaria fevers in children. Assuming no unmeasured confounding, there is strong evidence that HbAS reduces the rate of malarial fevers. Since this is an observational study, however, the no unmeasured confounding assumption is strong. A sensitivity analysis considers how robust a conclusion is to a potential unmeasured confounder. We propose a new sensitivity analysis method for recurrent event data and apply it to the malaria study. We find that for the causal conclusion that HbAS is protective against malarial fevers to be overturned, the hypothesized unmeasured confounder must be as influential as all but one of the measured confounders.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10353741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-01Epub Date: 2023-06-04DOI: 10.1007/s10985-023-09601-y
Marie Skov Breum, Anders Munch, Thomas A Gerds, Torben Martinussen
In this article we study the effect of a baseline exposure on a terminal time-to-event outcome either directly or mediated by the illness state of a continuous-time illness-death process with baseline covariates. We propose a definition of the corresponding direct and indirect effects using the concept of separable (interventionist) effects (Robins and Richardson in Causality and psychopathology: finding the determinants of disorders and their cures, Oxford University Press, 2011; Robins et al. in arXiv:2008.06019 , 2021; Stensrud et al. in J Am Stat Assoc 117:175-183, 2022). Our proposal generalizes Martinussen and Stensrud (Biometrics 79:127-139, 2023) who consider similar causal estimands for disentangling the causal treatment effects on the event of interest and competing events in the standard continuous-time competing risk model. Unlike natural direct and indirect effects (Robins and Greenland in Epidemiology 3:143-155, 1992; Pearl in Proceedings of the seventeenth conference on uncertainty in artificial intelligence, Morgan Kaufmann, 2001) which are usually defined through manipulations of the mediator independently of the exposure (so-called cross-world interventions), separable direct and indirect effects are defined through interventions on different components of the exposure that exert their effects through distinct causal mechanisms. This approach allows us to define meaningful mediation targets even though the mediating event is truncated by the terminal event. We present the conditions for identifiability, which include some arguably restrictive structural assumptions on the treatment mechanism, and discuss when such assumptions are valid. The identifying functionals are used to construct plug-in estimators for the separable direct and indirect effects. We also present multiply robust and asymptotically efficient estimators based on the efficient influence functions. We verify the theoretical properties of the estimators in a simulation study, and we demonstrate the use of the estimators using data from a Danish registry study.
在这篇文章中,我们研究了基线暴露对最终时间到事件结果的影响,这种影响可以是直接影响,也可以是由带有基线协变量的连续时间疾病-死亡过程中的疾病状态所介导的影响。我们利用可分离(干预)效应的概念,提出了相应的直接效应和间接效应的定义(罗宾斯和理查德森在《因果关系与精神病理学:寻找失调症及其治疗的决定因素》(Causality and psychopathology: Find the determinants of disorders and their cures)一书中提出,牛津大学出版社,2011 年;罗宾斯等人在 arXiv:2008.06019 中提出,2021 年;斯坦斯鲁德等人在《美国统计协会杂志》(J Am Stat Assoc 117:175-183, 2022 年)中提出)。我们的建议概括了 Martinussen 和 Stensrud(Biometrics 79:127-139,2023 年)的观点,他们考虑了类似的因果关系估计值,以在标准连续时间竞争风险模型中分离对相关事件和竞争事件的因果处理效应。自然直接效应和间接效应(Robins 和 Greenland,发表于《流行病学》3:143-155,1992 年;Pearl,发表于《第十七届人工智能不确定性会议论文集》,Morgan Kaufmann,2001 年)通常是通过独立于暴露的中介操作(所谓的跨世界干预)来定义的,而可分离的直接效应和间接效应则是通过对暴露的不同成分进行干预来定义的,这些成分通过不同的因果机制来产生效应。这种方法允许我们定义有意义的中介目标,即使中介事件被终端事件截断。我们提出了可识别性的条件,其中包括对治疗机制的一些可以说是限制性的结构假设,并讨论了这些假设何时有效。识别函数用于构建可分离的直接效应和间接效应的插件估计器。我们还提出了基于有效影响函数的多稳健渐进有效估计器。我们在模拟研究中验证了估计器的理论特性,并使用丹麦登记研究的数据演示了估计器的使用。
{"title":"Estimation of separable direct and indirect effects in a continuous-time illness-death model.","authors":"Marie Skov Breum, Anders Munch, Thomas A Gerds, Torben Martinussen","doi":"10.1007/s10985-023-09601-y","DOIUrl":"10.1007/s10985-023-09601-y","url":null,"abstract":"<p><p>In this article we study the effect of a baseline exposure on a terminal time-to-event outcome either directly or mediated by the illness state of a continuous-time illness-death process with baseline covariates. We propose a definition of the corresponding direct and indirect effects using the concept of separable (interventionist) effects (Robins and Richardson in Causality and psychopathology: finding the determinants of disorders and their cures, Oxford University Press, 2011; Robins et al. in arXiv:2008.06019 , 2021; Stensrud et al. in J Am Stat Assoc 117:175-183, 2022). Our proposal generalizes Martinussen and Stensrud (Biometrics 79:127-139, 2023) who consider similar causal estimands for disentangling the causal treatment effects on the event of interest and competing events in the standard continuous-time competing risk model. Unlike natural direct and indirect effects (Robins and Greenland in Epidemiology 3:143-155, 1992; Pearl in Proceedings of the seventeenth conference on uncertainty in artificial intelligence, Morgan Kaufmann, 2001) which are usually defined through manipulations of the mediator independently of the exposure (so-called cross-world interventions), separable direct and indirect effects are defined through interventions on different components of the exposure that exert their effects through distinct causal mechanisms. This approach allows us to define meaningful mediation targets even though the mediating event is truncated by the terminal event. We present the conditions for identifiability, which include some arguably restrictive structural assumptions on the treatment mechanism, and discuss when such assumptions are valid. The identifying functionals are used to construct plug-in estimators for the separable direct and indirect effects. We also present multiply robust and asymptotically efficient estimators based on the efficient influence functions. We verify the theoretical properties of the estimators in a simulation study, and we demonstrate the use of the estimators using data from a Danish registry study.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10764601/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9924229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-01Epub Date: 2023-11-17DOI: 10.1007/s10985-023-09610-x
Olivier Bouaziz
In a recurrent event setting, we introduce a new score designed to evaluate the prediction ability, for a given model, of the expected cumulative number of recurrent events. This score can be seen as an extension of the Brier Score for single time to event data but works for recurrent events with or without a terminal event. Theoretical results are provided that show that under standard assumptions in a recurrent event context, our score can be asymptotically decomposed as the sum of the theoretical mean squared error between the model and the true expected cumulative number of recurrent events and an inseparability term that does not depend on the model. This decomposition is further illustrated on simulations studies. It is also shown that this score should be used in comparison with a reference model, such as a nonparametric estimator that does not include the covariates. Finally, the score is applied for the prediction of hospitalisations on a dataset of patients suffering from atrial fibrillation and a comparison of the prediction performances of different models, such as the Cox model, the Aalen Model or the Ghosh and Lin model, is investigated.
{"title":"Assessing model prediction performance for the expected cumulative number of recurrent events.","authors":"Olivier Bouaziz","doi":"10.1007/s10985-023-09610-x","DOIUrl":"10.1007/s10985-023-09610-x","url":null,"abstract":"<p><p>In a recurrent event setting, we introduce a new score designed to evaluate the prediction ability, for a given model, of the expected cumulative number of recurrent events. This score can be seen as an extension of the Brier Score for single time to event data but works for recurrent events with or without a terminal event. Theoretical results are provided that show that under standard assumptions in a recurrent event context, our score can be asymptotically decomposed as the sum of the theoretical mean squared error between the model and the true expected cumulative number of recurrent events and an inseparability term that does not depend on the model. This decomposition is further illustrated on simulations studies. It is also shown that this score should be used in comparison with a reference model, such as a nonparametric estimator that does not include the covariates. Finally, the score is applied for the prediction of hospitalisations on a dataset of patients suffering from atrial fibrillation and a comparison of the prediction performances of different models, such as the Cox model, the Aalen Model or the Ghosh and Lin model, is investigated.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136400054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-10-01Epub Date: 2023-05-09DOI: 10.1007/s10985-023-09600-z
Yang Qu, Yu Cheng
We propose a screening method for high-dimensional data with ordinal competing risk outcomes, which is time-dependent and model-free. Existing methods are designed for cause-specific variable screening and fail to evaluate how a biomarker is associated with multiple competing events simultaneously. The proposed method utilizes the Volume under the ROC surface (VUS), which measures the concordance between values of a biomarker and event status at certain time points and provides an overall evaluation of the discrimination capacity of a biomarker. We show that the VUS possesses the sure screening property, i.e., true important covariates can be retained with probability tending to one, and the size of the selected set can be bounded with high probability. The VUS appears to be a viable model-free screening metric as compared to some existing methods in simulation studies, and it is especially robust to data contamination. Through an analysis of breast-cancer gene-expression data, we illustrate the unique insights into the overall discriminatory capability provided by the VUS.
{"title":"Volume under the ROC surface for high-dimensional independent screening with ordinal competing risk outcomes.","authors":"Yang Qu, Yu Cheng","doi":"10.1007/s10985-023-09600-z","DOIUrl":"10.1007/s10985-023-09600-z","url":null,"abstract":"<p><p>We propose a screening method for high-dimensional data with ordinal competing risk outcomes, which is time-dependent and model-free. Existing methods are designed for cause-specific variable screening and fail to evaluate how a biomarker is associated with multiple competing events simultaneously. The proposed method utilizes the Volume under the ROC surface (VUS), which measures the concordance between values of a biomarker and event status at certain time points and provides an overall evaluation of the discrimination capacity of a biomarker. We show that the VUS possesses the sure screening property, i.e., true important covariates can be retained with probability tending to one, and the size of the selected set can be bounded with high probability. The VUS appears to be a viable model-free screening metric as compared to some existing methods in simulation studies, and it is especially robust to data contamination. Through an analysis of breast-cancer gene-expression data, we illustrate the unique insights into the overall discriminatory capability provided by the VUS.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9432326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}