Multiple imputation with competing risk outcomes

IF 1 4区数学 Q3 STATISTICS & PROBABILITY Computational Statistics Pub Date : 2024-06-26 DOI:10.1007/s00180-024-01518-w

Peter C. Austin

{"title":"Multiple imputation with competing risk outcomes","authors":"Peter C. Austin","doi":"10.1007/s00180-024-01518-w","DOIUrl":null,"url":null,"abstract":"<p>In time-to-event analyses, a competing risk is an event whose occurrence precludes the occurrence of the event of interest. Settings with competing risks occur frequently in clinical research. Missing data, which is a common problem in research, occurs when the value of a variable is recorded for some, but not all, records in the dataset. Multiple Imputation (MI) is a popular method to address the presence of missing data. MI uses an imputation model to generate M (M > 1) values for each variable that is missing, resulting in the creation of M complete datasets. A popular algorithm for imputing missing data is multivariate imputation using chained equations (MICE). We used a complex simulation design with covariates and missing data patterns reflective of patients hospitalized with acute myocardial infarction (AMI) to compare three strategies for imputing missing predictor variables when the analysis model is a cause-specific hazard when there were three different event types. We compared two MICE-based strategies that differed according to which cause-specific cumulative hazard functions were included in the imputation models (the three cause-specific cumulative hazard functions vs. only the cause-specific cumulative hazard function for the primary outcome) with the use of the substantive model compatible fully conditional specification (SMCFCS) algorithm. While no strategy had consistently superior performance compared to the other strategies, SMCFCS may be the preferred strategy. We illustrated the application of the strategies using a case study of patients hospitalized with AMI.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"176 1","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Statistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s00180-024-01518-w","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}

引用次数: 0

Abstract

In time-to-event analyses, a competing risk is an event whose occurrence precludes the occurrence of the event of interest. Settings with competing risks occur frequently in clinical research. Missing data, which is a common problem in research, occurs when the value of a variable is recorded for some, but not all, records in the dataset. Multiple Imputation (MI) is a popular method to address the presence of missing data. MI uses an imputation model to generate M (M > 1) values for each variable that is missing, resulting in the creation of M complete datasets. A popular algorithm for imputing missing data is multivariate imputation using chained equations (MICE). We used a complex simulation design with covariates and missing data patterns reflective of patients hospitalized with acute myocardial infarction (AMI) to compare three strategies for imputing missing predictor variables when the analysis model is a cause-specific hazard when there were three different event types. We compared two MICE-based strategies that differed according to which cause-specific cumulative hazard functions were included in the imputation models (the three cause-specific cumulative hazard functions vs. only the cause-specific cumulative hazard function for the primary outcome) with the use of the substantive model compatible fully conditional specification (SMCFCS) algorithm. While no strategy had consistently superior performance compared to the other strategies, SMCFCS may be the preferred strategy. We illustrated the application of the strategies using a case study of patients hospitalized with AMI.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

具有竞争风险结果的多重估算

在时间到事件分析中，竞争风险是指其发生排除了相关事件发生的事件。临床研究中经常出现有竞争风险的情况。缺失数据是研究中的一个常见问题，当数据集中的某些记录记录了变量值，但并非所有记录都记录了变量值时，就会出现缺失数据。多重估算（MI）是解决数据缺失问题的常用方法。多重估算使用估算模型为每个缺失变量生成 M（M > 1）个值，从而创建 M 个完整的数据集。一种流行的缺失数据归因算法是使用链式方程的多变量归因（MICE）。我们使用了一个复杂的模拟设计，其中的协变量和缺失数据模式反映了急性心肌梗死（AMI）住院患者的情况，比较了在分析模型为特定病因危险时，当有三种不同的事件类型时，对缺失的预测变量进行归因的三种策略。我们比较了两种基于 MICE 的策略，这两种策略的不同之处在于，在使用实质性模型兼容全条件规范 (SMCFCS) 算法的情况下，归因模型中包含了哪些特定病因累积危险函数（三个特定病因累积危险函数与仅包含主要结果的特定病因累积危险函数）。虽然与其他策略相比，没有一种策略具有持续的优越性，但 SMCFCS 可能是首选策略。我们通过对急性心肌梗死住院患者的病例研究来说明这些策略的应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Computational Statistics 数学-统计学与概率论

CiteScore

2.90

自引率

0.00%

发文量

122

审稿时长

>12 weeks

期刊介绍： Computational Statistics (CompStat) is an international journal which promotes the publication of applications and methodological research in the field of Computational Statistics. The focus of papers in CompStat is on the contribution to and influence of computing on statistics and vice versa. The journal provides a forum for computer scientists, mathematicians, and statisticians in a variety of fields of statistics such as biometrics, econometrics, data analysis, graphics, simulation, algorithms, knowledge based systems, and Bayesian computing. CompStat publishes hardware, software plus package reports.