Statistical Methods in Medical Research最新文献_第2页

A Bayesian method for adverse effects estimation in observational studies with truncation by death. 用贝叶斯方法估算死亡截断的观察性研究中的不良反应。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-11-01 Epub Date: 2024-11-05 DOI: 10.1177/09622802241283170

Anthony Sisti, Andrew Zullo, Roee Gutman

Death among subjects is common in observational studies evaluating the causal effects of interventions among geriatric or severely ill patients. High mortality rates complicate the comparison of the prevalence of adverse events between interventions. This problem is often referred to as outcome "truncation" by death. A possible solution is to estimate the survivor average causal effect, an estimand that evaluates the effects of interventions among those who would have survived under both treatment assignments. However, because the survivor average causal effect does not include subjects who would have died under one or both arms, it does not consider the relationship between adverse events and death. We propose a Bayesian method which imputes the unobserved mortality and adverse event outcomes for each participant under the intervention they did not receive. Using the imputed outcomes we define a composite ordinal outcome for each patient, combining the occurrence of death and the adverse event in an increasing scale of severity. This allows for the comparison of the effects of the interventions on death and the adverse event simultaneously among the entire sample. We implement the procedure to analyze the incidence of heart failure among geriatric patients being treated for Type II diabetes with sulfonylureas or dipeptidyl peptidase-4 inhibitors.

在评估干预措施对老年病人或重病患者的因果影响的观察性研究中，受试者死亡的情况很常见。高死亡率使比较不同干预措施的不良事件发生率变得更加复杂。这个问题通常被称为死亡导致的结果 "截断"。一种可能的解决方案是估算幸存者平均因果效应，这种估算方法可以评估干预措施对两种治疗方案下存活者的影响。然而，由于幸存者平均因果效应不包括在一种或两种治疗方法下都会死亡的受试者，因此它没有考虑不良事件与死亡之间的关系。我们提出了一种贝叶斯方法，该方法可估算出每位受试者在未接受干预的情况下未观察到的死亡率和不良事件结果。利用估算的结果，我们为每位患者定义了一个综合的序数结果，将死亡和不良事件的发生按严重程度递增结合起来。这样就可以在整个样本中同时比较干预措施对死亡和不良事件的影响。我们采用该方法分析了接受磺脲类药物或二肽基肽酶-4 抑制剂治疗的 II 型糖尿病老年患者的心力衰竭发生率。

{"title":"A Bayesian method for adverse effects estimation in observational studies with truncation by death.","authors":"Anthony Sisti, Andrew Zullo, Roee Gutman","doi":"10.1177/09622802241283170","DOIUrl":"10.1177/09622802241283170","url":null,"abstract":"Death among subjects is common in observational studies evaluating the causal effects of interventions among geriatric or severely ill patients. High mortality rates complicate the comparison of the prevalence of adverse events between interventions. This problem is often referred to as outcome \"truncation\" by death. A possible solution is to estimate the survivor average causal effect, an estimand that evaluates the effects of interventions among those who would have survived under both treatment assignments. However, because the survivor average causal effect does not include subjects who would have died under one or both arms, it does not consider the relationship between adverse events and death. We propose a Bayesian method which imputes the unobserved mortality and adverse event outcomes for each participant under the intervention they did not receive. Using the imputed outcomes we define a composite ordinal outcome for each patient, combining the occurrence of death and the adverse event in an increasing scale of severity. This allows for the comparison of the effects of the interventions on death and the adverse event simultaneously among the entire sample. We implement the procedure to analyze the incidence of heart failure among geriatric patients being treated for Type II diabetes with sulfonylureas or dipeptidyl peptidase-4 inhibitors.","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"2079-2097"},"PeriodicalIF":1.6,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142584354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing DHA supplementation adherence: A Bayesian approach with finite mixture models and irregular interim schedules in adaptive trial designs. 提高 DHA 补充剂的依从性：在适应性试验设计中使用有限混合模型和不规则临时时间表的贝叶斯方法。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-11-01 Epub Date: 2024-10-04 DOI: 10.1177/09622802241283165

Sreejata Dutta, Samuel Boyd, Susan E Carlson, Danielle N Christifano, Gene T Lee, Sharla A Smith, Byron J Gajewski

Docosahexaenoic acid (DHA) supplementation has proven beneficial in reducing preterm births. However, the challenge lies in addressing nonadherence to prescribed supplementation regimens-a hurdle that significantly impacts clinical trial outcomes. Conventional methods of adherence estimation, such as pill counts and questionnaires, usually fall short when estimating adherence within a specific dosage group. Thus, we propose a Bayesian finite mixture model to estimate adherence among women with low baseline red blood cell phospholipid DHA levels (<6%) receiving higher DHA doses. In our model, adherence is defined as the proportion of participants classified into one of the two distinct components in a normal mixture distribution. Subsequently, based on the estimands from the adherence model, we introduce a novel Bayesian adaptive trial design. Unlike conventional adaptive trials that employ regularly spaced interim schedules, the novelty of our proposed trial design lies in its adaptability to adherence percentages across the treatment arm through irregular interims. The irregular interims in the proposed trial are based on the effect size estimation informed by the finite mixture model. In summary, this study presents innovative methods for leveraging the capabilities of Bayesian finite mixture models in adherence analysis and the design of adaptive clinical trials.

事实证明，补充二十二碳六烯酸 (DHA) 有利于减少早产。然而，挑战在于如何解决不遵从处方补充方案的问题--这是严重影响临床试验结果的障碍。在估算特定剂量组的依从性时，传统的依从性估算方法（如药片计数和问卷调查）通常存在不足。因此，我们提出了一种贝叶斯有限混合物模型，用于估算基线红细胞磷脂 DHA 水平较低的妇女的依从性。

{"title":"Enhancing DHA supplementation adherence: A Bayesian approach with finite mixture models and irregular interim schedules in adaptive trial designs.","authors":"Sreejata Dutta, Samuel Boyd, Susan E Carlson, Danielle N Christifano, Gene T Lee, Sharla A Smith, Byron J Gajewski","doi":"10.1177/09622802241283165","DOIUrl":"10.1177/09622802241283165","url":null,"abstract":"Docosahexaenoic acid (DHA) supplementation has proven beneficial in reducing preterm births. However, the challenge lies in addressing nonadherence to prescribed supplementation regimens-a hurdle that significantly impacts clinical trial outcomes. Conventional methods of adherence estimation, such as pill counts and questionnaires, usually fall short when estimating adherence within a specific dosage group. Thus, we propose a Bayesian finite mixture model to estimate adherence among women with low baseline red blood cell phospholipid DHA levels (<6%) receiving higher DHA doses. In our model, adherence is defined as the proportion of participants classified into one of the two distinct components in a normal mixture distribution. Subsequently, based on the estimands from the adherence model, we introduce a novel Bayesian adaptive trial design. Unlike conventional adaptive trials that employ regularly spaced interim schedules, the novelty of our proposed trial design lies in its adaptability to adherence percentages across the treatment arm through irregular interims. The irregular interims in the proposed trial are based on the effect size estimation informed by the finite mixture model. In summary, this study presents innovative methods for leveraging the capabilities of Bayesian finite mixture models in adherence analysis and the design of adaptive clinical trials.","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"2062-2078"},"PeriodicalIF":1.6,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11576245/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142372913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Adaptive enrichment trial designs using joint modelling of longitudinal and time-to-event data. 利用纵向数据和事件时间数据联合建模的适应性强化试验设计。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-11-01 Epub Date: 2024-10-16 DOI: 10.1177/09622802241287711

Abigail J Burdon, Richard D Baird, Thomas Jaki

Adaptive enrichment allows for pre-defined patient subgroups of interest to be investigated throughout the course of a clinical trial. These designs have gained attention in recent years because of their potential to shorten the trial's duration and identify effective therapies tailored to specific patient groups. We describe enrichment trials which consider long-term time-to-event outcomes but also incorporate additional short-term information from routinely collected longitudinal biomarkers. These methods are suitable for use in the setting where the trajectory of the biomarker may differ between subgroups and it is believed that the long-term endpoint is influenced by treatment, subgroup and biomarker. Methods are most promising when the majority of patients have biomarker measurements for at least two time points. We implement joint modelling of longitudinal and time-to-event data to define subgroup selection and stopping criteria and we show that the familywise error rate is protected in the strong sense. To assess the results, we perform a simulation study and find that, compared to the study where longitudinal biomarker observations are ignored, incorporating biomarker information leads to increases in power and the (sub)population which truly benefits from the experimental treatment being enriched with higher probability at the interim analysis. The investigations are motivated by a trial for the treatment of metastatic breast cancer and the parameter values for the simulation study are informed using real-world data where repeated circulating tumour DNA measurements and HER2 statuses are available for each patient and are used as our longitudinal data and subgroup identifiers, respectively.

在临床试验的整个过程中，可以对预先确定的感兴趣的患者亚组进行研究。这些设计近年来备受关注，因为它们有可能缩短试验时间，并找出针对特定患者群体的有效疗法。我们介绍的富集试验既考虑了从时间到事件的长期结果，又结合了从常规收集的纵向生物标记物中获得的额外短期信息。这些方法适用于生物标志物的轨迹在不同亚组之间可能存在差异的情况，并且相信长期终点会受到治疗、亚组和生物标志物的影响。当大多数患者至少有两个时间点的生物标志物测量结果时，这种方法最有前途。我们对纵向数据和时间到事件数据进行了联合建模，以确定亚组选择和停止标准，并证明在强意义上保护了家族误差率。为了评估结果，我们进行了一项模拟研究，结果发现，与忽略纵向生物标志物观察结果的研究相比，纳入生物标志物信息会提高研究的有效性，而且真正受益于实验治疗的（亚）群体在中期分析时会以更高的概率得到扩充。这项研究是由一项治疗转移性乳腺癌的试验激发的，模拟研究的参数值是根据真实世界的数据确定的，在真实世界中，每个患者都有重复的循环肿瘤 DNA 测量数据和 HER2 状态，这些数据和状态分别作为我们的纵向数据和亚群标识符。

{"title":"Adaptive enrichment trial designs using joint modelling of longitudinal and time-to-event data.","authors":"Abigail J Burdon, Richard D Baird, Thomas Jaki","doi":"10.1177/09622802241287711","DOIUrl":"10.1177/09622802241287711","url":null,"abstract":"Adaptive enrichment allows for pre-defined patient subgroups of interest to be investigated throughout the course of a clinical trial. These designs have gained attention in recent years because of their potential to shorten the trial's duration and identify effective therapies tailored to specific patient groups. We describe enrichment trials which consider long-term time-to-event outcomes but also incorporate additional short-term information from routinely collected longitudinal biomarkers. These methods are suitable for use in the setting where the trajectory of the biomarker may differ between subgroups and it is believed that the long-term endpoint is influenced by treatment, subgroup and biomarker. Methods are most promising when the majority of patients have biomarker measurements for at least two time points. We implement joint modelling of longitudinal and time-to-event data to define subgroup selection and stopping criteria and we show that the familywise error rate is protected in the strong sense. To assess the results, we perform a simulation study and find that, compared to the study where longitudinal biomarker observations are ignored, incorporating biomarker information leads to increases in power and the (sub)population which truly benefits from the experimental treatment being enriched with higher probability at the interim analysis. The investigations are motivated by a trial for the treatment of metastatic breast cancer and the parameter values for the simulation study are informed using real-world data where repeated circulating tumour DNA measurements and HER2 statuses are available for each patient and are used as our longitudinal data and subgroup identifiers, respectively.","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"2098-2114"},"PeriodicalIF":1.6,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11577695/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142475112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Applying survey weights to ordinal regression models for improved inference in outcome-dependent samples with ordinal outcomes. 将调查权重应用于序数回归模型，以改进具有序数结果的结果依赖性样本的推断。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-11-01 Epub Date: 2024-10-23 DOI: 10.1177/09622802241282091

Aya A Mitani, Osvaldo Espin-Garcia, Daniel Fernández, Victoria Landsman

Researchers often use outcome-dependent sampling to study the exposure-outcome association. The case-control study is a widely used example of outcome-dependent sampling when the outcome is binary. When the outcome is ordinal, standard ordinal regression models generally produce biased coefficients when the sampling fractions depend on the values of the outcome variable. To address this problem, we studied the performance of survey-weighted ordinal regression models with weights inversely proportional to the sampling fractions. Through an extensive simulation study, we compared the performance of four ordinal regression models (SM: stereotype model; AC: adjacent-category logit model; CR: continuation-ratio logit model; and CM: cumulative logit model), with and without sampling weights under outcome-dependent sampling. We observed that when using weights, all four models produced estimates with negligible bias of all regression coefficients. Without weights, only stereotype model and adjacent-category logit model produced estimates with negligible to low bias for all coefficients except for the intercepts in all scenarios. In one scenario, the unweighted continuation-ratio logit model also produced estimates with low bias. The weighted stereotype model and adjacent-category logit model also produced estimates with lower relative root mean square errors compared to the unweighted models in most scenarios. In some of the scenarios with unevenly distributed categories, the weighted continuation-ratio logit model and cumulative logit model produced estimates with lower relative root mean square errors compared to the respective unweighted models. We used a study of knee osteoarthritis as an example.

研究人员通常使用结果依赖性抽样来研究暴露与结果之间的关联。当结果为二元时，病例对照研究就是一个广泛使用的依赖结果抽样的例子。当结果为序数时，当抽样分数取决于结果变量的值时，标准的序数回归模型通常会产生有偏差的系数。为了解决这个问题，我们研究了权重与抽样分数成反比的调查加权序数回归模型的性能。通过广泛的模拟研究，我们比较了在结果依赖性抽样条件下，使用和不使用抽样权重的四种序数回归模型（SM：定型模型；AC：相邻类别 logit 模型；CR：延续比率 logit 模型；CM：累积 logit 模型）的性能。我们发现，在使用权重的情况下，所有四个模型都能得出所有回归系数偏差可忽略不计的估计值。在不使用权重的情况下，只有定型模型和相邻类别 logit 模型在所有情况下对除截距以外的所有系数都产生了可忽略不计或较低偏差的估计值。在一种情况下，未加权的延续比 logit 模型产生的估计值偏差也很小。在大多数情况下，加权定型模型和相邻类别 logit 模型产生的估计值的相对均方根误差也低于非加权模型。在一些类别分布不均的情况下，加权延续比 logit 模型和累积 logit 模型得出的估计值与各自的非加权模型相比具有较低的相对均方根误差。我们以膝关节骨关节炎的研究为例。

{"title":"Applying survey weights to ordinal regression models for improved inference in outcome-dependent samples with ordinal outcomes.","authors":"Aya A Mitani, Osvaldo Espin-Garcia, Daniel Fernández, Victoria Landsman","doi":"10.1177/09622802241282091","DOIUrl":"10.1177/09622802241282091","url":null,"abstract":"Researchers often use outcome-dependent sampling to study the exposure-outcome association. The case-control study is a widely used example of outcome-dependent sampling when the outcome is binary. When the outcome is ordinal, standard ordinal regression models generally produce biased coefficients when the sampling fractions depend on the values of the outcome variable. To address this problem, we studied the performance of survey-weighted ordinal regression models with weights inversely proportional to the sampling fractions. Through an extensive simulation study, we compared the performance of four ordinal regression models (SM: stereotype model; AC: adjacent-category logit model; CR: continuation-ratio logit model; and CM: cumulative logit model), with and without sampling weights under outcome-dependent sampling. We observed that when using weights, all four models produced estimates with negligible bias of all regression coefficients. Without weights, only stereotype model and adjacent-category logit model produced estimates with negligible to low bias for all coefficients except for the intercepts in all scenarios. In one scenario, the unweighted continuation-ratio logit model also produced estimates with low bias. The weighted stereotype model and adjacent-category logit model also produced estimates with lower relative root mean square errors compared to the unweighted models in most scenarios. In some of the scenarios with unevenly distributed categories, the weighted continuation-ratio logit model and cumulative logit model produced estimates with lower relative root mean square errors compared to the respective unweighted models. We used a study of knee osteoarthritis as an example.","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"2007-2026"},"PeriodicalIF":1.6,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11577697/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142508312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A seamless Phase I/II platform design with a time-to-event efficacy endpoint for potential COVID-19 therapies. 为潜在的 COVID-19 疗法设计了一个无缝的 I/II 期平台，其疗效终点为事件发生时间。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-11-01 Epub Date: 2024-10-14 DOI: 10.1177/09622802241288348

Thomas Jaki, Helen Barnett, Andrew Titman, Pavel Mozgunov

In the search for effective treatments for COVID-19, the initial emphasis has been on re-purposed treatments. To maximize the chances of finding successful treatments, novel treatments that have been developed for this disease in particular, are needed. In this article, we describe and evaluate the statistical design of the AGILE platform, an adaptive randomized seamless Phase I/II trial platform that seeks to quickly establish a safe range of doses and investigates treatments for potential efficacy. The bespoke Bayesian design (i) utilizes randomization during dose-finding, (ii) shares control arm information across the platform, and (iii) uses a time-to-event endpoint with a formal testing structure and error control for evaluation of potential efficacy. Both single-agent and combination treatments are considered. We find that the design can identify potential treatments that are safe and efficacious reliably with small to moderate sample sizes.

在为 COVID-19 寻找有效治疗方法的过程中，最初的重点是再利用治疗方法。为了最大限度地提高治疗成功的几率，我们需要特别针对这种疾病开发的新型疗法。在本文中，我们描述并评估了 AGILE 平台的统计设计，这是一个自适应随机无缝 I/II 期试验平台，旨在快速确定安全剂量范围并研究治疗的潜在疗效。这种定制贝叶斯设计（i）在剂量确定过程中采用随机化，（ii）在整个平台上共享对照臂信息，（iii）使用具有正式测试结构和误差控制的时间到事件终点来评估潜在疗效。单药治疗和联合治疗均在考虑之列。我们发现，该设计能以中小规模样本可靠地识别出安全有效的潜在治疗方法。

引用次数: 0

Comparison of random forest methods for conditional average treatment effect estimation with a continuous treatment. 连续治疗条件下平均治疗效果估计的随机森林方法比较。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-11-01 Epub Date: 2024-10-09 DOI: 10.1177/09622802241275401

Sami Tabib, Denis Larocque

We are addressing the problem of estimating conditional average treatment effects with a continuous treatment and a continuous response, using random forests. We explore two general approaches: building trees with a split rule that seeks to increase the heterogeneity of the treatment effect estimation and building trees to predict $Y$ as a proxy target variable. We conduct a simulation study to investigate several aspects including the presence or absence of confounding and colliding effects and the merits of locally centering the treatment and/or the response. Our study incorporates both existing and new implementations of random forests. The results indicate that locally centering both the response and treatment variables is generally the best strategy, and both general approaches are viable. Additionally, we provide an illustration using data from the 1987 National Medical Expenditure Survey.

我们正在解决利用随机森林估计连续治疗和连续反应的条件平均治疗效果的问题。我们探索了两种一般方法：用分裂规则构建树，以增加治疗效果估计的异质性；以及构建树来预测作为替代目标变量的 Y。我们进行了一项模拟研究，以调查几个方面的问题，包括是否存在混杂效应和碰撞效应，以及将处理和/或响应局部居中的优点。我们的研究结合了现有的和新的随机森林实现方法。结果表明，将响应变量和处理变量局部居中通常是最佳策略，而且这两种一般方法都是可行的。此外，我们还利用 1987 年全国医疗支出调查的数据进行了说明。

引用次数: 0

Testing for a treatment effect in a selected subgroup. 测试选定分组的治疗效果。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-11-01 Epub Date: 2024-09-25 DOI: 10.1177/09622802241277764

Nigel Stallard

There is a growing interest in clinical trials that investigate how patients may respond differently to an experimental treatment depending on the basis of some biomarker measured on a continuous scale, and in particular to identify some threshold value for the biomarker above which a positive treatment effect can be considered to have been demonstrated. This can be statistically challenging when the same data are used both to select the threshold and to test the treatment effect in the subpopulation that it defines. This paper describes a hierarchical testing framework to give familywise type I error rate control in this setting and proposes two specific tests that can be used within this framework. One, a simple test based on the estimated value from a linear regression model with treatment by biomarker interaction, is powerful but can lead to type I error rate inflation if the assumptions of the linear model are not met. The other is more robust to these assumptions, but can be slightly less powerful when the assumptions hold.

人们对临床试验的兴趣与日俱增，这些试验研究病人对试验性治疗的反应如何因连续测量的生物标志物的不同而不同，特别是要确定生物标志物的某个阈值，超过这个阈值就可以认为治疗效果得到了证实。如果使用相同的数据来选择阈值，并在阈值所定义的亚人群中测试治疗效果，这在统计学上可能具有挑战性。本文描述了一个分层检验框架，以便在这种情况下控制族类 I 型误差率，并提出了两个可在此框架内使用的具体检验方法。一种是基于线性回归模型估计值的简单检验，具有治疗与生物标志物交互作用的特点，但如果不符合线性模型的假设，则可能导致 I 型错误率膨胀。另一种方法对这些假设更为稳健，但当假设成立时，其有效性会稍差一些。

引用次数: 0

Joint regression analysis of clustered current status data with latent variables. 对带有潜变量的聚类现状数据进行联合回归分析。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-10-23 DOI: 10.1177/09622802241280792

Yanqin Feng, Sijie Wu, Jieli Ding

Clustered current status data frequently occur in many fields of survival studies. Some potential factors related to the hazards of interest cannot be directly observed but are characterized through multiple correlated observable surrogates. In this article, we propose a joint modeling method for regression analysis of clustered current status data with latent variables and potentially informative cluster sizes. The proposed models consist of a factor analysis model to characterize latent variables through their multiple surrogates and an additive hazards frailty model to investigate covariate effects on the failure time and incorporate intra-cluster correlations. We develop an estimation procedure that combines the expectation-maximization algorithm and the weighted estimating equations. The consistency and asymptotic normality of the proposed estimators are established. The finite-sample performance of the proposed method is assessed via a series of simulation studies. This procedure is applied to analyze clustered current status data from the National Toxicology Program on a tumorigenicity study given by the United States Department of Health and Human Services.

在许多领域的生存研究中，经常会出现聚类现状数据。有些与相关危害相关的潜在因素无法直接观察到，但可以通过多个相关的可观察代用指标来描述。在本文中，我们提出了一种联合建模方法，用于对具有潜变量和潜在信息聚类大小的聚类现状数据进行回归分析。我们提出的模型包括一个因子分析模型和一个加性危险虚弱模型，前者通过多个代理变量来描述潜变量的特征，后者则用于研究协变量对失败时间的影响，并纳入聚类内部的相关性。我们开发了一种结合期望最大化算法和加权估计方程的估计程序。我们建立了所提出估计器的一致性和渐近正态性。通过一系列模拟研究评估了所提方法的有限样本性能。该程序被应用于分析美国卫生与公众服务部提供的国家毒理学计划关于肿瘤致病性研究的聚类现状数据。

{"title":"Joint regression analysis of clustered current status data with latent variables.","authors":"Yanqin Feng, Sijie Wu, Jieli Ding","doi":"10.1177/09622802241280792","DOIUrl":"https://doi.org/10.1177/09622802241280792","url":null,"abstract":"Clustered current status data frequently occur in many fields of survival studies. Some potential factors related to the hazards of interest cannot be directly observed but are characterized through multiple correlated observable surrogates. In this article, we propose a joint modeling method for regression analysis of clustered current status data with latent variables and potentially informative cluster sizes. The proposed models consist of a factor analysis model to characterize latent variables through their multiple surrogates and an additive hazards frailty model to investigate covariate effects on the failure time and incorporate intra-cluster correlations. We develop an estimation procedure that combines the expectation-maximization algorithm and the weighted estimating equations. The consistency and asymptotic normality of the proposed estimators are established. The finite-sample performance of the proposed method is assessed via a series of simulation studies. This procedure is applied to analyze clustered current status data from the National Toxicology Program on a tumorigenicity study given by the United States Department of Health and Human Services.","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802241280792"},"PeriodicalIF":1.6,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142508325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Graphical methods to illustrate the nature of the relation between a continuous variable and the outcome when using restricted cubic splines with a Cox proportional hazards model. 在使用限制性三次样条和 Cox 比例危险模型时，用图形方法说明连续变量和结果之间关系的性质。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-10-21 DOI: 10.1177/09622802241287707

Peter C Austin

Restricted cubic splines (RCS) allow analysts to model nonlinear relations between continuous covariates and the outcome in a regression model. When using RCS with the Cox proportional hazards model, there is no longer a single hazard ratio for the continuous variable. Instead, the hazard ratio depends on the values of the covariate for the two individuals being compared. Thus, using age as an example, when one assumes a linear relation between age and the log-hazard of the outcome there is a single hazard ratio comparing any two individuals whose age differs by 1 year. However, when allowing for a nonlinear relation between age and the log-hazard of the outcome, the hazard ratio comparing the hazard of the outcome between a 31- and a 30-year-old may differ from the hazard ratio comparing the hazard of the outcome between an 81- and an 80-year-old. We describe four methods to describe graphically the relation between a continuous variable and the outcome when using RCS with a Cox model. These graphical methods are based on plots of relative hazard ratios, cumulative incidence, hazards, and cumulative hazards against the continuous variable. Using a case study of patients presenting to hospital with heart failure and a series of mathematical derivations, we illustrate that the four methods will produce qualitatively similar conclusions about the nature of the relation between a continuous variable and the outcome. Use of these methods will allow for an intuitive communication of the nature of the relation between the variable and the outcome.

受限三次样条（RCS）允许分析师在回归模型中模拟连续协变量与结果之间的非线性关系。将 RCS 与 Cox 比例危险模型结合使用时，连续变量的危险比不再是单一的。相反，危险比取决于被比较的两个个体的协变量值。因此，以年龄为例，如果假定年龄与结果的对数危险度之间存在线性关系，那么在年龄相差 1 岁的两个人之间进行比较，就会得出单一的危险比。但是，如果考虑到年龄与结果危害对数之间的非线性关系，那么比较 31 岁与 30 岁之间结果危害的危害比可能不同于比较 81 岁与 80 岁之间结果危害的危害比。我们介绍了四种在使用 RCS 和 Cox 模型时用图形描述连续变量和结果之间关系的方法。这些图形方法基于相对危险比、累积发病率、危险度和累积危险度与连续变量的关系图。通过对心力衰竭入院患者的病例研究和一系列数学推导，我们说明了这四种方法对连续变量和结果之间关系性质的定性结论是相似的。使用这些方法可以直观地了解变量与结果之间关系的性质。

{"title":"Graphical methods to illustrate the nature of the relation between a continuous variable and the outcome when using restricted cubic splines with a Cox proportional hazards model.","authors":"Peter C Austin","doi":"10.1177/09622802241287707","DOIUrl":"https://doi.org/10.1177/09622802241287707","url":null,"abstract":"Restricted cubic splines (RCS) allow analysts to model nonlinear relations between continuous covariates and the outcome in a regression model. When using RCS with the Cox proportional hazards model, there is no longer a single hazard ratio for the continuous variable. Instead, the hazard ratio depends on the values of the covariate for the two individuals being compared. Thus, using age as an example, when one assumes a linear relation between age and the log-hazard of the outcome there is a single hazard ratio comparing any two individuals whose age differs by 1 year. However, when allowing for a nonlinear relation between age and the log-hazard of the outcome, the hazard ratio comparing the hazard of the outcome between a 31- and a 30-year-old may differ from the hazard ratio comparing the hazard of the outcome between an 81- and an 80-year-old. We describe four methods to describe graphically the relation between a continuous variable and the outcome when using RCS with a Cox model. These graphical methods are based on plots of relative hazard ratios, cumulative incidence, hazards, and cumulative hazards against the continuous variable. Using a case study of patients presenting to hospital with heart failure and a series of mathematical derivations, we illustrate that the four methods will produce qualitatively similar conclusions about the nature of the relation between a continuous variable and the outcome. Use of these methods will allow for an intuitive communication of the nature of the relation between the variable and the outcome.","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802241287707"},"PeriodicalIF":1.6,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142475114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Delayed kernels for longitudinal survival analysis and dynamic prediction. 用于纵向生存分析和动态预测的延迟核。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-10-01 Epub Date: 2024-08-30 DOI: 10.1177/09622802241275382

Annabel Louisa Davies, Anthony Cc Coolen, Tobias Galla

Predicting patient survival probabilities based on observed covariates is an important assessment in clinical practice. These patient-specific covariates are often measured over multiple follow-up appointments. It is then of interest to predict survival based on the history of these longitudinal measurements, and to update predictions as more observations become available. The standard approaches to these so-called 'dynamic prediction' assessments are joint models and landmark analysis. Joint models involve high-dimensional parameterizations, and their computational complexity often prohibits including multiple longitudinal covariates. Landmark analysis is simpler, but discards a proportion of the available data at each 'landmark time'. In this work, we propose a 'delayed kernel' approach to dynamic prediction that sits somewhere in between the two standard methods in terms of complexity. By conditioning hazard rates directly on the covariate measurements over the observation time frame, we define a model that takes into account the full history of covariate measurements but is more practical and parsimonious than joint modelling. Time-dependent association kernels describe the impact of covariate changes at earlier times on the patient's hazard rate at later times. Under the constraints that our model (a) reduces to the standard Cox model for time-independent covariates, and (b) contains the instantaneous Cox model as a special case, we derive two natural kernel parameterizations. Upon application to three clinical data sets, we find that the predictive accuracy of the delayed kernel approach is comparable to that of the two existing standard methods.

根据观察到的协变量预测患者的生存概率是临床实践中的一项重要评估。这些患者特定的协变量通常是在多次随访中测量的。因此，根据这些纵向测量结果的历史预测生存率，并在获得更多观察结果后更新预测结果，是一项重要的工作。这些所谓 "动态预测 "评估的标准方法是联合模型和地标分析。联合模型涉及高维参数化，其计算复杂性往往使其无法包含多个纵向协变量。地标分析较为简单，但会在每个 "地标时间 "放弃一部分可用数据。在这项工作中，我们提出了一种 "延迟核 "动态预测方法，其复杂程度介于这两种标准方法之间。通过将危险率直接与观测时间框架内的协变量测量值挂钩，我们定义了一个模型，该模型考虑到了协变量测量值的全部历史，但比联合建模更实用、更简洁。与时间相关的关联核描述了协变量在早期的变化对患者后期危险率的影响。我们的模型(a)简化为时间无关协变量的标准 Cox 模型，(b)包含作为特例的瞬时 Cox 模型，在这两个约束条件下，我们得出了两个自然的核参数。通过对三个临床数据集的应用，我们发现延迟核方法的预测准确性与现有的两种标准方法相当。

{"title":"Delayed kernels for longitudinal survival analysis and dynamic prediction.","authors":"Annabel Louisa Davies, Anthony Cc Coolen, Tobias Galla","doi":"10.1177/09622802241275382","DOIUrl":"10.1177/09622802241275382","url":null,"abstract":"Predicting patient survival probabilities based on observed covariates is an important assessment in clinical practice. These patient-specific covariates are often measured over multiple follow-up appointments. It is then of interest to predict survival based on the history of these longitudinal measurements, and to update predictions as more observations become available. The standard approaches to these so-called 'dynamic prediction' assessments are joint models and landmark analysis. Joint models involve high-dimensional parameterizations, and their computational complexity often prohibits including multiple longitudinal covariates. Landmark analysis is simpler, but discards a proportion of the available data at each 'landmark time'. In this work, we propose a 'delayed kernel' approach to dynamic prediction that sits somewhere in between the two standard methods in terms of complexity. By conditioning hazard rates directly on the covariate measurements over the observation time frame, we define a model that takes into account the full history of covariate measurements but is more practical and parsimonious than joint modelling. Time-dependent association kernels describe the impact of covariate changes at earlier times on the patient's hazard rate at later times. Under the constraints that our model (a) reduces to the standard Cox model for time-independent covariates, and (b) contains the instantaneous Cox model as a special case, we derive two natural kernel parameterizations. Upon application to three clinical data sets, we find that the predictive accuracy of the delayed kernel approach is comparable to that of the two existing standard methods.","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1836-1858"},"PeriodicalIF":1.6,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11577694/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142112180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0