首页 > 最新文献

Statistical Methods in Medical Research最新文献

英文 中文
Demystifying estimands in cluster-randomised trials. 解密分组随机试验中的估计值。
IF 1.6 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-07-01 Epub Date: 2024-05-23 DOI: 10.1177/09622802241254197
Brennan C Kahan, Bryan S Blette, Michael O Harhay, Scott D Halpern, Vipul Jairath, Andrew Copas, Fan Li

Estimands can help clarify the interpretation of treatment effects and ensure that estimators are aligned with the study's objectives. Cluster-randomised trials require additional attributes to be defined within the estimand compared to individually randomised trials, including whether treatment effects are marginal or cluster-specific, and whether they are participant- or cluster-average. In this paper, we provide formal definitions of estimands encompassing both these attributes using potential outcomes notation and describe differences between them. We then provide an overview of estimators for each estimand, describe their assumptions, and show consistency (i.e. asymptotically unbiased estimation) for a series of analyses based on cluster-level summaries. Then, through a re-analysis of a published cluster-randomised trial, we demonstrate that the choice of both estimand and estimator can affect interpretation. For instance, the estimated odds ratio ranged from 1.38 (p = 0.17) to 1.83 (p = 0.03) depending on the target estimand, and for some estimands, the choice of estimator affected the conclusions by leading to smaller treatment effect estimates. We conclude that careful specification of the estimand, along with an appropriate choice of estimator, is essential to ensuring that cluster-randomised trials address the right question.

估计因子有助于明确治疗效果的解释,并确保估计因子与研究目标相一致。与单项随机试验相比,分组随机试验需要在估计因子中定义更多属性,包括治疗效果是边际的还是分组特定的,是参与者平均的还是分组平均的。在本文中,我们使用潜在结果符号提供了包含上述两种属性的估计值的正式定义,并描述了它们之间的差异。然后,我们概述了每种估算项的估算器,描述了它们的假设条件,并展示了基于聚类分析的一系列分析的一致性(即渐近无偏估算)。然后,通过对一项已发表的分组随机试验的重新分析,我们证明了估计因子和估计器的选择都会影响解释。例如,估计的几率比从 1.38(p = 0.17)到 1.83(p = 0.03)不等,这取决于目标估计因子,对于某些估计因子,估计因子的选择会导致治疗效果估计值变小,从而影响结论。我们的结论是,要确保分组随机试验能解决正确的问题,就必须仔细说明估计指标,同时选择适当的估计指标。
{"title":"Demystifying estimands in cluster-randomised trials.","authors":"Brennan C Kahan, Bryan S Blette, Michael O Harhay, Scott D Halpern, Vipul Jairath, Andrew Copas, Fan Li","doi":"10.1177/09622802241254197","DOIUrl":"10.1177/09622802241254197","url":null,"abstract":"<p><p>Estimands can help clarify the interpretation of treatment effects and ensure that estimators are aligned with the study's objectives. Cluster-randomised trials require additional attributes to be defined within the estimand compared to individually randomised trials, including whether treatment effects are <i>marginal</i> or <i>cluster-specific</i>, and whether they are <i>participant-</i> or <i>cluster-average</i>. In this paper, we provide formal definitions of estimands encompassing both these attributes using potential outcomes notation and describe differences between them. We then provide an overview of estimators for each estimand, describe their assumptions, and show consistency (i.e. asymptotically unbiased estimation) for a series of analyses based on cluster-level summaries. Then, through a re-analysis of a published cluster-randomised trial, we demonstrate that the choice of both estimand and estimator can affect interpretation. For instance, the estimated odds ratio ranged from 1.38 (<i>p</i> = 0.17) to 1.83 (<i>p</i> = 0.03) depending on the target estimand, and for some estimands, the choice of estimator affected the conclusions by leading to smaller treatment effect estimates. We conclude that careful specification of the estimand, along with an appropriate choice of estimator, is essential to ensuring that cluster-randomised trials address the right question.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1211-1232"},"PeriodicalIF":1.6,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11348634/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141082300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Goodness-of-fit tests for modified Poisson regression possibly producing fitted values exceeding one in binary outcome analysis. 对二元结果分析中可能产生拟合值超过 1 的修正泊松回归进行拟合优度测试。
IF 1.6 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-07-01 Epub Date: 2024-05-23 DOI: 10.1177/09622802241254220
Yasuhiro Hagiwara, Yutaka Matsuyama

Modified Poisson regression, which estimates the regression parameters in the log-binomial regression model using the Poisson quasi-likelihood estimating equation and robust variance, is a useful tool for estimating the adjusted risk and prevalence ratio in binary outcome analysis. Although several goodness-of-fit tests have been developed for other binary regressions, few goodness-of-fit tests are available for modified Poisson regression. In this study, we proposed several goodness-of-fit tests for modified Poisson regression, including the modified Hosmer-Lemeshow test with empirical variance, Tsiatis test, normalized Pearson chi-square tests with binomial variance and Poisson variance, and normalized residual sum of squares test. The original Hosmer-Lemeshow test and normalized Pearson chi-square test with binomial variance are inappropriate for the modified Poisson regression, which can produce a fitted value exceeding 1 owing to the unconstrained parameter space. A simulation study revealed that the normalized residual sum of squares test performed well regarding the type I error probability and the power for a wrong link function. We applied the proposed goodness-of-fit tests to the analysis of cross-sectional data of patients with cancer. We recommend the normalized residual sum of squares test as a goodness-of-fit test in the modified Poisson regression.

修正泊松回归使用泊松准似然估计方程和稳健方差来估计对数二叉回归模型中的回归参数,是估计二元结果分析中调整风险和流行率的有用工具。虽然针对其他二元回归已开发出几种拟合优度检验方法,但针对修正泊松回归的拟合优度检验方法却寥寥无几。在本研究中,我们为修正的泊松回归提出了几种拟合优度检验,包括修正的经验方差 Hosmer-Lemeshow 检验、Tsiatis 检验、二项式方差和泊松方差的归一化皮尔逊方差检验以及归一化残差平方和检验。原始的 Hosmer-Lemeshow 检验和二项方差归一化皮尔逊卡方检验不适合修正的泊松回归,因为修正的泊松回归由于参数空间不受约束,可能产生一个超过 1 的拟合值。模拟研究表明,归一化残差平方和检验在 I 类错误概率和错误链接函数的功率方面表现良好。我们将提出的拟合优度检验应用于癌症患者横截面数据的分析。我们推荐将归一化残差平方和检验作为修正泊松回归的拟合优度检验。
{"title":"Goodness-of-fit tests for modified Poisson regression possibly producing fitted values exceeding one in binary outcome analysis.","authors":"Yasuhiro Hagiwara, Yutaka Matsuyama","doi":"10.1177/09622802241254220","DOIUrl":"10.1177/09622802241254220","url":null,"abstract":"<p><p>Modified Poisson regression, which estimates the regression parameters in the log-binomial regression model using the Poisson quasi-likelihood estimating equation and robust variance, is a useful tool for estimating the adjusted risk and prevalence ratio in binary outcome analysis. Although several goodness-of-fit tests have been developed for other binary regressions, few goodness-of-fit tests are available for modified Poisson regression. In this study, we proposed several goodness-of-fit tests for modified Poisson regression, including the modified Hosmer-Lemeshow test with empirical variance, Tsiatis test, normalized Pearson chi-square tests with binomial variance and Poisson variance, and normalized residual sum of squares test. The original Hosmer-Lemeshow test and normalized Pearson chi-square test with binomial variance are inappropriate for the modified Poisson regression, which can produce a fitted value exceeding 1 owing to the unconstrained parameter space. A simulation study revealed that the normalized residual sum of squares test performed well regarding the type I error probability and the power for a wrong link function. We applied the proposed goodness-of-fit tests to the analysis of cross-sectional data of patients with cancer. We recommend the normalized residual sum of squares test as a goodness-of-fit test in the modified Poisson regression.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1185-1196"},"PeriodicalIF":1.6,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141082304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Retraction notice. 撤稿通知。
IF 1.6 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-06-01 Epub Date: 2015-05-17 DOI: 10.1177/0962280215586011
{"title":"Retraction notice.","authors":"","doi":"10.1177/0962280215586011","DOIUrl":"10.1177/0962280215586011","url":null,"abstract":"","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"NP1"},"PeriodicalIF":1.6,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33315488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sample size estimation for stratified cluster randomization trial with survival endpoint. 以生存为终点的分层分组随机试验的样本量估算。
IF 2.3 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-05-01 Epub Date: 2024-03-29 DOI: 10.1177/09622802241236953
Senmiao Ni, Zihang Zhong, Yang Zhao, Feng Chen, Jingwei Wu, Hao Yu, Jianling Bai

Cluster randomization trials with survival endpoint are predominantly used in drug development and clinical care research when drug treatments or interventions are delivered at a group level. Unlike conventional cluster randomization design, stratified cluster randomization design is generally considered more effective in reducing the impacts of imbalanced baseline prognostic factors and varying cluster sizes between groups when these stratification factors are adopted in the design. Failure to account for stratification and cluster size variability may lead to underpowered analysis and inaccurate sample size estimation. Apart from the sample size estimation in unstratified cluster randomization trials, there are no development of an explicit sample size formula for survival endpoint when a stratified cluster randomization design is employed. In this article, we present a closed-form sample size formula based on the stratified cluster log-rank statistics for stratified cluster randomization trials with survival endpoint. It provides an integrated solution for sample size estimation that account for cluster size variation, baseline hazard heterogeneity, and the estimated intracluster correlation coefficient based on the preliminary data. Simulation studies show that the proposed formula provides the appropriate sample size for achieving the desired statistical power under various parameter configurations. A real example of a stratified cluster randomization trial in the population with stable coronary heart disease is presented to illustrate our method.

具有生存终点的分组随机试验主要用于药物开发和临床护理研究,当药物治疗或干预措施在小组层面实施时。与传统的分组随机化设计不同,分层分组随机化设计通常被认为能更有效地减少基线预后因素不平衡和组间分组规模差异的影响。如果不考虑分层和分组规模的变化,可能会导致分析能力不足和样本量估计不准确。除了非分层分组随机试验中的样本量估算外,目前还没有针对采用分层分组随机设计的生存终点制定明确的样本量计算公式。在本文中,我们提出了一种基于分层分组对数rank统计量的封闭式样本量计算公式,适用于有生存终点的分层分组随机试验。它提供了一个综合的样本量估算方案,考虑了分组规模变化、基线危险异质性以及基于初步数据估算的分组内相关系数。模拟研究表明,在各种参数配置下,所提出的公式都能提供适当的样本量,以达到所需的统计功率。为了说明我们的方法,我们举了一个在冠心病稳定期人群中进行分层分组随机试验的实际例子。
{"title":"Sample size estimation for stratified cluster randomization trial with survival endpoint.","authors":"Senmiao Ni, Zihang Zhong, Yang Zhao, Feng Chen, Jingwei Wu, Hao Yu, Jianling Bai","doi":"10.1177/09622802241236953","DOIUrl":"10.1177/09622802241236953","url":null,"abstract":"<p><p>Cluster randomization trials with survival endpoint are predominantly used in drug development and clinical care research when drug treatments or interventions are delivered at a group level. Unlike conventional cluster randomization design, stratified cluster randomization design is generally considered more effective in reducing the impacts of imbalanced baseline prognostic factors and varying cluster sizes between groups when these stratification factors are adopted in the design. Failure to account for stratification and cluster size variability may lead to underpowered analysis and inaccurate sample size estimation. Apart from the sample size estimation in unstratified cluster randomization trials, there are no development of an explicit sample size formula for survival endpoint when a stratified cluster randomization design is employed. In this article, we present a closed-form sample size formula based on the stratified cluster log-rank statistics for stratified cluster randomization trials with survival endpoint. It provides an integrated solution for sample size estimation that account for cluster size variation, baseline hazard heterogeneity, and the estimated intracluster correlation coefficient based on the preliminary data. Simulation studies show that the proposed formula provides the appropriate sample size for achieving the desired statistical power under various parameter configurations. A real example of a stratified cluster randomization trial in the population with stable coronary heart disease is presented to illustrate our method.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"838-857"},"PeriodicalIF":2.3,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140319184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contrast-specific propensity scores for causal inference with multiple interventions. 针对多重干预因果推断的对比度倾向分数。
IF 2.3 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-05-01 Epub Date: 2024-03-18 DOI: 10.1177/09622802241236952
Shasha Han, Joel Goh, Fanwen Meng, Melvin Khee-Shing Leow, Donald B Rubin

Existing methods that use propensity scores for heterogeneous treatment effect estimation on non-experimental data do not readily extend to the case of more than two treatment options. In this work, we develop a new propensity score-based method for heterogeneous treatment effect estimation when there are three or more treatment options, and prove that it generates unbiased estimates. We demonstrate our method on a real patient registry of patients in Singapore with diabetic dyslipidemia. On this dataset, our method generates heterogeneous treatment recommendations for patients among three options: Statins, fibrates, and non-pharmacological treatment to control patients' lipid ratios (total cholesterol divided by high-density lipoprotein level). In our numerical study, our proposed method generated more stable estimates compared to a benchmark method based on a multi-dimensional propensity score.

现有的使用倾向得分对非实验数据进行异质性治疗效果估计的方法并不容易扩展到有两个以上治疗方案的情况。在这项工作中,我们开发了一种基于倾向得分的新方法,用于在有三个或更多治疗方案时进行异质性治疗效果估计,并证明该方法能产生无偏估计值。我们在新加坡糖尿病血脂异常患者的真实登记数据上演示了我们的方法。在这个数据集上,我们的方法在三种治疗方案中为患者提出了不同的治疗建议:他汀类药物、纤维素类药物和控制患者血脂比率(总胆固醇除以高密度脂蛋白水平)的非药物治疗。在我们的数值研究中,与基于多维倾向评分的基准方法相比,我们提出的方法产生的估计值更稳定。
{"title":"Contrast-specific propensity scores for causal inference with multiple interventions.","authors":"Shasha Han, Joel Goh, Fanwen Meng, Melvin Khee-Shing Leow, Donald B Rubin","doi":"10.1177/09622802241236952","DOIUrl":"10.1177/09622802241236952","url":null,"abstract":"<p><p>Existing methods that use propensity scores for heterogeneous treatment effect estimation on non-experimental data do not readily extend to the case of more than two treatment options. In this work, we develop a new propensity score-based method for heterogeneous treatment effect estimation when there are three or more treatment options, and prove that it generates unbiased estimates. We demonstrate our method on a real patient registry of patients in Singapore with diabetic dyslipidemia. On this dataset, our method generates heterogeneous treatment recommendations for patients among three options: Statins, fibrates, and non-pharmacological treatment to control patients' lipid ratios (total cholesterol divided by high-density lipoprotein level). In our numerical study, our proposed method generated more stable estimates compared to a benchmark method based on a multi-dimensional propensity score.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"825-837"},"PeriodicalIF":2.3,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140159035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Interval estimation in three-class receiver operating characteristic analysis: A fairly general approach based on the empirical likelihood. 三类接受者操作特征分析中的区间估计:基于经验似然法的通用方法。
IF 2.3 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-05-01 Epub Date: 2024-03-19 DOI: 10.1177/09622802241238998
Duc-Khanh To, Gianfranco Adimari, Monica Chiogna

The empirical likelihood is a powerful nonparametric tool, that emulates its parametric counterpart-the parametric likelihood-preserving many of its large-sample properties. This article tackles the problem of assessing the discriminatory power of three-class diagnostic tests from an empirical likelihood perspective. In particular, we concentrate on interval estimation in a three-class receiver operating characteristic analysis, where a variety of inferential tasks could be of interest. We present novel theoretical results and tailored techniques studied to efficiently solve some of such tasks. Extensive simulation experiments are provided in a supporting role, with our novel proposals compared to existing competitors, when possible. It emerges that our new proposals are extremely flexible, being able to compete with contestants and appearing suited to accommodating several distributions, such, for example, mixtures, for target populations. We illustrate the application of the novel proposals with a real data example. The article ends with a discussion and a presentation of some directions for future research.

经验似然法是一种功能强大的非参数工具,它仿效参数似然法,保留了参数似然法的许多大样本特性。本文从经验似然的角度出发,探讨了评估三类诊断检测的鉴别力问题。我们尤其关注三类接受者操作特征分析中的区间估计,在这种分析中,各种推断任务都可能引起兴趣。我们提出了新颖的理论结果和量身定制的技术,以有效解决其中的一些任务。我们还提供了大量的模拟实验作为辅助,并在可能的情况下将我们的新建议与现有的竞争者进行比较。结果表明,我们的新建议非常灵活,能够与竞争者竞争,而且似乎适合容纳多种分布,例如目标人群的混合分布。我们用一个真实数据实例来说明新建议的应用。文章最后进行了讨论,并提出了未来研究的一些方向。
{"title":"Interval estimation in three-class receiver operating characteristic analysis: A fairly general approach based on the empirical likelihood.","authors":"Duc-Khanh To, Gianfranco Adimari, Monica Chiogna","doi":"10.1177/09622802241238998","DOIUrl":"10.1177/09622802241238998","url":null,"abstract":"<p><p>The empirical likelihood is a powerful nonparametric tool, that emulates its parametric counterpart-the parametric likelihood-preserving many of its large-sample properties. This article tackles the problem of assessing the discriminatory power of three-class diagnostic tests from an empirical likelihood perspective. In particular, we concentrate on interval estimation in a three-class receiver operating characteristic analysis, where a variety of inferential tasks could be of interest. We present novel theoretical results and tailored techniques studied to efficiently solve some of such tasks. Extensive simulation experiments are provided in a supporting role, with our novel proposals compared to existing competitors, when possible. It emerges that our new proposals are extremely flexible, being able to compete with contestants and appearing suited to accommodating several distributions, such, for example, mixtures, for target populations. We illustrate the application of the novel proposals with a real data example. The article ends with a discussion and a presentation of some directions for future research.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"875-893"},"PeriodicalIF":2.3,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140159036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A matching-based machine learning approach to estimating optimal dynamic treatment regimes with time-to-event outcomes. 一种基于匹配的机器学习方法,用于估算具有时间到事件结果的最佳动态治疗方案。
IF 2.3 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-05-01 Epub Date: 2024-03-19 DOI: 10.1177/09622802241236954
Xuechen Wang, Hyejung Lee, Benjamin Haaland, Kathleen Kerrigan, Sonam Puri, Wallace Akerley, Jincheng Shen

Observational data (e.g. electronic health records) has become increasingly important in evidence-based research on dynamic treatment regimes, which tailor treatments over time to patients based on their characteristics and evolving clinical history. It is of great interest for clinicians and statisticians to identify an optimal dynamic treatment regime that can produce the best expected clinical outcome for each individual and thus maximize the treatment benefit over the population. Observational data impose various challenges for using statistical tools to estimate optimal dynamic treatment regimes. Notably, the task becomes more sophisticated when the clinical outcome of primary interest is time-to-event. Here, we propose a matching-based machine learning method to identify the optimal dynamic treatment regime with time-to-event outcomes subject to right-censoring using electronic health record data. In contrast to the established inverse probability weighting-based dynamic treatment regime methods, our proposed approach provides better protection against model misspecification and extreme weights in the context of treatment sequences, effectively addressing a prevalent challenge in the longitudinal analysis of electronic health record data. In simulations, the proposed method demonstrates robust performance across a range of scenarios. In addition, we illustrate the method with an application to estimate optimal dynamic treatment regimes for patients with advanced non-small cell lung cancer using a real-world, nationwide electronic health record database from Flatiron Health.

观察数据(如电子健康记录)在动态治疗方案的循证研究中变得越来越重要,动态治疗方案是根据患者的特征和不断变化的临床病史,在一段时间内为患者量身定制治疗方案。对于临床医生和统计学家来说,如何确定一种最佳动态治疗方案,使每个人都能获得最佳预期临床结果,从而使整个人群的治疗效益最大化,是一个非常重要的问题。观察数据给使用统计工具估算最佳动态治疗方案带来了各种挑战。值得注意的是,当主要关注的临床结果是时间到事件时,这项任务就变得更加复杂。在此,我们提出了一种基于匹配的机器学习方法,利用电子健康记录数据来识别具有时间到事件结果的最佳动态治疗方案,并对其进行右删减。与已有的基于反概率权重的动态治疗机制方法相比,我们提出的方法能更好地防止治疗序列中的模型错误规范和极端权重,有效地解决了电子健康记录数据纵向分析中普遍存在的难题。在模拟实验中,所提出的方法在各种情况下都表现出稳健的性能。此外,我们还利用 Flatiron Health 公司提供的真实世界、全国范围的电子健康记录数据库,对晚期非小细胞肺癌患者的最佳动态治疗方案进行了估算,以此来说明该方法。
{"title":"A matching-based machine learning approach to estimating optimal dynamic treatment regimes with time-to-event outcomes.","authors":"Xuechen Wang, Hyejung Lee, Benjamin Haaland, Kathleen Kerrigan, Sonam Puri, Wallace Akerley, Jincheng Shen","doi":"10.1177/09622802241236954","DOIUrl":"10.1177/09622802241236954","url":null,"abstract":"<p><p>Observational data (e.g. electronic health records) has become increasingly important in evidence-based research on dynamic treatment regimes, which tailor treatments over time to patients based on their characteristics and evolving clinical history. It is of great interest for clinicians and statisticians to identify an optimal dynamic treatment regime that can produce the best expected clinical outcome for each individual and thus maximize the treatment benefit over the population. Observational data impose various challenges for using statistical tools to estimate optimal dynamic treatment regimes. Notably, the task becomes more sophisticated when the clinical outcome of primary interest is time-to-event. Here, we propose a matching-based machine learning method to identify the optimal dynamic treatment regime with time-to-event outcomes subject to right-censoring using electronic health record data. In contrast to the established inverse probability weighting-based dynamic treatment regime methods, our proposed approach provides better protection against model misspecification and extreme weights in the context of treatment sequences, effectively addressing a prevalent challenge in the longitudinal analysis of electronic health record data. In simulations, the proposed method demonstrates robust performance across a range of scenarios. In addition, we illustrate the method with an application to estimate optimal dynamic treatment regimes for patients with advanced non-small cell lung cancer using a real-world, nationwide electronic health record database from Flatiron Health.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"794-806"},"PeriodicalIF":2.3,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140159034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sample size and power calculation for testing treatment effect heterogeneity in cluster randomized crossover designs 群组随机交叉设计中测试治疗效果异质性的样本量和功率计算
IF 2.3 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-05-01 DOI: 10.1177/09622802241247736
Xueqi Wang, Xinyuan Chen, Keith S Goldfeld, Monica Taljaard, Fan Li
The cluster randomized crossover design has been proposed to improve efficiency over the traditional parallel-arm cluster randomized design. While statistical methods have been developed for designing cluster randomized crossover trials, they have exclusively focused on testing the overall average treatment effect, with little attention to differential treatment effects across subpopulations. Recently, interest has grown in understanding whether treatment effects may vary across pre-specified patient subpopulations, such as those defined by demographic or clinical characteristics. In this article, we consider the two-treatment two-period cluster randomized crossover design under either a cross-sectional or closed-cohort sampling scheme, where it is of interest to detect the heterogeneity of treatment effect via an interaction test. Assuming a patterned correlation structure for both the covariate and the outcome, we derive new sample size formulas for testing the heterogeneity of treatment effect with continuous outcomes based on linear mixed models. Our formulas also address unequal cluster sizes and therefore allow us to analytically assess the impact of unequal cluster sizes on the power of the interaction test in cluster randomized crossover designs. We conduct simulations to confirm the accuracy of the proposed methods, and illustrate their application in two real cluster randomized crossover trials.
与传统的平行臂分组随机设计相比,分组随机交叉设计被提出来提高效率。虽然已开发出设计分组随机交叉试验的统计方法,但这些方法只侧重于测试总体平均治疗效果,而很少关注不同亚群的不同治疗效果。最近,人们越来越关注了解治疗效果是否会因预先指定的患者亚群(如根据人口统计或临床特征定义的亚群)而有所不同。在本文中,我们考虑了在横截面或封闭队列抽样方案下的两疗程两阶段群组随机交叉设计,在这种情况下,我们有兴趣通过交互检验来检测治疗效果的异质性。假设协变量和结果都具有模式化的相关结构,我们基于线性混合模型推导出了新的样本量公式,用于检验连续结果的治疗效果异质性。我们的公式还解决了不等群组规模的问题,因此可以分析评估不等群组规模对群组随机交叉设计中交互作用检验功率的影响。我们进行了模拟以证实所提方法的准确性,并在两个真实的分组随机交叉试验中说明了这些方法的应用。
{"title":"Sample size and power calculation for testing treatment effect heterogeneity in cluster randomized crossover designs","authors":"Xueqi Wang, Xinyuan Chen, Keith S Goldfeld, Monica Taljaard, Fan Li","doi":"10.1177/09622802241247736","DOIUrl":"https://doi.org/10.1177/09622802241247736","url":null,"abstract":"The cluster randomized crossover design has been proposed to improve efficiency over the traditional parallel-arm cluster randomized design. While statistical methods have been developed for designing cluster randomized crossover trials, they have exclusively focused on testing the overall average treatment effect, with little attention to differential treatment effects across subpopulations. Recently, interest has grown in understanding whether treatment effects may vary across pre-specified patient subpopulations, such as those defined by demographic or clinical characteristics. In this article, we consider the two-treatment two-period cluster randomized crossover design under either a cross-sectional or closed-cohort sampling scheme, where it is of interest to detect the heterogeneity of treatment effect via an interaction test. Assuming a patterned correlation structure for both the covariate and the outcome, we derive new sample size formulas for testing the heterogeneity of treatment effect with continuous outcomes based on linear mixed models. Our formulas also address unequal cluster sizes and therefore allow us to analytically assess the impact of unequal cluster sizes on the power of the interaction test in cluster randomized crossover designs. We conduct simulations to confirm the accuracy of the proposed methods, and illustrate their application in two real cluster randomized crossover trials.","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":"31 1","pages":""},"PeriodicalIF":2.3,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140834015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using joint models for longitudinal and time-to-event data to investigate the causal effect of salvage therapy after prostatectomy. 利用纵向数据和时间到事件数据的联合模型,研究前列腺切除术后挽救疗法的因果效应。
IF 1.6 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-05-01 Epub Date: 2024-03-19 DOI: 10.1177/09622802241239003
Dimitris Rizopoulos, Jeremy Mg Taylor, Grigorios Papageorgiou, Todd M Morgan

Prostate cancer patients who undergo prostatectomy are closely monitored for recurrence and metastasis using routine prostate-specific antigen measurements. When prostate-specific antigen levels rise, salvage therapies are recommended in order to decrease the risk of metastasis. However, due to the side effects of these therapies and to avoid over-treatment, it is important to understand which patients and when to initiate these salvage therapies. In this work, we use the University of Michigan Prostatectomy Registry Data to tackle this question. Due to the observational nature of this data, we face the challenge that prostate-specific antigen is simultaneously a time-varying confounder and an intermediate variable for salvage therapy. We define different causal salvage therapy effects defined conditionally on different specifications of the longitudinal prostate-specific antigen history. We then illustrate how these effects can be estimated using the framework of joint models for longitudinal and time-to-event data. All proposed methodology is implemented in the freely-available R package JMbayes2.

对接受前列腺切除术的前列腺癌患者进行常规前列腺特异性抗原测量,密切监测复发和转移情况。当前列腺特异性抗原水平升高时,建议采用挽救疗法,以降低转移风险。然而,由于这些疗法存在副作用,为了避免过度治疗,了解哪些患者以及何时启动这些挽救性疗法非常重要。在这项研究中,我们利用密歇根大学前列腺切除术登记数据来解决这个问题。由于该数据的观察性质,我们面临的挑战是前列腺特异性抗原既是时变混杂因素,又是挽救疗法的中间变量。我们根据纵向前列腺特异性抗原历史的不同规格定义了不同的挽救治疗因果效应。然后,我们说明了如何利用纵向数据和时间到事件数据的联合模型框架来估算这些效应。所有建议的方法都在免费提供的 R 软件包 JMbayes2 中实现。
{"title":"Using joint models for longitudinal and time-to-event data to investigate the causal effect of salvage therapy after prostatectomy.","authors":"Dimitris Rizopoulos, Jeremy Mg Taylor, Grigorios Papageorgiou, Todd M Morgan","doi":"10.1177/09622802241239003","DOIUrl":"10.1177/09622802241239003","url":null,"abstract":"<p><p>Prostate cancer patients who undergo prostatectomy are closely monitored for recurrence and metastasis using routine prostate-specific antigen measurements. When prostate-specific antigen levels rise, salvage therapies are recommended in order to decrease the risk of metastasis. However, due to the side effects of these therapies and to avoid over-treatment, it is important to understand which patients and when to initiate these salvage therapies. In this work, we use the University of Michigan Prostatectomy Registry Data to tackle this question. Due to the observational nature of this data, we face the challenge that prostate-specific antigen is simultaneously a time-varying confounder and an intermediate variable for salvage therapy. We define different causal salvage therapy effects defined conditionally on different specifications of the longitudinal prostate-specific antigen history. We then illustrate how these effects can be estimated using the framework of joint models for longitudinal and time-to-event data. All proposed methodology is implemented in the freely-available R package <b>JMbayes2</b>.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"894-908"},"PeriodicalIF":1.6,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11041089/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140159037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparisons of various estimates of the I2 statistic for quantifying between-study heterogeneity in meta-analysis. 用于量化荟萃分析中研究间异质性的各种 I2 统计估计值的比较。
IF 2.3 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-05-01 Epub Date: 2024-03-19 DOI: 10.1177/09622802241231496
Yipeng Wang, Natalie DelRocco, Lifeng Lin

Assessing heterogeneity between studies is a critical step in determining whether studies can be combined and whether the synthesized results are reliable. The I2 statistic has been a popular measure for quantifying heterogeneity, but its usage has been challenged from various perspectives in recent years. In particular, it should not be considered an absolute measure of heterogeneity, and it could be subject to large uncertainties. As such, when using I2 to interpret the extent of heterogeneity, it is essential to account for its interval estimate. Various point and interval estimators exist for I2. This article summarizes these estimators. In addition, we performed a simulation study under different scenarios to investigate preferable point and interval estimates of I2. We found that the Sidik-Jonkman method gave precise point estimates for I2 when the between-study variance was large, while in other cases, the DerSimonian-Laird method was suggested to estimate I2. When the effect measure was the mean difference or the standardized mean difference, the Q-profile method, the Biggerstaff-Jackson method, or the Jackson method was suggested to calculate the interval estimate for I2 due to reasonable interval length and more reliable coverage probabilities than various alternatives. For the same reason, the Kulinskaya-Dollinger method was recommended to calculate the interval estimate for I2 when the effect measure was the log odds ratio.

评估研究之间的异质性是决定研究是否可以合并以及综合结果是否可靠的关键步骤。I2 统计量一直是量化异质性的常用指标,但近年来它的使用受到了多方面的质疑。特别是,它不应被视为衡量异质性的绝对指标,而且可能存在很大的不确定性。因此,在使用 I2 解释异质性程度时,必须考虑其区间估计值。I2 有多种点估计值和区间估计值。本文总结了这些估计器。此外,我们还进行了一项不同情况下的模拟研究,以探讨更可取的 I2 点估计值和区间估计值。我们发现,当研究间方差较大时,Sidik-Jonkman 方法能给出精确的 I2 点估计值,而在其他情况下,建议使用 DerSimonian-Laird 方法来估计 I2。当效应度量为均值差异或标准化均值差异时,建议采用 Q-profile法、Biggerstaff-Jackson法或Jackson法计算I2的区间估计值,因为与其他方法相比,Q-profile法的区间长度合理,覆盖概率更可靠。出于同样的原因,当效应测量值为对数几率比率时,建议使用 Kulinskaya-Dollinger 方法计算 I2 的区间估计值。
{"title":"<ArticleTitle xmlns:ns0=\"http://www.w3.org/1998/Math/MathML\">Comparisons of various estimates of the <ns0:math><ns0:mrow><ns0:msup><ns0:mi>I</ns0:mi><ns0:mn>2</ns0:mn></ns0:msup></ns0:mrow></ns0:math> statistic for quantifying between-study heterogeneity in meta-analysis.","authors":"Yipeng Wang, Natalie DelRocco, Lifeng Lin","doi":"10.1177/09622802241231496","DOIUrl":"10.1177/09622802241231496","url":null,"abstract":"<p><p>Assessing heterogeneity between studies is a critical step in determining whether studies can be combined and whether the synthesized results are reliable. The <math><mrow><msup><mi>I</mi><mn>2</mn></msup></mrow></math> statistic has been a popular measure for quantifying heterogeneity, but its usage has been challenged from various perspectives in recent years. In particular, it should not be considered an absolute measure of heterogeneity, and it could be subject to large uncertainties. As such, when using <math><mrow><msup><mi>I</mi><mn>2</mn></msup></mrow></math> to interpret the extent of heterogeneity, it is essential to account for its interval estimate. Various point and interval estimators exist for <math><mrow><msup><mi>I</mi><mn>2</mn></msup></mrow></math>. This article summarizes these estimators. In addition, we performed a simulation study under different scenarios to investigate preferable point and interval estimates of <math><mrow><msup><mi>I</mi><mn>2</mn></msup></mrow></math>. We found that the Sidik-Jonkman method gave precise point estimates for <math><mrow><msup><mi>I</mi><mn>2</mn></msup></mrow></math> when the between-study variance was large, while in other cases, the DerSimonian-Laird method was suggested to estimate <math><mrow><msup><mi>I</mi><mn>2</mn></msup></mrow></math>. When the effect measure was the mean difference or the standardized mean difference, the <math><mi>Q</mi></math>-profile method, the Biggerstaff-Jackson method, or the Jackson method was suggested to calculate the interval estimate for <math><mrow><msup><mi>I</mi><mn>2</mn></msup></mrow></math> due to reasonable interval length and more reliable coverage probabilities than various alternatives. For the same reason, the Kulinskaya-Dollinger method was recommended to calculate the interval estimate for <math><mrow><msup><mi>I</mi><mn>2</mn></msup></mrow></math> when the effect measure was the log odds ratio.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"745-764"},"PeriodicalIF":2.3,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140159017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Statistical Methods in Medical Research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1