以多平台随机临床试验为动力，比较贝叶斯和频数监测边界。

IF 2.2 3区医学 Q3 MEDICINE, RESEARCH & EXPERIMENTAL Clinical Trials Pub Date : 2024-12-01 Epub Date: 2024-05-17 DOI:10.1177/17407745241244801

Jungnam Joo, Eric S Leifer, Michael A Proschan, James F Troendle, Harmony R Reynolds, Erinn A Hade, Patrick R Lawler, Dong-Yun Kim, Nancy L Geller

{"title":"以多平台随机临床试验为动力，比较贝叶斯和频数监测边界。","authors":"Jungnam Joo, Eric S Leifer, Michael A Proschan, James F Troendle, Harmony R Reynolds, Erinn A Hade, Patrick R Lawler, Dong-Yun Kim, Nancy L Geller","doi":"10.1177/17407745241244801","DOIUrl":null,"url":null,"abstract":"Background: The coronavirus disease 2019 pandemic highlighted the need to conduct efficient randomized clinical trials with interim monitoring guidelines for efficacy and futility. Several randomized coronavirus disease 2019 trials, including the Multiplatform Randomized Clinical Trial (mpRCT), used Bayesian guidelines with the belief that they would lead to quicker efficacy or futility decisions than traditional \"frequentist\" guidelines, such as spending functions and conditional power. We explore this belief using an intuitive interpretation of Bayesian methods as translating prior opinion about the treatment effect into imaginary prior data. These imaginary observations are then combined with actual observations from the trial to make conclusions. Using this approach, we show that the Bayesian efficacy boundary used in mpRCT is actually quite similar to the frequentist Pocock boundary.Methods: The mpRCT's efficacy monitoring guideline considered stopping if, given the observed data, there was greater than 99% probability that the treatment was effective (odds ratio greater than 1). The mpRCT's futility monitoring guideline considered stopping if, given the observed data, there was greater than 95% probability that the treatment was less than 20% effective (odds ratio less than 1.2). The mpRCT used a normal prior distribution that can be thought of as supplementing the actual patients' data with imaginary patients' data. We explore the effects of varying probability thresholds and the prior-to-actual patient ratio in the mpRCT and compare the resulting Bayesian efficacy monitoring guidelines to the well-known frequentist Pocock and O'Brien-Fleming efficacy guidelines. We also contrast Bayesian futility guidelines with a more traditional 20% conditional power futility guideline.Results: A Bayesian efficacy and futility monitoring boundary using a neutral, weakly informative prior distribution and a fixed probability threshold at all interim analyses is more aggressive than the commonly used O'Brien-Fleming efficacy boundary coupled with a 20% conditional power threshold for futility. The trade-off is that more aggressive boundaries tend to stop trials earlier, but incur a loss of power. Interestingly, the Bayesian efficacy boundary with 99% probability threshold is very similar to the classic Pocock efficacy boundary.Conclusions: In a pandemic where quickly weeding out ineffective treatments and identifying effective treatments is paramount, aggressive monitoring may be preferred to conservative approaches, such as the O'Brien-Fleming boundary. This can be accomplished with either Bayesian or frequentist methods.","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"701-709"},"PeriodicalIF":2.2000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11530333/pdf/","citationCount":"0","resultStr":"{\"title\":\"Comparison of Bayesian and frequentist monitoring boundaries motivated by the Multiplatform Randomized Clinical Trial.\",\"authors\":\"Jungnam Joo, Eric S Leifer, Michael A Proschan, James F Troendle, Harmony R Reynolds, Erinn A Hade, Patrick R Lawler, Dong-Yun Kim, Nancy L Geller\",\"doi\":\"10.1177/17407745241244801\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: The coronavirus disease 2019 pandemic highlighted the need to conduct efficient randomized clinical trials with interim monitoring guidelines for efficacy and futility. Several randomized coronavirus disease 2019 trials, including the Multiplatform Randomized Clinical Trial (mpRCT), used Bayesian guidelines with the belief that they would lead to quicker efficacy or futility decisions than traditional \\\"frequentist\\\" guidelines, such as spending functions and conditional power. We explore this belief using an intuitive interpretation of Bayesian methods as translating prior opinion about the treatment effect into imaginary prior data. These imaginary observations are then combined with actual observations from the trial to make conclusions. Using this approach, we show that the Bayesian efficacy boundary used in mpRCT is actually quite similar to the frequentist Pocock boundary.Methods: The mpRCT's efficacy monitoring guideline considered stopping if, given the observed data, there was greater than 99% probability that the treatment was effective (odds ratio greater than 1). The mpRCT's futility monitoring guideline considered stopping if, given the observed data, there was greater than 95% probability that the treatment was less than 20% effective (odds ratio less than 1.2). The mpRCT used a normal prior distribution that can be thought of as supplementing the actual patients' data with imaginary patients' data. We explore the effects of varying probability thresholds and the prior-to-actual patient ratio in the mpRCT and compare the resulting Bayesian efficacy monitoring guidelines to the well-known frequentist Pocock and O'Brien-Fleming efficacy guidelines. We also contrast Bayesian futility guidelines with a more traditional 20% conditional power futility guideline.Results: A Bayesian efficacy and futility monitoring boundary using a neutral, weakly informative prior distribution and a fixed probability threshold at all interim analyses is more aggressive than the commonly used O'Brien-Fleming efficacy boundary coupled with a 20% conditional power threshold for futility. The trade-off is that more aggressive boundaries tend to stop trials earlier, but incur a loss of power. Interestingly, the Bayesian efficacy boundary with 99% probability threshold is very similar to the classic Pocock efficacy boundary.Conclusions: In a pandemic where quickly weeding out ineffective treatments and identifying effective treatments is paramount, aggressive monitoring may be preferred to conservative approaches, such as the O'Brien-Fleming boundary. This can be accomplished with either Bayesian or frequentist methods.\",\"PeriodicalId\":10685,\"journal\":{\"name\":\"Clinical Trials\",\"volume\":\" \",\"pages\":\"701-709\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11530333/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinical Trials\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1177/17407745241244801\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/5/17 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"MEDICINE, RESEARCH & EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Trials","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/17407745241244801","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/5/17 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

摘要

背景：冠状病毒病2019年大流行凸显了开展高效随机临床试验的必要性，并对疗效和无效性制定了中期监测指南。包括 "多平台随机临床试验"（mpRCT）在内的几项冠状病毒病 2019 年随机临床试验使用了贝叶斯指南，认为与传统的 "频繁主义 "指南（如支出函数和条件幂）相比，贝叶斯指南能更快地做出疗效或无效决定。我们利用贝叶斯方法的直观解释来探讨这一信念，即把关于治疗效果的先验观点转化为假想的先验数据。然后将这些假想的观察结果与试验中的实际观察结果相结合，从而得出结论。利用这种方法，我们证明了 mpRCT 中使用的贝叶斯疗效边界实际上与频数主义的 Pocock 边界非常相似：mpRCT疗效监测指南认为，如果根据观察到的数据，治疗有效的可能性大于99%（几率比大于1），则应停止治疗。如果根据观察到的数据，治疗有效率低于 20% 的可能性大于 95%（几率比小于 1.2），则 mpRCT 的无效性监测准则认为应停止治疗。mpRCT 使用的是正态先验分布，可以认为是用假想患者数据对实际患者数据的补充。我们探讨了mpRCT中不同概率阈值和先验与实际患者比率的影响，并将得出的贝叶斯疗效监测指南与著名的频数主义Pocock和O'Brien-Fleming疗效指南进行了比较。我们还将贝叶斯无效性指南与更传统的20%条件功率无效性指南进行了对比：结果：使用中性、弱信息先验分布和所有中期分析的固定概率阈值的贝叶斯疗效和无效性监测边界比常用的奥布莱恩-弗莱明疗效边界和 20% 条件功率无效性阈值更为激进。权衡的结果是，更激进的界限往往会更早地停止试验，但却会导致功率损失。有趣的是，概率阈值为 99% 的贝叶斯疗效边界与经典的 Pocock 疗效边界非常相似：结论：在大流行病中，快速剔除无效治疗方法并确定有效治疗方法是最重要的，与保守的方法（如奥布莱恩-弗莱明边界）相比，积极的监测可能更受欢迎。这可以通过贝叶斯法或频数法来实现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Comparison of Bayesian and frequentist monitoring boundaries motivated by the Multiplatform Randomized Clinical Trial.

Background: The coronavirus disease 2019 pandemic highlighted the need to conduct efficient randomized clinical trials with interim monitoring guidelines for efficacy and futility. Several randomized coronavirus disease 2019 trials, including the Multiplatform Randomized Clinical Trial (mpRCT), used Bayesian guidelines with the belief that they would lead to quicker efficacy or futility decisions than traditional "frequentist" guidelines, such as spending functions and conditional power. We explore this belief using an intuitive interpretation of Bayesian methods as translating prior opinion about the treatment effect into imaginary prior data. These imaginary observations are then combined with actual observations from the trial to make conclusions. Using this approach, we show that the Bayesian efficacy boundary used in mpRCT is actually quite similar to the frequentist Pocock boundary.

Methods: The mpRCT's efficacy monitoring guideline considered stopping if, given the observed data, there was greater than 99% probability that the treatment was effective (odds ratio greater than 1). The mpRCT's futility monitoring guideline considered stopping if, given the observed data, there was greater than 95% probability that the treatment was less than 20% effective (odds ratio less than 1.2). The mpRCT used a normal prior distribution that can be thought of as supplementing the actual patients' data with imaginary patients' data. We explore the effects of varying probability thresholds and the prior-to-actual patient ratio in the mpRCT and compare the resulting Bayesian efficacy monitoring guidelines to the well-known frequentist Pocock and O'Brien-Fleming efficacy guidelines. We also contrast Bayesian futility guidelines with a more traditional 20% conditional power futility guideline.

Results: A Bayesian efficacy and futility monitoring boundary using a neutral, weakly informative prior distribution and a fixed probability threshold at all interim analyses is more aggressive than the commonly used O'Brien-Fleming efficacy boundary coupled with a 20% conditional power threshold for futility. The trade-off is that more aggressive boundaries tend to stop trials earlier, but incur a loss of power. Interestingly, the Bayesian efficacy boundary with 99% probability threshold is very similar to the classic Pocock efficacy boundary.

Conclusions: In a pandemic where quickly weeding out ineffective treatments and identifying effective treatments is paramount, aggressive monitoring may be preferred to conservative approaches, such as the O'Brien-Fleming boundary. This can be accomplished with either Bayesian or frequentist methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Clinical Trials 医学-医学：研究与实验

CiteScore

4.10

自引率

3.70%

发文量

审稿时长

6-12 weeks

期刊介绍： Clinical Trials is dedicated to advancing knowledge on the design and conduct of clinical trials related research methodologies. Covering the design, conduct, analysis, synthesis and evaluation of key methodologies, the journal remains on the cusp of the latest topics, including ethics, regulation and policy impact.