首页 > 最新文献

Clinical Trials最新文献

英文 中文
Concordance between clinical trial data use request proposals and corresponding publications: A cross-sectional study. 临床试验数据使用请求提案与相应出版物之间的一致性:一项横断面研究。
IF 2.2 3区 医学 Q3 MEDICINE, RESEARCH & EXPERIMENTAL Pub Date : 2025-06-01 Epub Date: 2024-12-29 DOI: 10.1177/17407745241304355
Enrique Vazquez, Joseph S Ross, Cary P Gross, Karla Childers, Stephen Bamford, Jessica D Ritchie, Joanne Waldstreicher, Harlan M Krumholz, Joshua D Wallach
<p><p>Background/AimsThe reuse of clinical trial data available through data-sharing platforms has grown over the past decade. Several prominent clinical data-sharing platforms require researchers to submit formal research proposals before granting data access, providing an opportunity to evaluate how published analyses compare with initially proposed aims. We evaluated the concordance between the included trials, study objectives, endpoints, and statistical methods specified in researchers' clinical trial data use request proposals to four clinical data-sharing platforms and their corresponding publications.MethodsWe identified all unique data request proposals with at least one corresponding peer-reviewed publication as of 31 March 2023 on four prominent clinical trial data sharing request platforms (Vivli, ClinicalStudyDataRequest.com, the Yale Open Data Access Project, and Supporting Open Access to Researchers-Bristol Myers Squibb). When data requests had multiple publications, we treated each publication-request pair as a unit. For each pair, the trials requested and analyzed were classified as fully concordant, discordant, or unclear, whereas the study objectives, primary and secondary endpoints, and statistical methods were classified as fully concordant, partially concordant, discordant, or unclear. For Vivli, ClinicalStudyDataRequest.com, and Supporting Open Access to Researchers-Bristol Myers Squibb, endpoints of publication-request pairs were not compared because the data request proposals on these platforms do not consistently report this information.ResultsOf 117 Vivli publication-request pairs, 76 (65.0%) were fully concordant for the trials requested and analyzed, 61 (52.1%) for study objectives, and 57 (48.7%) for statistical methods; 35 (29.9%) pairs were fully concordant across the 3 characteristics reported by all platforms. Of 106 ClinicalStudyDataRequest.com publication-request pairs, 66 (62.3%) were fully concordant for the trials requested and analyzed, 41 (38.7%) for study objectives, and 35 (33.0%) for statistical methods; 20 (18.9%) pairs were fully concordant across the 3 characteristics. Of 65 Yale Open Data Access Project publication-request pairs, 35 (53.8%) were fully concordant for the trials requested and analyzed, 44 (67.7%) for primary study objectives, and 25 (38.5%) for statistical methods; 15 (23.1%) pairs were fully concordant across the 3 characteristics. In addition, 26 (40.0%) and 2 (3.1%) Yale Open Data Access Project publication-request pairs were concordant for primary and secondary endpoints, respectively, such that only one (1.5%) Yale Open Data Access Project publication-request pair was fully concordant across all five characteristics reported. Of three Supporting Open Access to Researchers-Bristol Myers Squibb publication-request pairs, one (33.3%) was fully concordant for the trials requested and analyzed, two (66.6%) for primary study objectives, and two (66.6%) for statistical methods; one (33.
背景/目的通过数据共享平台获得的临床试验数据的重用在过去十年中有所增长。一些著名的临床数据共享平台要求研究人员在授予数据访问权限之前提交正式的研究提案,这为评估已发表的分析与最初提出的目标的比较提供了机会。我们评估了纳入的试验、研究目标、终点和研究人员向四个临床数据共享平台及其相应出版物提交的临床试验数据使用请求中指定的统计方法之间的一致性。方法:我们在四个著名的临床试验数据共享请求平台(Vivli、ClinicalStudyDataRequest.com、耶鲁大学开放数据获取项目和支持研究人员开放获取- bristol Myers Squibb)上识别了截至2023年3月31日至少有一篇同行评审出版物的所有独特数据请求提案。当数据请求有多个发布时,我们将每个发布-请求对视为一个单元。对于每一对,要求和分析的试验被分类为完全一致、不一致或不清楚,而研究目标、主要和次要终点和统计方法被分类为完全一致、部分一致、不一致或不清楚。对于Vivli, ClinicalStudyDataRequest.com和support Open Access to Researchers-Bristol Myers Squibb,没有比较发表请求对的端点,因为这些平台上的数据请求建议没有一致地报告这些信息。结果117对Vivli发表请求对中,76对(65.0%)与请求和分析的试验完全一致,61对(52.1%)与研究目标完全一致,57对(48.7%)与统计方法完全一致;35对(29.9%)对在所有平台报告的3个特征上完全一致。在106对ClinicalStudyDataRequest.com发表请求对中,66对(62.3%)对所请求和分析的试验完全一致,41对(38.7%)对研究目标完全一致,35对(33.0%)对统计方法完全一致;3个性状完全一致的有20对(18.9%)。在65对耶鲁开放数据获取项目发表请求对中,35对(53.8%)与请求和分析的试验完全一致,44对(67.7%)与主要研究目标完全一致,25对(38.5%)与统计方法完全一致;3个性状完全一致的有15对(23.1%)。此外,26对(40.0%)和2对(3.1%)耶鲁开放数据访问项目出版请求对分别在主要和次要终点上是一致的,因此只有1对(1.5%)耶鲁开放数据访问项目出版请求对在报告的所有五个特征上是完全一致的。在3对支持开放获取研究人员-百时美施贵宝出版请求对中,1对(33.3%)与请求和分析的试验完全一致,2对(66.6%)与主要研究目标完全一致,2对(66.6%)与统计方法完全一致;一个(33.3%)对在所有平台报告的所有三个特征上完全一致。结论在四个临床数据共享平台中,数据请求提案往往与其相应的出版物不一致,每个平台报告的所有三个关键提案特征只有25%的一致性。研究人员有机会在其出版物中描述任何数据共享请求建议偏差,平台也有机会加强对关键研究特征规范的报告。
{"title":"Concordance between clinical trial data use request proposals and corresponding publications: A cross-sectional study.","authors":"Enrique Vazquez, Joseph S Ross, Cary P Gross, Karla Childers, Stephen Bamford, Jessica D Ritchie, Joanne Waldstreicher, Harlan M Krumholz, Joshua D Wallach","doi":"10.1177/17407745241304355","DOIUrl":"10.1177/17407745241304355","url":null,"abstract":"&lt;p&gt;&lt;p&gt;Background/AimsThe reuse of clinical trial data available through data-sharing platforms has grown over the past decade. Several prominent clinical data-sharing platforms require researchers to submit formal research proposals before granting data access, providing an opportunity to evaluate how published analyses compare with initially proposed aims. We evaluated the concordance between the included trials, study objectives, endpoints, and statistical methods specified in researchers' clinical trial data use request proposals to four clinical data-sharing platforms and their corresponding publications.MethodsWe identified all unique data request proposals with at least one corresponding peer-reviewed publication as of 31 March 2023 on four prominent clinical trial data sharing request platforms (Vivli, ClinicalStudyDataRequest.com, the Yale Open Data Access Project, and Supporting Open Access to Researchers-Bristol Myers Squibb). When data requests had multiple publications, we treated each publication-request pair as a unit. For each pair, the trials requested and analyzed were classified as fully concordant, discordant, or unclear, whereas the study objectives, primary and secondary endpoints, and statistical methods were classified as fully concordant, partially concordant, discordant, or unclear. For Vivli, ClinicalStudyDataRequest.com, and Supporting Open Access to Researchers-Bristol Myers Squibb, endpoints of publication-request pairs were not compared because the data request proposals on these platforms do not consistently report this information.ResultsOf 117 Vivli publication-request pairs, 76 (65.0%) were fully concordant for the trials requested and analyzed, 61 (52.1%) for study objectives, and 57 (48.7%) for statistical methods; 35 (29.9%) pairs were fully concordant across the 3 characteristics reported by all platforms. Of 106 ClinicalStudyDataRequest.com publication-request pairs, 66 (62.3%) were fully concordant for the trials requested and analyzed, 41 (38.7%) for study objectives, and 35 (33.0%) for statistical methods; 20 (18.9%) pairs were fully concordant across the 3 characteristics. Of 65 Yale Open Data Access Project publication-request pairs, 35 (53.8%) were fully concordant for the trials requested and analyzed, 44 (67.7%) for primary study objectives, and 25 (38.5%) for statistical methods; 15 (23.1%) pairs were fully concordant across the 3 characteristics. In addition, 26 (40.0%) and 2 (3.1%) Yale Open Data Access Project publication-request pairs were concordant for primary and secondary endpoints, respectively, such that only one (1.5%) Yale Open Data Access Project publication-request pair was fully concordant across all five characteristics reported. Of three Supporting Open Access to Researchers-Bristol Myers Squibb publication-request pairs, one (33.3%) was fully concordant for the trials requested and analyzed, two (66.6%) for primary study objectives, and two (66.6%) for statistical methods; one (33.","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":"22 3","pages":"279-288"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12095927/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144109862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sequential monitoring of time-to-event safety endpoints in clinical trials. 临床试验中时间到事件安全终点的顺序监测。
IF 2.2 3区 医学 Q3 MEDICINE, RESEARCH & EXPERIMENTAL Pub Date : 2025-06-01 Epub Date: 2024-12-29 DOI: 10.1177/17407745241304119
Michael J Martens, Qinghua Lian, Nancy L Geller, Eric S Leifer, Brent R Logan
<p><p>Background/aimsSafety monitoring is a crucial requirement for Phase II and III clinical trials. To protect patients from toxicity risk, stopping rules may be implemented that will halt the study if an unexpectedly high number of events occur. These rules are constructed using statistical procedures that typically treat the toxicity data as binary occurrences. Because the exact dates of toxicities are often available, a strategy that handles these as time-to-event data may offer higher power and require less calendar time to identify excess risk. This work investigates several statistical methods for monitoring safety events as time-to-event endpoints and illustrates our R software package for designing and evaluating these procedures.MethodsThe performance metrics of safety stopping rules derived from Wang-Tsiatis tests, Bayesian Gamma-Poisson models, and sequential probability ratio tests are evaluated and contrasted in Phase II and III trial scenarios. We developed a publicly available R package "stoppingrule" for designing and assessing these stopping rules whose utility is illustrated through the design of a stopping rule for Blood and Marrow Transplant Clinical Trials Network 1204 (National Clinical Trial number NCT01998633), a multicenter, Phase II, single-arm trial that assessed the efficacy and safety of bone marrow transplant for the treatment of hemophagocytic lymphohistiocytosis and primary immune deficiencies.ResultsAs seen previously in group sequential testing settings, rules with strict stopping criteria early in a study tend to have more lenient stopping criteria late in the trial. Consequently, methods with aggressive early monitoring, such as Gamma-Poisson models with weak priors and certain choices of truncated sequential probability ratio tests, usually yield a smaller number of toxicities and lower power than ones that are more permissive at early stages, such as Gamma-Poisson models with strong priors and the O'Brien-Fleming test. The Pocock test and maximized sequential probability ratio test performed contrary to these trends, however, exhibiting both diminished power and higher numbers of toxicities than other methods due to their extremely aggressive early stopping criteria, failing to reserve adequate power to identify safety issues beyond the start of the study. In contrast to binary toxicity approaches, our time-to-event methods offer meaningful reductions in expected toxicities of up to 20% across scenarios considered.ConclusionSafety monitoring procedures aim to guard study participants from being exposed to and suffering toxicity from unsafe treatments. Toward this end, we recommend considering the time-to-event-oriented Gamma-Poisson model-weak prior model or truncated sequential probability ratio test for constructing safety stopping rules, as they performed the best in minimizing the number of toxicities in our investigations. Our R package "stoppingrule" offers procedures for creating and assessing stoppi
背景/目的安全监测是II期和III期临床试验的关键要求。为了保护患者免受毒性风险,可能会实施停止规则,如果意外发生大量事件,将停止研究。这些规则是使用统计程序构建的,这些程序通常将毒性数据视为二元事件。由于毒性的确切日期通常是可用的,因此将这些数据作为事件时间数据处理的策略可能会提供更高的能力,并且需要更少的日历时间来识别超额风险。这项工作研究了几种用于监控安全事件的统计方法,并说明了我们设计和评估这些程序的R软件包。方法通过Wang-Tsiatis检验、贝叶斯伽玛泊松模型和序贯概率比检验得出的安全停车规则的性能指标,在II期和III期试验情景下进行评价和对比。我们开发了一个公开可用的R包“停止规则”,用于设计和评估这些停止规则,其效用通过血液和骨髓移植临床试验网络1204(国家临床试验编号NCT01998633)的停止规则的设计来说明,这是一项多中心,II期,单臂试验,评估骨髓移植治疗噬血细胞淋巴组织细胞病和原发性免疫缺陷的有效性和安全性。结果如先前在组序贯试验设置中所见,在研究早期具有严格停止标准的规则往往在试验后期具有更宽松的停止标准。因此,积极的早期监测方法,如具有弱先验的伽马-泊松模型和某些截断顺序概率比测试的选择,通常比在早期阶段更允许的方法产生更少的毒性和更低的功率,如具有强先验的伽马-泊松模型和O'Brien-Fleming测试。然而,Pocock试验和最大化序列概率比试验的结果与这些趋势相反,由于其极端激进的早期停止标准,与其他方法相比,显示出功率降低和毒性数量增加,未能保留足够的功率来识别研究开始后的安全问题。与二元毒性方法相比,我们的时间-事件方法在考虑的各种情况下可将预期毒性降低20%。结论安全监测程序旨在保护研究参与者免受不安全治疗的暴露和毒性。为此,我们建议考虑以时间-事件为导向的伽马-泊松模型-弱先验模型或截断序列概率比检验来构建安全停车规则,因为在我们的研究中,它们在最小化毒性数量方面表现最好。我们的R包“停止规则”提供了创建和评估停止规则的程序,以帮助试验设计。
{"title":"Sequential monitoring of time-to-event safety endpoints in clinical trials.","authors":"Michael J Martens, Qinghua Lian, Nancy L Geller, Eric S Leifer, Brent R Logan","doi":"10.1177/17407745241304119","DOIUrl":"10.1177/17407745241304119","url":null,"abstract":"&lt;p&gt;&lt;p&gt;Background/aimsSafety monitoring is a crucial requirement for Phase II and III clinical trials. To protect patients from toxicity risk, stopping rules may be implemented that will halt the study if an unexpectedly high number of events occur. These rules are constructed using statistical procedures that typically treat the toxicity data as binary occurrences. Because the exact dates of toxicities are often available, a strategy that handles these as time-to-event data may offer higher power and require less calendar time to identify excess risk. This work investigates several statistical methods for monitoring safety events as time-to-event endpoints and illustrates our R software package for designing and evaluating these procedures.MethodsThe performance metrics of safety stopping rules derived from Wang-Tsiatis tests, Bayesian Gamma-Poisson models, and sequential probability ratio tests are evaluated and contrasted in Phase II and III trial scenarios. We developed a publicly available R package \"stoppingrule\" for designing and assessing these stopping rules whose utility is illustrated through the design of a stopping rule for Blood and Marrow Transplant Clinical Trials Network 1204 (National Clinical Trial number NCT01998633), a multicenter, Phase II, single-arm trial that assessed the efficacy and safety of bone marrow transplant for the treatment of hemophagocytic lymphohistiocytosis and primary immune deficiencies.ResultsAs seen previously in group sequential testing settings, rules with strict stopping criteria early in a study tend to have more lenient stopping criteria late in the trial. Consequently, methods with aggressive early monitoring, such as Gamma-Poisson models with weak priors and certain choices of truncated sequential probability ratio tests, usually yield a smaller number of toxicities and lower power than ones that are more permissive at early stages, such as Gamma-Poisson models with strong priors and the O'Brien-Fleming test. The Pocock test and maximized sequential probability ratio test performed contrary to these trends, however, exhibiting both diminished power and higher numbers of toxicities than other methods due to their extremely aggressive early stopping criteria, failing to reserve adequate power to identify safety issues beyond the start of the study. In contrast to binary toxicity approaches, our time-to-event methods offer meaningful reductions in expected toxicities of up to 20% across scenarios considered.ConclusionSafety monitoring procedures aim to guard study participants from being exposed to and suffering toxicity from unsafe treatments. Toward this end, we recommend considering the time-to-event-oriented Gamma-Poisson model-weak prior model or truncated sequential probability ratio test for constructing safety stopping rules, as they performed the best in minimizing the number of toxicities in our investigations. Our R package \"stoppingrule\" offers procedures for creating and assessing stoppi","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":"22 3","pages":"267-278"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12096354/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144109871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Central statistical monitoring in clinical trial management: A scoping review. 临床试验管理中的中心统计监测:范围综述。
IF 2.2 3区 医学 Q3 MEDICINE, RESEARCH & EXPERIMENTAL Pub Date : 2025-06-01 Epub Date: 2025-01-02 DOI: 10.1177/17407745241304059
Maciej Fronc, Michał Jakubczyk, Sharon B Love, Susan Talbot, Timothy Rolfe

Background: Clinical trials handle a huge amount of data which can be used during the trial to improve the ongoing study conduct. It is suggested by regulators to implement the remote approach to evaluate clinical trials by analysing collected data. Central statistical monitoring helps to achieve that by employing quantitative methods, the results of which are a basis for decision-making on quality issues.

Methods: This article presents a scoping review which is based on a systematic and iterative approach to identify and synthesise literature on central statistical monitoring methodology. In particular, we investigated the decision-making processes (with emphasis on quality issues) of central statistical monitoring methodology and its place in the clinical trial workflow. We reviewed papers published over the last 10 years in two databases (Scopus and Web of Science) with a focus on data mining algorithms of central statistical monitoring and its benefit to the quality of trials.

Results: As a result, 24 scientific papers were selected for this review, and they consider central statistical monitoring at two levels. First, the perspective of the central statistical monitoring process and its location in the study conduct in terms of quality issues. Second, central statistical monitoring methods categorised into practices applied in the industry, and innovative methods in development. The established methods are discussed through the prism of categories of their usage. In turn, the innovations refer to either research on new methods or extensions to existing ones.

Discussion: Our review suggests directions for further research into central statistical monitoring methodology - including increased application of multivariate analysis and using advanced distance metrics - and guidance on how central statistical monitoring operates in response to regulators' requirements.

背景:临床试验处理大量的数据,这些数据可以在试验期间使用,以改善正在进行的研究行为。监管机构建议实施远程方法,通过分析收集的数据来评估临床试验。中央统计监测通过采用定量方法帮助实现这一目标,其结果是就质量问题作出决策的基础。方法:本文提出了一种基于系统和迭代方法的范围审查,以识别和综合有关中央统计监测方法的文献。特别是,我们调查了中央统计监测方法的决策过程(重点是质量问题)及其在临床试验工作流程中的地位。我们回顾了过去10年在两个数据库(Scopus和Web of Science)中发表的论文,重点关注中央统计监测的数据挖掘算法及其对试验质量的好处。结果:本次综述选取了24篇科学论文,考虑了两个层面的中央统计监测。首先,从中央统计监测过程的角度及其在研究开展方面存在的质量问题。二是将中央统计监测方法分类为行业应用的实践方法和发展中的创新方法。通过其使用类别的棱镜来讨论已建立的方法。反过来,创新指的是对新方法的研究或对现有方法的扩展。讨论:我们的综述提出了进一步研究中央统计监测方法的方向——包括增加多变量分析的应用和使用先进的距离度量——以及关于中央统计监测如何响应监管机构要求的指导。
{"title":"Central statistical monitoring in clinical trial management: A scoping review.","authors":"Maciej Fronc, Michał Jakubczyk, Sharon B Love, Susan Talbot, Timothy Rolfe","doi":"10.1177/17407745241304059","DOIUrl":"10.1177/17407745241304059","url":null,"abstract":"<p><strong>Background: </strong>Clinical trials handle a huge amount of data which can be used during the trial to improve the ongoing study conduct. It is suggested by regulators to implement the remote approach to evaluate clinical trials by analysing collected data. Central statistical monitoring helps to achieve that by employing quantitative methods, the results of which are a basis for decision-making on quality issues.</p><p><strong>Methods: </strong>This article presents a scoping review which is based on a systematic and iterative approach to identify and synthesise literature on central statistical monitoring methodology. In particular, we investigated the decision-making processes (with emphasis on quality issues) of central statistical monitoring methodology and its place in the clinical trial workflow. We reviewed papers published over the last 10 years in two databases (Scopus and Web of Science) with a focus on data mining algorithms of central statistical monitoring and its benefit to the quality of trials.</p><p><strong>Results: </strong>As a result, 24 scientific papers were selected for this review, and they consider central statistical monitoring at two levels. First, the perspective of the central statistical monitoring process and its location in the study conduct in terms of quality issues. Second, central statistical monitoring methods categorised into practices applied in the industry, and innovative methods in development. The established methods are discussed through the prism of categories of their usage. In turn, the innovations refer to either research on new methods or extensions to existing ones.</p><p><strong>Discussion: </strong>Our review suggests directions for further research into central statistical monitoring methodology - including increased application of multivariate analysis and using advanced distance metrics - and guidance on how central statistical monitoring operates in response to regulators' requirements.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"342-351"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7617700/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142913824","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of differences between interim and post-interim analysis populations on outcomes of a group sequential trial: Example of the MOVe-OUT study. 中期和中期后分析人群差异对一组序贯试验结果的影响:MOVe-OUT研究的例子。
IF 2.2 3区 医学 Q3 MEDICINE, RESEARCH & EXPERIMENTAL Pub Date : 2025-06-01 Epub Date: 2025-03-02 DOI: 10.1177/17407745251313925
Yoseph Caraco, Matthew G Johnson, Joseph A Chiarappa, Brian M Maas, Julie A Stone, Matthew L Rizk, Mary Vesnesky, Julie M Strizki, Angela Williams-Diaz, Michelle L Brown, Patricia Carmelitano, Hong Wan, Alison Pedley, Akshita Chawla, Dominik J Wolf, Jay A Grobler, Amanda Paschke, Carisa De Anda
<p><p>BackgroundPre-specified interim analyses allow for more timely evaluation of efficacy or futility, potentially accelerating decision-making on an investigational intervention. In such an analysis, the randomized, double-blind MOVe-OUT trial demonstrated superiority of molnupiravir over placebo for outpatient treatment of COVID-19 in high-risk patients. In the full analysis population, the point estimate of the treatment difference in the primary endpoint was notably lower than at the interim analysis. We conducted a comprehensive assessment to investigate this unexpected difference in treatment effect size, with the goal of informing future clinical research evaluating treatments for rapidly evolving infectious diseases.MethodsThe modified intention-to-treat population of the MOVe-OUT trial was divided into an interim analysis cohort (i.e. all participants included in the interim analysis; prospectively defined) and a post-interim analysis cohort (i.e. all remaining participants; retrospectively defined). Baseline characteristics (including many well-established prognostic factors for disease progression), clinical outcomes, and virologic outcomes were retrospectively evaluated. The impact of changes in baseline characteristics over time was explored using logistic regression modeling and simulations.ResultsBaseline characteristics were well-balanced between arms overall. However, between- and within-arm differences in known prognostic baseline factors (e.g. comorbidities, SARS-CoV-2 viral load, and anti-SARS-CoV-2 antibody status) were observed for the interim and post-interim analysis cohorts. For the individual factors, these differences were generally minor and otherwise not notable; as the trial progressed, however, these shifts in combination increasingly favored the placebo arm across most of the evaluated factors in the post-interim cohort. Model-based simulations confirmed that the reduction in effect size could be accounted for by these longitudinal trends toward a lower-risk study population among placebo participants. Infectivity and viral load data confirmed that molnupiravir's antiviral activity was consistent across both cohorts, which were heavily dominated by different viral clades (reflecting the rapid SARS-CoV-2 evolution).DiscussionThe cumulative effect of randomly occurring minor differences in prognostic baseline characteristics within and between arms over time, rather than virologic factors such as reduced activity of molnupiravir against evolving variants, likely impacted the observed outcomes. Our results have broader implications for group sequential trials seeking to evaluate treatments for rapidly emerging pathogens. During dynamic epidemic or pandemic conditions, adaptive trials should be designed and interpreted especially carefully, considering that they will likely rapidly enroll a large post-interim overrun population and that even small longitudinal shifts across multiple baseline variables can disproporti
背景:预先指定的中期分析可以更及时地评估疗效或无效性,从而加快对研究干预措施的决策。在这种分析中,随机双盲 MOVe-OUT 试验证明,在门诊治疗 COVID-19 的高危患者中,莫仑吡韦的疗效优于安慰剂。在全面分析人群中,主要终点的治疗差异点估计值明显低于中期分析。我们进行了一次全面评估,以调查治疗效果大小的这一意外差异,目的是为未来评估快速发展的传染病治疗方法的临床研究提供信息:MOVe-OUT试验的修改后意向治疗人群被分为中期分析队列(即所有纳入中期分析的参与者;前瞻性定义)和中期分析后队列(即所有剩余参与者;回顾性定义)。对基线特征(包括许多公认的疾病进展预后因素)、临床结果和病毒学结果进行了回顾性评估。采用逻辑回归模型和模拟方法探讨了基线特征随时间变化的影响:结果:总体而言,各组间的基线特征非常均衡。然而,在中期和中期后分析组别中,观察到已知预后基线因素(如合并症、SARS-CoV-2 病毒载量和抗 SARS-CoV-2 抗体状态)在组别间和组别内存在差异。就单个因素而言,这些差异通常较小,并不显著;但随着试验的进展,在大多数评估因素上,这些综合因素的变化越来越有利于安慰剂治疗组。基于模型的模拟证实,安慰剂参与者中风险较低的研究人群的这些纵向趋势可以解释效应大小的减少。感染率和病毒载量数据证实,molnupiravir的抗病毒活性在两个队列中都是一致的,而这两个队列中主要是不同的病毒支系(反映了SARS-CoV-2的快速演变):讨论:随着时间的推移,随机出现的各组内和各组间预后基线特征的微小差异所产生的累积效应,而不是病毒学因素(如molnupiravir对不断演变的变异株的活性降低),可能会影响观察到的结果。我们的研究结果对寻求评估快速出现的病原体治疗方法的分组序贯试验具有更广泛的意义。在流行病或大流行的动态条件下,适应性试验的设计和解释应特别谨慎,因为这些试验很可能会迅速纳入大量中期后超支人群,而且即使多个基线变量发生微小的纵向变化,也会对不同时间点的预设疗效结果产生不成比例的影响。预后因素的变化可能会带来额外的变异性,而这种变异性很难与流行病学(如致病病原体的进化变化)或疾病管理的时间趋势区分开来(ClinicalTrials.gov: NCT04575597.)。
{"title":"Impact of differences between interim and post-interim analysis populations on outcomes of a group sequential trial: Example of the MOVe-OUT study.","authors":"Yoseph Caraco, Matthew G Johnson, Joseph A Chiarappa, Brian M Maas, Julie A Stone, Matthew L Rizk, Mary Vesnesky, Julie M Strizki, Angela Williams-Diaz, Michelle L Brown, Patricia Carmelitano, Hong Wan, Alison Pedley, Akshita Chawla, Dominik J Wolf, Jay A Grobler, Amanda Paschke, Carisa De Anda","doi":"10.1177/17407745251313925","DOIUrl":"10.1177/17407745251313925","url":null,"abstract":"&lt;p&gt;&lt;p&gt;BackgroundPre-specified interim analyses allow for more timely evaluation of efficacy or futility, potentially accelerating decision-making on an investigational intervention. In such an analysis, the randomized, double-blind MOVe-OUT trial demonstrated superiority of molnupiravir over placebo for outpatient treatment of COVID-19 in high-risk patients. In the full analysis population, the point estimate of the treatment difference in the primary endpoint was notably lower than at the interim analysis. We conducted a comprehensive assessment to investigate this unexpected difference in treatment effect size, with the goal of informing future clinical research evaluating treatments for rapidly evolving infectious diseases.MethodsThe modified intention-to-treat population of the MOVe-OUT trial was divided into an interim analysis cohort (i.e. all participants included in the interim analysis; prospectively defined) and a post-interim analysis cohort (i.e. all remaining participants; retrospectively defined). Baseline characteristics (including many well-established prognostic factors for disease progression), clinical outcomes, and virologic outcomes were retrospectively evaluated. The impact of changes in baseline characteristics over time was explored using logistic regression modeling and simulations.ResultsBaseline characteristics were well-balanced between arms overall. However, between- and within-arm differences in known prognostic baseline factors (e.g. comorbidities, SARS-CoV-2 viral load, and anti-SARS-CoV-2 antibody status) were observed for the interim and post-interim analysis cohorts. For the individual factors, these differences were generally minor and otherwise not notable; as the trial progressed, however, these shifts in combination increasingly favored the placebo arm across most of the evaluated factors in the post-interim cohort. Model-based simulations confirmed that the reduction in effect size could be accounted for by these longitudinal trends toward a lower-risk study population among placebo participants. Infectivity and viral load data confirmed that molnupiravir's antiviral activity was consistent across both cohorts, which were heavily dominated by different viral clades (reflecting the rapid SARS-CoV-2 evolution).DiscussionThe cumulative effect of randomly occurring minor differences in prognostic baseline characteristics within and between arms over time, rather than virologic factors such as reduced activity of molnupiravir against evolving variants, likely impacted the observed outcomes. Our results have broader implications for group sequential trials seeking to evaluate treatments for rapidly emerging pathogens. During dynamic epidemic or pandemic conditions, adaptive trials should be designed and interpreted especially carefully, considering that they will likely rapidly enroll a large post-interim overrun population and that even small longitudinal shifts across multiple baseline variables can disproporti","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"312-324"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143536668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of correlation structure on sample size requirements of statistical methods for multiple binary outcomes: A simulation study. 相关结构对多元二元结果统计方法样本量要求的影响:模拟研究。
IF 2.2 3区 医学 Q3 MEDICINE, RESEARCH & EXPERIMENTAL Pub Date : 2025-06-01 Epub Date: 2025-01-03 DOI: 10.1177/17407745241304706
Kanako Fuyama, Kentaro Sakamaki, Kohei Uemura, Isao Yokota

BackgroundIn randomized clinical trials, multiple-testing procedures, composite endpoints, and prioritized outcome approaches are increasingly used to analyze multiple binary outcomes. Previous studies have shown that correlations between outcomes influence their sample size requirements. Although sample size is an important factor affecting the choice of statistical methods, the power and required sample sizes of methods for analyzing multiple binary outcomes have yet to be compared under the influence of outcome correlations.MethodsWe conducted simulations to evaluate the power of co-primary and multiple primary endpoints, composite endpoints, and prioritized outcome approaches based on generalized pairwise comparisons with varying correlations, marginal proportions, treatment effects, and number of outcomes. We then conducted a case study on sample size using a clinical trial of a migraine treatment as an example.ResultsThe correlations significantly affected the statistical power and sample size of composite endpoints. The power and sample size of co-primary endpoints remained relatively stable across different correlations, though their power declined substantially when treatment effects were opposite on some components or more than two components were present. While the correlations influenced the power and sample size of all methods assessed, their direction and degree of influence varied between methods. Notably, the method with the greatest power and smallest sample size also differed depending on the correlations. When the correlations were the same between arms, prioritized outcome approaches usually had higher power and smaller sample sizes than other methods.ConclusionsAnticipated correlations and their uncertainty should be considered when selecting statistical methods. Overall, co-primary endpoints remain a reliable option for evaluating the superiority of all components, although they are unsuitable for assessing the balance between treatment effects pointing in different directions. Generalized pairwise comparisons offer a useful alternative to deal with multiple prioritized outcomes, often providing the smallest sample sizes when the correlation structures are shared between the arms.

背景:在随机临床试验中,越来越多地采用多重测试程序、复合终点和优先结果法来分析多个二元结果。以往的研究表明,结果之间的相关性会影响对样本量的要求。虽然样本量是影响统计方法选择的一个重要因素,但在结果相关性的影响下,分析多个二元结果的方法的功率和所需样本量尚未进行比较:我们进行了模拟实验,以评估共同主要终点和多个主要终点、复合终点以及基于广义配对比较的优先结果方法的功率,这些方法的相关性、边际比例、治疗效果和结果数量各不相同。然后,我们以偏头痛治疗的临床试验为例,对样本量进行了案例研究:结果:相关性极大地影响了综合终点的统计能力和样本量。在不同的相关性下,共同主要终点的统计能力和样本量保持相对稳定,但当治疗效果在某些成分上相反或存在两个以上成分时,其统计能力会大幅下降。虽然相关性会影响所有评估方法的功率和样本量,但其影响方向和程度因方法而异。值得注意的是,功率最大、样本量最小的方法也因相关性而异。当两臂间的相关性相同时,优先结果方法通常比其他方法具有更高的功率和更小的样本量:结论:在选择统计方法时,应考虑预期相关性及其不确定性。总体而言,共同主要终点仍是评估所有成分优劣的可靠选择,尽管它们不适合评估指向不同方向的治疗效果之间的平衡。广义配对比较为处理多个优先结果提供了一种有用的替代方法,当两臂之间共享相关结构时,它往往能提供最小的样本量。
{"title":"Impact of correlation structure on sample size requirements of statistical methods for multiple binary outcomes: A simulation study.","authors":"Kanako Fuyama, Kentaro Sakamaki, Kohei Uemura, Isao Yokota","doi":"10.1177/17407745241304706","DOIUrl":"10.1177/17407745241304706","url":null,"abstract":"<p><p>BackgroundIn randomized clinical trials, multiple-testing procedures, composite endpoints, and prioritized outcome approaches are increasingly used to analyze multiple binary outcomes. Previous studies have shown that correlations between outcomes influence their sample size requirements. Although sample size is an important factor affecting the choice of statistical methods, the power and required sample sizes of methods for analyzing multiple binary outcomes have yet to be compared under the influence of outcome correlations.MethodsWe conducted simulations to evaluate the power of co-primary and multiple primary endpoints, composite endpoints, and prioritized outcome approaches based on generalized pairwise comparisons with varying correlations, marginal proportions, treatment effects, and number of outcomes. We then conducted a case study on sample size using a clinical trial of a migraine treatment as an example.ResultsThe correlations significantly affected the statistical power and sample size of composite endpoints. The power and sample size of co-primary endpoints remained relatively stable across different correlations, though their power declined substantially when treatment effects were opposite on some components or more than two components were present. While the correlations influenced the power and sample size of all methods assessed, their direction and degree of influence varied between methods. Notably, the method with the greatest power and smallest sample size also differed depending on the correlations. When the correlations were the same between arms, prioritized outcome approaches usually had higher power and smaller sample sizes than other methods.ConclusionsAnticipated correlations and their uncertainty should be considered when selecting statistical methods. Overall, co-primary endpoints remain a reliable option for evaluating the superiority of all components, although they are unsuitable for assessing the balance between treatment effects pointing in different directions. Generalized pairwise comparisons offer a useful alternative to deal with multiple prioritized outcomes, often providing the smallest sample sizes when the correlation structures are shared between the arms.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"301-311"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142920978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exclusion of people from oncology clinical trials based on functional status. 基于功能状态将患者排除在肿瘤临床试验之外。
IF 2.2 3区 医学 Q3 MEDICINE, RESEARCH & EXPERIMENTAL Pub Date : 2025-06-01 Epub Date: 2025-01-02 DOI: 10.1177/17407745241304114
Nicole D Agaronnik, Mary Linton B Peters, Lisa I Iezzoni

Background/aims: People with disability have higher rates of cancer, excluding skin cancer, compared with people without disability. Food and Drug Administration draft guidelines from 2024 address use of performance status criteria to determine eligibility for clinical trials, advocating for less restrictive thresholds. We examined the exclusion of people with disability from clinical trials based on performance status and other criteria.

Methods: We reviewed eligibility criteria in approved interventional Phase III and Phase IV oncology clinical trials listed on ClinicalTrails.gov between 1 January 2019 and 31 December 2023. Functional status thresholds were assessed using the Eastern Cooperative Oncology Group Performance Status Scale and Karnofsky Performance Scale in clinical trial eligibility criteria. Qualitative analysis was used to review eligibility criteria relating to functional impairments or disability.

Results: Among 96 oncology clinical trials, approximately 40% had restrictive Eastern Cooperative Oncology Group and Karnofsky Performance Scale thresholds, explicitly including only patients with Eastern Cooperative Oncology Group 0 or 1, or equivalent Karnofsky Performance Scale 70 or greater. Only 20% of studies included patients with Eastern Cooperative Oncology Group 2 and Karnofsky Performance Scale 60. Multiple studies contained miscellaneous eligibility criteria that could potentially exclude people with disability. No studies described making accommodations for people with disability to participate in the clinical trial.

Conclusion: Draft Food and Drug Administration guidelines recommend including patients with Eastern Cooperative Oncology Group scores of 2 and Karnofsky Performance Scale scores of 60 in oncology clinical trials. We found that oncology clinical trials often exclude people with more restrictive performance status scores than the draft Food and Drug Administration guidelines, as well as other criteria that relate to disability. These estimates provide baseline information for assessing how the 2024 Food and Drug Administration guidance, if finalized, might affect the inclusion of people with disability in future trials.

背景/目的:与正常人相比,残障人士患癌症(不包括皮肤癌)的几率更高。美国食品和药物管理局(fda) 2024年的指南草案解决了使用绩效状态标准来确定临床试验资格的问题,提倡减少限制性阈值。我们研究了基于表现状态和其他标准将残疾人排除在临床试验之外的情况。方法:我们回顾了2019年1月1日至2023年12月31日期间ClinicalTrails.gov上列出的已批准的III期和IV期介入肿瘤学临床试验的资格标准。在临床试验资格标准中,使用东部肿瘤合作组绩效状态量表和Karnofsky绩效量表评估功能状态阈值。定性分析用于审查与功能障碍或残疾有关的资格标准。结果:在96个肿瘤学临床试验中,约40%具有限制性的Eastern Cooperative oncology Group和Karnofsky Performance Scale阈值,明确仅包括Eastern Cooperative oncology Group 0或1,或等效Karnofsky Performance Scale 70或更高的患者。只有20%的研究纳入了Eastern Cooperative Oncology Group 2和Karnofsky Performance Scale 60的患者。多项研究包含了各种各样的资格标准,可能会将残疾人排除在外。没有研究描述为残疾人参与临床试验提供便利。结论:美国食品和药物管理局指南草案建议在肿瘤临床试验中纳入东部肿瘤合作组评分为2分和Karnofsky绩效量表评分为60分的患者。我们发现肿瘤临床试验经常排除那些比食品和药物管理局指南草案更严格的表现状态评分的人,以及其他与残疾有关的标准。这些估计为评估2024年美国食品和药物管理局指南(如果最终确定)如何影响将残疾人纳入未来试验提供了基线信息。
{"title":"Exclusion of people from oncology clinical trials based on functional status.","authors":"Nicole D Agaronnik, Mary Linton B Peters, Lisa I Iezzoni","doi":"10.1177/17407745241304114","DOIUrl":"10.1177/17407745241304114","url":null,"abstract":"<p><strong>Background/aims: </strong>People with disability have higher rates of cancer, excluding skin cancer, compared with people without disability. Food and Drug Administration draft guidelines from 2024 address use of performance status criteria to determine eligibility for clinical trials, advocating for less restrictive thresholds. We examined the exclusion of people with disability from clinical trials based on performance status and other criteria.</p><p><strong>Methods: </strong>We reviewed eligibility criteria in approved interventional Phase III and Phase IV oncology clinical trials listed on ClinicalTrails.gov between 1 January 2019 and 31 December 2023. Functional status thresholds were assessed using the Eastern Cooperative Oncology Group Performance Status Scale and Karnofsky Performance Scale in clinical trial eligibility criteria. Qualitative analysis was used to review eligibility criteria relating to functional impairments or disability.</p><p><strong>Results: </strong>Among 96 oncology clinical trials, approximately 40% had restrictive Eastern Cooperative Oncology Group and Karnofsky Performance Scale thresholds, explicitly including only patients with Eastern Cooperative Oncology Group 0 or 1, or equivalent Karnofsky Performance Scale 70 or greater. Only 20% of studies included patients with Eastern Cooperative Oncology Group 2 and Karnofsky Performance Scale 60. Multiple studies contained miscellaneous eligibility criteria that could potentially exclude people with disability. No studies described making accommodations for people with disability to participate in the clinical trial.</p><p><strong>Conclusion: </strong>Draft Food and Drug Administration guidelines recommend including patients with Eastern Cooperative Oncology Group scores of 2 and Karnofsky Performance Scale scores of 60 in oncology clinical trials. We found that oncology clinical trials often exclude people with more restrictive performance status scores than the draft Food and Drug Administration guidelines, as well as other criteria that relate to disability. These estimates provide baseline information for assessing how the 2024 Food and Drug Administration guidance, if finalized, might affect the inclusion of people with disability in future trials.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"367-373"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12199287/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142913834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Challenges in estimating the counterfactual placebo HIV incidence rate from a registration cohort: The PrEPVacc trial. 在注册队列中估计反事实安慰剂HIV发病率的挑战:PrEPVacc试验。
IF 2.2 3区 医学 Q3 MEDICINE, RESEARCH & EXPERIMENTAL Pub Date : 2025-06-01 Epub Date: 2024-12-31 DOI: 10.1177/17407745241304721
Sheila Kansiime, Christian Holm Hansen, Eugene Ruzagira, Sheena McCormack, Richard Hayes, David Dunn
<p><p>BackgroundThere is increasing recognition that the interpretation of active-controlled HIV prevention trials should consider the counterfactual placebo HIV incidence rate, that is, the rate that would have been observed if the trial had included a placebo control arm. The PrEPVacc HIV vaccine and pre-exposure prophylaxis trial (NCT04066881) incorporated a pre-trial registration cohort partly for this purpose. In this article, we describe our attempts to model the counterfactual placebo HIV incidence rate from the registration cohort.MethodsPrEPVacc was conducted at four study sites in three African countries. During the set up of the trial, potential participants were invited to join a registration cohort, which included HIV testing every 3 months. The trial included a non-inferiority comparison of two daily, oral pre-exposure prophylaxis regimens (emtricitabine/tenofovir disoproxil fumarate, emtricitabine/tenofovir alafenamide fumarate), administered for a target duration of 26 weeks (until 2 weeks after the third of four vaccinations). We developed a multi-variable Poisson regression model to estimate associations in the registration cohort between HIV incidence and baseline predictors (socio-demographic and behavioural variables) and time-dependent predictors (calendar time, time in follow-up). We then used the estimated regression coefficients together with participant characteristics in the active-controlled pre-exposure prophylaxis trial to predict a counterfactual placebo incidence rate. Sensitivity analyses regarding the effect of calendar period were conducted.ResultsA total of 3255 participants were followed up in the registration cohort between July 2018 and October 2022, and 1512 participants were enrolled in the trial between December 2020 and March 2023. In the registration cohort, 106 participants were diagnosed with HIV over 3638 person-years of follow-up (incidence rate = 2.9/100 person-years of follow-up (95% confidence interval: 2.4-3.5)). The final statistical model included terms for study site, gender, age, occupation, sex after using recreational drugs, time in follow-up, and calendar period. The estimated effect of calendar period was very strong, an overall 37% (95% confidence interval: 19-51) decline per year in adjusted analyses, with evidence that this effect varied by study site. In sensitivity analyses investigating different assumptions about the precise effect of calendar period, the predicted counterfactual placebo incidence ranged between 1.2 and 3.7/100 person-years of follow-up.ConclusionIn principle, the use of a registration cohort is one of the most straightforward and reliable methods for estimating the counterfactual placebo HIV incidence. However, the predictions in PrEPVacc are complicated by an implausibly large calendar time effect, with uncertainty as to whether this can be validly extrapolated over the period of trial follow-up. Other limitations are discussed, along with suggestions for mitiga
背景:越来越多的人认识到,对主动控制HIV预防试验的解释应考虑反事实安慰剂HIV发病率,即如果试验包括安慰剂对照组,本应观察到的发病率。PrEPVacc HIV疫苗和暴露前预防试验(NCT04066881)纳入了一个试验前注册队列,部分原因就是为了这个目的。在这篇文章中,我们描述了我们试图从注册队列中建立反事实安慰剂HIV发病率模型的尝试。方法:PrEPVacc在三个非洲国家的四个研究地点进行。在试验开始期间,潜在的参与者被邀请加入一个注册队列,其中包括每3个月进行一次艾滋病毒检测。该试验包括两种每日口服暴露前预防方案(恩曲他滨/富马酸替诺福韦二氧吡酯,恩曲他滨/富马酸替诺福韦alafenamide)的非效性比较,目标持续时间为26周(直到四次接种中的第三次接种后2周)。我们建立了一个多变量泊松回归模型来估计登记队列中HIV发病率与基线预测因子(社会人口统计学和行为变量)和时间相关预测因子(日历时间、随访时间)之间的关联。然后,我们将估计的回归系数与主动控制的暴露前预防试验中的参与者特征一起用于预测与事实相反的安慰剂发病率。对日历期的影响进行敏感性分析。结果:在2018年7月至2022年10月期间,共有3255名参与者在注册队列中进行了随访,在2020年12月至2023年3月期间,共有1512名参与者入组试验。在注册队列中,106名参与者在3638人-年的随访中被诊断为HIV(发病率= 2.9/100人-年的随访(95%可信区间:2.4-3.5))。最终的统计模型包括研究地点、性别、年龄、职业、使用消遣性药物后的性别、随访时间和日历周期。日历期的估计影响非常强,在调整后的分析中,每年总体下降37%(95%置信区间:19-51),有证据表明这种影响因研究地点而异。在敏感性分析中,研究了对日历期精确影响的不同假设,预测的反事实安慰剂发生率在1.2至3.7/100人年的随访期间。结论:原则上,使用注册队列是估计安慰剂HIV感染率的最直接和可靠的方法之一。然而,PrEPVacc中的预测由于难以置信的大日历时间效应而变得复杂,并且不确定这是否可以在试验随访期间有效地推断出来。讨论了其他限制,以及在未来研究中减轻这些限制的建议。
{"title":"Challenges in estimating the counterfactual placebo HIV incidence rate from a registration cohort: The PrEPVacc trial.","authors":"Sheila Kansiime, Christian Holm Hansen, Eugene Ruzagira, Sheena McCormack, Richard Hayes, David Dunn","doi":"10.1177/17407745241304721","DOIUrl":"10.1177/17407745241304721","url":null,"abstract":"&lt;p&gt;&lt;p&gt;BackgroundThere is increasing recognition that the interpretation of active-controlled HIV prevention trials should consider the counterfactual placebo HIV incidence rate, that is, the rate that would have been observed if the trial had included a placebo control arm. The PrEPVacc HIV vaccine and pre-exposure prophylaxis trial (NCT04066881) incorporated a pre-trial registration cohort partly for this purpose. In this article, we describe our attempts to model the counterfactual placebo HIV incidence rate from the registration cohort.MethodsPrEPVacc was conducted at four study sites in three African countries. During the set up of the trial, potential participants were invited to join a registration cohort, which included HIV testing every 3 months. The trial included a non-inferiority comparison of two daily, oral pre-exposure prophylaxis regimens (emtricitabine/tenofovir disoproxil fumarate, emtricitabine/tenofovir alafenamide fumarate), administered for a target duration of 26 weeks (until 2 weeks after the third of four vaccinations). We developed a multi-variable Poisson regression model to estimate associations in the registration cohort between HIV incidence and baseline predictors (socio-demographic and behavioural variables) and time-dependent predictors (calendar time, time in follow-up). We then used the estimated regression coefficients together with participant characteristics in the active-controlled pre-exposure prophylaxis trial to predict a counterfactual placebo incidence rate. Sensitivity analyses regarding the effect of calendar period were conducted.ResultsA total of 3255 participants were followed up in the registration cohort between July 2018 and October 2022, and 1512 participants were enrolled in the trial between December 2020 and March 2023. In the registration cohort, 106 participants were diagnosed with HIV over 3638 person-years of follow-up (incidence rate = 2.9/100 person-years of follow-up (95% confidence interval: 2.4-3.5)). The final statistical model included terms for study site, gender, age, occupation, sex after using recreational drugs, time in follow-up, and calendar period. The estimated effect of calendar period was very strong, an overall 37% (95% confidence interval: 19-51) decline per year in adjusted analyses, with evidence that this effect varied by study site. In sensitivity analyses investigating different assumptions about the precise effect of calendar period, the predicted counterfactual placebo incidence ranged between 1.2 and 3.7/100 person-years of follow-up.ConclusionIn principle, the use of a registration cohort is one of the most straightforward and reliable methods for estimating the counterfactual placebo HIV incidence. However, the predictions in PrEPVacc are complicated by an implausibly large calendar time effect, with uncertainty as to whether this can be validly extrapolated over the period of trial follow-up. Other limitations are discussed, along with suggestions for mitiga","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"289-300"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12092936/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143028103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pivotal trial characteristics and types of endpoints used to support Food and Drug Administration rare disease drug approvals between 2013 and 2022. 2013年至2022年期间用于支持美国食品和药物管理局罕见疾病药物批准的关键试验特征和终点类型。
IF 2.2 3区 医学 Q3 MEDICINE, RESEARCH & EXPERIMENTAL Pub Date : 2025-06-01 Epub Date: 2025-01-25 DOI: 10.1177/17407745241309318
Kyungwan Hong, Bridget Nugent, Abbas Bandukwala, Robert Schuck, York Tomita, Salvatore Pepe, Mary Doi, Scott Winiecki, Kerry Jo Lee

Background/aimsRare disease drug development faces unique challenges, such as genotypic and phenotypic heterogeneity within small patient populations and a lack of established outcome measures for conditions without previously successful drug development programs. These challenges complicate the process of selecting the appropriate trial endpoints and conducting clinical trials in rare diseases. In this descriptive study, we examined novel drug approvals for non-oncologic rare diseases by the U.S. Food and Drug Administration's Center for Drug Evaluation and Research over the past decade and characterized key regulatory and trial design elements with a focus on the primary efficacy endpoint utilized as the basis of approval.MethodsUsing the Food and Drug Administration's Data Analysis Search Host database, we identified novel new drug applications and biologics license applications with orphan drug designation that were approved between 2013 and 2022 for non-oncologic indications. From Food and Drug Administration review documents and other external databases, we examined characteristics of pivotal trials for the included drugs, such as therapeutic area, trial design, and type of primary efficacy endpoints. Differences in trial design elements associated with primary efficacy endpoint type were assessed such as randomization and blinding. Then, we summarized the primary efficacy endpoint types utilized in pivotal trials by therapeutic area, approval pathway, and whether the disease etiology is well defined.ResultsOne hundred and seven drugs that met our inclusion criteria were approved between 2013 and 2022. Assessment of the 107 drug development programs identified 150 pivotal trials that were subsequently analyzed. The pivotal trials were mostly randomized (80%) and blinded (69.3%). Biomarkers (41.1%) and clinical outcomes (42.1%) were commonly utilized as primary efficacy endpoints. Analysis of the use of clinical trial design elements across trials that utilized biomarkers, clinical outcomes, or composite endpoints did not reveal statistically significant differences. The choice of primary efficacy endpoint varied by the drug's therapeutic area, approval pathway, and whether the indicated disease etiology was well defined. For example, biomarkers were commonly selected as primary efficacy endpoints in hematology drug approvals (70.6%), whereas clinical outcomes were commonly selected in neurology drug approvals (69.6%). Further, if the disease etiology was well defined, biomarkers were more commonly used as primary efficacy endpoints in pivotal trials (44.7%) than if the disease etiology was not well defined (27.3%).DiscussionIn the past 10 years, numerous novel drugs have been approved to treat non-oncologic rare diseases in various therapeutic areas. To demonstrate their efficacy for regulatory approval, biomarkers and clinical outcomes were commonly utilized as primary efficacy endpoints. Biomarkers were not only frequently used as s

背景/目的:罕见病药物开发面临着独特的挑战,例如小患者群体的基因型和表型异质性,以及缺乏先前成功的药物开发计划的条件下缺乏既定的结果测量。这些挑战使选择适当的试验终点和进行罕见病临床试验的过程复杂化。在这项描述性研究中,我们检查了过去十年美国食品和药物管理局药物评估和研究中心批准的用于非肿瘤性罕见疾病的新药,并描述了关键的监管和试验设计要素,重点关注作为批准基础的主要疗效终点。方法:使用美国食品和药物管理局的数据分析搜索主机数据库,我们确定了2013年至2022年间批准的用于非肿瘤适应症的孤儿药新药申请和生物制剂许可申请。从美国食品和药物管理局的审查文件和其他外部数据库中,我们检查了所纳入药物的关键试验的特征,如治疗区域、试验设计和主要疗效终点类型。评估了与主要疗效终点类型相关的试验设计要素的差异,如随机化和盲法。然后,我们根据治疗领域、批准途径和疾病病因是否明确,总结了关键试验中使用的主要疗效终点类型。结果:在2013年至2022年期间,有107种药物符合我们的纳入标准。对107个药物开发项目的评估确定了150个关键试验,随后进行了分析。关键试验大多是随机(80%)和盲法(69.3%)。生物标志物(41.1%)和临床结果(42.1%)通常被用作主要疗效终点。对使用生物标志物、临床结果或复合终点的临床试验设计元素的分析没有显示统计学上的显著差异。主要疗效终点的选择取决于药物的治疗领域、批准途径以及适应症病因是否明确。例如,生物标志物通常被选为血液学药物批准的主要疗效终点(70.6%),而临床结果通常被选为神经学药物批准的主要疗效终点(69.6%)。此外,如果疾病病因明确,生物标志物更常被用作关键试验的主要疗效终点(44.7%),而如果疾病病因不明确(27.3%)。讨论:在过去的10年里,许多新药被批准用于治疗各个治疗领域的非肿瘤罕见病。为了证明它们的监管审批有效性,生物标志物和临床结果通常被用作主要疗效终点。生物标志物不仅经常被用作加速审批的替代疗效终点,而且在传统批准的罕见病药物中也经常被用作替代疗效终点。主要疗效终点的选择因治疗领域、批准途径和对疾病病因的了解而异。
{"title":"Pivotal trial characteristics and types of endpoints used to support Food and Drug Administration rare disease drug approvals between 2013 and 2022.","authors":"Kyungwan Hong, Bridget Nugent, Abbas Bandukwala, Robert Schuck, York Tomita, Salvatore Pepe, Mary Doi, Scott Winiecki, Kerry Jo Lee","doi":"10.1177/17407745241309318","DOIUrl":"10.1177/17407745241309318","url":null,"abstract":"<p><p>Background/aimsRare disease drug development faces unique challenges, such as genotypic and phenotypic heterogeneity within small patient populations and a lack of established outcome measures for conditions without previously successful drug development programs. These challenges complicate the process of selecting the appropriate trial endpoints and conducting clinical trials in rare diseases. In this descriptive study, we examined novel drug approvals for non-oncologic rare diseases by the U.S. Food and Drug Administration's Center for Drug Evaluation and Research over the past decade and characterized key regulatory and trial design elements with a focus on the primary efficacy endpoint utilized as the basis of approval.MethodsUsing the Food and Drug Administration's Data Analysis Search Host database, we identified novel new drug applications and biologics license applications with orphan drug designation that were approved between 2013 and 2022 for non-oncologic indications. From Food and Drug Administration review documents and other external databases, we examined characteristics of pivotal trials for the included drugs, such as therapeutic area, trial design, and type of primary efficacy endpoints. Differences in trial design elements associated with primary efficacy endpoint type were assessed such as randomization and blinding. Then, we summarized the primary efficacy endpoint types utilized in pivotal trials by therapeutic area, approval pathway, and whether the disease etiology is well defined.ResultsOne hundred and seven drugs that met our inclusion criteria were approved between 2013 and 2022. Assessment of the 107 drug development programs identified 150 pivotal trials that were subsequently analyzed. The pivotal trials were mostly randomized (80%) and blinded (69.3%). Biomarkers (41.1%) and clinical outcomes (42.1%) were commonly utilized as primary efficacy endpoints. Analysis of the use of clinical trial design elements across trials that utilized biomarkers, clinical outcomes, or composite endpoints did not reveal statistically significant differences. The choice of primary efficacy endpoint varied by the drug's therapeutic area, approval pathway, and whether the indicated disease etiology was well defined. For example, biomarkers were commonly selected as primary efficacy endpoints in hematology drug approvals (70.6%), whereas clinical outcomes were commonly selected in neurology drug approvals (69.6%). Further, if the disease etiology was well defined, biomarkers were more commonly used as primary efficacy endpoints in pivotal trials (44.7%) than if the disease etiology was not well defined (27.3%).DiscussionIn the past 10 years, numerous novel drugs have been approved to treat non-oncologic rare diseases in various therapeutic areas. To demonstrate their efficacy for regulatory approval, biomarkers and clinical outcomes were commonly utilized as primary efficacy endpoints. Biomarkers were not only frequently used as s","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"352-360"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143036919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A multilevel framework for recruitment and retention in implementation trials: An illustrative example. 实施试验中招聘和留用的多层次框架:一个说明性例子。
IF 2.2 3区 医学 Q3 MEDICINE, RESEARCH & EXPERIMENTAL Pub Date : 2025-06-01 Epub Date: 2025-01-10 DOI: 10.1177/17407745241307948
Nathaniel J Williams, Alexandra E Gomes, Nallely R Vega, Susan Esp, Mimi Choy-Brown, Rinad S Beidas

Background: Implementation and hybrid effectiveness-implementation trials aspire to speed the translation of science into practice by generating crucial evidence for improving the uptake of effective health interventions. By design, they pose unique recruitment and retention challenges due to their aims, units of analysis, and sampling plans, which typically require many clinical sites (i.e. often 20 or more) and participation by individuals who are related across multiple levels (e.g. linked organizational leaders, clinicians, and patients). In this article, we present a new multilevel, theory-informed, and relationship-centered framework for conceptualizing recruitment and retention in implementation and hybrid effectiveness-implementation trials which integrates and builds on prior work on recruitment and retention strategies in patient-focused trials. We describe the framework's application in the Working to Implement and Sustain Digital Outcome Measures hybrid type III trial, which occurred in part during the COVID-19 pandemic.

Methods: Recruitment for the Working to Implement and Sustain Digital Outcome Measures trial occurred from October 2019 to February 2022. Development of recruitment and retention strategies was guided by a newly developed multilevel framework, which targeted the capability, opportunity, and motivation of organizational leaders, clinicians, patient-facing administrative staff, and patients to engage in research. A structured assessment guide was developed and applied to refine recruitment and retention approaches throughout the trial. We describe the framework and its application amid the onset of the COVID-19 pandemic which required rapid adjustments to address numerous barriers.

Results: The Working to Implement and Sustain Digital Outcome Measures trial enrolled 21 outpatient clinics in three US states, incorporating 252 clinicians and 686 caregivers of youth (95% of patient recruitment target) across two distinct phases. Data completion rates for organizational leaders and clinicians averaged 90% over five waves spanning 18 months, despite the onset of the COVID pandemic. Caregiver completion rates of monthly follow-up assessments ranged from 80%-88% across 6 months. This article presents the multilevel framework, assessment guide, and strategies used to achieve recruitment and retention targets at each level.

Conclusion: We conducted a multi-state hybrid type III effectiveness-implementation trial that maintained high recruitment and retention across all relevant levels amid a global pandemic. The newly developed multilevel recruitment and retention framework and assessment guide presented here, which integrates behavioral theory, a relationship-focused lens, and evidence-based strategies for participant recruitment and retention at multiple levels, can be adapted and used by other researchers for implementation, hybrid, and m

背景:实施和混合有效性-实施试验旨在通过为改进有效卫生干预措施的采用提供关键证据,加速将科学转化为实践。在设计上,由于它们的目标、分析单位和抽样计划,它们提出了独特的招聘和保留挑战,这些挑战通常需要许多临床站点(即通常是20个或更多),并且需要跨多个级别(例如,关联的组织领导、临床医生和患者)相关个人的参与。在这篇文章中,我们提出了一个新的多层次的、有理论依据的、以关系为中心的框架,用于概念化实施和混合有效性实施试验中的招募和保留,该框架整合并建立在以患者为中心的试验中招募和保留策略的先前工作之上。我们描述了该框架在实施和维持数字成果措施混合III型试验中的应用,该试验部分发生在COVID-19大流行期间。方法:在2019年10月至2022年2月期间招募实施和维持数字结果措施试验的人员。招聘和保留策略的制定以新开发的多层次框架为指导,该框架针对组织领导者、临床医生、面向患者的行政人员和患者参与研究的能力、机会和动机。在整个试验过程中,制定并应用了结构化评估指南,以改进招聘和保留方法。我们描述了该框架及其在COVID-19大流行爆发期间的应用,该大流行需要快速调整以解决众多障碍。结果:努力实施和维持数字结果测量试验在美国三个州招募了21家门诊诊所,包括252名临床医生和686名青年护理人员(95%的患者招募目标),分为两个不同的阶段。尽管出现了COVID - 19大流行,但组织领导者和临床医生的数据完成率在18个月的五次浪潮中平均为90%。护理人员每月随访评估的完成率在6个月内从80%-88%不等。本文介绍了用于实现每个级别的招聘和保留目标的多层框架、评估指南和策略。结论:我们进行了一项多州混合III型有效性实施试验,该试验在全球大流行期间保持了所有相关级别的高招募和保留率。本文提出的新开发的多层次招募和保留框架和评估指南,整合了行为理论、以关系为中心的视角和基于证据的多层次参与者招募和保留策略,可以被其他研究人员用于实施、混合、多层次实用试验以及其他实施研究。
{"title":"A multilevel framework for recruitment and retention in implementation trials: An illustrative example.","authors":"Nathaniel J Williams, Alexandra E Gomes, Nallely R Vega, Susan Esp, Mimi Choy-Brown, Rinad S Beidas","doi":"10.1177/17407745241307948","DOIUrl":"10.1177/17407745241307948","url":null,"abstract":"<p><strong>Background: </strong>Implementation and hybrid effectiveness-implementation trials aspire to speed the translation of science into practice by generating crucial evidence for improving the uptake of effective health interventions. By design, they pose unique recruitment and retention challenges due to their aims, units of analysis, and sampling plans, which typically require many clinical sites (i.e. often 20 or more) and participation by individuals who are related across multiple levels (e.g. linked organizational leaders, clinicians, and patients). In this article, we present a new multilevel, theory-informed, and relationship-centered framework for conceptualizing recruitment and retention in implementation and hybrid effectiveness-implementation trials which integrates and builds on prior work on recruitment and retention strategies in patient-focused trials. We describe the framework's application in the Working to Implement and Sustain Digital Outcome Measures hybrid type III trial, which occurred in part during the COVID-19 pandemic.</p><p><strong>Methods: </strong>Recruitment for the Working to Implement and Sustain Digital Outcome Measures trial occurred from October 2019 to February 2022. Development of recruitment and retention strategies was guided by a newly developed multilevel framework, which targeted the capability, opportunity, and motivation of organizational leaders, clinicians, patient-facing administrative staff, and patients to engage in research. A structured assessment guide was developed and applied to refine recruitment and retention approaches throughout the trial. We describe the framework and its application amid the onset of the COVID-19 pandemic which required rapid adjustments to address numerous barriers.</p><p><strong>Results: </strong>The Working to Implement and Sustain Digital Outcome Measures trial enrolled 21 outpatient clinics in three US states, incorporating 252 clinicians and 686 caregivers of youth (95% of patient recruitment target) across two distinct phases. Data completion rates for organizational leaders and clinicians averaged 90% over five waves spanning 18 months, despite the onset of the COVID pandemic. Caregiver completion rates of monthly follow-up assessments ranged from 80%-88% across 6 months. This article presents the multilevel framework, assessment guide, and strategies used to achieve recruitment and retention targets at each level.</p><p><strong>Conclusion: </strong>We conducted a multi-state hybrid type III effectiveness-implementation trial that maintained high recruitment and retention across all relevant levels amid a global pandemic. The newly developed multilevel recruitment and retention framework and assessment guide presented here, which integrates behavioral theory, a relationship-focused lens, and evidence-based strategies for participant recruitment and retention at multiple levels, can be adapted and used by other researchers for implementation, hybrid, and m","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"325-341"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12094903/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142964033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating the impact of stratification on the power and cross-arm balance of randomized phase 2 clinical trials. 评估分层对随机2期临床试验的疗效和横臂平衡的影响。
IF 2.2 3区 医学 Q3 MEDICINE, RESEARCH & EXPERIMENTAL Pub Date : 2025-06-01 Epub Date: 2025-01-15 DOI: 10.1177/17407745241304065
Anna Moseley, Michael LeBlanc, Boris Freidlin, Rory M Shallis, Amer M Zeidan, David A Sallman, Harry P Erba, Richard F Little, Megan Othus

Background/aimsRandomized clinical trials often use stratification to ensure balance between arms. Analysis of primary endpoints of these trials typically uses a "stratified analysis," in which analyses are performed separately in each subgroup defined by the stratification factors, and those separate analyses are weighted and combined. In the phase 3 setting, stratified analyses based on a small number of stratification factors can provide a small increase in power. The impact on power and type-1 error of stratification in the setting of smaller sample sizes as in randomized phase 2 trials has not been well characterized.MethodsWe performed computational studies to characterize the power and cross-arm balance of modestly sized clinical trials (less than 170 patients) with varying numbers of stratification factors (0-6), sample sizes, randomization ratios (1:1 vs 2:1), and randomization methods (dynamic balancing vs stratified block).ResultsWe found that the power of unstratified analyses was minimally impacted by the number of stratification factors used in randomization. Analyses stratified by 1-3 factors maintained power over 80%, while power dropped below 80% when four or more stratification factors were used. These trends held regardless of sample size, randomization ratio, and randomization method. For a given randomization ratio and sample size, increasing the number of factors used in randomization had an adverse impact on cross-arm balance. Stratified block randomization performed worse than dynamic balancing with respect to cross-arm balance when three or more stratification factors were used.ConclusionStratified analyses can decrease power in the setting of phase 2 trials when the number of patients in a stratification subgroup is small.

背景/目的:随机临床试验通常采用分层来确保两臂之间的平衡。对这些试验的主要终点的分析通常使用“分层分析”,在分层因素定义的每个亚组中分别进行分析,并对这些单独的分析进行加权和合并。在第3阶段设置中,基于少量分层因素的分层分析可以提供少量的功率增加。在较小样本量的随机2期试验中,分层对功效和1型误差的影响尚未得到很好的表征。方法:我们进行了计算研究,以表征中等规模临床试验(少于170例患者)的功率和横臂平衡,这些试验具有不同数量的分层因素(0-6)、样本量、随机化比例(1:1 vs 2:1)和随机化方法(动态平衡vs分层块)。结果:我们发现,随机化中使用的分层因素数量对非分层分析的影响最小。采用1-3个因素分层的分析,准确率保持在80%以上,而采用4个或更多因素分层的分析,准确率下降到80%以下。这些趋势与样本量、随机化比例和随机化方法无关。对于给定的随机化比例和样本量,增加随机化中使用的因素数量会对横臂平衡产生不利影响。当使用三个或更多分层因素时,分层块随机化在横臂平衡方面的表现比动态平衡差。结论:当分层亚组中患者数量较少时,分层分析可能会降低2期试验的有效性。
{"title":"Evaluating the impact of stratification on the power and cross-arm balance of randomized phase 2 clinical trials.","authors":"Anna Moseley, Michael LeBlanc, Boris Freidlin, Rory M Shallis, Amer M Zeidan, David A Sallman, Harry P Erba, Richard F Little, Megan Othus","doi":"10.1177/17407745241304065","DOIUrl":"10.1177/17407745241304065","url":null,"abstract":"<p><p>Background/aimsRandomized clinical trials often use stratification to ensure balance between arms. Analysis of primary endpoints of these trials typically uses a \"stratified analysis,\" in which analyses are performed separately in each subgroup defined by the stratification factors, and those separate analyses are weighted and combined. In the phase 3 setting, stratified analyses based on a small number of stratification factors can provide a small increase in power. The impact on power and type-1 error of stratification in the setting of smaller sample sizes as in randomized phase 2 trials has not been well characterized.MethodsWe performed computational studies to characterize the power and cross-arm balance of modestly sized clinical trials (less than 170 patients) with varying numbers of stratification factors (0-6), sample sizes, randomization ratios (1:1 vs 2:1), and randomization methods (dynamic balancing vs stratified block).ResultsWe found that the power of unstratified analyses was minimally impacted by the number of stratification factors used in randomization. Analyses stratified by 1-3 factors maintained power over 80%, while power dropped below 80% when four or more stratification factors were used. These trends held regardless of sample size, randomization ratio, and randomization method. For a given randomization ratio and sample size, increasing the number of factors used in randomization had an adverse impact on cross-arm balance. Stratified block randomization performed worse than dynamic balancing with respect to cross-arm balance when three or more stratification factors were used.ConclusionStratified analyses can decrease power in the setting of phase 2 trials when the number of patients in a stratification subgroup is small.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"361-366"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12143638/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143001560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Clinical Trials
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1