Anna Moseley, Michael LeBlanc, Boris Freidlin, Rory M Shallis, Amer M Zeidan, David A Sallman, Harry P Erba, Richard F Little, Megan Othus
{"title":"Evaluating the impact of stratification on the power and cross-arm balance of randomized phase 2 clinical trials.","authors":"Anna Moseley, Michael LeBlanc, Boris Freidlin, Rory M Shallis, Amer M Zeidan, David A Sallman, Harry P Erba, Richard F Little, Megan Othus","doi":"10.1177/17407745241304065","DOIUrl":null,"url":null,"abstract":"<p><strong>Background/aims: </strong>Randomized clinical trials often use stratification to ensure balance between arms. Analysis of primary endpoints of these trials typically uses a \"stratified analysis,\" in which analyses are performed separately in each subgroup defined by the stratification factors, and those separate analyses are weighted and combined. In the phase 3 setting, stratified analyses based on a small number of stratification factors can provide a small increase in power. The impact on power and type-1 error of stratification in the setting of smaller sample sizes as in randomized phase 2 trials has not been well characterized.</p><p><strong>Methods: </strong>We performed computational studies to characterize the power and cross-arm balance of modestly sized clinical trials (less than 170 patients) with varying numbers of stratification factors (0-6), sample sizes, randomization ratios (1:1 vs 2:1), and randomization methods (dynamic balancing vs stratified block).</p><p><strong>Results: </strong>We found that the power of unstratified analyses was minimally impacted by the number of stratification factors used in randomization. Analyses stratified by 1-3 factors maintained power over 80%, while power dropped below 80% when four or more stratification factors were used. These trends held regardless of sample size, randomization ratio, and randomization method. For a given randomization ratio and sample size, increasing the number of factors used in randomization had an adverse impact on cross-arm balance. Stratified block randomization performed worse than dynamic balancing with respect to cross-arm balance when three or more stratification factors were used.</p><p><strong>Conclusion: </strong>Stratified analyses can decrease power in the setting of phase 2 trials when the number of patients in a stratification subgroup is small.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241304065"},"PeriodicalIF":2.2000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Trials","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/17407745241304065","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background/aims: Randomized clinical trials often use stratification to ensure balance between arms. Analysis of primary endpoints of these trials typically uses a "stratified analysis," in which analyses are performed separately in each subgroup defined by the stratification factors, and those separate analyses are weighted and combined. In the phase 3 setting, stratified analyses based on a small number of stratification factors can provide a small increase in power. The impact on power and type-1 error of stratification in the setting of smaller sample sizes as in randomized phase 2 trials has not been well characterized.
Methods: We performed computational studies to characterize the power and cross-arm balance of modestly sized clinical trials (less than 170 patients) with varying numbers of stratification factors (0-6), sample sizes, randomization ratios (1:1 vs 2:1), and randomization methods (dynamic balancing vs stratified block).
Results: We found that the power of unstratified analyses was minimally impacted by the number of stratification factors used in randomization. Analyses stratified by 1-3 factors maintained power over 80%, while power dropped below 80% when four or more stratification factors were used. These trends held regardless of sample size, randomization ratio, and randomization method. For a given randomization ratio and sample size, increasing the number of factors used in randomization had an adverse impact on cross-arm balance. Stratified block randomization performed worse than dynamic balancing with respect to cross-arm balance when three or more stratification factors were used.
Conclusion: Stratified analyses can decrease power in the setting of phase 2 trials when the number of patients in a stratification subgroup is small.
背景/目的:随机临床试验通常采用分层来确保两臂之间的平衡。对这些试验的主要终点的分析通常使用“分层分析”,在分层因素定义的每个亚组中分别进行分析,并对这些单独的分析进行加权和合并。在第3阶段设置中,基于少量分层因素的分层分析可以提供少量的功率增加。在较小样本量的随机2期试验中,分层对功效和1型误差的影响尚未得到很好的表征。方法:我们进行了计算研究,以表征中等规模临床试验(少于170例患者)的功率和横臂平衡,这些试验具有不同数量的分层因素(0-6)、样本量、随机化比例(1:1 vs 2:1)和随机化方法(动态平衡vs分层块)。结果:我们发现,随机化中使用的分层因素数量对非分层分析的影响最小。采用1-3个因素分层的分析,准确率保持在80%以上,而采用4个或更多因素分层的分析,准确率下降到80%以下。这些趋势与样本量、随机化比例和随机化方法无关。对于给定的随机化比例和样本量,增加随机化中使用的因素数量会对横臂平衡产生不利影响。当使用三个或更多分层因素时,分层块随机化在横臂平衡方面的表现比动态平衡差。结论:当分层亚组中患者数量较少时,分层分析可能会降低2期试验的有效性。
期刊介绍:
Clinical Trials is dedicated to advancing knowledge on the design and conduct of clinical trials related research methodologies. Covering the design, conduct, analysis, synthesis and evaluation of key methodologies, the journal remains on the cusp of the latest topics, including ethics, regulation and policy impact.