{"title":"Risk of Bias When Using Early Failure Criteria in Randomized Clinical Trials With Stereoacuity Outcomes.","authors":"Meet Panjwani, Jonathan M Holmes","doi":"10.1167/tvst.13.12.1","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>The purpose of this study was to explore the effects of early failure criteria for participants in randomized clinical trials (RCTs) on overall trial conclusions.</p><p><strong>Method: </strong>We simulated 10,000 hypothetical RCTs with 2 treatments, 1 linear improvement and 1 with increasing rate of improvement and 6 follow-up visits. Each RCT had 400 participants, with the same baseline stereoacuity distribution. We incorporated random test-retest noise for every visit, and scores were rounded to the nearest observable score. Early failure was defined as worsening of two or more levels. We compared mean outcome stereoacuity between treatment groups, with and without the failure rule, using the two-sample t-test and the proportion of erroneous RCTs (significantly different mean outcome values, where truth is known to be no different). Sensitivity analyses were performed to explore the influence of sample size, baseline distribution of stereoacuity, overall magnitude of mean improvement, magnitude of change for the failure rule, and distribution of noise.</p><p><strong>Results: </strong>A greater proportion of 10,000 simulated RCTs had an erroneous mean difference in outcome with the early failure rule than without (5.49%, 95% confidence interval [CI] = 5.05% to 5.94% vs. 0, 0%, 95% CI = 0% to 0.000001%, difference 5.49%, P < 0.0001). Sensitivity analysis revealed that increased sample size and wider distribution of noise had the greatest influence on increasing proportions of erroneous RCT conclusions.</p><p><strong>Conclusions: </strong>Study designs incorporating participant-level early failure rules increase the risk of erroneous RCT conclusions and should be avoided.</p><p><strong>Translational relevance: </strong>We provide data informing the design of future clinical trials. Earlier failure rules at the participant level should be avoided.</p>","PeriodicalId":23322,"journal":{"name":"Translational Vision Science & Technology","volume":"13 12","pages":"1"},"PeriodicalIF":2.6000,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11614026/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational Vision Science & Technology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1167/tvst.13.12.1","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: The purpose of this study was to explore the effects of early failure criteria for participants in randomized clinical trials (RCTs) on overall trial conclusions.
Method: We simulated 10,000 hypothetical RCTs with 2 treatments, 1 linear improvement and 1 with increasing rate of improvement and 6 follow-up visits. Each RCT had 400 participants, with the same baseline stereoacuity distribution. We incorporated random test-retest noise for every visit, and scores were rounded to the nearest observable score. Early failure was defined as worsening of two or more levels. We compared mean outcome stereoacuity between treatment groups, with and without the failure rule, using the two-sample t-test and the proportion of erroneous RCTs (significantly different mean outcome values, where truth is known to be no different). Sensitivity analyses were performed to explore the influence of sample size, baseline distribution of stereoacuity, overall magnitude of mean improvement, magnitude of change for the failure rule, and distribution of noise.
Results: A greater proportion of 10,000 simulated RCTs had an erroneous mean difference in outcome with the early failure rule than without (5.49%, 95% confidence interval [CI] = 5.05% to 5.94% vs. 0, 0%, 95% CI = 0% to 0.000001%, difference 5.49%, P < 0.0001). Sensitivity analysis revealed that increased sample size and wider distribution of noise had the greatest influence on increasing proportions of erroneous RCT conclusions.
Conclusions: Study designs incorporating participant-level early failure rules increase the risk of erroneous RCT conclusions and should be avoided.
Translational relevance: We provide data informing the design of future clinical trials. Earlier failure rules at the participant level should be avoided.
期刊介绍:
Translational Vision Science & Technology (TVST), an official journal of the Association for Research in Vision and Ophthalmology (ARVO), an international organization whose purpose is to advance research worldwide into understanding the visual system and preventing, treating and curing its disorders, is an online, open access, peer-reviewed journal emphasizing multidisciplinary research that bridges the gap between basic research and clinical care. A highly qualified and diverse group of Associate Editors and Editorial Board Members is led by Editor-in-Chief Marco Zarbin, MD, PhD, FARVO.
The journal covers a broad spectrum of work, including but not limited to:
Applications of stem cell technology for regenerative medicine,
Development of new animal models of human diseases,
Tissue bioengineering,
Chemical engineering to improve virus-based gene delivery,
Nanotechnology for drug delivery,
Design and synthesis of artificial extracellular matrices,
Development of a true microsurgical operating environment,
Refining data analysis algorithms to improve in vivo imaging technology,
Results of Phase 1 clinical trials,
Reverse translational ("bedside to bench") research.
TVST seeks manuscripts from scientists and clinicians with diverse backgrounds ranging from basic chemistry to ophthalmic surgery that will advance or change the way we understand and/or treat vision-threatening diseases. TVST encourages the use of color, multimedia, hyperlinks, program code and other digital enhancements.