Pub Date : 2025-06-01Epub Date: 2025-01-02DOI: 10.1177/17407745241304059
Maciej Fronc, Michał Jakubczyk, Sharon B Love, Susan Talbot, Timothy Rolfe
Background: Clinical trials handle a huge amount of data which can be used during the trial to improve the ongoing study conduct. It is suggested by regulators to implement the remote approach to evaluate clinical trials by analysing collected data. Central statistical monitoring helps to achieve that by employing quantitative methods, the results of which are a basis for decision-making on quality issues.
Methods: This article presents a scoping review which is based on a systematic and iterative approach to identify and synthesise literature on central statistical monitoring methodology. In particular, we investigated the decision-making processes (with emphasis on quality issues) of central statistical monitoring methodology and its place in the clinical trial workflow. We reviewed papers published over the last 10 years in two databases (Scopus and Web of Science) with a focus on data mining algorithms of central statistical monitoring and its benefit to the quality of trials.
Results: As a result, 24 scientific papers were selected for this review, and they consider central statistical monitoring at two levels. First, the perspective of the central statistical monitoring process and its location in the study conduct in terms of quality issues. Second, central statistical monitoring methods categorised into practices applied in the industry, and innovative methods in development. The established methods are discussed through the prism of categories of their usage. In turn, the innovations refer to either research on new methods or extensions to existing ones.
Discussion: Our review suggests directions for further research into central statistical monitoring methodology - including increased application of multivariate analysis and using advanced distance metrics - and guidance on how central statistical monitoring operates in response to regulators' requirements.
背景:临床试验处理大量的数据,这些数据可以在试验期间使用,以改善正在进行的研究行为。监管机构建议实施远程方法,通过分析收集的数据来评估临床试验。中央统计监测通过采用定量方法帮助实现这一目标,其结果是就质量问题作出决策的基础。方法:本文提出了一种基于系统和迭代方法的范围审查,以识别和综合有关中央统计监测方法的文献。特别是,我们调查了中央统计监测方法的决策过程(重点是质量问题)及其在临床试验工作流程中的地位。我们回顾了过去10年在两个数据库(Scopus和Web of Science)中发表的论文,重点关注中央统计监测的数据挖掘算法及其对试验质量的好处。结果:本次综述选取了24篇科学论文,考虑了两个层面的中央统计监测。首先,从中央统计监测过程的角度及其在研究开展方面存在的质量问题。二是将中央统计监测方法分类为行业应用的实践方法和发展中的创新方法。通过其使用类别的棱镜来讨论已建立的方法。反过来,创新指的是对新方法的研究或对现有方法的扩展。讨论:我们的综述提出了进一步研究中央统计监测方法的方向——包括增加多变量分析的应用和使用先进的距离度量——以及关于中央统计监测如何响应监管机构要求的指导。
{"title":"Central statistical monitoring in clinical trial management: A scoping review.","authors":"Maciej Fronc, Michał Jakubczyk, Sharon B Love, Susan Talbot, Timothy Rolfe","doi":"10.1177/17407745241304059","DOIUrl":"10.1177/17407745241304059","url":null,"abstract":"<p><strong>Background: </strong>Clinical trials handle a huge amount of data which can be used during the trial to improve the ongoing study conduct. It is suggested by regulators to implement the remote approach to evaluate clinical trials by analysing collected data. Central statistical monitoring helps to achieve that by employing quantitative methods, the results of which are a basis for decision-making on quality issues.</p><p><strong>Methods: </strong>This article presents a scoping review which is based on a systematic and iterative approach to identify and synthesise literature on central statistical monitoring methodology. In particular, we investigated the decision-making processes (with emphasis on quality issues) of central statistical monitoring methodology and its place in the clinical trial workflow. We reviewed papers published over the last 10 years in two databases (Scopus and Web of Science) with a focus on data mining algorithms of central statistical monitoring and its benefit to the quality of trials.</p><p><strong>Results: </strong>As a result, 24 scientific papers were selected for this review, and they consider central statistical monitoring at two levels. First, the perspective of the central statistical monitoring process and its location in the study conduct in terms of quality issues. Second, central statistical monitoring methods categorised into practices applied in the industry, and innovative methods in development. The established methods are discussed through the prism of categories of their usage. In turn, the innovations refer to either research on new methods or extensions to existing ones.</p><p><strong>Discussion: </strong>Our review suggests directions for further research into central statistical monitoring methodology - including increased application of multivariate analysis and using advanced distance metrics - and guidance on how central statistical monitoring operates in response to regulators' requirements.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"342-351"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7617700/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142913824","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01Epub Date: 2025-03-02DOI: 10.1177/17407745251313925
Yoseph Caraco, Matthew G Johnson, Joseph A Chiarappa, Brian M Maas, Julie A Stone, Matthew L Rizk, Mary Vesnesky, Julie M Strizki, Angela Williams-Diaz, Michelle L Brown, Patricia Carmelitano, Hong Wan, Alison Pedley, Akshita Chawla, Dominik J Wolf, Jay A Grobler, Amanda Paschke, Carisa De Anda
<p><p>BackgroundPre-specified interim analyses allow for more timely evaluation of efficacy or futility, potentially accelerating decision-making on an investigational intervention. In such an analysis, the randomized, double-blind MOVe-OUT trial demonstrated superiority of molnupiravir over placebo for outpatient treatment of COVID-19 in high-risk patients. In the full analysis population, the point estimate of the treatment difference in the primary endpoint was notably lower than at the interim analysis. We conducted a comprehensive assessment to investigate this unexpected difference in treatment effect size, with the goal of informing future clinical research evaluating treatments for rapidly evolving infectious diseases.MethodsThe modified intention-to-treat population of the MOVe-OUT trial was divided into an interim analysis cohort (i.e. all participants included in the interim analysis; prospectively defined) and a post-interim analysis cohort (i.e. all remaining participants; retrospectively defined). Baseline characteristics (including many well-established prognostic factors for disease progression), clinical outcomes, and virologic outcomes were retrospectively evaluated. The impact of changes in baseline characteristics over time was explored using logistic regression modeling and simulations.ResultsBaseline characteristics were well-balanced between arms overall. However, between- and within-arm differences in known prognostic baseline factors (e.g. comorbidities, SARS-CoV-2 viral load, and anti-SARS-CoV-2 antibody status) were observed for the interim and post-interim analysis cohorts. For the individual factors, these differences were generally minor and otherwise not notable; as the trial progressed, however, these shifts in combination increasingly favored the placebo arm across most of the evaluated factors in the post-interim cohort. Model-based simulations confirmed that the reduction in effect size could be accounted for by these longitudinal trends toward a lower-risk study population among placebo participants. Infectivity and viral load data confirmed that molnupiravir's antiviral activity was consistent across both cohorts, which were heavily dominated by different viral clades (reflecting the rapid SARS-CoV-2 evolution).DiscussionThe cumulative effect of randomly occurring minor differences in prognostic baseline characteristics within and between arms over time, rather than virologic factors such as reduced activity of molnupiravir against evolving variants, likely impacted the observed outcomes. Our results have broader implications for group sequential trials seeking to evaluate treatments for rapidly emerging pathogens. During dynamic epidemic or pandemic conditions, adaptive trials should be designed and interpreted especially carefully, considering that they will likely rapidly enroll a large post-interim overrun population and that even small longitudinal shifts across multiple baseline variables can disproporti
{"title":"Impact of differences between interim and post-interim analysis populations on outcomes of a group sequential trial: Example of the MOVe-OUT study.","authors":"Yoseph Caraco, Matthew G Johnson, Joseph A Chiarappa, Brian M Maas, Julie A Stone, Matthew L Rizk, Mary Vesnesky, Julie M Strizki, Angela Williams-Diaz, Michelle L Brown, Patricia Carmelitano, Hong Wan, Alison Pedley, Akshita Chawla, Dominik J Wolf, Jay A Grobler, Amanda Paschke, Carisa De Anda","doi":"10.1177/17407745251313925","DOIUrl":"10.1177/17407745251313925","url":null,"abstract":"<p><p>BackgroundPre-specified interim analyses allow for more timely evaluation of efficacy or futility, potentially accelerating decision-making on an investigational intervention. In such an analysis, the randomized, double-blind MOVe-OUT trial demonstrated superiority of molnupiravir over placebo for outpatient treatment of COVID-19 in high-risk patients. In the full analysis population, the point estimate of the treatment difference in the primary endpoint was notably lower than at the interim analysis. We conducted a comprehensive assessment to investigate this unexpected difference in treatment effect size, with the goal of informing future clinical research evaluating treatments for rapidly evolving infectious diseases.MethodsThe modified intention-to-treat population of the MOVe-OUT trial was divided into an interim analysis cohort (i.e. all participants included in the interim analysis; prospectively defined) and a post-interim analysis cohort (i.e. all remaining participants; retrospectively defined). Baseline characteristics (including many well-established prognostic factors for disease progression), clinical outcomes, and virologic outcomes were retrospectively evaluated. The impact of changes in baseline characteristics over time was explored using logistic regression modeling and simulations.ResultsBaseline characteristics were well-balanced between arms overall. However, between- and within-arm differences in known prognostic baseline factors (e.g. comorbidities, SARS-CoV-2 viral load, and anti-SARS-CoV-2 antibody status) were observed for the interim and post-interim analysis cohorts. For the individual factors, these differences were generally minor and otherwise not notable; as the trial progressed, however, these shifts in combination increasingly favored the placebo arm across most of the evaluated factors in the post-interim cohort. Model-based simulations confirmed that the reduction in effect size could be accounted for by these longitudinal trends toward a lower-risk study population among placebo participants. Infectivity and viral load data confirmed that molnupiravir's antiviral activity was consistent across both cohorts, which were heavily dominated by different viral clades (reflecting the rapid SARS-CoV-2 evolution).DiscussionThe cumulative effect of randomly occurring minor differences in prognostic baseline characteristics within and between arms over time, rather than virologic factors such as reduced activity of molnupiravir against evolving variants, likely impacted the observed outcomes. Our results have broader implications for group sequential trials seeking to evaluate treatments for rapidly emerging pathogens. During dynamic epidemic or pandemic conditions, adaptive trials should be designed and interpreted especially carefully, considering that they will likely rapidly enroll a large post-interim overrun population and that even small longitudinal shifts across multiple baseline variables can disproporti","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"312-324"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143536668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
BackgroundIn randomized clinical trials, multiple-testing procedures, composite endpoints, and prioritized outcome approaches are increasingly used to analyze multiple binary outcomes. Previous studies have shown that correlations between outcomes influence their sample size requirements. Although sample size is an important factor affecting the choice of statistical methods, the power and required sample sizes of methods for analyzing multiple binary outcomes have yet to be compared under the influence of outcome correlations.MethodsWe conducted simulations to evaluate the power of co-primary and multiple primary endpoints, composite endpoints, and prioritized outcome approaches based on generalized pairwise comparisons with varying correlations, marginal proportions, treatment effects, and number of outcomes. We then conducted a case study on sample size using a clinical trial of a migraine treatment as an example.ResultsThe correlations significantly affected the statistical power and sample size of composite endpoints. The power and sample size of co-primary endpoints remained relatively stable across different correlations, though their power declined substantially when treatment effects were opposite on some components or more than two components were present. While the correlations influenced the power and sample size of all methods assessed, their direction and degree of influence varied between methods. Notably, the method with the greatest power and smallest sample size also differed depending on the correlations. When the correlations were the same between arms, prioritized outcome approaches usually had higher power and smaller sample sizes than other methods.ConclusionsAnticipated correlations and their uncertainty should be considered when selecting statistical methods. Overall, co-primary endpoints remain a reliable option for evaluating the superiority of all components, although they are unsuitable for assessing the balance between treatment effects pointing in different directions. Generalized pairwise comparisons offer a useful alternative to deal with multiple prioritized outcomes, often providing the smallest sample sizes when the correlation structures are shared between the arms.
{"title":"Impact of correlation structure on sample size requirements of statistical methods for multiple binary outcomes: A simulation study.","authors":"Kanako Fuyama, Kentaro Sakamaki, Kohei Uemura, Isao Yokota","doi":"10.1177/17407745241304706","DOIUrl":"10.1177/17407745241304706","url":null,"abstract":"<p><p>BackgroundIn randomized clinical trials, multiple-testing procedures, composite endpoints, and prioritized outcome approaches are increasingly used to analyze multiple binary outcomes. Previous studies have shown that correlations between outcomes influence their sample size requirements. Although sample size is an important factor affecting the choice of statistical methods, the power and required sample sizes of methods for analyzing multiple binary outcomes have yet to be compared under the influence of outcome correlations.MethodsWe conducted simulations to evaluate the power of co-primary and multiple primary endpoints, composite endpoints, and prioritized outcome approaches based on generalized pairwise comparisons with varying correlations, marginal proportions, treatment effects, and number of outcomes. We then conducted a case study on sample size using a clinical trial of a migraine treatment as an example.ResultsThe correlations significantly affected the statistical power and sample size of composite endpoints. The power and sample size of co-primary endpoints remained relatively stable across different correlations, though their power declined substantially when treatment effects were opposite on some components or more than two components were present. While the correlations influenced the power and sample size of all methods assessed, their direction and degree of influence varied between methods. Notably, the method with the greatest power and smallest sample size also differed depending on the correlations. When the correlations were the same between arms, prioritized outcome approaches usually had higher power and smaller sample sizes than other methods.ConclusionsAnticipated correlations and their uncertainty should be considered when selecting statistical methods. Overall, co-primary endpoints remain a reliable option for evaluating the superiority of all components, although they are unsuitable for assessing the balance between treatment effects pointing in different directions. Generalized pairwise comparisons offer a useful alternative to deal with multiple prioritized outcomes, often providing the smallest sample sizes when the correlation structures are shared between the arms.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"301-311"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142920978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01Epub Date: 2025-01-02DOI: 10.1177/17407745241304114
Nicole D Agaronnik, Mary Linton B Peters, Lisa I Iezzoni
Background/aims: People with disability have higher rates of cancer, excluding skin cancer, compared with people without disability. Food and Drug Administration draft guidelines from 2024 address use of performance status criteria to determine eligibility for clinical trials, advocating for less restrictive thresholds. We examined the exclusion of people with disability from clinical trials based on performance status and other criteria.
Methods: We reviewed eligibility criteria in approved interventional Phase III and Phase IV oncology clinical trials listed on ClinicalTrails.gov between 1 January 2019 and 31 December 2023. Functional status thresholds were assessed using the Eastern Cooperative Oncology Group Performance Status Scale and Karnofsky Performance Scale in clinical trial eligibility criteria. Qualitative analysis was used to review eligibility criteria relating to functional impairments or disability.
Results: Among 96 oncology clinical trials, approximately 40% had restrictive Eastern Cooperative Oncology Group and Karnofsky Performance Scale thresholds, explicitly including only patients with Eastern Cooperative Oncology Group 0 or 1, or equivalent Karnofsky Performance Scale 70 or greater. Only 20% of studies included patients with Eastern Cooperative Oncology Group 2 and Karnofsky Performance Scale 60. Multiple studies contained miscellaneous eligibility criteria that could potentially exclude people with disability. No studies described making accommodations for people with disability to participate in the clinical trial.
Conclusion: Draft Food and Drug Administration guidelines recommend including patients with Eastern Cooperative Oncology Group scores of 2 and Karnofsky Performance Scale scores of 60 in oncology clinical trials. We found that oncology clinical trials often exclude people with more restrictive performance status scores than the draft Food and Drug Administration guidelines, as well as other criteria that relate to disability. These estimates provide baseline information for assessing how the 2024 Food and Drug Administration guidance, if finalized, might affect the inclusion of people with disability in future trials.
{"title":"Exclusion of people from oncology clinical trials based on functional status.","authors":"Nicole D Agaronnik, Mary Linton B Peters, Lisa I Iezzoni","doi":"10.1177/17407745241304114","DOIUrl":"10.1177/17407745241304114","url":null,"abstract":"<p><strong>Background/aims: </strong>People with disability have higher rates of cancer, excluding skin cancer, compared with people without disability. Food and Drug Administration draft guidelines from 2024 address use of performance status criteria to determine eligibility for clinical trials, advocating for less restrictive thresholds. We examined the exclusion of people with disability from clinical trials based on performance status and other criteria.</p><p><strong>Methods: </strong>We reviewed eligibility criteria in approved interventional Phase III and Phase IV oncology clinical trials listed on ClinicalTrails.gov between 1 January 2019 and 31 December 2023. Functional status thresholds were assessed using the Eastern Cooperative Oncology Group Performance Status Scale and Karnofsky Performance Scale in clinical trial eligibility criteria. Qualitative analysis was used to review eligibility criteria relating to functional impairments or disability.</p><p><strong>Results: </strong>Among 96 oncology clinical trials, approximately 40% had restrictive Eastern Cooperative Oncology Group and Karnofsky Performance Scale thresholds, explicitly including only patients with Eastern Cooperative Oncology Group 0 or 1, or equivalent Karnofsky Performance Scale 70 or greater. Only 20% of studies included patients with Eastern Cooperative Oncology Group 2 and Karnofsky Performance Scale 60. Multiple studies contained miscellaneous eligibility criteria that could potentially exclude people with disability. No studies described making accommodations for people with disability to participate in the clinical trial.</p><p><strong>Conclusion: </strong>Draft Food and Drug Administration guidelines recommend including patients with Eastern Cooperative Oncology Group scores of 2 and Karnofsky Performance Scale scores of 60 in oncology clinical trials. We found that oncology clinical trials often exclude people with more restrictive performance status scores than the draft Food and Drug Administration guidelines, as well as other criteria that relate to disability. These estimates provide baseline information for assessing how the 2024 Food and Drug Administration guidance, if finalized, might affect the inclusion of people with disability in future trials.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"367-373"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12199287/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142913834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01Epub Date: 2024-12-31DOI: 10.1177/17407745241304721
Sheila Kansiime, Christian Holm Hansen, Eugene Ruzagira, Sheena McCormack, Richard Hayes, David Dunn
<p><p>BackgroundThere is increasing recognition that the interpretation of active-controlled HIV prevention trials should consider the counterfactual placebo HIV incidence rate, that is, the rate that would have been observed if the trial had included a placebo control arm. The PrEPVacc HIV vaccine and pre-exposure prophylaxis trial (NCT04066881) incorporated a pre-trial registration cohort partly for this purpose. In this article, we describe our attempts to model the counterfactual placebo HIV incidence rate from the registration cohort.MethodsPrEPVacc was conducted at four study sites in three African countries. During the set up of the trial, potential participants were invited to join a registration cohort, which included HIV testing every 3 months. The trial included a non-inferiority comparison of two daily, oral pre-exposure prophylaxis regimens (emtricitabine/tenofovir disoproxil fumarate, emtricitabine/tenofovir alafenamide fumarate), administered for a target duration of 26 weeks (until 2 weeks after the third of four vaccinations). We developed a multi-variable Poisson regression model to estimate associations in the registration cohort between HIV incidence and baseline predictors (socio-demographic and behavioural variables) and time-dependent predictors (calendar time, time in follow-up). We then used the estimated regression coefficients together with participant characteristics in the active-controlled pre-exposure prophylaxis trial to predict a counterfactual placebo incidence rate. Sensitivity analyses regarding the effect of calendar period were conducted.ResultsA total of 3255 participants were followed up in the registration cohort between July 2018 and October 2022, and 1512 participants were enrolled in the trial between December 2020 and March 2023. In the registration cohort, 106 participants were diagnosed with HIV over 3638 person-years of follow-up (incidence rate = 2.9/100 person-years of follow-up (95% confidence interval: 2.4-3.5)). The final statistical model included terms for study site, gender, age, occupation, sex after using recreational drugs, time in follow-up, and calendar period. The estimated effect of calendar period was very strong, an overall 37% (95% confidence interval: 19-51) decline per year in adjusted analyses, with evidence that this effect varied by study site. In sensitivity analyses investigating different assumptions about the precise effect of calendar period, the predicted counterfactual placebo incidence ranged between 1.2 and 3.7/100 person-years of follow-up.ConclusionIn principle, the use of a registration cohort is one of the most straightforward and reliable methods for estimating the counterfactual placebo HIV incidence. However, the predictions in PrEPVacc are complicated by an implausibly large calendar time effect, with uncertainty as to whether this can be validly extrapolated over the period of trial follow-up. Other limitations are discussed, along with suggestions for mitiga
{"title":"Challenges in estimating the counterfactual placebo HIV incidence rate from a registration cohort: The PrEPVacc trial.","authors":"Sheila Kansiime, Christian Holm Hansen, Eugene Ruzagira, Sheena McCormack, Richard Hayes, David Dunn","doi":"10.1177/17407745241304721","DOIUrl":"10.1177/17407745241304721","url":null,"abstract":"<p><p>BackgroundThere is increasing recognition that the interpretation of active-controlled HIV prevention trials should consider the counterfactual placebo HIV incidence rate, that is, the rate that would have been observed if the trial had included a placebo control arm. The PrEPVacc HIV vaccine and pre-exposure prophylaxis trial (NCT04066881) incorporated a pre-trial registration cohort partly for this purpose. In this article, we describe our attempts to model the counterfactual placebo HIV incidence rate from the registration cohort.MethodsPrEPVacc was conducted at four study sites in three African countries. During the set up of the trial, potential participants were invited to join a registration cohort, which included HIV testing every 3 months. The trial included a non-inferiority comparison of two daily, oral pre-exposure prophylaxis regimens (emtricitabine/tenofovir disoproxil fumarate, emtricitabine/tenofovir alafenamide fumarate), administered for a target duration of 26 weeks (until 2 weeks after the third of four vaccinations). We developed a multi-variable Poisson regression model to estimate associations in the registration cohort between HIV incidence and baseline predictors (socio-demographic and behavioural variables) and time-dependent predictors (calendar time, time in follow-up). We then used the estimated regression coefficients together with participant characteristics in the active-controlled pre-exposure prophylaxis trial to predict a counterfactual placebo incidence rate. Sensitivity analyses regarding the effect of calendar period were conducted.ResultsA total of 3255 participants were followed up in the registration cohort between July 2018 and October 2022, and 1512 participants were enrolled in the trial between December 2020 and March 2023. In the registration cohort, 106 participants were diagnosed with HIV over 3638 person-years of follow-up (incidence rate = 2.9/100 person-years of follow-up (95% confidence interval: 2.4-3.5)). The final statistical model included terms for study site, gender, age, occupation, sex after using recreational drugs, time in follow-up, and calendar period. The estimated effect of calendar period was very strong, an overall 37% (95% confidence interval: 19-51) decline per year in adjusted analyses, with evidence that this effect varied by study site. In sensitivity analyses investigating different assumptions about the precise effect of calendar period, the predicted counterfactual placebo incidence ranged between 1.2 and 3.7/100 person-years of follow-up.ConclusionIn principle, the use of a registration cohort is one of the most straightforward and reliable methods for estimating the counterfactual placebo HIV incidence. However, the predictions in PrEPVacc are complicated by an implausibly large calendar time effect, with uncertainty as to whether this can be validly extrapolated over the period of trial follow-up. Other limitations are discussed, along with suggestions for mitiga","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"289-300"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12092936/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143028103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01Epub Date: 2025-01-25DOI: 10.1177/17407745241309318
Kyungwan Hong, Bridget Nugent, Abbas Bandukwala, Robert Schuck, York Tomita, Salvatore Pepe, Mary Doi, Scott Winiecki, Kerry Jo Lee
Background/aimsRare disease drug development faces unique challenges, such as genotypic and phenotypic heterogeneity within small patient populations and a lack of established outcome measures for conditions without previously successful drug development programs. These challenges complicate the process of selecting the appropriate trial endpoints and conducting clinical trials in rare diseases. In this descriptive study, we examined novel drug approvals for non-oncologic rare diseases by the U.S. Food and Drug Administration's Center for Drug Evaluation and Research over the past decade and characterized key regulatory and trial design elements with a focus on the primary efficacy endpoint utilized as the basis of approval.MethodsUsing the Food and Drug Administration's Data Analysis Search Host database, we identified novel new drug applications and biologics license applications with orphan drug designation that were approved between 2013 and 2022 for non-oncologic indications. From Food and Drug Administration review documents and other external databases, we examined characteristics of pivotal trials for the included drugs, such as therapeutic area, trial design, and type of primary efficacy endpoints. Differences in trial design elements associated with primary efficacy endpoint type were assessed such as randomization and blinding. Then, we summarized the primary efficacy endpoint types utilized in pivotal trials by therapeutic area, approval pathway, and whether the disease etiology is well defined.ResultsOne hundred and seven drugs that met our inclusion criteria were approved between 2013 and 2022. Assessment of the 107 drug development programs identified 150 pivotal trials that were subsequently analyzed. The pivotal trials were mostly randomized (80%) and blinded (69.3%). Biomarkers (41.1%) and clinical outcomes (42.1%) were commonly utilized as primary efficacy endpoints. Analysis of the use of clinical trial design elements across trials that utilized biomarkers, clinical outcomes, or composite endpoints did not reveal statistically significant differences. The choice of primary efficacy endpoint varied by the drug's therapeutic area, approval pathway, and whether the indicated disease etiology was well defined. For example, biomarkers were commonly selected as primary efficacy endpoints in hematology drug approvals (70.6%), whereas clinical outcomes were commonly selected in neurology drug approvals (69.6%). Further, if the disease etiology was well defined, biomarkers were more commonly used as primary efficacy endpoints in pivotal trials (44.7%) than if the disease etiology was not well defined (27.3%).DiscussionIn the past 10 years, numerous novel drugs have been approved to treat non-oncologic rare diseases in various therapeutic areas. To demonstrate their efficacy for regulatory approval, biomarkers and clinical outcomes were commonly utilized as primary efficacy endpoints. Biomarkers were not only frequently used as s
{"title":"Pivotal trial characteristics and types of endpoints used to support Food and Drug Administration rare disease drug approvals between 2013 and 2022.","authors":"Kyungwan Hong, Bridget Nugent, Abbas Bandukwala, Robert Schuck, York Tomita, Salvatore Pepe, Mary Doi, Scott Winiecki, Kerry Jo Lee","doi":"10.1177/17407745241309318","DOIUrl":"10.1177/17407745241309318","url":null,"abstract":"<p><p>Background/aimsRare disease drug development faces unique challenges, such as genotypic and phenotypic heterogeneity within small patient populations and a lack of established outcome measures for conditions without previously successful drug development programs. These challenges complicate the process of selecting the appropriate trial endpoints and conducting clinical trials in rare diseases. In this descriptive study, we examined novel drug approvals for non-oncologic rare diseases by the U.S. Food and Drug Administration's Center for Drug Evaluation and Research over the past decade and characterized key regulatory and trial design elements with a focus on the primary efficacy endpoint utilized as the basis of approval.MethodsUsing the Food and Drug Administration's Data Analysis Search Host database, we identified novel new drug applications and biologics license applications with orphan drug designation that were approved between 2013 and 2022 for non-oncologic indications. From Food and Drug Administration review documents and other external databases, we examined characteristics of pivotal trials for the included drugs, such as therapeutic area, trial design, and type of primary efficacy endpoints. Differences in trial design elements associated with primary efficacy endpoint type were assessed such as randomization and blinding. Then, we summarized the primary efficacy endpoint types utilized in pivotal trials by therapeutic area, approval pathway, and whether the disease etiology is well defined.ResultsOne hundred and seven drugs that met our inclusion criteria were approved between 2013 and 2022. Assessment of the 107 drug development programs identified 150 pivotal trials that were subsequently analyzed. The pivotal trials were mostly randomized (80%) and blinded (69.3%). Biomarkers (41.1%) and clinical outcomes (42.1%) were commonly utilized as primary efficacy endpoints. Analysis of the use of clinical trial design elements across trials that utilized biomarkers, clinical outcomes, or composite endpoints did not reveal statistically significant differences. The choice of primary efficacy endpoint varied by the drug's therapeutic area, approval pathway, and whether the indicated disease etiology was well defined. For example, biomarkers were commonly selected as primary efficacy endpoints in hematology drug approvals (70.6%), whereas clinical outcomes were commonly selected in neurology drug approvals (69.6%). Further, if the disease etiology was well defined, biomarkers were more commonly used as primary efficacy endpoints in pivotal trials (44.7%) than if the disease etiology was not well defined (27.3%).DiscussionIn the past 10 years, numerous novel drugs have been approved to treat non-oncologic rare diseases in various therapeutic areas. To demonstrate their efficacy for regulatory approval, biomarkers and clinical outcomes were commonly utilized as primary efficacy endpoints. Biomarkers were not only frequently used as s","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"352-360"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143036919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01Epub Date: 2025-01-10DOI: 10.1177/17407745241307948
Nathaniel J Williams, Alexandra E Gomes, Nallely R Vega, Susan Esp, Mimi Choy-Brown, Rinad S Beidas
Background: Implementation and hybrid effectiveness-implementation trials aspire to speed the translation of science into practice by generating crucial evidence for improving the uptake of effective health interventions. By design, they pose unique recruitment and retention challenges due to their aims, units of analysis, and sampling plans, which typically require many clinical sites (i.e. often 20 or more) and participation by individuals who are related across multiple levels (e.g. linked organizational leaders, clinicians, and patients). In this article, we present a new multilevel, theory-informed, and relationship-centered framework for conceptualizing recruitment and retention in implementation and hybrid effectiveness-implementation trials which integrates and builds on prior work on recruitment and retention strategies in patient-focused trials. We describe the framework's application in the Working to Implement and Sustain Digital Outcome Measures hybrid type III trial, which occurred in part during the COVID-19 pandemic.
Methods: Recruitment for the Working to Implement and Sustain Digital Outcome Measures trial occurred from October 2019 to February 2022. Development of recruitment and retention strategies was guided by a newly developed multilevel framework, which targeted the capability, opportunity, and motivation of organizational leaders, clinicians, patient-facing administrative staff, and patients to engage in research. A structured assessment guide was developed and applied to refine recruitment and retention approaches throughout the trial. We describe the framework and its application amid the onset of the COVID-19 pandemic which required rapid adjustments to address numerous barriers.
Results: The Working to Implement and Sustain Digital Outcome Measures trial enrolled 21 outpatient clinics in three US states, incorporating 252 clinicians and 686 caregivers of youth (95% of patient recruitment target) across two distinct phases. Data completion rates for organizational leaders and clinicians averaged 90% over five waves spanning 18 months, despite the onset of the COVID pandemic. Caregiver completion rates of monthly follow-up assessments ranged from 80%-88% across 6 months. This article presents the multilevel framework, assessment guide, and strategies used to achieve recruitment and retention targets at each level.
Conclusion: We conducted a multi-state hybrid type III effectiveness-implementation trial that maintained high recruitment and retention across all relevant levels amid a global pandemic. The newly developed multilevel recruitment and retention framework and assessment guide presented here, which integrates behavioral theory, a relationship-focused lens, and evidence-based strategies for participant recruitment and retention at multiple levels, can be adapted and used by other researchers for implementation, hybrid, and m
{"title":"A multilevel framework for recruitment and retention in implementation trials: An illustrative example.","authors":"Nathaniel J Williams, Alexandra E Gomes, Nallely R Vega, Susan Esp, Mimi Choy-Brown, Rinad S Beidas","doi":"10.1177/17407745241307948","DOIUrl":"10.1177/17407745241307948","url":null,"abstract":"<p><strong>Background: </strong>Implementation and hybrid effectiveness-implementation trials aspire to speed the translation of science into practice by generating crucial evidence for improving the uptake of effective health interventions. By design, they pose unique recruitment and retention challenges due to their aims, units of analysis, and sampling plans, which typically require many clinical sites (i.e. often 20 or more) and participation by individuals who are related across multiple levels (e.g. linked organizational leaders, clinicians, and patients). In this article, we present a new multilevel, theory-informed, and relationship-centered framework for conceptualizing recruitment and retention in implementation and hybrid effectiveness-implementation trials which integrates and builds on prior work on recruitment and retention strategies in patient-focused trials. We describe the framework's application in the Working to Implement and Sustain Digital Outcome Measures hybrid type III trial, which occurred in part during the COVID-19 pandemic.</p><p><strong>Methods: </strong>Recruitment for the Working to Implement and Sustain Digital Outcome Measures trial occurred from October 2019 to February 2022. Development of recruitment and retention strategies was guided by a newly developed multilevel framework, which targeted the capability, opportunity, and motivation of organizational leaders, clinicians, patient-facing administrative staff, and patients to engage in research. A structured assessment guide was developed and applied to refine recruitment and retention approaches throughout the trial. We describe the framework and its application amid the onset of the COVID-19 pandemic which required rapid adjustments to address numerous barriers.</p><p><strong>Results: </strong>The Working to Implement and Sustain Digital Outcome Measures trial enrolled 21 outpatient clinics in three US states, incorporating 252 clinicians and 686 caregivers of youth (95% of patient recruitment target) across two distinct phases. Data completion rates for organizational leaders and clinicians averaged 90% over five waves spanning 18 months, despite the onset of the COVID pandemic. Caregiver completion rates of monthly follow-up assessments ranged from 80%-88% across 6 months. This article presents the multilevel framework, assessment guide, and strategies used to achieve recruitment and retention targets at each level.</p><p><strong>Conclusion: </strong>We conducted a multi-state hybrid type III effectiveness-implementation trial that maintained high recruitment and retention across all relevant levels amid a global pandemic. The newly developed multilevel recruitment and retention framework and assessment guide presented here, which integrates behavioral theory, a relationship-focused lens, and evidence-based strategies for participant recruitment and retention at multiple levels, can be adapted and used by other researchers for implementation, hybrid, and m","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"325-341"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12094903/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142964033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01Epub Date: 2025-01-15DOI: 10.1177/17407745241304065
Anna Moseley, Michael LeBlanc, Boris Freidlin, Rory M Shallis, Amer M Zeidan, David A Sallman, Harry P Erba, Richard F Little, Megan Othus
Background/aimsRandomized clinical trials often use stratification to ensure balance between arms. Analysis of primary endpoints of these trials typically uses a "stratified analysis," in which analyses are performed separately in each subgroup defined by the stratification factors, and those separate analyses are weighted and combined. In the phase 3 setting, stratified analyses based on a small number of stratification factors can provide a small increase in power. The impact on power and type-1 error of stratification in the setting of smaller sample sizes as in randomized phase 2 trials has not been well characterized.MethodsWe performed computational studies to characterize the power and cross-arm balance of modestly sized clinical trials (less than 170 patients) with varying numbers of stratification factors (0-6), sample sizes, randomization ratios (1:1 vs 2:1), and randomization methods (dynamic balancing vs stratified block).ResultsWe found that the power of unstratified analyses was minimally impacted by the number of stratification factors used in randomization. Analyses stratified by 1-3 factors maintained power over 80%, while power dropped below 80% when four or more stratification factors were used. These trends held regardless of sample size, randomization ratio, and randomization method. For a given randomization ratio and sample size, increasing the number of factors used in randomization had an adverse impact on cross-arm balance. Stratified block randomization performed worse than dynamic balancing with respect to cross-arm balance when three or more stratification factors were used.ConclusionStratified analyses can decrease power in the setting of phase 2 trials when the number of patients in a stratification subgroup is small.
背景/目的:随机临床试验通常采用分层来确保两臂之间的平衡。对这些试验的主要终点的分析通常使用“分层分析”,在分层因素定义的每个亚组中分别进行分析,并对这些单独的分析进行加权和合并。在第3阶段设置中,基于少量分层因素的分层分析可以提供少量的功率增加。在较小样本量的随机2期试验中,分层对功效和1型误差的影响尚未得到很好的表征。方法:我们进行了计算研究,以表征中等规模临床试验(少于170例患者)的功率和横臂平衡,这些试验具有不同数量的分层因素(0-6)、样本量、随机化比例(1:1 vs 2:1)和随机化方法(动态平衡vs分层块)。结果:我们发现,随机化中使用的分层因素数量对非分层分析的影响最小。采用1-3个因素分层的分析,准确率保持在80%以上,而采用4个或更多因素分层的分析,准确率下降到80%以下。这些趋势与样本量、随机化比例和随机化方法无关。对于给定的随机化比例和样本量,增加随机化中使用的因素数量会对横臂平衡产生不利影响。当使用三个或更多分层因素时,分层块随机化在横臂平衡方面的表现比动态平衡差。结论:当分层亚组中患者数量较少时,分层分析可能会降低2期试验的有效性。
{"title":"Evaluating the impact of stratification on the power and cross-arm balance of randomized phase 2 clinical trials.","authors":"Anna Moseley, Michael LeBlanc, Boris Freidlin, Rory M Shallis, Amer M Zeidan, David A Sallman, Harry P Erba, Richard F Little, Megan Othus","doi":"10.1177/17407745241304065","DOIUrl":"10.1177/17407745241304065","url":null,"abstract":"<p><p>Background/aimsRandomized clinical trials often use stratification to ensure balance between arms. Analysis of primary endpoints of these trials typically uses a \"stratified analysis,\" in which analyses are performed separately in each subgroup defined by the stratification factors, and those separate analyses are weighted and combined. In the phase 3 setting, stratified analyses based on a small number of stratification factors can provide a small increase in power. The impact on power and type-1 error of stratification in the setting of smaller sample sizes as in randomized phase 2 trials has not been well characterized.MethodsWe performed computational studies to characterize the power and cross-arm balance of modestly sized clinical trials (less than 170 patients) with varying numbers of stratification factors (0-6), sample sizes, randomization ratios (1:1 vs 2:1), and randomization methods (dynamic balancing vs stratified block).ResultsWe found that the power of unstratified analyses was minimally impacted by the number of stratification factors used in randomization. Analyses stratified by 1-3 factors maintained power over 80%, while power dropped below 80% when four or more stratification factors were used. These trends held regardless of sample size, randomization ratio, and randomization method. For a given randomization ratio and sample size, increasing the number of factors used in randomization had an adverse impact on cross-arm balance. Stratified block randomization performed worse than dynamic balancing with respect to cross-arm balance when three or more stratification factors were used.ConclusionStratified analyses can decrease power in the setting of phase 2 trials when the number of patients in a stratification subgroup is small.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"361-366"},"PeriodicalIF":2.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12143638/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143001560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-04-01Epub Date: 2024-12-18DOI: 10.1177/17407745241297162
Christian J Wiedermann
Background: Enoximone, a phosphodiesterase III inhibitor, was approved in Germany in 1989 and initially proposed for heart failure and perioperative cardiac conditions. The research of Joachim Boldt and his supervisor Gunter Hempelmann came under scrutiny after investigations revealed systematic scientific misconduct leading to numerous retractions. Therefore, early enoximone studies by Boldt and Hempelmann from 1988 to 1991 were reviewed.
Methods: PubMed-listed publications and dissertations on enoximone from the Justus-Liebig-University of Giessen were analyzed for study design, participant demographics, methods, and outcomes. The data were screened for duplications and inconsistencies.
Results: Of seven randomized controlled trial articles identified, two were retracted. Five of the non-retracted articles reported similarly designed studies and included similar patient cohorts. The analysis revealed overlap in patient demographics and reported outcomes and inconsistencies in cardiac index values between trials, suggesting data duplication and manipulation. Several other articles have been retracted. The analysis results of misconduct and co-authorship of retracted studies during Boldt's late formative years indicate inadequate mentorship. The university's slow response in supporting the retraction of publications involving scientific misconduct indicates systemic oversight problems.
Conclusion: All five publications analyzed remained active and warrant retraction to maintain the integrity of the scientific record. This analysis highlights the need for improved institutional supervision. The current guidelines of the Committee on Publication Ethics for retraction are inadequate for large-scale scientific misconduct. Comprehensive ethics training, regular audits, and transparent reporting are essential to ensure the credibility of published research.
背景:Enoximone是一种磷酸二酯酶III抑制剂,于1989年在德国被批准,最初用于心力衰竭和围手术期心脏病。Joachim Boldt和他的导师Gunter Hempelmann的研究在调查发现系统性的科学不端行为导致大量撤回后受到审查。因此,本文回顾了Boldt和Hempelmann从1988年到1991年的早期依诺西酮研究。方法:对来自德国吉森大学(Justus-Liebig-University of Giessen)的pubmed收录的关于依诺西蒙的出版物和论文进行研究设计、参与者人口统计、方法和结果分析。对数据进行了重复和不一致的筛选。结果:在确定的7篇随机对照试验文章中,2篇被撤回。未撤回的文章中有5篇报道了类似设计的研究,包括类似的患者队列。分析显示患者人口统计学和报告的结果重叠,试验之间心脏指数值不一致,表明数据重复和操纵。其他几篇文章也被撤回。在Boldt的后期形成时期,不当行为和撤回研究的共同作者的分析结果表明指导不足。该大学在支持撤回涉及科学不端行为的出版物方面反应迟缓,这表明系统监管存在问题。结论:所分析的所有五篇出版物都保持活跃,值得撤回,以保持科学记录的完整性。这一分析强调了改善机构监督的必要性。出版伦理委员会目前关于撤稿的指导方针不足以应对大规模的科学不端行为。全面的伦理培训、定期审计和透明的报告对于确保已发表研究的可信度至关重要。
{"title":"Assessing institutional responsibility in scientific misconduct: A case study of enoximone research by Joachim Boldt.","authors":"Christian J Wiedermann","doi":"10.1177/17407745241297162","DOIUrl":"10.1177/17407745241297162","url":null,"abstract":"<p><strong>Background: </strong>Enoximone, a phosphodiesterase III inhibitor, was approved in Germany in 1989 and initially proposed for heart failure and perioperative cardiac conditions. The research of Joachim Boldt and his supervisor Gunter Hempelmann came under scrutiny after investigations revealed systematic scientific misconduct leading to numerous retractions. Therefore, early enoximone studies by Boldt and Hempelmann from 1988 to 1991 were reviewed.</p><p><strong>Methods: </strong>PubMed-listed publications and dissertations on enoximone from the Justus-Liebig-University of Giessen were analyzed for study design, participant demographics, methods, and outcomes. The data were screened for duplications and inconsistencies.</p><p><strong>Results: </strong>Of seven randomized controlled trial articles identified, two were retracted. Five of the non-retracted articles reported similarly designed studies and included similar patient cohorts. The analysis revealed overlap in patient demographics and reported outcomes and inconsistencies in cardiac index values between trials, suggesting data duplication and manipulation. Several other articles have been retracted. The analysis results of misconduct and co-authorship of retracted studies during Boldt's late formative years indicate inadequate mentorship. The university's slow response in supporting the retraction of publications involving scientific misconduct indicates systemic oversight problems.</p><p><strong>Conclusion: </strong>All five publications analyzed remained active and warrant retraction to maintain the integrity of the scientific record. This analysis highlights the need for improved institutional supervision. The current guidelines of the Committee on Publication Ethics for retraction are inadequate for large-scale scientific misconduct. Comprehensive ethics training, regular audits, and transparent reporting are essential to ensure the credibility of published research.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"239-247"},"PeriodicalIF":2.2,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142853387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-04-01Epub Date: 2025-01-10DOI: 10.1177/17407745241302488
Elizabeth J Conroy, Jane M Blazeby, Girvan Burnside, Jonathan A Cook, Carrol Gamble
Background/aimsWhen conducting a randomised controlled trial in surgery, it is important to consider surgical learning, where surgeons' familiarity with one, or both, of the interventions increases during the trial. If present, learning may compromise trial validity. We demonstrate a statistical investigation into surgical learning within a trial of cleft palate repair.MethodsThe Timing of Primary Surgery compared primary surgery, using the Sommerlad technique, for cleft palate repair delivered at 6 or 12 months of age. Participating surgeons had varying levels of experience with the intervention and in repair across the age groups. Trial design aimed to reduce the surgical learning via pre-trial surgical technique training and balancing the randomisation process by surgeon. We explore residual learning effects by applying visual methods and statistical models to a surgical outcome (fistula formation) and a process indicator (operation time).ResultsNotably, 26 surgeons operated on 521 infants. As the trial progressed, operation time reduced for surgeons with no pre-trial Sommerlad experience (n = 2), before plateauing at 30 operations, whereas it remained stable for those with prior experience. Fistula rates remained stable regardless of technique experience. Pre-trial age for primary surgery experience had no impact on either measures.ConclusionManaging learning effects through design was not fully achieved but balanced between trial arms, and residual effects were minimal. This investigation explores the presence of learning, within a randomised controlled trial that may be valuable for future trials. We recommend such investigations are undertaken to aid trial interpretation and generalisability, and determine success of trial design measures.
{"title":"Investigating the presence of surgical learning in the Timing of Primary Surgery for cleft palate randomised trial.","authors":"Elizabeth J Conroy, Jane M Blazeby, Girvan Burnside, Jonathan A Cook, Carrol Gamble","doi":"10.1177/17407745241302488","DOIUrl":"10.1177/17407745241302488","url":null,"abstract":"<p><p>Background/aimsWhen conducting a randomised controlled trial in surgery, it is important to consider surgical learning, where surgeons' familiarity with one, or both, of the interventions increases during the trial. If present, learning may compromise trial validity. We demonstrate a statistical investigation into surgical learning within a trial of cleft palate repair.MethodsThe Timing of Primary Surgery compared primary surgery, using the Sommerlad technique, for cleft palate repair delivered at 6 or 12 months of age. Participating surgeons had varying levels of experience with the intervention and in repair across the age groups. Trial design aimed to reduce the surgical learning via pre-trial surgical technique training and balancing the randomisation process by surgeon. We explore residual learning effects by applying visual methods and statistical models to a surgical outcome (fistula formation) and a process indicator (operation time).ResultsNotably, 26 surgeons operated on 521 infants. As the trial progressed, operation time reduced for surgeons with no pre-trial Sommerlad experience (n = 2), before plateauing at 30 operations, whereas it remained stable for those with prior experience. Fistula rates remained stable regardless of technique experience. Pre-trial age for primary surgery experience had no impact on either measures.ConclusionManaging learning effects through design was not fully achieved but balanced between trial arms, and residual effects were minimal. This investigation explores the presence of learning, within a randomised controlled trial that may be valuable for future trials. We recommend such investigations are undertaken to aid trial interpretation and generalisability, and determine success of trial design measures.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"200-208"},"PeriodicalIF":2.2,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11986075/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142945660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}