Pub Date : 2025-01-25DOI: 10.1177/17407745241302477
Charles Swanton, Velicia Bachtiar, Chris Mathews, Adam R Brentnall, Ian Lowenhoff, Jo Waller, Martine Bomb, Sean McPhail, Heather Pinches, Rebecca Smittenaar, Sara Hiom, Richard D Neal, Peter Sasieni
Background/aims: Certain sociodemographic groups are routinely underrepresented in clinical trials, limiting generalisability. Here, we describe the extent to which enriched enrolment approaches yielded a diverse trial population enriched for older age in a randomised controlled trial of a blood-based multi-cancer early detection test (NCT05611632).
Methods: Participants aged 50-77 years were recruited from eight Cancer Alliance regions in England. Most were identified and invited from centralised health service lists; a dynamic invitation algorithm was used to target those in older and more deprived groups. Others were invited by their general practice surgery (GP-based Participant Identification Centres in selected regions); towards the end of recruitment, specifically Asian and Black individuals were invited via this route, as part of a concerted effort to encourage enrolment among these individuals. Some participants self-referred, often following engagement activities involving community organisations. Enrolment took place in 11 mobile clinics at 151 locations that were generally more socioeconomically deprived and ethnically diverse than the England average. We reduced logistical barriers to trial participation by offering language interpretation and translation and disabled access measures. After enrolment, we examined (1) sociodemographic distribution of participants versus England and Cancer Alliance populations, and (2) number needed to invite (NNI; the number of invitations sent to enrol one participant) by age, sex, index of multiple deprivation (IMD) and ethnicity, and GP surgery-level bowel screening participation.
Results: Approximately 1.5 million individuals were invited and 142,924 enrolled (98% via centralised health service lists/invitation algorithm) in 10.5 months. The enrolled population was older and more deprived than the England population aged 50-77 years (73.3% vs 56.8% aged 60-77 years; 42.3% vs 35.3% in IMD groups 1-2). Ethnic diversity was lower in the trial than the England population (1.4% vs 2.8% Black; 3.3% vs 5.3% Asian). NNI was highest in Black (32.8), Asian (28.2) and most-deprived (21.5) groups, and lowest in mixed ethnicity (8.1) and least-deprived (4.6) groups.
Conclusions: Enrolment approaches used in the NHS-Galleri trial enabled recruitment of an older, socioeconomically diverse participant population relatively rapidly. Compared with the England and Cancer Alliance populations, the enrolled population was enriched for those in older age and more deprived groups. Better ethnicity data availability in central health service records could enable better invitation targeting to further enhance ethnically diverse recruitment. Future research should evaluate approaches used to facilitate recruitment from underrepresented groups in clinical trials.
{"title":"NHS-Galleri trial: Enriched enrolment approaches and sociodemographic characteristics of enrolled participants.","authors":"Charles Swanton, Velicia Bachtiar, Chris Mathews, Adam R Brentnall, Ian Lowenhoff, Jo Waller, Martine Bomb, Sean McPhail, Heather Pinches, Rebecca Smittenaar, Sara Hiom, Richard D Neal, Peter Sasieni","doi":"10.1177/17407745241302477","DOIUrl":"https://doi.org/10.1177/17407745241302477","url":null,"abstract":"<p><strong>Background/aims: </strong>Certain sociodemographic groups are routinely underrepresented in clinical trials, limiting generalisability. Here, we describe the extent to which enriched enrolment approaches yielded a diverse trial population enriched for older age in a randomised controlled trial of a blood-based multi-cancer early detection test (NCT05611632).</p><p><strong>Methods: </strong>Participants aged 50-77 years were recruited from eight Cancer Alliance regions in England. Most were identified and invited from centralised health service lists; a dynamic invitation algorithm was used to target those in older and more deprived groups. Others were invited by their general practice surgery (GP-based Participant Identification Centres in selected regions); towards the end of recruitment, specifically Asian and Black individuals were invited via this route, as part of a concerted effort to encourage enrolment among these individuals. Some participants self-referred, often following engagement activities involving community organisations. Enrolment took place in 11 mobile clinics at 151 locations that were generally more socioeconomically deprived and ethnically diverse than the England average. We reduced logistical barriers to trial participation by offering language interpretation and translation and disabled access measures. After enrolment, we examined (1) sociodemographic distribution of participants versus England and Cancer Alliance populations, and (2) number needed to invite (NNI; the number of invitations sent to enrol one participant) by age, sex, index of multiple deprivation (IMD) and ethnicity, and GP surgery-level bowel screening participation.</p><p><strong>Results: </strong>Approximately 1.5 million individuals were invited and 142,924 enrolled (98% via centralised health service lists/invitation algorithm) in 10.5 months. The enrolled population was older and more deprived than the England population aged 50-77 years (73.3% vs 56.8% aged 60-77 years; 42.3% vs 35.3% in IMD groups 1-2). Ethnic diversity was lower in the trial than the England population (1.4% vs 2.8% Black; 3.3% vs 5.3% Asian). NNI was highest in Black (32.8), Asian (28.2) and most-deprived (21.5) groups, and lowest in mixed ethnicity (8.1) and least-deprived (4.6) groups.</p><p><strong>Conclusions: </strong>Enrolment approaches used in the NHS-Galleri trial enabled recruitment of an older, socioeconomically diverse participant population relatively rapidly. Compared with the England and Cancer Alliance populations, the enrolled population was enriched for those in older age and more deprived groups. Better ethnicity data availability in central health service records could enable better invitation targeting to further enhance ethnically diverse recruitment. Future research should evaluate approaches used to facilitate recruitment from underrepresented groups in clinical trials.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241302477"},"PeriodicalIF":2.2,"publicationDate":"2025-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143036914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-25DOI: 10.1177/17407745241309056
Xinlin Lu, Guogen Shan
Introduction: The sequential parallel comparison design has emerged as a valuable tool in clinical trials with high placebo response rates. To further enhance its efficiency and effectiveness, adaptive strategies, such as sample size adjustment and allocation ratio modification can be employed.
Methods: We compared the performance of Jennison and Turnbull's method and the Promising Zone approach for sample size adjustment in a two-phase sequential parallel comparison design study. We also evaluated the impact of allocation ratio adjustments using Neyman and Optimal allocation strategies. Various scenarios were simulated to assess the effects of different design parameters, including weight in the test statistic, initial randomization ratio, and interim analysis timing.
Results: The Promising Zone approach demonstrated superior or comparable power to Jennison and Turnbull's method at equivalent expected sample sizes while maintaining the intuitive property that more promising interim results lead to smaller required follow-up sample sizes. However, the Promising Zone approach may require a larger maximum possible sample size in some cases. The addition of allocation ratio adjustments offered minimal improvements overall, but showed potential benefits when the variance in the treatment group was larger than that in the placebo group. We also applied our findings to a real-world example from the AVP-923 trial in patients with Alzheimer's disease-related agitation, demonstrating the practical implications of adaptive sequential parallel comparison designs in clinical research.
Discussion: Adaptive strategies can significantly enhance the efficiency of sequential parallel comparison designs. The choice between sample size adjustment methods should consider trade-offs between power, expected sample size, and maximum adjusted sample size. Although allocation ratio adjustments showed limited overall impact, they may be beneficial in specific scenarios. Future research should explore the application of these adaptive strategies to binary and survival outcomes in sequential parallel comparison designs.
{"title":"Adaptive promising zone design for sequential parallel comparison design with continuous outcomes.","authors":"Xinlin Lu, Guogen Shan","doi":"10.1177/17407745241309056","DOIUrl":"https://doi.org/10.1177/17407745241309056","url":null,"abstract":"<p><strong>Introduction: </strong>The sequential parallel comparison design has emerged as a valuable tool in clinical trials with high placebo response rates. To further enhance its efficiency and effectiveness, adaptive strategies, such as sample size adjustment and allocation ratio modification can be employed.</p><p><strong>Methods: </strong>We compared the performance of Jennison and Turnbull's method and the Promising Zone approach for sample size adjustment in a two-phase sequential parallel comparison design study. We also evaluated the impact of allocation ratio adjustments using Neyman and Optimal allocation strategies. Various scenarios were simulated to assess the effects of different design parameters, including weight in the test statistic, initial randomization ratio, and interim analysis timing.</p><p><strong>Results: </strong>The Promising Zone approach demonstrated superior or comparable power to Jennison and Turnbull's method at equivalent expected sample sizes while maintaining the intuitive property that more promising interim results lead to smaller required follow-up sample sizes. However, the Promising Zone approach may require a larger maximum possible sample size in some cases. The addition of allocation ratio adjustments offered minimal improvements overall, but showed potential benefits when the variance in the treatment group was larger than that in the placebo group. We also applied our findings to a real-world example from the AVP-923 trial in patients with Alzheimer's disease-related agitation, demonstrating the practical implications of adaptive sequential parallel comparison designs in clinical research.</p><p><strong>Discussion: </strong>Adaptive strategies can significantly enhance the efficiency of sequential parallel comparison designs. The choice between sample size adjustment methods should consider trade-offs between power, expected sample size, and maximum adjusted sample size. Although allocation ratio adjustments showed limited overall impact, they may be beneficial in specific scenarios. Future research should explore the application of these adaptive strategies to binary and survival outcomes in sequential parallel comparison designs.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241309056"},"PeriodicalIF":2.2,"publicationDate":"2025-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143036899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-25DOI: 10.1177/17407745241309318
Kyungwan Hong, Bridget Nugent, Abbas Bandukwala, Robert Schuck, York Tomita, Salvatore Pepe, Mary Doi, Scott Winiecki, Kerry Jo Lee
<p><strong>Background/aims: </strong>Rare disease drug development faces unique challenges, such as genotypic and phenotypic heterogeneity within small patient populations and a lack of established outcome measures for conditions without previously successful drug development programs. These challenges complicate the process of selecting the appropriate trial endpoints and conducting clinical trials in rare diseases. In this descriptive study, we examined novel drug approvals for non-oncologic rare diseases by the U.S. Food and Drug Administration's Center for Drug Evaluation and Research over the past decade and characterized key regulatory and trial design elements with a focus on the primary efficacy endpoint utilized as the basis of approval.</p><p><strong>Methods: </strong>Using the Food and Drug Administration's Data Analysis Search Host database, we identified novel new drug applications and biologics license applications with orphan drug designation that were approved between 2013 and 2022 for non-oncologic indications. From Food and Drug Administration review documents and other external databases, we examined characteristics of pivotal trials for the included drugs, such as therapeutic area, trial design, and type of primary efficacy endpoints. Differences in trial design elements associated with primary efficacy endpoint type were assessed such as randomization and blinding. Then, we summarized the primary efficacy endpoint types utilized in pivotal trials by therapeutic area, approval pathway, and whether the disease etiology is well defined.</p><p><strong>Results: </strong>One hundred and seven drugs that met our inclusion criteria were approved between 2013 and 2022. Assessment of the 107 drug development programs identified 150 pivotal trials that were subsequently analyzed. The pivotal trials were mostly randomized (80%) and blinded (69.3%). Biomarkers (41.1%) and clinical outcomes (42.1%) were commonly utilized as primary efficacy endpoints. Analysis of the use of clinical trial design elements across trials that utilized biomarkers, clinical outcomes, or composite endpoints did not reveal statistically significant differences. The choice of primary efficacy endpoint varied by the drug's therapeutic area, approval pathway, and whether the indicated disease etiology was well defined. For example, biomarkers were commonly selected as primary efficacy endpoints in hematology drug approvals (70.6%), whereas clinical outcomes were commonly selected in neurology drug approvals (69.6%). Further, if the disease etiology was well defined, biomarkers were more commonly used as primary efficacy endpoints in pivotal trials (44.7%) than if the disease etiology was not well defined (27.3%).</p><p><strong>Discussion: </strong>In the past 10 years, numerous novel drugs have been approved to treat non-oncologic rare diseases in various therapeutic areas. To demonstrate their efficacy for regulatory approval, biomarkers and clinical outcomes were
{"title":"Pivotal trial characteristics and types of endpoints used to support Food and Drug Administration rare disease drug approvals between 2013 and 2022.","authors":"Kyungwan Hong, Bridget Nugent, Abbas Bandukwala, Robert Schuck, York Tomita, Salvatore Pepe, Mary Doi, Scott Winiecki, Kerry Jo Lee","doi":"10.1177/17407745241309318","DOIUrl":"https://doi.org/10.1177/17407745241309318","url":null,"abstract":"<p><strong>Background/aims: </strong>Rare disease drug development faces unique challenges, such as genotypic and phenotypic heterogeneity within small patient populations and a lack of established outcome measures for conditions without previously successful drug development programs. These challenges complicate the process of selecting the appropriate trial endpoints and conducting clinical trials in rare diseases. In this descriptive study, we examined novel drug approvals for non-oncologic rare diseases by the U.S. Food and Drug Administration's Center for Drug Evaluation and Research over the past decade and characterized key regulatory and trial design elements with a focus on the primary efficacy endpoint utilized as the basis of approval.</p><p><strong>Methods: </strong>Using the Food and Drug Administration's Data Analysis Search Host database, we identified novel new drug applications and biologics license applications with orphan drug designation that were approved between 2013 and 2022 for non-oncologic indications. From Food and Drug Administration review documents and other external databases, we examined characteristics of pivotal trials for the included drugs, such as therapeutic area, trial design, and type of primary efficacy endpoints. Differences in trial design elements associated with primary efficacy endpoint type were assessed such as randomization and blinding. Then, we summarized the primary efficacy endpoint types utilized in pivotal trials by therapeutic area, approval pathway, and whether the disease etiology is well defined.</p><p><strong>Results: </strong>One hundred and seven drugs that met our inclusion criteria were approved between 2013 and 2022. Assessment of the 107 drug development programs identified 150 pivotal trials that were subsequently analyzed. The pivotal trials were mostly randomized (80%) and blinded (69.3%). Biomarkers (41.1%) and clinical outcomes (42.1%) were commonly utilized as primary efficacy endpoints. Analysis of the use of clinical trial design elements across trials that utilized biomarkers, clinical outcomes, or composite endpoints did not reveal statistically significant differences. The choice of primary efficacy endpoint varied by the drug's therapeutic area, approval pathway, and whether the indicated disease etiology was well defined. For example, biomarkers were commonly selected as primary efficacy endpoints in hematology drug approvals (70.6%), whereas clinical outcomes were commonly selected in neurology drug approvals (69.6%). Further, if the disease etiology was well defined, biomarkers were more commonly used as primary efficacy endpoints in pivotal trials (44.7%) than if the disease etiology was not well defined (27.3%).</p><p><strong>Discussion: </strong>In the past 10 years, numerous novel drugs have been approved to treat non-oncologic rare diseases in various therapeutic areas. To demonstrate their efficacy for regulatory approval, biomarkers and clinical outcomes were ","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241309318"},"PeriodicalIF":2.2,"publicationDate":"2025-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143036919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-22DOI: 10.1177/17407745241309293
Amos J de Jong, Helga Gardarsdottir, Yared Santa-Ana-Tellez, Anthonius de Boer, Mira Gp Zuidgeest
Background/aims: Low-intervention clinical trials have been established under the European Union Clinical Trials Regulation (EU 536/2014) which aims to simplify the conduct of clinical trials with authorized medicinal products. There is limited experience with conducting low-intervention trials. Therefore, this study aims to report on experiences and perceived (dis)advantages of low-intervention trials.
Methods: We surveyed representatives of all individual clinical trials registered on the public website of the European Union Clinical Trials Information System between 31 January 2022 and 1 December 2023 that evaluated authorized investigational medicinal products and had at least one investigative site in the European Union. These representatives were approached between June 2023 and January 2024.
Results: We received 70 responses (response rate 21%). Of the respondents, 31 represented a trial registered as low-intervention trial, and 39 represented a trial not registered as a low-intervention trial (hereafter "regular trials"). Simplified clinical trial monitoring and an easier regulatory approval process were perceived as the main advantages of low-intervention trials, with respectively 44% and 34% of the respondents indicating this to be an advantage in low-intervention trials. However, the respondents experienced that stringent and unclear regulatory requirements impeded the conduct of low-intervention trials. Respondents involved with regular trials indicated that 39% of the regular trials met the criteria of a low-intervention trial but were not registered as such, among others due to unfamiliarity with this trial category.
Conclusions: We argue that the simplified procedures for low-intervention trials should be more detailed-for example in regulatory guidance-in the future to further simplify the conduct of clinical trials with authorized investigational medicinal products.
{"title":"Experiences with low-intervention clinical trials-the new category under the European Union Clinical Trials Regulation.","authors":"Amos J de Jong, Helga Gardarsdottir, Yared Santa-Ana-Tellez, Anthonius de Boer, Mira Gp Zuidgeest","doi":"10.1177/17407745241309293","DOIUrl":"https://doi.org/10.1177/17407745241309293","url":null,"abstract":"<p><strong>Background/aims: </strong>Low-intervention clinical trials have been established under the European Union Clinical Trials Regulation (EU 536/2014) which aims to simplify the conduct of clinical trials with authorized medicinal products. There is limited experience with conducting low-intervention trials. Therefore, this study aims to report on experiences and perceived (dis)advantages of low-intervention trials.</p><p><strong>Methods: </strong>We surveyed representatives of all individual clinical trials registered on the public website of the European Union Clinical Trials Information System between 31 January 2022 and 1 December 2023 that evaluated authorized investigational medicinal products and had at least one investigative site in the European Union. These representatives were approached between June 2023 and January 2024.</p><p><strong>Results: </strong>We received 70 responses (response rate 21%). Of the respondents, 31 represented a trial registered as low-intervention trial, and 39 represented a trial not registered as a low-intervention trial (hereafter \"regular trials\"). Simplified clinical trial monitoring and an easier regulatory approval process were perceived as the main advantages of low-intervention trials, with respectively 44% and 34% of the respondents indicating this to be an advantage in low-intervention trials. However, the respondents experienced that stringent and unclear regulatory requirements impeded the conduct of low-intervention trials. Respondents involved with regular trials indicated that 39% of the regular trials met the criteria of a low-intervention trial but were not registered as such, among others due to unfamiliarity with this trial category.</p><p><strong>Conclusions: </strong>We argue that the simplified procedures for low-intervention trials should be more detailed-for example in regulatory guidance-in the future to further simplify the conduct of clinical trials with authorized investigational medicinal products.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241309293"},"PeriodicalIF":2.2,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143022383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-15DOI: 10.1177/17407745241309054
Kamil Malshy, Alexis Steinmetz, Kit Yuen, Jathin Bandari, Ronald Rabinowitz
Clinical trials of drugs, procedures, and other therapies play a crucial role in advancing medical science by evaluating the safety, efficacy, and optimal use of medical interventions. The design and implementation of these trials have evolved significantly over time, reflecting advancements in medicine, ethics, and methodology. Early historical examples, such as King Nebuchadnezzar II's and his captives' dietary experiment and Ambroise Paré's treatment of gunshot wounds, laid some foundational principles of trial design. The momentum of clinical trial development increased notably with James Lind's 1747 trial for scurvy and continued to progress during World War I with innovations in blood transfusion techniques. World War II (WWII) marked a pivotal moment with breakthroughs in oncology, including the development of the first modern chemotherapeutic agents derived from mustard gas and the introduction of the randomized controlled trial, credited to British epidemiologist Austin Bradford Hill, which revolutionized trial design. More recent conflicts, such as those in Vietnam, Iraq, and Afghanistan, have driven advancements in trauma care, heroin addiction treatment, and hemorrhage management. In response to historical abuses committed by the Nazis during WWII, the evolution of clinical trials has increasingly emphasized ethical standards, particularly informed consent, starting with the Doctors' Trial and the Nuremberg Code. This article discusses how military needs and wartime innovations have shaped modern clinical research, highlighting the interplay between military imperatives and medical progress. Ultimately, clinical trials play an essential role in advancing medical science and improving patient outcomes.
{"title":"Military influences on the evolution of clinical trials throughout history.","authors":"Kamil Malshy, Alexis Steinmetz, Kit Yuen, Jathin Bandari, Ronald Rabinowitz","doi":"10.1177/17407745241309054","DOIUrl":"https://doi.org/10.1177/17407745241309054","url":null,"abstract":"<p><p>Clinical trials of drugs, procedures, and other therapies play a crucial role in advancing medical science by evaluating the safety, efficacy, and optimal use of medical interventions. The design and implementation of these trials have evolved significantly over time, reflecting advancements in medicine, ethics, and methodology. Early historical examples, such as King Nebuchadnezzar II's and his captives' dietary experiment and Ambroise Paré's treatment of gunshot wounds, laid some foundational principles of trial design. The momentum of clinical trial development increased notably with James Lind's 1747 trial for scurvy and continued to progress during World War I with innovations in blood transfusion techniques. World War II (WWII) marked a pivotal moment with breakthroughs in oncology, including the development of the first modern chemotherapeutic agents derived from mustard gas and the introduction of the randomized controlled trial, credited to British epidemiologist Austin Bradford Hill, which revolutionized trial design. More recent conflicts, such as those in Vietnam, Iraq, and Afghanistan, have driven advancements in trauma care, heroin addiction treatment, and hemorrhage management. In response to historical abuses committed by the Nazis during WWII, the evolution of clinical trials has increasingly emphasized ethical standards, particularly informed consent, starting with the Doctors' Trial and the Nuremberg Code. This article discusses how military needs and wartime innovations have shaped modern clinical research, highlighting the interplay between military imperatives and medical progress. Ultimately, clinical trials play an essential role in advancing medical science and improving patient outcomes.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241309054"},"PeriodicalIF":2.2,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143001576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-15DOI: 10.1177/17407745241304065
Anna Moseley, Michael LeBlanc, Boris Freidlin, Rory M Shallis, Amer M Zeidan, David A Sallman, Harry P Erba, Richard F Little, Megan Othus
Background/aims: Randomized clinical trials often use stratification to ensure balance between arms. Analysis of primary endpoints of these trials typically uses a "stratified analysis," in which analyses are performed separately in each subgroup defined by the stratification factors, and those separate analyses are weighted and combined. In the phase 3 setting, stratified analyses based on a small number of stratification factors can provide a small increase in power. The impact on power and type-1 error of stratification in the setting of smaller sample sizes as in randomized phase 2 trials has not been well characterized.
Methods: We performed computational studies to characterize the power and cross-arm balance of modestly sized clinical trials (less than 170 patients) with varying numbers of stratification factors (0-6), sample sizes, randomization ratios (1:1 vs 2:1), and randomization methods (dynamic balancing vs stratified block).
Results: We found that the power of unstratified analyses was minimally impacted by the number of stratification factors used in randomization. Analyses stratified by 1-3 factors maintained power over 80%, while power dropped below 80% when four or more stratification factors were used. These trends held regardless of sample size, randomization ratio, and randomization method. For a given randomization ratio and sample size, increasing the number of factors used in randomization had an adverse impact on cross-arm balance. Stratified block randomization performed worse than dynamic balancing with respect to cross-arm balance when three or more stratification factors were used.
Conclusion: Stratified analyses can decrease power in the setting of phase 2 trials when the number of patients in a stratification subgroup is small.
背景/目的:随机临床试验通常采用分层来确保两臂之间的平衡。对这些试验的主要终点的分析通常使用“分层分析”,在分层因素定义的每个亚组中分别进行分析,并对这些单独的分析进行加权和合并。在第3阶段设置中,基于少量分层因素的分层分析可以提供少量的功率增加。在较小样本量的随机2期试验中,分层对功效和1型误差的影响尚未得到很好的表征。方法:我们进行了计算研究,以表征中等规模临床试验(少于170例患者)的功率和横臂平衡,这些试验具有不同数量的分层因素(0-6)、样本量、随机化比例(1:1 vs 2:1)和随机化方法(动态平衡vs分层块)。结果:我们发现,随机化中使用的分层因素数量对非分层分析的影响最小。采用1-3个因素分层的分析,准确率保持在80%以上,而采用4个或更多因素分层的分析,准确率下降到80%以下。这些趋势与样本量、随机化比例和随机化方法无关。对于给定的随机化比例和样本量,增加随机化中使用的因素数量会对横臂平衡产生不利影响。当使用三个或更多分层因素时,分层块随机化在横臂平衡方面的表现比动态平衡差。结论:当分层亚组中患者数量较少时,分层分析可能会降低2期试验的有效性。
{"title":"Evaluating the impact of stratification on the power and cross-arm balance of randomized phase 2 clinical trials.","authors":"Anna Moseley, Michael LeBlanc, Boris Freidlin, Rory M Shallis, Amer M Zeidan, David A Sallman, Harry P Erba, Richard F Little, Megan Othus","doi":"10.1177/17407745241304065","DOIUrl":"https://doi.org/10.1177/17407745241304065","url":null,"abstract":"<p><strong>Background/aims: </strong>Randomized clinical trials often use stratification to ensure balance between arms. Analysis of primary endpoints of these trials typically uses a \"stratified analysis,\" in which analyses are performed separately in each subgroup defined by the stratification factors, and those separate analyses are weighted and combined. In the phase 3 setting, stratified analyses based on a small number of stratification factors can provide a small increase in power. The impact on power and type-1 error of stratification in the setting of smaller sample sizes as in randomized phase 2 trials has not been well characterized.</p><p><strong>Methods: </strong>We performed computational studies to characterize the power and cross-arm balance of modestly sized clinical trials (less than 170 patients) with varying numbers of stratification factors (0-6), sample sizes, randomization ratios (1:1 vs 2:1), and randomization methods (dynamic balancing vs stratified block).</p><p><strong>Results: </strong>We found that the power of unstratified analyses was minimally impacted by the number of stratification factors used in randomization. Analyses stratified by 1-3 factors maintained power over 80%, while power dropped below 80% when four or more stratification factors were used. These trends held regardless of sample size, randomization ratio, and randomization method. For a given randomization ratio and sample size, increasing the number of factors used in randomization had an adverse impact on cross-arm balance. Stratified block randomization performed worse than dynamic balancing with respect to cross-arm balance when three or more stratification factors were used.</p><p><strong>Conclusion: </strong>Stratified analyses can decrease power in the setting of phase 2 trials when the number of patients in a stratification subgroup is small.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241304065"},"PeriodicalIF":2.2,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143001560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-10DOI: 10.1177/17407745241302488
Elizabeth J Conroy, Jane M Blazeby, Girvan Burnside, Jonathan A Cook, Carrol Gamble
Background/aims: When conducting a randomised controlled trial in surgery, it is important to consider surgical learning, where surgeons' familiarity with one, or both, of the interventions increases during the trial. If present, learning may compromise trial validity. We demonstrate a statistical investigation into surgical learning within a trial of cleft palate repair.
Methods: The Timing of Primary Surgery compared primary surgery, using the Sommerlad technique, for cleft palate repair delivered at 6 or 12 months of age. Participating surgeons had varying levels of experience with the intervention and in repair across the age groups. Trial design aimed to reduce the surgical learning via pre-trial surgical technique training and balancing the randomisation process by surgeon. We explore residual learning effects by applying visual methods and statistical models to a surgical outcome (fistula formation) and a process indicator (operation time).
Results: Notably, 26 surgeons operated on 521 infants. As the trial progressed, operation time reduced for surgeons with no pre-trial Sommerlad experience (n = 2), before plateauing at 30 operations, whereas it remained stable for those with prior experience. Fistula rates remained stable regardless of technique experience. Pre-trial age for primary surgery experience had no impact on either measures.
Conclusion: Managing learning effects through design was not fully achieved but balanced between trial arms, and residual effects were minimal. This investigation explores the presence of learning, within a randomised controlled trial that may be valuable for future trials. We recommend such investigations are undertaken to aid trial interpretation and generalisability, and determine success of trial design measures.
{"title":"Investigating the presence of surgical learning in the Timing of Primary Surgery for cleft palate randomised trial.","authors":"Elizabeth J Conroy, Jane M Blazeby, Girvan Burnside, Jonathan A Cook, Carrol Gamble","doi":"10.1177/17407745241302488","DOIUrl":"https://doi.org/10.1177/17407745241302488","url":null,"abstract":"<p><strong>Background/aims: </strong>When conducting a randomised controlled trial in surgery, it is important to consider surgical learning, where surgeons' familiarity with one, or both, of the interventions increases during the trial. If present, learning may compromise trial validity. We demonstrate a statistical investigation into surgical learning within a trial of cleft palate repair.</p><p><strong>Methods: </strong>The Timing of Primary Surgery compared primary surgery, using the Sommerlad technique, for cleft palate repair delivered at 6 or 12 months of age. Participating surgeons had varying levels of experience with the intervention and in repair across the age groups. Trial design aimed to reduce the surgical learning via pre-trial surgical technique training and balancing the randomisation process by surgeon. We explore residual learning effects by applying visual methods and statistical models to a surgical outcome (fistula formation) and a process indicator (operation time).</p><p><strong>Results: </strong>Notably, 26 surgeons operated on 521 infants. As the trial progressed, operation time reduced for surgeons with no pre-trial Sommerlad experience (n = 2), before plateauing at 30 operations, whereas it remained stable for those with prior experience. Fistula rates remained stable regardless of technique experience. Pre-trial age for primary surgery experience had no impact on either measures.</p><p><strong>Conclusion: </strong>Managing learning effects through design was not fully achieved but balanced between trial arms, and residual effects were minimal. This investigation explores the presence of learning, within a randomised controlled trial that may be valuable for future trials. We recommend such investigations are undertaken to aid trial interpretation and generalisability, and determine success of trial design measures.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241302488"},"PeriodicalIF":2.2,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142945660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-10DOI: 10.1177/17407745241307948
Nathaniel J Williams, Alexandra E Gomes, Nallely R Vega, Susan Esp, Mimi Choy-Brown, Rinad S Beidas
<p><strong>Background: </strong>Implementation and hybrid effectiveness-implementation trials aspire to speed the translation of science into practice by generating crucial evidence for improving the uptake of effective health interventions. By design, they pose unique recruitment and retention challenges due to their aims, units of analysis, and sampling plans, which typically require many clinical sites (i.e. often 20 or more) and participation by individuals who are related across multiple levels (e.g. linked organizational leaders, clinicians, and patients). In this article, we present a new multilevel, theory-informed, and relationship-centered framework for conceptualizing recruitment and retention in implementation and hybrid effectiveness-implementation trials which integrates and builds on prior work on recruitment and retention strategies in patient-focused trials. We describe the framework's application in the Working to Implement and Sustain Digital Outcome Measures hybrid type III trial, which occurred in part during the COVID-19 pandemic.</p><p><strong>Methods: </strong>Recruitment for the Working to Implement and Sustain Digital Outcome Measures trial occurred from October 2019 to February 2022. Development of recruitment and retention strategies was guided by a newly developed multilevel framework, which targeted the capability, opportunity, and motivation of organizational leaders, clinicians, patient-facing administrative staff, and patients to engage in research. A structured assessment guide was developed and applied to refine recruitment and retention approaches throughout the trial. We describe the framework and its application amid the onset of the COVID-19 pandemic which required rapid adjustments to address numerous barriers.</p><p><strong>Results: </strong>The Working to Implement and Sustain Digital Outcome Measures trial enrolled 21 outpatient clinics in three US states, incorporating 252 clinicians and 686 caregivers of youth (95% of patient recruitment target) across two distinct phases. Data completion rates for organizational leaders and clinicians averaged 90% over five waves spanning 18 months, despite the onset of the COVID pandemic. Caregiver completion rates of monthly follow-up assessments ranged from 80%-88% across 6 months. This article presents the multilevel framework, assessment guide, and strategies used to achieve recruitment and retention targets at each level.</p><p><strong>Conclusion: </strong>We conducted a multi-state hybrid type III effectiveness-implementation trial that maintained high recruitment and retention across all relevant levels amid a global pandemic. The newly developed multilevel recruitment and retention framework and assessment guide presented here, which integrates behavioral theory, a relationship-focused lens, and evidence-based strategies for participant recruitment and retention at multiple levels, can be adapted and used by other researchers for implementation, hybrid, and m
{"title":"A multilevel framework for recruitment and retention in implementation trials: An illustrative example.","authors":"Nathaniel J Williams, Alexandra E Gomes, Nallely R Vega, Susan Esp, Mimi Choy-Brown, Rinad S Beidas","doi":"10.1177/17407745241307948","DOIUrl":"https://doi.org/10.1177/17407745241307948","url":null,"abstract":"<p><strong>Background: </strong>Implementation and hybrid effectiveness-implementation trials aspire to speed the translation of science into practice by generating crucial evidence for improving the uptake of effective health interventions. By design, they pose unique recruitment and retention challenges due to their aims, units of analysis, and sampling plans, which typically require many clinical sites (i.e. often 20 or more) and participation by individuals who are related across multiple levels (e.g. linked organizational leaders, clinicians, and patients). In this article, we present a new multilevel, theory-informed, and relationship-centered framework for conceptualizing recruitment and retention in implementation and hybrid effectiveness-implementation trials which integrates and builds on prior work on recruitment and retention strategies in patient-focused trials. We describe the framework's application in the Working to Implement and Sustain Digital Outcome Measures hybrid type III trial, which occurred in part during the COVID-19 pandemic.</p><p><strong>Methods: </strong>Recruitment for the Working to Implement and Sustain Digital Outcome Measures trial occurred from October 2019 to February 2022. Development of recruitment and retention strategies was guided by a newly developed multilevel framework, which targeted the capability, opportunity, and motivation of organizational leaders, clinicians, patient-facing administrative staff, and patients to engage in research. A structured assessment guide was developed and applied to refine recruitment and retention approaches throughout the trial. We describe the framework and its application amid the onset of the COVID-19 pandemic which required rapid adjustments to address numerous barriers.</p><p><strong>Results: </strong>The Working to Implement and Sustain Digital Outcome Measures trial enrolled 21 outpatient clinics in three US states, incorporating 252 clinicians and 686 caregivers of youth (95% of patient recruitment target) across two distinct phases. Data completion rates for organizational leaders and clinicians averaged 90% over five waves spanning 18 months, despite the onset of the COVID pandemic. Caregiver completion rates of monthly follow-up assessments ranged from 80%-88% across 6 months. This article presents the multilevel framework, assessment guide, and strategies used to achieve recruitment and retention targets at each level.</p><p><strong>Conclusion: </strong>We conducted a multi-state hybrid type III effectiveness-implementation trial that maintained high recruitment and retention across all relevant levels amid a global pandemic. The newly developed multilevel recruitment and retention framework and assessment guide presented here, which integrates behavioral theory, a relationship-focused lens, and evidence-based strategies for participant recruitment and retention at multiple levels, can be adapted and used by other researchers for implementation, hybrid, and m","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241307948"},"PeriodicalIF":2.2,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142964033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Background: In randomized clinical trials, multiple-testing procedures, composite endpoints, and prioritized outcome approaches are increasingly used to analyze multiple binary outcomes. Previous studies have shown that correlations between outcomes influence their sample size requirements. Although sample size is an important factor affecting the choice of statistical methods, the power and required sample sizes of methods for analyzing multiple binary outcomes have yet to be compared under the influence of outcome correlations.
Methods: We conducted simulations to evaluate the power of co-primary and multiple primary endpoints, composite endpoints, and prioritized outcome approaches based on generalized pairwise comparisons with varying correlations, marginal proportions, treatment effects, and number of outcomes. We then conducted a case study on sample size using a clinical trial of a migraine treatment as an example.
Results: The correlations significantly affected the statistical power and sample size of composite endpoints. The power and sample size of co-primary endpoints remained relatively stable across different correlations, though their power declined substantially when treatment effects were opposite on some components or more than two components were present. While the correlations influenced the power and sample size of all methods assessed, their direction and degree of influence varied between methods. Notably, the method with the greatest power and smallest sample size also differed depending on the correlations. When the correlations were the same between arms, prioritized outcome approaches usually had higher power and smaller sample sizes than other methods.
Conclusions: Anticipated correlations and their uncertainty should be considered when selecting statistical methods. Overall, co-primary endpoints remain a reliable option for evaluating the superiority of all components, although they are unsuitable for assessing the balance between treatment effects pointing in different directions. Generalized pairwise comparisons offer a useful alternative to deal with multiple prioritized outcomes, often providing the smallest sample sizes when the correlation structures are shared between the arms.
{"title":"Impact of correlation structure on sample size requirements of statistical methods for multiple binary outcomes: A simulation study.","authors":"Kanako Fuyama, Kentaro Sakamaki, Kohei Uemura, Isao Yokota","doi":"10.1177/17407745241304706","DOIUrl":"https://doi.org/10.1177/17407745241304706","url":null,"abstract":"<p><strong>Background: </strong>In randomized clinical trials, multiple-testing procedures, composite endpoints, and prioritized outcome approaches are increasingly used to analyze multiple binary outcomes. Previous studies have shown that correlations between outcomes influence their sample size requirements. Although sample size is an important factor affecting the choice of statistical methods, the power and required sample sizes of methods for analyzing multiple binary outcomes have yet to be compared under the influence of outcome correlations.</p><p><strong>Methods: </strong>We conducted simulations to evaluate the power of co-primary and multiple primary endpoints, composite endpoints, and prioritized outcome approaches based on generalized pairwise comparisons with varying correlations, marginal proportions, treatment effects, and number of outcomes. We then conducted a case study on sample size using a clinical trial of a migraine treatment as an example.</p><p><strong>Results: </strong>The correlations significantly affected the statistical power and sample size of composite endpoints. The power and sample size of co-primary endpoints remained relatively stable across different correlations, though their power declined substantially when treatment effects were opposite on some components or more than two components were present. While the correlations influenced the power and sample size of all methods assessed, their direction and degree of influence varied between methods. Notably, the method with the greatest power and smallest sample size also differed depending on the correlations. When the correlations were the same between arms, prioritized outcome approaches usually had higher power and smaller sample sizes than other methods.</p><p><strong>Conclusions: </strong>Anticipated correlations and their uncertainty should be considered when selecting statistical methods. Overall, co-primary endpoints remain a reliable option for evaluating the superiority of all components, although they are unsuitable for assessing the balance between treatment effects pointing in different directions. Generalized pairwise comparisons offer a useful alternative to deal with multiple prioritized outcomes, often providing the smallest sample sizes when the correlation structures are shared between the arms.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241304706"},"PeriodicalIF":2.2,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142920978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-02DOI: 10.1177/17407745241304284
Subodh Selukar, David K Prince, Susanne May
Background: N-of-1 trials compare two or more treatment options for a single participant. These trials have been used to study options for chronic conditions such as arthritis and attention deficit hyperactivity disorder. In addition, they have been suggested as a means to study interventions in rare populations that may not be tractable to include in standard clinical trials, such as treatment options for HIV-positive patients in need of organ transplant. Sequential monitoring of accruing data has been well-studied in traditional clinical trials, but these methods have not yet been implemented in N-of-1 trials. However, the option to validly stop an N-of-1 trial early could deliver faster decisions that could directly improve the patient's health.
Methods: In this work, we propose and evaluate a framework to (1) facilitate sequential monitoring in individual N-of-1 trials with a continuous outcome and (2) combine results across a series of already-completed sequentially monitored N-of-1 trials. By employing the block structure common to N-of-1 trials, we suggest that existing approaches to sequential monitoring may be employed when data from one N-of-1 trial are analyzed with a linear mixed-effects model. To combine results across a series of already-completed sequentially monitored N-of-1 trials, we propose combining the naive estimates from constituent trials in a random-effects model with inverse-variance weighting. We evaluate these proposals via simulation.
Results: We find that type 1 error can be substantially inflated for N-of-1 trials with a small number of planned blocks but can reach the nominal rate for trials with more planned blocks or those with larger numbers of periods per block or by using a -value correction. For those settings with acceptable type 1 error, sequential monitoring results in similar power and on average earlier stopping compared with trials with no sequential monitoring. And, as expected, we find that including a larger number of constituent trials in a series reduces the mean-squared error of the combined point estimator.
Conclusion: Under suitable design considerations, our proposed framework for sequential monitoring can support clinicians in providing important decisions earlier, on average, for patients engaged in N-of-1 trials.
{"title":"A framework for sequential monitoring of individual N-of-1 trials and combining results across a series of sequentially monitored N-of-1 trials.","authors":"Subodh Selukar, David K Prince, Susanne May","doi":"10.1177/17407745241304284","DOIUrl":"10.1177/17407745241304284","url":null,"abstract":"<p><strong>Background: </strong>N-of-1 trials compare two or more treatment options for a single participant. These trials have been used to study options for chronic conditions such as arthritis and attention deficit hyperactivity disorder. In addition, they have been suggested as a means to study interventions in rare populations that may not be tractable to include in standard clinical trials, such as treatment options for HIV-positive patients in need of organ transplant. Sequential monitoring of accruing data has been well-studied in traditional clinical trials, but these methods have not yet been implemented in N-of-1 trials. However, the option to validly stop an N-of-1 trial early could deliver faster decisions that could directly improve the patient's health.</p><p><strong>Methods: </strong>In this work, we propose and evaluate a framework to (1) facilitate sequential monitoring in individual N-of-1 trials with a continuous outcome and (2) combine results across a series of already-completed sequentially monitored N-of-1 trials. By employing the block structure common to N-of-1 trials, we suggest that existing approaches to sequential monitoring may be employed when data from one N-of-1 trial are analyzed with a linear mixed-effects model. To combine results across a series of already-completed sequentially monitored N-of-1 trials, we propose combining the naive estimates from constituent trials in a random-effects model with inverse-variance weighting. We evaluate these proposals via simulation.</p><p><strong>Results: </strong>We find that type 1 error can be substantially inflated for N-of-1 trials with a small number of planned blocks but can reach the nominal rate for trials with more planned blocks or those with larger numbers of periods per block or by using a <math><mrow><mi>t</mi></mrow></math>-value correction. For those settings with acceptable type 1 error, sequential monitoring results in similar power and on average earlier stopping compared with trials with no sequential monitoring. And, as expected, we find that including a larger number of constituent trials in a series reduces the mean-squared error of the combined point estimator.</p><p><strong>Conclusion: </strong>Under suitable design considerations, our proposed framework for sequential monitoring can support clinicians in providing important decisions earlier, on average, for patients engaged in N-of-1 trials.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241304284"},"PeriodicalIF":2.2,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142913822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}