Pub Date : 2024-10-23DOI: 10.1177/17407745241286065
Carolyn Mead-Harvey, Ethan Basch, Lauren J Rogak, Blake T Langlais, Gita Thanarajasingam, Brenda F Ginos, Minji K Lee, Claire Yee, Sandra A Mitchell, Lori M Minasian, Tito R Mendoza, Antonia V Bennett, Deborah Schrag, Amylou C Dueck, Gina L Mazza
Background/aims: The Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE®) was developed to capture symptomatic adverse events from the patient perspective. We aim to describe statistical properties of PRO-CTCAE items and summary scores and to provide evidence for recommendations regarding PRO-CTCAE administration and reporting.
Methods: Using data from the PRO-CTCAE validation study (NCT02158637), prevalence, means, and standard deviations of PRO-CTCAE items, composite scores, and mean and maximum scores across attributes (frequency, severity, and/or interference) of symptomatic adverse events were calculated. For each adverse event, correlations and agreement between attributes, correlations between attributes and composite scores, and correlations between composite, mean, and maximum scores were estimated.
Results: PRO-CTCAE items were completed by 899 patients with various cancer types. Most patients reported experiencing one or more adverse events, with the most prevalent being fatigue (87.7%), sad/unhappy feelings (66.0%), anxiety (63.6%), pain (63.2%), insomnia (61.8%), and dry mouth (60.0%). Attributes were moderately to strongly correlated within an adverse event (r = 0.53 to 0.77, all p < 0.001) but not fully concordant (κweighted = 0.26 to 0.60, all p < 0.001), with interference demonstrating lowest mean scores and prevalence among attributes of the same adverse event. Attributes were moderately to strongly correlated with composite scores (r = 0.67 to 0.97, all p < 0.001). Composite scores were moderately to strongly correlated with mean and maximum scores for the same adverse event (r = 0.69 to 0.94, all p < 0.001). Correlations between composite scores of different adverse events varied widely (r = 0.04 to 0.68) but were moderate to strong for conceptually related adverse events.
Conclusions: Results provide evidence for PRO-CTCAE administration and reporting recommendations that the full complement of attributes be administered for each adverse event, and that attributes as well as summary scores be reported.
背景/目的:患者报告结果版不良事件通用术语标准(PRO-CTCAE®)旨在从患者角度捕捉症状性不良事件。我们旨在描述 PRO-CTCAE 项目和总分的统计特性,并为有关 PRO-CTCAE 管理和报告的建议提供证据:利用 PRO-CTCAE 验证研究(NCT02158637)的数据,计算了 PRO-CTCAE 项目、综合评分以及症状性不良事件不同属性(频率、严重程度和/或干扰)的平均分和最高分的流行率、平均值和标准偏差。对每种不良事件的属性之间的相关性和一致性、属性与综合评分之间的相关性以及综合评分、平均分和最高分之间的相关性进行了估算:899名不同癌症类型的患者完成了PRO-CTCAE项目。大多数患者报告经历了一种或多种不良事件,其中最普遍的不良事件是疲劳(87.7%)、悲伤/不开心(66.0%)、焦虑(63.6%)、疼痛(63.2%)、失眠(61.8%)和口干(60.0%)。在一个不良事件中,属性的相关性为中度到高度相关(r = 0.53 到 0.77,所有 p 加权 = 0.26 到 0.60,所有 p r = 0.67 到 0.97,所有 p r = 0.69 到 0.94,所有 p r = 0.04 到 0.68),但在概念相关的不良事件中,属性的相关性为中度到高度相关:结论:研究结果为 PRO-CTCAE 的管理和报告建议提供了证据,建议对每种不良事件进行全套属性管理,并报告属性和总分。
{"title":"Statistical properties of items and summary scores from the Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE<sup>®</sup>) in a diverse cancer sample.","authors":"Carolyn Mead-Harvey, Ethan Basch, Lauren J Rogak, Blake T Langlais, Gita Thanarajasingam, Brenda F Ginos, Minji K Lee, Claire Yee, Sandra A Mitchell, Lori M Minasian, Tito R Mendoza, Antonia V Bennett, Deborah Schrag, Amylou C Dueck, Gina L Mazza","doi":"10.1177/17407745241286065","DOIUrl":"10.1177/17407745241286065","url":null,"abstract":"<p><strong>Background/aims: </strong>The Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE<sup>®</sup>) was developed to capture symptomatic adverse events from the patient perspective. We aim to describe statistical properties of PRO-CTCAE items and summary scores and to provide evidence for recommendations regarding PRO-CTCAE administration and reporting.</p><p><strong>Methods: </strong>Using data from the PRO-CTCAE validation study (NCT02158637), prevalence, means, and standard deviations of PRO-CTCAE items, composite scores, and mean and maximum scores across attributes (frequency, severity, and/or interference) of symptomatic adverse events were calculated. For each adverse event, correlations and agreement between attributes, correlations between attributes and composite scores, and correlations between composite, mean, and maximum scores were estimated.</p><p><strong>Results: </strong>PRO-CTCAE items were completed by 899 patients with various cancer types. Most patients reported experiencing one or more adverse events, with the most prevalent being fatigue (87.7%), sad/unhappy feelings (66.0%), anxiety (63.6%), pain (63.2%), insomnia (61.8%), and dry mouth (60.0%). Attributes were moderately to strongly correlated within an adverse event (<i>r</i> = 0.53 to 0.77, all <i>p</i> < 0.001) but not fully concordant (κ<sub>weighted</sub> = 0.26 to 0.60, all <i>p</i> < 0.001), with interference demonstrating lowest mean scores and prevalence among attributes of the same adverse event. Attributes were moderately to strongly correlated with composite scores (<i>r</i> = 0.67 to 0.97, all <i>p</i> < 0.001). Composite scores were moderately to strongly correlated with mean and maximum scores for the same adverse event (<i>r</i> = 0.69 to 0.94, all <i>p</i> < 0.001). Correlations between composite scores of different adverse events varied widely (<i>r</i> = 0.04 to 0.68) but were moderate to strong for conceptually related adverse events.</p><p><strong>Conclusions: </strong>Results provide evidence for PRO-CTCAE administration and reporting recommendations that the full complement of attributes be administered for each adverse event, and that attributes as well as summary scores be reported.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241286065"},"PeriodicalIF":2.2,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142496568","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-15DOI: 10.1177/17407745241286147
Natansh D Modi, Lee X Li, Jessica M Logan, Michael D Wiese, Ahmad Y Abuhelwa, Ross A McKinnon, Andrew Rowland, Michael J Sorich, Ashley M Hopkins
Background: Amid growing emphasis from pharmaceutical companies, advocacy groups, and regulatory bodies for sharing of individual participant data, recent audits reveal limited sharing, particularly for high-revenue medicines. Therefore, this study aimed to assess the individual participant data-sharing eligibility of clinical trials supporting the Food and Drug Administration approval of the top 30 highest-revenue medicines for 2021.
Methods: A cross-sectional analysis was conducted on 316 clinical trials supporting approval of the top 30 revenue-generating medicines of 2021. The study assessed whether these trials were eligible for individual participant data sharing, defined as being publicly listed on a data-sharing platform or confirmed by the trial sponsors as in scope for independent researcher individual participant data investigations. Information was gathered from various sources including ClinicalTrials.gov, the European Union Clinical Trials Register, and PubMed. Key factors such as the trial phase, completion dates, and the nature of the data-sharing process were also examined.
Results: Of the 316 trials, 201 (64%) were confirmed eligible for sharing, meaning they were either publicly listed on a data-sharing platform or confirmed by the trial sponsors as in scope for independent researcher individual participant data investigations. A total of 102 (32%) were confirmed ineligible, and for 13 (4%), the sponsor indicated that a full research proposal would be required to determine eligibility. The analysis also revealed a higher rate of individual participant data sharing among companies that utilized independent platforms, such as Vivli, for managing their individual participant data-sharing process. Trials not marked as completed had significantly lower eligibility for individual participant data sharing.
Conclusion: This study highlights that a substantial portion of trials for top revenue-generating medicines are eligible for individual participant data sharing. However, challenges persist, particularly for trials that are marked as ongoing and for trials where the sharing processes are managed internally by pharmaceutical companies. Data-sharing rates could be improved by adopting open-access individual participant data-sharing models or using independent platforms. Standardizing policies to facilitate immediate individual participant data availability for approved medicines is necessary.
{"title":"The state of individual participant data sharing for the highest-revenue medicines.","authors":"Natansh D Modi, Lee X Li, Jessica M Logan, Michael D Wiese, Ahmad Y Abuhelwa, Ross A McKinnon, Andrew Rowland, Michael J Sorich, Ashley M Hopkins","doi":"10.1177/17407745241286147","DOIUrl":"https://doi.org/10.1177/17407745241286147","url":null,"abstract":"<p><strong>Background: </strong>Amid growing emphasis from pharmaceutical companies, advocacy groups, and regulatory bodies for sharing of individual participant data, recent audits reveal limited sharing, particularly for high-revenue medicines. Therefore, this study aimed to assess the individual participant data-sharing eligibility of clinical trials supporting the Food and Drug Administration approval of the top 30 highest-revenue medicines for 2021.</p><p><strong>Methods: </strong>A cross-sectional analysis was conducted on 316 clinical trials supporting approval of the top 30 revenue-generating medicines of 2021. The study assessed whether these trials were eligible for individual participant data sharing, defined as being publicly listed on a data-sharing platform or confirmed by the trial sponsors as in scope for independent researcher individual participant data investigations. Information was gathered from various sources including ClinicalTrials.gov, the European Union Clinical Trials Register, and PubMed. Key factors such as the trial phase, completion dates, and the nature of the data-sharing process were also examined.</p><p><strong>Results: </strong>Of the 316 trials, 201 (64%) were confirmed eligible for sharing, meaning they were either publicly listed on a data-sharing platform or confirmed by the trial sponsors as in scope for independent researcher individual participant data investigations. A total of 102 (32%) were confirmed ineligible, and for 13 (4%), the sponsor indicated that a full research proposal would be required to determine eligibility. The analysis also revealed a higher rate of individual participant data sharing among companies that utilized independent platforms, such as Vivli, for managing their individual participant data-sharing process. Trials not marked as completed had significantly lower eligibility for individual participant data sharing.</p><p><strong>Conclusion: </strong>This study highlights that a substantial portion of trials for top revenue-generating medicines are eligible for individual participant data sharing. However, challenges persist, particularly for trials that are marked as ongoing and for trials where the sharing processes are managed internally by pharmaceutical companies. Data-sharing rates could be improved by adopting open-access individual participant data-sharing models or using independent platforms. Standardizing policies to facilitate immediate individual participant data availability for approved medicines is necessary.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241286147"},"PeriodicalIF":2.2,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142459903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-15DOI: 10.1177/17407745241284044
Andrew J Vickers, Behfar Ehdaie, Hanae K Tokita, Jonas Nelson, Evan Matros, Andrea L Pusic, Michael D'Angelica
Background: Concerns about low accrual have long been a standard part of the discourse on cancer clinical trials, reaching even as far as the news media. Indeed, so many trials are closed before completing accrual that a cottage industry has recently developed creating statistical models to predict trial failure. We previously proposed four methodologic fixes for the current crisis in clinical trials: (1) dramatically reducing the number of eligibility criteria, (2) using data routinely collected in clinical practice for trial endpoints; then lowering barriers to accrual by (3) cluster randomization or (4) staged consent.
Methods: We report our practical experience of applying these fixes to randomized trials at Memorial Sloan Kettering Cancer Center.
Results: We have completed seven single-center randomized trials, with several more underway and accruing rapidly, with a total accrual approaching 10,000. Many of the trials have compared surgical interventions, an area where trials have traditionally been hard to complete. Only one of these trials was externally funded. While low costs were possible due to the existing research infrastructure at our institution, such infrastructure is common at major cancer centers.
Conclusions: Further research on innovative clinical trial designs is warranted, particularly in higher-stakes settings, and in trials of medical and radiotherapy interventions.
{"title":"Successful completion of large, low-cost randomized cancer trials at an academic cancer center.","authors":"Andrew J Vickers, Behfar Ehdaie, Hanae K Tokita, Jonas Nelson, Evan Matros, Andrea L Pusic, Michael D'Angelica","doi":"10.1177/17407745241284044","DOIUrl":"https://doi.org/10.1177/17407745241284044","url":null,"abstract":"<p><strong>Background: </strong>Concerns about low accrual have long been a standard part of the discourse on cancer clinical trials, reaching even as far as the news media. Indeed, so many trials are closed before completing accrual that a cottage industry has recently developed creating statistical models to predict trial failure. We previously proposed four methodologic fixes for the current crisis in clinical trials: (1) dramatically reducing the number of eligibility criteria, (2) using data routinely collected in clinical practice for trial endpoints; then lowering barriers to accrual by (3) cluster randomization or (4) staged consent.</p><p><strong>Methods: </strong>We report our practical experience of applying these fixes to randomized trials at Memorial Sloan Kettering Cancer Center.</p><p><strong>Results: </strong>We have completed seven single-center randomized trials, with several more underway and accruing rapidly, with a total accrual approaching 10,000. Many of the trials have compared surgical interventions, an area where trials have traditionally been hard to complete. Only one of these trials was externally funded. While low costs were possible due to the existing research infrastructure at our institution, such infrastructure is common at major cancer centers.</p><p><strong>Conclusions: </strong>Further research on innovative clinical trial designs is warranted, particularly in higher-stakes settings, and in trials of medical and radiotherapy interventions.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241284044"},"PeriodicalIF":2.2,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142459902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-15DOI: 10.1177/17407745241284798
Hayden P Nix, Charles Weijer, Monica Taljaard
Background: Randomized controlled trials with pragmatic intent aim to generate evidence that directly informs clinical decisions. Some have argued that the ethical protection of informed consent can be in tension with the goals of pragmatism. But the impact of other ethical protections on trial pragmatism has yet to be explored.
Purpose: In this article, we analyze the relationship between additional ethical protections for vulnerable participants and the degree of pragmatism within the PRagmatic Explanatory Continuum Indicator Summary-2 (PRECIS-2) domains of trial design.
Methods: We analyze three example trials with pragmatic intent that include vulnerable participants.
Conclusion: The relationship between ethical protections and trial pragmatism is complex. In some cases, additional ethical protections for vulnerable participants can promote the pragmatism of some of the PRECIS-2 domains of trial design. When designing trials with pragmatic intent, researchers ought to look for opportunities wherein ethical protections enhance the degree of pragmatism.
{"title":"Are pragmatism and ethical protections in clinical trials a zero-sum game?","authors":"Hayden P Nix, Charles Weijer, Monica Taljaard","doi":"10.1177/17407745241284798","DOIUrl":"10.1177/17407745241284798","url":null,"abstract":"<p><strong>Background: </strong>Randomized controlled trials with pragmatic intent aim to generate evidence that directly informs clinical decisions. Some have argued that the ethical protection of informed consent can be in tension with the goals of pragmatism. But the impact of other ethical protections on trial pragmatism has yet to be explored.</p><p><strong>Purpose: </strong>In this article, we analyze the relationship between additional ethical protections for vulnerable participants and the degree of pragmatism within the PRagmatic Explanatory Continuum Indicator Summary-2 (PRECIS-2) domains of trial design.</p><p><strong>Methods: </strong>We analyze three example trials with pragmatic intent that include vulnerable participants.</p><p><strong>Conclusion: </strong>The relationship between ethical protections and trial pragmatism is complex. In some cases, additional ethical protections for vulnerable participants can promote the pragmatism of some of the PRECIS-2 domains of trial design. When designing trials with pragmatic intent, researchers ought to look for opportunities wherein ethical protections enhance the degree of pragmatism.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241284798"},"PeriodicalIF":2.2,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142459901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-12DOI: 10.1177/17407745241284786
Lingyun Ji, Todd A Alonzo
Background/aims: For cancers with low incidence, low event rates, and a time-to-event endpoint, a randomized non-inferiority trial designed based on the logrank test can require a large sample size with significantly prolonged enrollment duration, making such a non-inferiority trial not feasible. This article evaluates a design based on a non-inferiority test of proportions, compares its required sample size to the non-inferiority logrank test, assesses whether there are scenarios for which a non-inferiority test of proportions can be more efficient, and provides guidelines in usage of a non-inferiority test of proportions.
Methods: This article describes the sample size calculation for a randomized non-inferiority trial based on a non-inferiority logrank test or a non-inferiority test of proportions. The sample size required by the two design methods are compared for a wide range of scenarios, varying the underlying Weibull survival functions, the non-inferiority margin, and loss to follow-up rate.
Results: Our results showed that there are scenarios for which the non-inferiority test of proportions can have significantly reduced sample size. Specifically, the non-inferiority test of proportions can be considered for cancers with more than 80% long-term survival rate. We provide guidance in choice of this design approach based on parameters of the Weibull survival functions, the non-inferiority margin, and loss to follow-up rate.
Conclusion: For cancers with low incidence and low event rates, a non-inferiority trial based on the logrank test is not feasible due to its large required sample size and prolonged enrollment duration. The use of a non-inferiority test of proportions can make a randomized non-inferiority Phase III trial feasible.
{"title":"Using non-inferiority test of proportions in design of randomized non-inferiority trials with time-to-event endpoint with a focus on low-event-rate setting.","authors":"Lingyun Ji, Todd A Alonzo","doi":"10.1177/17407745241284786","DOIUrl":"10.1177/17407745241284786","url":null,"abstract":"<p><strong>Background/aims: </strong>For cancers with low incidence, low event rates, and a time-to-event endpoint, a randomized non-inferiority trial designed based on the logrank test can require a large sample size with significantly prolonged enrollment duration, making such a non-inferiority trial not feasible. This article evaluates a design based on a non-inferiority test of proportions, compares its required sample size to the non-inferiority logrank test, assesses whether there are scenarios for which a non-inferiority test of proportions can be more efficient, and provides guidelines in usage of a non-inferiority test of proportions.</p><p><strong>Methods: </strong>This article describes the sample size calculation for a randomized non-inferiority trial based on a non-inferiority logrank test or a non-inferiority test of proportions. The sample size required by the two design methods are compared for a wide range of scenarios, varying the underlying Weibull survival functions, the non-inferiority margin, and loss to follow-up rate.</p><p><strong>Results: </strong>Our results showed that there are scenarios for which the non-inferiority test of proportions can have significantly reduced sample size. Specifically, the non-inferiority test of proportions can be considered for cancers with more than 80% long-term survival rate. We provide guidance in choice of this design approach based on parameters of the Weibull survival functions, the non-inferiority margin, and loss to follow-up rate.</p><p><strong>Conclusion: </strong>For cancers with low incidence and low event rates, a non-inferiority trial based on the logrank test is not feasible due to its large required sample size and prolonged enrollment duration. The use of a non-inferiority test of proportions can make a randomized non-inferiority Phase III trial feasible.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241284786"},"PeriodicalIF":2.2,"publicationDate":"2024-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142459904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-10DOI: 10.1177/17407745241276130
Pedro Nascimento Martins, Mateus Henrique Toledo Lourenço, Gabriel Paz Souza Mota, Alexandre Biasi Cavalcanti, Ana Carolina Peçanha Antonio, Fredi Alexander Diaz-Quijano
Background/aims: This study aimed to determine the prevalence of ordinal, binary, and numerical composite endpoints among coronavirus disease 2019 trials and the potential bias attributable to their use.
Methods: We systematically reviewed the Cochrane COVID-19 Study Register to assess the prevalence, characteristics, and bias associated with using composite endpoints in coronavirus disease 2019 randomized clinical trials. We compared the effect measure (relative risk) of composite outcomes and that of its most critical component (i.e. death) by estimating the Bias Attributable to Composite Outcomes index [ln(relative risk for the composite outcome)/ln(relative risk for death)].
Results: Composite endpoints accounted for 152 out of 417 primary endpoints in coronavirus disease 2019 randomized trials, being more frequent among studies published in high-impact journals. Ordinal endpoints were the most common (54% of all composites), followed by binary or time-to-event (34%), numerical (11%), and hierarchical (1%). Composites predominated among trials enrolling patients with severe disease when compared to trials with a mild or moderate case mix (odds ratio = 1.72). Adaptations of the seven-point World Health Organization scale occurred in 40% of the ordinal primary endpoints, which frequently underwent dichotomization for the statistical analyses. Mortality accounted for a median of 24% (interquartile range: 6%-48%) of all events when included in the composite. The median point estimate of the Bias Attributable to Composite Outcomes index was 0.3 (interquartile range: -0.1 to 0.7), being significantly lower than 1 in 5 of 24 comparisons.
Discussion: Composite endpoints were used in a significant proportion of coronavirus disease 2019 trials, especially those involving severely ill patients. This is likely due to the higher anticipated rates of competing events, such as death, in such studies. Ordinal composites were common but often not fully appreciated, reducing the potential gains in information and statistical efficiency. For studies with binary composites, death was the most frequent component, and, unexpectedly, composite outcome estimates were often closer to the null when compared to those for mortality death. Numerical composites were less common, and only two trials used hierarchical endpoints. These newer approaches may offer advantages over traditional binary and ordinal composites; however, their potential benefits warrant further scrutiny.
Conclusion: Composite endpoints accounted for more than a third of coronavirus disease 2019 trials' primary endpoints; their use was more common among studies that included patients with severe disease and their point effect estimates tended to underestimate those for mortality.
{"title":"Composite endpoints in COVID-19 randomized controlled trials: a systematic review.","authors":"Pedro Nascimento Martins, Mateus Henrique Toledo Lourenço, Gabriel Paz Souza Mota, Alexandre Biasi Cavalcanti, Ana Carolina Peçanha Antonio, Fredi Alexander Diaz-Quijano","doi":"10.1177/17407745241276130","DOIUrl":"https://doi.org/10.1177/17407745241276130","url":null,"abstract":"<p><strong>Background/aims: </strong>This study aimed to determine the prevalence of ordinal, binary, and numerical composite endpoints among coronavirus disease 2019 trials and the potential bias attributable to their use.</p><p><strong>Methods: </strong>We systematically reviewed the Cochrane COVID-19 Study Register to assess the prevalence, characteristics, and bias associated with using composite endpoints in coronavirus disease 2019 randomized clinical trials. We compared the effect measure (relative risk) of composite outcomes and that of its most critical component (i.e. death) by estimating the Bias Attributable to Composite Outcomes index [ln(relative risk for the composite outcome)/ln(relative risk for death)].</p><p><strong>Results: </strong>Composite endpoints accounted for 152 out of 417 primary endpoints in coronavirus disease 2019 randomized trials, being more frequent among studies published in high-impact journals. Ordinal endpoints were the most common (54% of all composites), followed by binary or time-to-event (34%), numerical (11%), and hierarchical (1%). Composites predominated among trials enrolling patients with severe disease when compared to trials with a mild or moderate case mix (odds ratio = 1.72). Adaptations of the seven-point World Health Organization scale occurred in 40% of the ordinal primary endpoints, which frequently underwent dichotomization for the statistical analyses. Mortality accounted for a median of 24% (interquartile range: 6%-48%) of all events when included in the composite. The median point estimate of the Bias Attributable to Composite Outcomes index was 0.3 (interquartile range: -0.1 to 0.7), being significantly lower than 1 in 5 of 24 comparisons.</p><p><strong>Discussion: </strong>Composite endpoints were used in a significant proportion of coronavirus disease 2019 trials, especially those involving severely ill patients. This is likely due to the higher anticipated rates of competing events, such as death, in such studies. Ordinal composites were common but often not fully appreciated, reducing the potential gains in information and statistical efficiency. For studies with binary composites, death was the most frequent component, and, unexpectedly, composite outcome estimates were often closer to the null when compared to those for mortality death. Numerical composites were less common, and only two trials used hierarchical endpoints. These newer approaches may offer advantages over traditional binary and ordinal composites; however, their potential benefits warrant further scrutiny.</p><p><strong>Conclusion: </strong>Composite endpoints accounted for more than a third of coronavirus disease 2019 trials' primary endpoints; their use was more common among studies that included patients with severe disease and their point effect estimates tended to underestimate those for mortality.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241276130"},"PeriodicalIF":2.2,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142399650","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-08DOI: 10.1177/17407745241276137
Guangyu Tong, Pascale Nevins, Mary Ryan, Kendra Davis-Plourde, Yongdong Ouyang, Jules Antoine Pereira Macedo, Can Meng, Xueqi Wang, Agnès Caille, Fan Li, Monica Taljaard
<p><strong>Background/aims: </strong>Stepped-wedge cluster randomized trials tend to require fewer clusters than standard parallel-arm designs due to the switches between control and intervention conditions, but there are no recommendations for the minimum number of clusters. Trials randomizing an extremely small number of clusters are not uncommon, but the justification for small numbers of clusters is often unclear and appropriate analysis is often lacking. In addition, stepped-wedge cluster randomized trials are methodologically more complex due to their longitudinal correlation structure, and ignoring the distinct within- and between-period intracluster correlations can underestimate the sample size in small stepped-wedge cluster randomized trials. We conducted a review of published small stepped-wedge cluster randomized trials to understand how and why they are used, and to characterize approaches used in their design and analysis.</p><p><strong>Methods: </strong>Electronic searches were used to identify primary reports of full-scale stepped-wedge cluster randomized trials published during the period 2016-2022; the subset that randomized two to six clusters was identified. Two reviewers independently extracted information from each report and any available protocol. Disagreements were resolved through discussion.</p><p><strong>Results: </strong>We identified 61 stepped-wedge cluster randomized trials that randomized two to six clusters: median sample size (Q1-Q3) 1426 (420-7553) participants. Twelve (19.7%) gave some indication that the evaluation was considered a "preliminary" evaluation and 16 (26.2%) recognized the small number of clusters as a limitation. Sixteen (26.2%) provided an explanation for the limited number of clusters: the need to minimize contamination (e.g. by merging adjacent units), limited availability of clusters, and logistical considerations were common explanations. Majority (51, 83.6%) presented sample size or power calculations, but only one assumed distinct within- and between-period intracluster correlations. Few (10, 16.4%) utilized restricted randomization methods; more than half (34, 55.7%) identified baseline imbalances. The most common statistical method for analysis was the generalized linear mixed model (44, 72.1%). Only four trials (6.6%) reported statistical analyses considering small numbers of clusters: one used generalized estimating equations with small-sample correction, two used generalized linear mixed model with small-sample correction, and one used Bayesian analysis. Another eight (13.1%) used fixed-effects regression, the performance of which requires further evaluation under stepped-wedge cluster randomized trials with small numbers of clusters. None used permutation tests or cluster-period level analysis.</p><p><strong>Conclusion: </strong>Methods appropriate for the design and analysis of small stepped-wedge cluster randomized trials have not been widely adopted in practice. Greater awareness
{"title":"A review of current practice in the design and analysis of extremely small stepped-wedge cluster randomized trials.","authors":"Guangyu Tong, Pascale Nevins, Mary Ryan, Kendra Davis-Plourde, Yongdong Ouyang, Jules Antoine Pereira Macedo, Can Meng, Xueqi Wang, Agnès Caille, Fan Li, Monica Taljaard","doi":"10.1177/17407745241276137","DOIUrl":"10.1177/17407745241276137","url":null,"abstract":"<p><strong>Background/aims: </strong>Stepped-wedge cluster randomized trials tend to require fewer clusters than standard parallel-arm designs due to the switches between control and intervention conditions, but there are no recommendations for the minimum number of clusters. Trials randomizing an extremely small number of clusters are not uncommon, but the justification for small numbers of clusters is often unclear and appropriate analysis is often lacking. In addition, stepped-wedge cluster randomized trials are methodologically more complex due to their longitudinal correlation structure, and ignoring the distinct within- and between-period intracluster correlations can underestimate the sample size in small stepped-wedge cluster randomized trials. We conducted a review of published small stepped-wedge cluster randomized trials to understand how and why they are used, and to characterize approaches used in their design and analysis.</p><p><strong>Methods: </strong>Electronic searches were used to identify primary reports of full-scale stepped-wedge cluster randomized trials published during the period 2016-2022; the subset that randomized two to six clusters was identified. Two reviewers independently extracted information from each report and any available protocol. Disagreements were resolved through discussion.</p><p><strong>Results: </strong>We identified 61 stepped-wedge cluster randomized trials that randomized two to six clusters: median sample size (Q1-Q3) 1426 (420-7553) participants. Twelve (19.7%) gave some indication that the evaluation was considered a \"preliminary\" evaluation and 16 (26.2%) recognized the small number of clusters as a limitation. Sixteen (26.2%) provided an explanation for the limited number of clusters: the need to minimize contamination (e.g. by merging adjacent units), limited availability of clusters, and logistical considerations were common explanations. Majority (51, 83.6%) presented sample size or power calculations, but only one assumed distinct within- and between-period intracluster correlations. Few (10, 16.4%) utilized restricted randomization methods; more than half (34, 55.7%) identified baseline imbalances. The most common statistical method for analysis was the generalized linear mixed model (44, 72.1%). Only four trials (6.6%) reported statistical analyses considering small numbers of clusters: one used generalized estimating equations with small-sample correction, two used generalized linear mixed model with small-sample correction, and one used Bayesian analysis. Another eight (13.1%) used fixed-effects regression, the performance of which requires further evaluation under stepped-wedge cluster randomized trials with small numbers of clusters. None used permutation tests or cluster-period level analysis.</p><p><strong>Conclusion: </strong>Methods appropriate for the design and analysis of small stepped-wedge cluster randomized trials have not been widely adopted in practice. Greater awareness","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"17407745241276137"},"PeriodicalIF":2.2,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142388716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-01Epub Date: 2024-10-08DOI: 10.1177/17407745241271939
Ionut Bebu, Rebecca A Betensky, Michael P Fay
{"title":"15th Annual University of Pennsylvania conference on statistical issues in clinical trial/advances in time-to-event analyses in clinical trials (afternoon panel discussion).","authors":"Ionut Bebu, Rebecca A Betensky, Michael P Fay","doi":"10.1177/17407745241271939","DOIUrl":"10.1177/17407745241271939","url":null,"abstract":"","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"612-622"},"PeriodicalIF":2.2,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142388715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-01Epub Date: 2024-08-08DOI: 10.1177/17407745241267999
Rachel Marceau West, Gregory Golm, Devan V Mehrotra
Composite time-to-event endpoints are commonly used in cardiovascular outcome trials. For example, the IMPROVE-IT trial comparing ezetimibe+simvastatin to placebo+simvastatin in 18,144 patients with acute coronary syndrome used a primary composite endpoint with five component outcomes: (1) cardiovascular death, (2) non-fatal stroke, (3) non-fatal myocardial infarction, (4) coronary revascularization ≥30 days after randomization, and (5) unstable angina requiring hospitalization. In such settings, the traditional analysis compares treatments using the observed time to the occurrence of the first (i.e. earliest) component outcome for each patient. This approach ignores information for subsequent outcome(s), possibly leading to reduced power to demonstrate the benefit of the test versus the control treatment. We use real data examples and simulations to contrast the traditional approach with several alternative approaches that use data for all the intra-patient component outcomes, not just the first.
{"title":"Analysis of composite time-to-event endpoints in cardiovascular outcome trials.","authors":"Rachel Marceau West, Gregory Golm, Devan V Mehrotra","doi":"10.1177/17407745241267999","DOIUrl":"10.1177/17407745241267999","url":null,"abstract":"<p><p>Composite time-to-event endpoints are commonly used in cardiovascular outcome trials. For example, the IMPROVE-IT trial comparing ezetimibe+simvastatin to placebo+simvastatin in 18,144 patients with acute coronary syndrome used a primary composite endpoint with five component outcomes: (1) cardiovascular death, (2) non-fatal stroke, (3) non-fatal myocardial infarction, (4) coronary revascularization ≥30 days after randomization, and (5) unstable angina requiring hospitalization. In such settings, the traditional analysis compares treatments using the observed time to the occurrence of the first (i.e. earliest) component outcome for each patient. This approach ignores information for subsequent outcome(s), possibly leading to reduced power to demonstrate the benefit of the test versus the control treatment. We use real data examples and simulations to contrast the traditional approach with several alternative approaches that use data for all the intra-patient component outcomes, not just the first.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"576-583"},"PeriodicalIF":2.2,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141906134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-01Epub Date: 2024-02-02DOI: 10.1177/17407745231222448
Devan V Mehrotra, Rachel Marceau West
In randomized clinical trials, analyses of time-to-event data without risk stratification, or with stratification based on pre-selected factors revealed at the end of the trial to be at most weakly associated with risk, are quite common. We caution that such analyses are likely delivering hazard ratio estimates that unwittingly dilute the evidence of benefit for the test relative to the control treatment. To make our case, first, we use a hypothetical scenario to contrast risk-unstratified and risk-stratified hazard ratios. Thereafter, we draw attention to the previously published 5-step stratified testing and amalgamation routine (5-STAR) approach in which a pre-specified treatment-blinded algorithm is applied to survival times from the trial to partition patients into well-separated risk strata using baseline covariates determined to be jointly strongly prognostic for event risk. After treatment unblinding, a treatment comparison is done within each risk stratum and stratum-level results are averaged for overall inference. For illustration, we use 5-STAR to reanalyze data for the primary and key secondary time-to-event endpoints from three published cardiovascular outcomes trials. The results show that the 5-STAR estimate is typically smaller (i.e. more in favor of the test treatment) than the originally reported (traditional) estimate. This is not surprising because 5-STAR mitigates the presumed dilution bias in the traditional hazard ratio estimate caused by no or inadequate risk stratification, as evidenced by two detailed examples. Pre-selection of stratification factors at the trial design stage to achieve adequate risk stratification for the analysis will often be challenging. In such settings, an objective risk stratification approach such as 5-STAR, which is partly aligned with guidance from the US Food and Drug Administration on covariate-adjustment in clinical trials, is worthy of consideration.
{"title":"Is inadequate risk stratification diluting hazard ratio estimates in randomized clinical trials?","authors":"Devan V Mehrotra, Rachel Marceau West","doi":"10.1177/17407745231222448","DOIUrl":"10.1177/17407745231222448","url":null,"abstract":"<p><p>In randomized clinical trials, analyses of time-to-event data without risk stratification, or with stratification based on pre-selected factors revealed at the end of the trial to be at most weakly associated with risk, are quite common. We caution that such analyses are likely delivering hazard ratio estimates that unwittingly dilute the evidence of benefit for the test relative to the control treatment. To make our case, first, we use a hypothetical scenario to contrast risk-unstratified and risk-stratified hazard ratios. Thereafter, we draw attention to the previously published 5-step stratified testing and amalgamation routine (5-STAR) approach in which a pre-specified treatment-blinded algorithm is applied to survival times from the trial to partition patients into well-separated risk strata using baseline covariates determined to be jointly strongly prognostic for event risk. After treatment unblinding, a treatment comparison is done within each risk stratum and stratum-level results are averaged for overall inference. For illustration, we use 5-STAR to reanalyze data for the primary and key secondary time-to-event endpoints from three published cardiovascular outcomes trials. The results show that the 5-STAR estimate is typically smaller (i.e. more in favor of the test treatment) than the originally reported (traditional) estimate. This is not surprising because 5-STAR mitigates the presumed dilution bias in the traditional hazard ratio estimate caused by no or inadequate risk stratification, as evidenced by two detailed examples. Pre-selection of stratification factors at the trial design stage to achieve adequate risk stratification for the analysis will often be challenging. In such settings, an objective risk stratification approach such as 5-STAR, which is partly aligned with guidance from the US Food and Drug Administration on covariate-adjustment in clinical trials, is worthy of consideration.</p>","PeriodicalId":10685,"journal":{"name":"Clinical Trials","volume":" ","pages":"571-575"},"PeriodicalIF":2.2,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139671450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}