BMC Medical Research Methodology最新文献_第10页

Determining doses for backfill cohorts based on patient-reported outcome. 根据患者报告的结果确定后补队列的剂量。

IF 3.9 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES

BMC Medical Research Methodology

Pub Date : 2024-11-08 DOI: 10.1186/s12874-024-02398-w

Xin Chen, Jingyi Zhang, Bosheng Li, Fangrong Yan

Background: Incorporating backfill cohorts in phase I oncology trials is a recently developed strategy for dose optimization. However, the efficacy assessment window is long in general, causing a lag in identifying ineffective doses and more patients being backfilled to those doses. There is necessity to investigate how to use patient-reported outcomes (PRO) to determine doses for backfill cohorts.

Methods: We propose a unified Bayesian design framework, called 'Backfill-QoL', to utilize patient-reported quality of life (QoL) data into phase I oncology trials with backfill cohorts, including methods for trial monitoring, algorithm for dose-finding, and criteria for dose selection. Simulation studies and sensitivity analyses are conducted to evaluate the proposed Backfill-QoL design.

Results: The simulation studies demonstrate that the Backfill-QoL design is more efficient than traditional dose-expansion strategy, and fewer patients would be allocated to doses with unacceptable QoL profiles. A user-friendly Windows desktop application is developed and freely available for implementing the proposed design.

Conclusions: The Backfill-QoL design enables continuous monitoring of safety, efficacy and QoL outcomes, and the recommended phase II dose (RP2D) can be identified in a more patient-centered perspective.

背景：在 I 期肿瘤试验中纳入后补队列是最近开发的一种剂量优化策略。然而，疗效评估窗口期一般较长，导致确定无效剂量的工作滞后，更多的患者被回补到这些剂量。因此，有必要研究如何利用患者报告结果（PRO）来确定后补队列的剂量：方法：我们提出了一个统一的贝叶斯设计框架，称为 "回填-QoL"，将患者报告的生活质量（QoL）数据用于具有回填队列的一期肿瘤试验，包括试验监测方法、剂量寻找算法和剂量选择标准。我们进行了模拟研究和敏感性分析，以评估拟议的 "回填-QoL "设计：模拟研究表明，Backfill-QoL 设计比传统的剂量扩展策略更有效，而且更少的患者会被分配到无法接受 QoL 资料的剂量上。我们开发了一个用户友好的 Windows 桌面应用程序，并免费提供给用户使用，以实现所提出的设计：Backfill-QoL设计可对安全性、疗效和QoL结果进行持续监测，并可从更以患者为中心的角度确定II期推荐剂量（RP2D）。

{"title":"Determining doses for backfill cohorts based on patient-reported outcome.","authors":"Xin Chen, Jingyi Zhang, Bosheng Li, Fangrong Yan","doi":"10.1186/s12874-024-02398-w","DOIUrl":"10.1186/s12874-024-02398-w","url":null,"abstract":"Background: Incorporating backfill cohorts in phase I oncology trials is a recently developed strategy for dose optimization. However, the efficacy assessment window is long in general, causing a lag in identifying ineffective doses and more patients being backfilled to those doses. There is necessity to investigate how to use patient-reported outcomes (PRO) to determine doses for backfill cohorts.Methods: We propose a unified Bayesian design framework, called 'Backfill-QoL', to utilize patient-reported quality of life (QoL) data into phase I oncology trials with backfill cohorts, including methods for trial monitoring, algorithm for dose-finding, and criteria for dose selection. Simulation studies and sensitivity analyses are conducted to evaluate the proposed Backfill-QoL design.Results: The simulation studies demonstrate that the Backfill-QoL design is more efficient than traditional dose-expansion strategy, and fewer patients would be allocated to doses with unacceptable QoL profiles. A user-friendly Windows desktop application is developed and freely available for implementing the proposed design.Conclusions: The Backfill-QoL design enables continuous monitoring of safety, efficacy and QoL outcomes, and the recommended phase II dose (RP2D) can be identified in a more patient-centered perspective.","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"270"},"PeriodicalIF":3.9,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11546322/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142614529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Challenges in measurement of adolescent mental health: how are gender patterns affected when level of symptoms is analysed simultaneously with impairment? 衡量青少年心理健康的挑战：同时分析症状水平和损伤时，性别模式会受到什么影响？

IF 3.9 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES

BMC Medical Research Methodology

Pub Date : 2024-11-07 DOI: 10.1186/s12874-024-02385-1

Curt Hagquist

Background: Adolescent mental health surveys in public health are sometimes questioned because of their main focus on self-reported symptoms, lacking data on impairment, e.g. the consequences on everyday life of the mental health problems. While public health studies typically reveal higher prevalence rates of internalising problems for girls than boys, there are indications that the gender pattern may change when self-reported data on symptoms are analysed simultaneously with impairment. The purpose is to determine how gender patterns of adolescent mental health solely based on symptoms are affected when level of symptoms is analysed simultaneously with impairment.

Methods: Questionnaire data on adolescent mental health were collected in schools by Statistics Sweden in the autumn of 2009 as part of a national total population study in grades 6 and 9 in Sweden. In this study only data from grade 9 students are used (n = 91 627; response rate = 80 per cent). Psychosomatic symptoms were measured with the Psychosomatic Problems scale including eight items. Impairment was measured with four items included in the SDQ impact supplement. The associations between these key constructs were analysed with logistic regression and contingency tables.

Results: When analysing variables on psychosomatic symptoms and impairment independently, the results are consistent with typical findings of gender patterns in adolescent internalising mental health. Girls report both more psychosomatic symptoms, and more negative consequences in everyday life, than boys. The gender patterns are, however, strongly affected when impairment is conditioned on level of psychosomatic symptoms. Except for the Home Life setting, in the settings of Friendships, Classroom Learning and Leisure Activities, the previously reported gender pattern favoring higher disturbances among girls becomes partly reversed implying that boys report more negative consequences than girls. Hence, while girls report a higher prevalence of psychosomatic symptoms, boys appear to suffer from such symptoms more than, or as much as, girls in three out of four everyday life settings.

Conclusions: The study confirms the insufficiency of solely including data on symptoms in the measurement of adolescent mental health. Regardless of the causes of the complex gender pattern shown in this study, the results highlight the importance of simultaneous inclusion of indicators of impairment as well as symptom counts and frequency in the measurement of adolescent mental health.

背景：在公共卫生领域开展的青少年心理健康调查有时会受到质疑，因为这些调查主要侧重于自我报告的症状，而缺乏有关障碍的数据，例如心理健康问题对日常生活的影响。虽然公共卫生研究通常显示女孩的内化问题发生率高于男孩，但有迹象表明，如果同时分析自我报告的症状数据和受损情况，性别模式可能会发生变化。本研究的目的是要确定，当症状水平与受损程度同时进行分析时，仅以症状为依据的青少年心理健康性别模式会受到怎样的影响：方法：2009 年秋季，瑞典统计局在学校收集了有关青少年心理健康的问卷数据，作为瑞典六年级和九年级全国总人口研究的一部分。本研究仅使用了九年级学生的数据（n = 91 627；回复率 = 80%）。心身症状用心身问题量表测量，包括八个项目。影响则通过 SDQ 影响补充量表中的四个项目进行测量。通过逻辑回归和或然率表分析了这些关键结构之间的关联：结果：在对心身症状和损伤变量进行独立分析时，结果与青少年内化性心理健康中性别模式的典型发现一致。与男孩相比，女孩报告的心身症状和日常生活中的负面影响都更多。然而，当受损程度以心身症状的程度为条件时，性别模式就会受到很大影响。除家庭生活环境外，在朋友关系、课堂学习和休闲活动环境中，先前报告的性别模式在一定程度上发生了逆转，即女孩的干扰程度更高，这意味着男孩比女孩报告了更多的负面后果。因此，虽然女生报告的心身症状发生率较高，但在四种日常生活环境中的三种环境中，男生的心身症状似乎比女生更严重，甚至与女生一样严重：这项研究证实，在衡量青少年心理健康时，仅将症状数据包括在内是不够的。无论本研究中显示的复杂性别模式的原因是什么，研究结果都强调了在测量青少年心理健康时，同时纳入损伤指标以及症状计数和频率的重要性。

{"title":"Challenges in measurement of adolescent mental health: how are gender patterns affected when level of symptoms is analysed simultaneously with impairment?","authors":"Curt Hagquist","doi":"10.1186/s12874-024-02385-1","DOIUrl":"10.1186/s12874-024-02385-1","url":null,"abstract":"Background: Adolescent mental health surveys in public health are sometimes questioned because of their main focus on self-reported symptoms, lacking data on impairment, e.g. the consequences on everyday life of the mental health problems. While public health studies typically reveal higher prevalence rates of internalising problems for girls than boys, there are indications that the gender pattern may change when self-reported data on symptoms are analysed simultaneously with impairment. The purpose is to determine how gender patterns of adolescent mental health solely based on symptoms are affected when level of symptoms is analysed simultaneously with impairment.Methods: Questionnaire data on adolescent mental health were collected in schools by Statistics Sweden in the autumn of 2009 as part of a national total population study in grades 6 and 9 in Sweden. In this study only data from grade 9 students are used (n = 91 627; response rate = 80 per cent). Psychosomatic symptoms were measured with the Psychosomatic Problems scale including eight items. Impairment was measured with four items included in the SDQ impact supplement. The associations between these key constructs were analysed with logistic regression and contingency tables.Results: When analysing variables on psychosomatic symptoms and impairment independently, the results are consistent with typical findings of gender patterns in adolescent internalising mental health. Girls report both more psychosomatic symptoms, and more negative consequences in everyday life, than boys. The gender patterns are, however, strongly affected when impairment is conditioned on level of psychosomatic symptoms. Except for the Home Life setting, in the settings of Friendships, Classroom Learning and Leisure Activities, the previously reported gender pattern favoring higher disturbances among girls becomes partly reversed implying that boys report more negative consequences than girls. Hence, while girls report a higher prevalence of psychosomatic symptoms, boys appear to suffer from such symptoms more than, or as much as, girls in three out of four everyday life settings.Conclusions: The study confirms the insufficiency of solely including data on symptoms in the measurement of adolescent mental health. Regardless of the causes of the complex gender pattern shown in this study, the results highlight the importance of simultaneous inclusion of indicators of impairment as well as symptom counts and frequency in the measurement of adolescent mental health.","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"268"},"PeriodicalIF":3.9,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11542466/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142603223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Motivations for enrollment in a COVID-19 ring-based post-exposure prophylaxis trial: qualitative examination of participant experiences. 参加基于 COVID-19 环的暴露后预防试验的动机：对参与者经历的定性研究。

IF 3.9 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES

BMC Medical Research Methodology

Pub Date : 2024-11-05 DOI: 10.1186/s12874-024-02394-0

Julien Brisson, Rebecca Balasa, Andrea Bowra, David C Hill, Aarti S Doshi, Darrell H S Tan, Amaya Perez-Brumer

Background: Ring-based studies are a novel research design commonly used for research involving infectious diseases: contacts of newly infected individuals form a ring that is targeted for interventions (e.g., vaccine, post-exposure prophylaxis). Given the novelty of the research design, it is critical to obtain feedback from participants on their experiences with ring-based studies to help with the development of future trials.

Methods: In 2021, we conducted 26 semi-structured interviews with adult participants of a COVID-19 ring-based post-exposure prophylaxis trial based in Canada. We applied a purposive sampling approach and electronically recruited participants who tested positive for COVID-19 (Index Cases) and either agreed or declined for the study team to contact their potentially exposed contacts. We also included individuals who participated in the trial after being potentially exposed to an Index Case (known as Ring Members), and those who declined to participate after potential exposure. The methodological design of semi-structured interviews allowed participants to share their opinions and experiences in the trial (e.g., elements they enjoyed and disliked regarding their participation in the study).

Results: The majority of participants in our study were women (62%) and the average age was 37.3 years (SD = 13.2). Overall, participants reported being highly satisfied with partaking in the ring-based trial. Notably, no substantial complaints were voiced about the trial's design involving contact after exposure. The most common reason of satisfaction was the knowledge of potentially helping others by advancing knowledge for a greater cause (e.g., development of potential treatment to prevent SARS-CoV-2 infection). Other reasons were curiosity about participating in a trial, and an activity to fill free time during the pandemic. A central element of dislike was confusion about instructions with the trial (e.g., independent at home SARS-CoV-2 testing). Additionally, maintaining confidentiality was a crucial concern for participants, who sought assurance that their data would not be shared beyond the scope of the study.

Conclusions: Our results have the potential to inform future research, including clinical trials such as ring-based studies, by incorporating insights from participants' experiences into the development of study protocols. Despite some protocol-related challenges, participants expressed high satisfaction, driven by the desire to advance science and potentially aid others.

背景：环状研究是一种新颖的研究设计，常用于涉及传染病的研究：新感染者的接触者组成一个环，作为干预（如疫苗、暴露后预防）的目标。鉴于该研究设计的新颖性，从参与者处获得他们对基于环的研究的经验反馈至关重要，这有助于未来试验的发展：2021 年，我们在加拿大对 COVID-19 环基暴露后预防试验的成年参与者进行了 26 次半结构化访谈。我们采用了有目的的抽样方法，并通过电子方式招募了 COVID-19 检测呈阳性且同意或拒绝研究小组与其潜在接触者联系的参与者（索引病例）。我们还纳入了在可能暴露于指数病例后参与试验的个人（称为环状成员），以及在可能暴露后拒绝参与试验的个人。半结构式访谈的方法设计允许参与者分享他们在试验中的意见和经验（例如，他们喜欢和不喜欢参与研究的因素）：大部分参与者为女性（62%），平均年龄为 37.3 岁（SD = 13.2）。总体而言，参与者对参加环基试验非常满意。值得注意的是，参与者对试验中涉及暴露后接触的设计没有提出实质性的抱怨。最常见的满意原因是知道自己有可能通过增进知识来帮助他人，从而实现更大的目标（例如，开发预防 SARS-CoV-2 感染的潜在治疗方法）。其他原因包括对参与试验的好奇心，以及在大流行期间填补空闲时间的活动。不喜欢试验的一个主要原因是对试验说明（如在家独立进行 SARS-CoV-2 检测）感到困惑。此外，保密也是参与者非常关心的一个问题，他们希望得到保证，他们的数据不会在研究范围之外被共享：我们的研究结果有可能为未来的研究提供参考，包括临床试验，如基于环的研究，将参与者的经验纳入研究方案的制定中。尽管存在一些与方案相关的挑战，但参与者在推动科学发展和帮助他人的愿望驱使下表示非常满意。

{"title":"Motivations for enrollment in a COVID-19 ring-based post-exposure prophylaxis trial: qualitative examination of participant experiences.","authors":"Julien Brisson, Rebecca Balasa, Andrea Bowra, David C Hill, Aarti S Doshi, Darrell H S Tan, Amaya Perez-Brumer","doi":"10.1186/s12874-024-02394-0","DOIUrl":"10.1186/s12874-024-02394-0","url":null,"abstract":"Background: Ring-based studies are a novel research design commonly used for research involving infectious diseases: contacts of newly infected individuals form a ring that is targeted for interventions (e.g., vaccine, post-exposure prophylaxis). Given the novelty of the research design, it is critical to obtain feedback from participants on their experiences with ring-based studies to help with the development of future trials.Methods: In 2021, we conducted 26 semi-structured interviews with adult participants of a COVID-19 ring-based post-exposure prophylaxis trial based in Canada. We applied a purposive sampling approach and electronically recruited participants who tested positive for COVID-19 (Index Cases) and either agreed or declined for the study team to contact their potentially exposed contacts. We also included individuals who participated in the trial after being potentially exposed to an Index Case (known as Ring Members), and those who declined to participate after potential exposure. The methodological design of semi-structured interviews allowed participants to share their opinions and experiences in the trial (e.g., elements they enjoyed and disliked regarding their participation in the study).Results: The majority of participants in our study were women (62%) and the average age was 37.3 years (SD = 13.2). Overall, participants reported being highly satisfied with partaking in the ring-based trial. Notably, no substantial complaints were voiced about the trial's design involving contact after exposure. The most common reason of satisfaction was the knowledge of potentially helping others by advancing knowledge for a greater cause (e.g., development of potential treatment to prevent SARS-CoV-2 infection). Other reasons were curiosity about participating in a trial, and an activity to fill free time during the pandemic. A central element of dislike was confusion about instructions with the trial (e.g., independent at home SARS-CoV-2 testing). Additionally, maintaining confidentiality was a crucial concern for participants, who sought assurance that their data would not be shared beyond the scope of the study.Conclusions: Our results have the potential to inform future research, including clinical trials such as ring-based studies, by incorporating insights from participants' experiences into the development of study protocols. Despite some protocol-related challenges, participants expressed high satisfaction, driven by the desire to advance science and potentially aid others.","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"267"},"PeriodicalIF":3.9,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11536907/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142581959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Concordance between humans and GPT-4 in appraising the methodological quality of case reports and case series using the Murad tool. 使用 Murad 工具评估病例报告和系列病例的方法学质量时，人类与 GPT-4 之间的一致性。

IF 3.9 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES

BMC Medical Research Methodology

Pub Date : 2024-11-04 DOI: 10.1186/s12874-024-02372-6

Zin Tarakji, Adel Kanaan, Samer Saadi, Mohammed Firwana, Adel Kabbara Allababidi, Mohamed F Abusalih, Rami Basmaci, Tamim I Rajjo, Zhen Wang, M Hassan Murad, Bashar Hasan

Background: Assessing the methodological quality of case reports and case series is challenging due to human judgment variability and time constraints. We evaluated the agreement in judgments between human reviewers and GPT-4 when applying a standard methodological quality assessment tool designed for case reports and series.

Methods: We searched Scopus for systematic reviews published in 2023-2024 that cited the appraisal tool by Murad et al. A GPT-4 based agent was developed to assess the methodological quality using the 8 signaling questions of the tool. Observed agreement and agreement coefficient were estimated comparing published judgments of human reviewers to GPT-4 assessment.

Results: We included 797 case reports and series. The observed agreement ranged between 41.91% and 80.93% across the eight questions (agreement coefficient ranged from 25.39 to 79.72%). The lowest agreement was noted in the first signaling question about selection of cases. The agreement was similar in articles published in journals with impact factor < 5 vs. ≥ 5, and when excluding systematic reviews that did not use 3 causality questions. Repeating the analysis using the same prompts demonstrated high agreement between the two GPT-4 attempts except for the first question about selection of cases.

Conclusions: The study demonstrates a moderate agreement between GPT-4 and human reviewers in assessing the methodological quality of case series and reports using the Murad tool. The current performance of GPT-4 seems promising but unlikely to be sufficient for the rigor of a systematic review and pairing the model with a human reviewer is required.

背景：由于人为判断的多变性和时间限制，评估病例报告和系列病例的方法学质量具有挑战性。我们评估了人类审稿人和 GPT-4 在应用专为病例报告和系列病例设计的标准方法学质量评估工具时的判断一致性：我们在 Scopus 上搜索了 2023-2024 年间发表的系统性综述，这些综述都引用了 Murad 等人的评估工具。将已发表的人类审稿人的判断与 GPT-4 评估进行比较，估算出观察到的一致性和一致性系数：结果：我们纳入了 797 份病例报告和系列报告。在八个问题中，观察到的一致性在 41.91% 到 80.93% 之间（一致性系数在 25.39% 到 79.72% 之间）。第一个信号问题 "病例的选择 "的一致性最低。在影响因子为结论的期刊上发表的文章的一致性相似：研究表明，在使用 Murad 工具评估病例系列和报告的方法学质量时，GPT-4 和人类审稿人之间的一致性适中。GPT-4 目前的表现似乎很有希望，但不太可能满足系统综述的严谨性要求，因此需要将该模型与人类审稿人配对使用。

{"title":"Concordance between humans and GPT-4 in appraising the methodological quality of case reports and case series using the Murad tool.","authors":"Zin Tarakji, Adel Kanaan, Samer Saadi, Mohammed Firwana, Adel Kabbara Allababidi, Mohamed F Abusalih, Rami Basmaci, Tamim I Rajjo, Zhen Wang, M Hassan Murad, Bashar Hasan","doi":"10.1186/s12874-024-02372-6","DOIUrl":"10.1186/s12874-024-02372-6","url":null,"abstract":"Background: Assessing the methodological quality of case reports and case series is challenging due to human judgment variability and time constraints. We evaluated the agreement in judgments between human reviewers and GPT-4 when applying a standard methodological quality assessment tool designed for case reports and series.Methods: We searched Scopus for systematic reviews published in 2023-2024 that cited the appraisal tool by Murad et al. A GPT-4 based agent was developed to assess the methodological quality using the 8 signaling questions of the tool. Observed agreement and agreement coefficient were estimated comparing published judgments of human reviewers to GPT-4 assessment.Results: We included 797 case reports and series. The observed agreement ranged between 41.91% and 80.93% across the eight questions (agreement coefficient ranged from 25.39 to 79.72%). The lowest agreement was noted in the first signaling question about selection of cases. The agreement was similar in articles published in journals with impact factor < 5 vs. ≥ 5, and when excluding systematic reviews that did not use 3 causality questions. Repeating the analysis using the same prompts demonstrated high agreement between the two GPT-4 attempts except for the first question about selection of cases.Conclusions: The study demonstrates a moderate agreement between GPT-4 and human reviewers in assessing the methodological quality of case series and reports using the Murad tool. The current performance of GPT-4 seems promising but unlikely to be sufficient for the rigor of a systematic review and pairing the model with a human reviewer is required.","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"266"},"PeriodicalIF":3.9,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11533388/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142575047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Incorporating external controls in the design of randomized clinical trials: a case study in solid tumors. 在随机临床试验设计中纳入外部控制：实体瘤案例研究。

IF 3.9 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES

BMC Medical Research Methodology

Pub Date : 2024-11-01 DOI: 10.1186/s12874-024-02383-3

Emily M Damone, Jiawen Zhu, Herbert Pang, Xiao Li, Yinqi Zhao, Evan Kwiatkowski, Lisa A Carey, Joseph G Ibrahim

Background: The use of historical external control data in clinical trials has grown in interest and needs when considering the design of future trials. Hybrid control designs can be more efficient to achieve the same power with fewer patients and limited resources. The literature is sparse on appropriate statistical methods which can account for the differences between historical external controls and the control patients in a study. In this article, we illustrate the analysis framework of a clinical trial if a hybrid control design was used after determining an RCT may not be feasible.

Methods: We utilize two previously completed RCTs in nonsquamous NSCLC and a nationwide electronic health record derived de-identified database as examples and compare 5 analysis methods on each trial, as well as a set of simulations to determine operating characteristics of such designs.

Results: In single trial estimation, the Case Weighted Adaptive Power Prior provided estimated treatment hazard ratios consistent with the original trial's conclusions with narrower confidence intervals. The simulation studies showed that the Case Weighted Adaptive Power Prior achieved the highest power (and well controlled type-1 error) across all 5 methods with consistent study sample size.

Conclusions: By following the proposed hybrid control framework, one can design a hybrid control trial transparently and accounting for differences between control groups while controlling type-1 error and still achieving efficiency gains from the additional contribution from external controls.

背景：在考虑未来试验设计时，临床试验中使用历史外部对照数据的兴趣和需求日益增长。混合对照设计可以更有效地利用更少的患者和有限的资源达到相同的效果。目前有关适当统计方法的文献很少，而这些方法可以解释研究中历史外部对照和对照患者之间的差异。在本文中，我们将说明在确定 RCT 不可行后，如果采用混合对照设计，临床试验的分析框架：方法：我们以之前完成的两项非鳞癌 NSCLC 的 RCT 和一个全国性的电子健康记录衍生去标识数据库为例，比较了每项试验的 5 种分析方法，并进行了一组模拟，以确定此类设计的运行特征：结果：在单一试验估算中，病例加权自适应功率先验法估算出的治疗危险比与原始试验结论一致，但置信区间较窄。模拟研究表明，在研究样本量一致的情况下，病例加权自适应功率先验法在所有 5 种方法中取得了最高的功率（并很好地控制了 1 类误差）：通过遵循所提出的混合控制框架，我们可以透明地设计混合控制试验，并考虑到控制组之间的差异，同时控制类型-1 误差，并从外部控制的额外贡献中获得效率收益。

{"title":"Incorporating external controls in the design of randomized clinical trials: a case study in solid tumors.","authors":"Emily M Damone, Jiawen Zhu, Herbert Pang, Xiao Li, Yinqi Zhao, Evan Kwiatkowski, Lisa A Carey, Joseph G Ibrahim","doi":"10.1186/s12874-024-02383-3","DOIUrl":"10.1186/s12874-024-02383-3","url":null,"abstract":"Background: The use of historical external control data in clinical trials has grown in interest and needs when considering the design of future trials. Hybrid control designs can be more efficient to achieve the same power with fewer patients and limited resources. The literature is sparse on appropriate statistical methods which can account for the differences between historical external controls and the control patients in a study. In this article, we illustrate the analysis framework of a clinical trial if a hybrid control design was used after determining an RCT may not be feasible.Methods: We utilize two previously completed RCTs in nonsquamous NSCLC and a nationwide electronic health record derived de-identified database as examples and compare 5 analysis methods on each trial, as well as a set of simulations to determine operating characteristics of such designs.Results: In single trial estimation, the Case Weighted Adaptive Power Prior provided estimated treatment hazard ratios consistent with the original trial's conclusions with narrower confidence intervals. The simulation studies showed that the Case Weighted Adaptive Power Prior achieved the highest power (and well controlled type-1 error) across all 5 methods with consistent study sample size.Conclusions: By following the proposed hybrid control framework, one can design a hybrid control trial transparently and accounting for differences between control groups while controlling type-1 error and still achieving efficiency gains from the additional contribution from external controls.","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"264"},"PeriodicalIF":3.9,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11529009/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142563712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Recruiting and retaining healthcare workers in Scotland to a longitudinal COVID-19 study: a descriptive analysis. 苏格兰医护人员的招聘和留用：COVID-19 纵向研究的描述性分析。

IF 3.9 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES

BMC Medical Research Methodology

Pub Date : 2024-11-01 DOI: 10.1186/s12874-024-02380-6

Josie Mm Evans, Nicole Sergenson, Melanie Dembinsky, Lynne Haahr, Jen Bishop, Anna Howells, Katie Munro, Lesley Price

Background: Rapid timescales for the design and delivery of research were common during the COVID-19 pandemic. The recruitment and retention of healthcare workers (HCWs) as participants in research studies are notoriously challenging, but this was exacerbated during the pandemic by the unprecedented demand placed on the workforce. The SARS-CoV-2 Immunity and Reinfection Evaluation (SIREN study) is a prospective multicentre cohort study following HCWs in the UK. This paper discusses the strategies and challenges associated with recruitment and retention of HCW participants in Scotland.

Methods: There were 44,546 HCWs recruited to the SIREN study, of whom 6,285 were recruited by research teams at ten different research sites in Scotland between October 2020 and March 2021. Information on target and actual sample size, availability of resource, recruitment rate, and recruitment and engagement strategies by site was collated from SIREN study documentation and discussions with local key SIREN site staff. Individual-level data from 6,153 HCW participants with ongoing consent for all data usage were also collated, including socio-demographic data and information on withdrawal (in first year) and opt-in to a study extension after one year. Factors associated with these outcomes were explored in logistic regression analyses.

Results: Different recruitment strategies were used in each site according to local agreements, protocol and staff capacity, with the recruitment period ranging from 13 to 160 days. The locally-agreed recruitment target was met in four sites. The proportion of participants who withdrew in the first year ranged from 3.1 to 24.8% by site, while subsequent opt-in to a 12-month study extension ranged from 28.6 to 74.8%. The sites with the highest proportions of withdrawals were the same four sites with lowest proportions of opt-in. On an individual level, there was a lower level of retention among younger participants, and those from lower socio-economic backgrounds and minority ethnic groups.

Conclusions: Site-specific factors including research-readiness likely had a significant influence on recruitment and retention, more so than the specific recruitment or retention strategies employed. Independent of site factors, individual-level variables influenced recruitment and retention, suggesting targeted strategies may be needed to promote research engagement among particular socio-demographic groups.

背景：在 COVID-19 大流行期间，设计和开展研究的快速时间表很常见。招募和留住参与研究的医护人员（HCWs）是一项众所周知的挑战，但在大流行期间，对医护人员的需求空前高涨，加剧了这一挑战。SARS-CoV-2 免疫和再感染评估（SIREN 研究）是一项前瞻性多中心队列研究，研究对象是英国的医护人员。本文讨论了在苏格兰招募和留住高危卫生工作者参与者的相关策略和挑战：SIREN 研究共招募了 44,546 名高危职业妇女，其中 6,285 名是由苏格兰 10 个不同研究地点的研究团队在 2020 年 10 月至 2021 年 3 月期间招募的。有关各研究点的目标和实际样本量、可用资源、招募率以及招募和参与策略的信息均来自 SIREN 研究文件以及与当地主要 SIREN 研究点工作人员的讨论。此外，还整理了 6,153 名人机工程参与者的个人数据，其中包括社会人口学数据以及关于退出（第一年）和一年后选择延长研究的信息。通过逻辑回归分析探讨了与这些结果相关的因素：各研究机构根据当地协议、规程和人员能力采取了不同的招募策略，招募时间从 13 天到 160 天不等。有四个研究机构达到了当地商定的招募目标。各研究机构在第一年退出研究的参与者比例从3.1%到24.8%不等，而随后选择延长12个月研究的比例从28.6%到74.8%不等。退出比例最高的四个地点也是选择加入比例最低的四个地点。就个人而言，年轻参与者、社会经济背景较差者和少数民族群体的保留率较低：结论：研究机构的具体因素（包括研究准备程度）可能对招募和留住参与者有重大影响，其影响程度超过了所采用的具体招募或留住策略。与研究地点因素无关，个人层面的变量也会影响招募和留用情况，这表明可能需要采取有针对性的策略来促进特定社会人口群体参与研究。

{"title":"Recruiting and retaining healthcare workers in Scotland to a longitudinal COVID-19 study: a descriptive analysis.","authors":"Josie Mm Evans, Nicole Sergenson, Melanie Dembinsky, Lynne Haahr, Jen Bishop, Anna Howells, Katie Munro, Lesley Price","doi":"10.1186/s12874-024-02380-6","DOIUrl":"10.1186/s12874-024-02380-6","url":null,"abstract":"Background: Rapid timescales for the design and delivery of research were common during the COVID-19 pandemic. The recruitment and retention of healthcare workers (HCWs) as participants in research studies are notoriously challenging, but this was exacerbated during the pandemic by the unprecedented demand placed on the workforce. The SARS-CoV-2 Immunity and Reinfection Evaluation (SIREN study) is a prospective multicentre cohort study following HCWs in the UK. This paper discusses the strategies and challenges associated with recruitment and retention of HCW participants in Scotland.Methods: There were 44,546 HCWs recruited to the SIREN study, of whom 6,285 were recruited by research teams at ten different research sites in Scotland between October 2020 and March 2021. Information on target and actual sample size, availability of resource, recruitment rate, and recruitment and engagement strategies by site was collated from SIREN study documentation and discussions with local key SIREN site staff. Individual-level data from 6,153 HCW participants with ongoing consent for all data usage were also collated, including socio-demographic data and information on withdrawal (in first year) and opt-in to a study extension after one year. Factors associated with these outcomes were explored in logistic regression analyses.Results: Different recruitment strategies were used in each site according to local agreements, protocol and staff capacity, with the recruitment period ranging from 13 to 160 days. The locally-agreed recruitment target was met in four sites. The proportion of participants who withdrew in the first year ranged from 3.1 to 24.8% by site, while subsequent opt-in to a 12-month study extension ranged from 28.6 to 74.8%. The sites with the highest proportions of withdrawals were the same four sites with lowest proportions of opt-in. On an individual level, there was a lower level of retention among younger participants, and those from lower socio-economic backgrounds and minority ethnic groups.Conclusions: Site-specific factors including research-readiness likely had a significant influence on recruitment and retention, more so than the specific recruitment or retention strategies employed. Independent of site factors, individual-level variables influenced recruitment and retention, suggesting targeted strategies may be needed to promote research engagement among particular socio-demographic groups.","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"265"},"PeriodicalIF":3.9,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11529010/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142563718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Variability of relative treatment effect among populations with low, moderate and high control group event rates: a meta-epidemiological study. 低、中、高对照组事件发生率人群中相对治疗效果的差异：一项荟萃流行病学研究。

IF 3.9 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES

BMC Medical Research Methodology

Pub Date : 2024-11-01 DOI: 10.1186/s12874-024-02388-y

M Hassan Murad, Zhen Wang, Mengli Xiao, Haitao Chu, Lifeng Lin

Background: The current practice in guideline development is to use the control group event rate (CR) as a surrogate of baseline risk and to assume portability of the relative treatment effect across populations with low, moderate and high baseline risk. We sought to emulate this practice in a very large sample of meta-analyses.

Methods: We retrieved data from all meta-analyses published in the Cochrane Database of Systematic Reviews (2003-2020) that evaluated a binary outcome, reported 2 × 2 data for each individual study and included at least 4 studies. We excluded studies with no events. We conducted meta-analyses with odds ratios and relative risks and performed subgroup analyses based on tertiles of CR. In sensitivity analyses, we evaluated the use of total event rate (TR) instead of CR and using quartiles instead of tertiles.

Results: The analysis included 2,531 systematic reviews (27,692 meta-analyses, 226,975 studies, 25,669,783 patients).The percentages of meta-analyses with statistically significant interaction (P < 0.05) based on CR tertile or quartile ranged 12-18% across various sensitivity analyses. This percentage increased as the number of studies or range of CR per meta-analysis increased, reflecting increased power of the subgroup test. The percentages of meta-analyses with statistically significant interaction (P < 0.05) with TR quantiles were lower than those with CR but remained higher than expected by chance.

Conclusion: This analysis suggests that when CR or TR are used as surrogates for baseline risk, relative treatment effects may not be portable across populations with varying baseline risks in many meta-analyses. Categroization of the continuous CR variable and not addressing measurement error limit inferences from such analyses and imply that CR is an undesirable source for baseline risk. Guideline developers and decision-makers should be provided with relative and absolute treatment effects that are conditioned on the baseline risk or derived from studies with similar baseline risk to their target populations.

背景：指南制定的现行做法是使用对照组事件发生率（CR）作为基线风险的代用指标，并假定相对治疗效果在低、中、高基线风险人群中的可移植性。我们试图在大量的荟萃分析样本中效仿这一做法：我们检索了科克伦系统综述数据库（2003-2020 年）中发表的所有荟萃分析数据，这些数据评估了二元结果，报告了每项研究的 2 × 2 数据，并至少纳入了 4 项研究。我们排除了无事件的研究。我们用几率比和相对风险进行了荟萃分析，并根据 CR 的 tertiles 进行了亚组分析。在敏感性分析中，我们评估了使用总事件率（TR）代替CR以及使用四分位数代替三分位数的情况：分析包括 2,531 篇系统综述（27,692 项元分析，226,975 项研究，25,669,783 名患者）：该分析表明，当使用 CR 或 TR 作为基线风险的替代指标时，在许多荟萃分析中，基线风险不同的人群的相对治疗效果可能并不可移植。对连续 CR 变量进行分类以及不解决测量误差问题限制了此类分析的推论，并暗示 CR 是基线风险的不理想来源。应向指南制定者和决策者提供以基线风险为条件的相对和绝对治疗效果，或从与目标人群基线风险相似的研究中得出的相对和绝对治疗效果。

{"title":"Variability of relative treatment effect among populations with low, moderate and high control group event rates: a meta-epidemiological study.","authors":"M Hassan Murad, Zhen Wang, Mengli Xiao, Haitao Chu, Lifeng Lin","doi":"10.1186/s12874-024-02388-y","DOIUrl":"10.1186/s12874-024-02388-y","url":null,"abstract":"Background: The current practice in guideline development is to use the control group event rate (CR) as a surrogate of baseline risk and to assume portability of the relative treatment effect across populations with low, moderate and high baseline risk. We sought to emulate this practice in a very large sample of meta-analyses.Methods: We retrieved data from all meta-analyses published in the Cochrane Database of Systematic Reviews (2003-2020) that evaluated a binary outcome, reported 2 × 2 data for each individual study and included at least 4 studies. We excluded studies with no events. We conducted meta-analyses with odds ratios and relative risks and performed subgroup analyses based on tertiles of CR. In sensitivity analyses, we evaluated the use of total event rate (TR) instead of CR and using quartiles instead of tertiles.Results: The analysis included 2,531 systematic reviews (27,692 meta-analyses, 226,975 studies, 25,669,783 patients).The percentages of meta-analyses with statistically significant interaction (P < 0.05) based on CR tertile or quartile ranged 12-18% across various sensitivity analyses. This percentage increased as the number of studies or range of CR per meta-analysis increased, reflecting increased power of the subgroup test. The percentages of meta-analyses with statistically significant interaction (P < 0.05) with TR quantiles were lower than those with CR but remained higher than expected by chance.Conclusion: This analysis suggests that when CR or TR are used as surrogates for baseline risk, relative treatment effects may not be portable across populations with varying baseline risks in many meta-analyses. Categroization of the continuous CR variable and not addressing measurement error limit inferences from such analyses and imply that CR is an undesirable source for baseline risk. Guideline developers and decision-makers should be provided with relative and absolute treatment effects that are conditioned on the baseline risk or derived from studies with similar baseline risk to their target populations.","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"263"},"PeriodicalIF":3.9,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11529075/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142563724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Bayesian additive regression trees for predicting childhood asthma in the CHILD cohort study. 贝叶斯加法回归树用于预测 CHILD 队列研究中的儿童哮喘。

IF 3.9 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES

BMC Medical Research Methodology

Pub Date : 2024-11-01 DOI: 10.1186/s12874-024-02376-2

Mojtaba Ahmadiankalati, Himani Boury, Padmaja Subbarao, Wendy Lou, Zihang Lu

Background: Asthma is a heterogeneous disease that affects millions of children and adults. There is a lack of objective gold standard diagnosis that spans the ages; instead, diagnoses are made by clinician assessment based on a cluster of signs, symptoms and objective tests dependent on age. Yet, there is a clear morbidity associated with chronic asthma symptoms. Machine learning has become a popular tool to improve asthma diagnosis and classification. There is a paucity of literature on the use of Bayesian machine learning algorithms to predict asthma diagnosis in children. This paper develops a prediction model using the Bayesian additive regression trees (BART) and compares its performance to various machine learning algorithms in predicting the diagnosis of childhood asthma.

Methods: Clinically relevant variables collected at or before 3 years of age from 2794 participants in the CHILD Cohort Study were used to predict physician-diagnosed asthma at age 5. BART and six other commonly used machine learning algorithms, namely adaptive boosting, logistic regression, decision tree, neural network, random forest, and support vector machine were trained. Measures of performance including sensitivity, specificity, and area under the receiver operating characteristic (ROC) curve were calculated. The confidence intervals were calculated using Bootstrapping samples. Important predictors and interaction effects associated with asthma were also identified using BART.

Results: BART, logistic regression and random forest showed the highest area under the ROC curve compared to other machine learning algorithms. Based on BART, recurrent wheeze, respiratory infection and food sensitization at 3 years of age were the most important predictors. The three most important interaction effects were found to be interaction terms of respiratory infection at 3 years and recurrent wheezing at 3 years, maternal asthma and paternal asthma, and maternal wheezing and inhalant sensitization of child at 3 years.

Conclusions: BART demonstrated promising prediction performance when compared to other machine learning algorithms. Future research could validate the BART in an external cohort to evaluate its reliability and generalizability.

背景：哮喘是一种影响数百万儿童和成人的异质性疾病。目前缺乏跨年龄段的客观金标准诊断；相反，诊断是由临床医生根据一系列体征、症状和与年龄相关的客观测试进行评估得出的。然而，与慢性哮喘症状相关的发病率非常明显。机器学习已成为改善哮喘诊断和分类的流行工具。关于使用贝叶斯机器学习算法预测儿童哮喘诊断的文献还很少。本文利用贝叶斯加性回归树（BART）建立了一个预测模型，并比较了它与各种机器学习算法在预测儿童哮喘诊断方面的性能：方法：我们利用从 CHILD 队列研究的 2794 名参与者中收集到的 3 岁时或 3 岁前的临床相关变量来预测 5 岁时医生诊断的哮喘。对 BART 和其他六种常用的机器学习算法（即自适应提升、逻辑回归、决策树、神经网络、随机森林和支持向量机）进行了训练。计算了灵敏度、特异性和接收者操作特征曲线（ROC）下面积等性能指标。使用 Bootstrapping 样本计算置信区间。还使用 BART 确定了与哮喘相关的重要预测因素和交互效应：结果：与其他机器学习算法相比，BART、逻辑回归和随机森林的 ROC 曲线下面积最大。根据 BART，反复喘息、呼吸道感染和 3 岁时食物过敏是最重要的预测因素。3岁时呼吸道感染与3岁时反复喘息、母亲哮喘与父亲哮喘、母亲喘息与3岁时儿童吸入物过敏的交互项是三个最重要的交互效应：与其他机器学习算法相比，BART具有良好的预测性能。未来的研究可以在外部队列中验证 BART，以评估其可靠性和可推广性。

{"title":"Bayesian additive regression trees for predicting childhood asthma in the CHILD cohort study.","authors":"Mojtaba Ahmadiankalati, Himani Boury, Padmaja Subbarao, Wendy Lou, Zihang Lu","doi":"10.1186/s12874-024-02376-2","DOIUrl":"10.1186/s12874-024-02376-2","url":null,"abstract":"Background: Asthma is a heterogeneous disease that affects millions of children and adults. There is a lack of objective gold standard diagnosis that spans the ages; instead, diagnoses are made by clinician assessment based on a cluster of signs, symptoms and objective tests dependent on age. Yet, there is a clear morbidity associated with chronic asthma symptoms. Machine learning has become a popular tool to improve asthma diagnosis and classification. There is a paucity of literature on the use of Bayesian machine learning algorithms to predict asthma diagnosis in children. This paper develops a prediction model using the Bayesian additive regression trees (BART) and compares its performance to various machine learning algorithms in predicting the diagnosis of childhood asthma.Methods: Clinically relevant variables collected at or before 3 years of age from 2794 participants in the CHILD Cohort Study were used to predict physician-diagnosed asthma at age 5. BART and six other commonly used machine learning algorithms, namely adaptive boosting, logistic regression, decision tree, neural network, random forest, and support vector machine were trained. Measures of performance including sensitivity, specificity, and area under the receiver operating characteristic (ROC) curve were calculated. The confidence intervals were calculated using Bootstrapping samples. Important predictors and interaction effects associated with asthma were also identified using BART.Results: BART, logistic regression and random forest showed the highest area under the ROC curve compared to other machine learning algorithms. Based on BART, recurrent wheeze, respiratory infection and food sensitization at 3 years of age were the most important predictors. The three most important interaction effects were found to be interaction terms of respiratory infection at 3 years and recurrent wheezing at 3 years, maternal asthma and paternal asthma, and maternal wheezing and inhalant sensitization of child at 3 years.Conclusions: BART demonstrated promising prediction performance when compared to other machine learning algorithms. Future research could validate the BART in an external cohort to evaluate its reliability and generalizability.","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"262"},"PeriodicalIF":3.9,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11529447/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142563703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Comparing methods for risk prediction of multicategory outcomes: dichotomized logistic regression vs. multinomial logit regression. 比较多类别结果的风险预测方法：二分法逻辑回归与多项式逻辑回归。

IF 3.9 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES

BMC Medical Research Methodology

Pub Date : 2024-10-31 DOI: 10.1186/s12874-024-02389-x

Lei Li, Matthew A Rysavy, Georgiy Bobashev, Abhik Das

Background: Medical outcomes of interest to clinicians may have multiple categories. Researchers face several options for risk prediction of such outcomes, including dichotomized logistic regression and multinomial logit regression modeling. We aimed to compare these methods and provide guidance needed for practice.

Methods: We described dichotomized logistic regression, multinomial continuation-ratio logit regression, which is an alternative to standard multinomial logit regression for ordinal outcomes, and logistic competing risks regression. We then applied these methods to develop prediction models of survival and neurodevelopmental outcomes based on the NICHD Extremely Preterm Birth Outcome Tool model. The statistical and practical advantages and flaws of these methods were examined. Both discrimination and calibration of the estimated logistic models of dichotomized outcomes and continuation-ratio logit model were assessed.

Results: The dichotomized logistic models and multinomial continuation-ratio logit model had similar discrimination and calibration in predicting death and survival without neurodevelopmental impairment. But the continuation-ratio logit model had better discrimination and calibration in predicting neurodevelopmental impairment. The sum of predicted probabilities of outcome categories from the dichotomized logistic models could deviate from 100% substantially, ranging from 87.7 to 124.0%, and the dichotomized logistic model of neurodevelopmental impairment greatly overpredicted low risks and underpredicted high risks.

Conclusions: Estimating multiple logistic regression models of dichotomized outcomes may result in poorly calibrated predictions for an outcome with multiple ordinal categories. Multinomial continuation-ratio logit regression produces better calibrated predictions, constrains the sum of predicted probabilities to 100%, and has the advantages of simplicity in model interpretation, flexibility to include outcome category-specific predictors and random-effect terms for patient heterogeneity by hospital. It also accounts for mutual dependence among multiple categories and accommodates competing risks.

背景：临床医生感兴趣的医疗结果可能有多个类别。研究人员面临着对此类结果进行风险预测的几种选择，包括二分法逻辑回归和多项式逻辑回归模型。我们旨在对这些方法进行比较，并提供实践所需的指导：我们介绍了二分法逻辑回归、多二项延续比逻辑回归（这是针对序数结果的标准多二项逻辑回归的替代方法）和逻辑竞争风险回归。然后，我们应用这些方法开发了基于美国国家儿童疾病防治中心极早产儿结局工具模型的存活率和神经发育结局预测模型。我们研究了这些方法在统计学和实用性方面的优势和缺陷。对估计的二分法结果逻辑模型和延续比逻辑模型的区分度和校准进行了评估：结果：在预测无神经发育障碍的死亡和存活率方面，二分法逻辑模型和多二项式延续比对数模型具有相似的辨别力和校准性。但连续比对数模型在预测神经发育障碍方面具有更好的区分度和校准性。二分法逻辑模型对结果类别的预测概率之和与100%的偏差很大，从87.7%到124.0%不等，神经发育障碍的二分法逻辑模型大大高估了低风险，低估了高风险：结论：估计二分法结果的多重逻辑回归模型可能会导致对具有多个序数类别的结果的预测校准不良。多项式延续比 logit 回归能得出更好的校准预测结果，将预测概率之和限制在 100%，并且具有模型解释简单、可灵活纳入结果类别特异性预测因子和医院患者异质性随机效应项等优点。它还考虑了多个类别之间的相互依赖性，并适应竞争风险。

{"title":"Comparing methods for risk prediction of multicategory outcomes: dichotomized logistic regression vs. multinomial logit regression.","authors":"Lei Li, Matthew A Rysavy, Georgiy Bobashev, Abhik Das","doi":"10.1186/s12874-024-02389-x","DOIUrl":"10.1186/s12874-024-02389-x","url":null,"abstract":"Background: Medical outcomes of interest to clinicians may have multiple categories. Researchers face several options for risk prediction of such outcomes, including dichotomized logistic regression and multinomial logit regression modeling. We aimed to compare these methods and provide guidance needed for practice.Methods: We described dichotomized logistic regression, multinomial continuation-ratio logit regression, which is an alternative to standard multinomial logit regression for ordinal outcomes, and logistic competing risks regression. We then applied these methods to develop prediction models of survival and neurodevelopmental outcomes based on the NICHD Extremely Preterm Birth Outcome Tool model. The statistical and practical advantages and flaws of these methods were examined. Both discrimination and calibration of the estimated logistic models of dichotomized outcomes and continuation-ratio logit model were assessed.Results: The dichotomized logistic models and multinomial continuation-ratio logit model had similar discrimination and calibration in predicting death and survival without neurodevelopmental impairment. But the continuation-ratio logit model had better discrimination and calibration in predicting neurodevelopmental impairment. The sum of predicted probabilities of outcome categories from the dichotomized logistic models could deviate from 100% substantially, ranging from 87.7 to 124.0%, and the dichotomized logistic model of neurodevelopmental impairment greatly overpredicted low risks and underpredicted high risks.Conclusions: Estimating multiple logistic regression models of dichotomized outcomes may result in poorly calibrated predictions for an outcome with multiple ordinal categories. Multinomial continuation-ratio logit regression produces better calibrated predictions, constrains the sum of predicted probabilities to 100%, and has the advantages of simplicity in model interpretation, flexibility to include outcome category-specific predictors and random-effect terms for patient heterogeneity by hospital. It also accounts for mutual dependence among multiple categories and accommodates competing risks.","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"261"},"PeriodicalIF":3.9,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11526521/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142557124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Barriers and facilitators to conducting randomised controlled trials within routine care of neurorehabilitation centres: a qualitative study. 在神经康复中心的常规护理中开展随机对照试验的障碍和促进因素：一项定性研究。

IF 3.9 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES

BMC Medical Research Methodology

Pub Date : 2024-10-30 DOI: 10.1186/s12874-024-02386-0

Isabella Hotz, Nathalie Ernst, Christian Brenneis, Gudrun Diermayr, Barbara Seebacher

Background: Randomised controlled trials (RCTs) are considered the gold standard for generating clinical evidence. The focus on high internal validity in RCTs challenges the external validity and generalisability of findings, potentially hindering their application in routine care. In neurorehabilitation, limited literature addresses conducting RCTs feasibly and efficiently. We investigated barriers and facilitators to conducting RCTs within routine care of neurorehabilitation centres from the perspective of stakeholders in neurorehabilitation in Germany and Austria.

Methods: We conducted semi-structured interviews with stakeholders in neurorehabilitation from four centres in Germany and Austria, informed by the Theoretical Domains Framework (TDF) and the Capability, Opportunity, Motivation and Behaviour model (COM-B). Employing a hybrid approach, the interview analysis integrated both deductive, theory-driven analysis based on the TDF domains and COM-B model and inductive, reflexive thematic analysis.

Results: Twelve stakeholders (4 physicians, 4 therapy managers, 4 therapists; 5 females, 7 males; with research experience spanning 0-40 years) were interviewed. Key barriers to conducting RCTs in neurological rehabilitation centres include limited financial, human, and time resources, high clinical workloads, and a lack of interest of some therapists. Ineffective leadership, perceived lack of research expertise, and communication issues were also significant barriers. Social influence factors such as lack of employer support and inadequate training access further contributed to the challenges. Additionally, barriers included insufficient research infrastructure, limited space, internal power struggles, and rigid cost bearer specifications. Key facilitators included physicians' and therapists' motivation to advance the field, contribute to knowledge, and to prioritise patient health. Support from supervisors, joint decision-making, and efficient organisation were crucial facilitators. Flexible therapy planning, mutual support, and interdisciplinary collaboration also played important roles.

Conclusion: Our results suggest that increasing professional development and understanding, along with providing adequate financial, human, time, and spatial resources to support research endeavours, implementing effective communication strategies to enhance interdisciplinary collaboration and coordination among team members may contribute to increased motivation and facilitate RCTs within the setting of neurorehabilitation centres.

Trial registration: This study was prospectively registered with the German Clinical Trials Register (08.04.2021 DRKSID DRKS00024982).

背景：随机对照试验（RCT）被认为是产生临床证据的黄金标准。随机对照试验注重高度的内部有效性，这就对研究结果的外部有效性和可推广性提出了挑战，可能会阻碍其在常规护理中的应用。在神经康复领域，有关可行、高效地开展 RCT 的文献十分有限。我们从德国和奥地利神经康复领域利益相关者的角度出发，调查了在神经康复中心常规护理中开展 RCT 的障碍和促进因素：我们以理论领域框架（TDF）和能力、机会、动机和行为模型（COM-B）为依据，对德国和奥地利四个神经康复中心的利益相关者进行了半结构化访谈。访谈分析采用混合方法，既有基于 TDF 领域和 COM-B 模型的演绎式理论驱动分析，也有归纳式反思主题分析：12 位相关人员（4 位医生、4 位治疗经理、4 位治疗师；5 位女性、7 位男性；研究经验跨度为 0-40 年）接受了访谈。在神经康复中心开展 RCT 的主要障碍包括财力、人力和时间资源有限，临床工作量大，以及一些治疗师缺乏兴趣。领导不力、缺乏研究专长和沟通问题也是主要障碍。缺乏雇主支持和培训机会不足等社会影响因素进一步加剧了这些挑战。此外，障碍还包括研究基础设施不足、空间有限、内部权力斗争以及僵化的成本承担规格。主要的促进因素包括医生和治疗师推动该领域发展、贡献知识以及优先考虑患者健康的积极性。上司的支持、共同决策和高效的组织是重要的促进因素。灵活的治疗计划、相互支持和跨学科合作也发挥了重要作用：我们的研究结果表明，加强专业发展和理解，同时提供充足的财力、人力、时间和空间资源以支持研究工作，实施有效的沟通策略以加强跨学科合作和团队成员之间的协调，可能有助于提高积极性，并促进神经康复中心开展 RCT：本研究已在德国临床试验登记处进行了前瞻性登记（2021 年 4 月 8 日 DRKSID DRKS00024982）。

{"title":"Barriers and facilitators to conducting randomised controlled trials within routine care of neurorehabilitation centres: a qualitative study.","authors":"Isabella Hotz, Nathalie Ernst, Christian Brenneis, Gudrun Diermayr, Barbara Seebacher","doi":"10.1186/s12874-024-02386-0","DOIUrl":"10.1186/s12874-024-02386-0","url":null,"abstract":"Background: Randomised controlled trials (RCTs) are considered the gold standard for generating clinical evidence. The focus on high internal validity in RCTs challenges the external validity and generalisability of findings, potentially hindering their application in routine care. In neurorehabilitation, limited literature addresses conducting RCTs feasibly and efficiently. We investigated barriers and facilitators to conducting RCTs within routine care of neurorehabilitation centres from the perspective of stakeholders in neurorehabilitation in Germany and Austria.Methods: We conducted semi-structured interviews with stakeholders in neurorehabilitation from four centres in Germany and Austria, informed by the Theoretical Domains Framework (TDF) and the Capability, Opportunity, Motivation and Behaviour model (COM-B). Employing a hybrid approach, the interview analysis integrated both deductive, theory-driven analysis based on the TDF domains and COM-B model and inductive, reflexive thematic analysis.Results: Twelve stakeholders (4 physicians, 4 therapy managers, 4 therapists; 5 females, 7 males; with research experience spanning 0-40 years) were interviewed. Key barriers to conducting RCTs in neurological rehabilitation centres include limited financial, human, and time resources, high clinical workloads, and a lack of interest of some therapists. Ineffective leadership, perceived lack of research expertise, and communication issues were also significant barriers. Social influence factors such as lack of employer support and inadequate training access further contributed to the challenges. Additionally, barriers included insufficient research infrastructure, limited space, internal power struggles, and rigid cost bearer specifications. Key facilitators included physicians' and therapists' motivation to advance the field, contribute to knowledge, and to prioritise patient health. Support from supervisors, joint decision-making, and efficient organisation were crucial facilitators. Flexible therapy planning, mutual support, and interdisciplinary collaboration also played important roles.Conclusion: Our results suggest that increasing professional development and understanding, along with providing adequate financial, human, time, and spatial resources to support research endeavours, implementing effective communication strategies to enhance interdisciplinary collaboration and coordination among team members may contribute to increased motivation and facilitate RCTs within the setting of neurorehabilitation centres.Trial registration: This study was prospectively registered with the German Clinical Trials Register (08.04.2021 DRKSID DRKS00024982).","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"258"},"PeriodicalIF":3.9,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11523832/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142543590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0