首页 > 最新文献

Journal of Biopharmaceutical Statistics最新文献

英文 中文
The 2009 FDA PRO guidance, Potential Type I error, Descriptive Statistics and Pragmatic estimation of the number of interviews for item elicitation. 2009 年 FDA PRO 指南、潜在的 I 类错误、描述性统计和项目征询访谈次数的实用估算。
IF 1.2 4区 医学 Q4 PHARMACOLOGY & PHARMACY Pub Date : 2025-08-01 Epub Date: 2024-11-24 DOI: 10.1080/10543406.2024.2420642
Josh Fleckner, Chris Barker

A statistical methodology named "capture recapture", a Kaplan-Meier Summary Statistic, and an urn model framework are presented to describe the elicitation, then estimate both the number of interviews and the total number of items ("codes") that will be elicited during patient interviews, and present a summary graphical statistic that "saturation" has occurred. This methodology is developed to address a gap in the FDA 2009 PRO and 2012 PFDD guidance for determining the number of interviews (sample size). This estimate of the number of interviews (sample size) uses a two-step procedure. The estimate of the total number of items is then used to estimate the number of interviews to elicit all items. A framework called an urn model is a framework for describing the elicitation and demonstrate the algorithm for declaring saturation "first interview with zero new codes". A caveat emptor is that due to independence assumptions, the urn model is not used as a method for estimating probabilities. The URN model provides a framework to demonstrate that an algorithm such as "first interview with zero new codes" may establish that all codes have been elicited. The limitations of the Urn model, capture recapture, and Kaplan-Meier are summarized. The statistical methods and the estimates supplement but do not replace expert judgement and declaration of "saturation." A graphical summary statistic is presented to summarize "saturation," after expert declaration for two algorithms. An example of a capture-recapture estimate, using simulated data is provided. The example suggests that the estimate of total number of codes may be accurate when prepared as early as the second interview. A second simulation is presented with an URN model, under a strong assumption of independence that an algorithm such as 'first interview with zero new codes" may fail to identify all codes. Potential errors in declaration of saturation are presented. Recommendations are presented for additional research and the use of the algorithm "first interview with zero new codes."

本文介绍了一种名为 "捕获再捕获 "的统计方法、Kaplan-Meier 统计摘要和瓮模型框架,用于描述诱导过程,然后估算访谈次数和患者访谈期间将诱导出的项目("代码")总数,并以图形统计摘要的方式说明 "饱和 "已经发生。此方法的开发是为了弥补 FDA 2009 PRO 和 2012 PFDD 指南在确定访谈次数(样本大小)方面的不足。对访谈次数(样本量)的估算采用两步程序。首先估算项目总数,然后使用估算的项目总数估算获取所有项目的访谈次数。一个称为 "urn 模型 "的框架可用于描述诱导过程,并演示宣布 "首次访谈无新代码 "为饱和的算法。需要注意的是,由于存在独立性假设,瓮模型不能用作估计概率的方法。瓮模型提供了一个框架,可以证明 "首次访谈无新代码 "这样的算法可以确定所有代码都已引出。本文总结了瓮模型、捕获再捕获和 Kaplan-Meier 的局限性。统计方法和估算结果是对专家判断和 "饱和 "声明的补充,但不能取代专家判断和 "饱和 "声明。在专家宣布两种算法的 "饱和度 "后,提出了一个图解统计摘要。提供了一个使用模拟数据进行捕获-再捕获估算的例子。该示例表明,如果早在第二次访谈时就做好准备,对代码总数的估计可能是准确的。在 "第一次访谈无新代码 "等算法可能无法识别所有代码的独立性假设下,使用 URN 模型进行了第二次模拟。还介绍了在宣布饱和时可能出现的错误。提出了关于进一步研究和使用 "首次访谈零新代码 "算法的建议。
{"title":"The 2009 FDA PRO guidance, Potential Type I error, Descriptive Statistics and Pragmatic estimation of the number of interviews for item elicitation.","authors":"Josh Fleckner, Chris Barker","doi":"10.1080/10543406.2024.2420642","DOIUrl":"10.1080/10543406.2024.2420642","url":null,"abstract":"<p><p>A statistical methodology named \"capture recapture\", a Kaplan-Meier Summary Statistic, and an urn model framework are presented to describe the elicitation, then estimate both the number of interviews and the total number of items (\"codes\") that will be elicited during patient interviews, and present a summary graphical statistic that \"saturation\" has occurred. This methodology is developed to address a gap in the FDA 2009 PRO and 2012 PFDD guidance for determining the number of interviews (sample size). This estimate of the number of interviews (sample size) uses a two-step procedure. The estimate of the total number of items is then used to estimate the number of interviews to elicit all items. A framework called an urn model is a framework for describing the elicitation and demonstrate the algorithm for declaring saturation \"first interview with zero new codes\". A caveat emptor is that due to independence assumptions, the urn model is not used as a method for estimating probabilities. The URN model provides a framework to demonstrate that an algorithm such as \"first interview with zero new codes\" may establish that all codes have been elicited. The limitations of the Urn model, capture recapture, and Kaplan-Meier are summarized. The statistical methods and the estimates supplement but do not replace expert judgement and declaration of \"saturation.\" A graphical summary statistic is presented to summarize \"saturation,\" after expert declaration for two algorithms. An example of a capture-recapture estimate, using simulated data is provided. The example suggests that the estimate of total number of codes may be accurate when prepared as early as the second interview. A second simulation is presented with an URN model, under a strong assumption of independence that an algorithm such as 'first interview with zero new codes\" may fail to identify all codes. Potential errors in declaration of saturation are presented. Recommendations are presented for additional research and the use of the algorithm \"first interview with zero new codes.\"</p>","PeriodicalId":54870,"journal":{"name":"Journal of Biopharmaceutical Statistics","volume":" ","pages":"872-887"},"PeriodicalIF":1.2,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142711180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Introduction to the special issue Advances in statistical methods for the assessment of patient-centered outcomes. 特刊导论以患者为中心的结果评估的统计方法进展。
IF 1.2 4区 医学 Q4 PHARMACOLOGY & PHARMACY Pub Date : 2025-08-01 Epub Date: 2025-05-15 DOI: 10.1080/10543406.2025.2472801
Jessica Roydhouse, Nunzio Camerlingo, Joseph C Cappelleri
{"title":"Introduction to the special issue <i>Advances in statistical methods for the assessment of patient-centered outcomes</i>.","authors":"Jessica Roydhouse, Nunzio Camerlingo, Joseph C Cappelleri","doi":"10.1080/10543406.2025.2472801","DOIUrl":"10.1080/10543406.2025.2472801","url":null,"abstract":"","PeriodicalId":54870,"journal":{"name":"Journal of Biopharmaceutical Statistics","volume":" ","pages":"777-781"},"PeriodicalIF":1.2,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144081629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Weighted sum and order statistics methods for dynamic information borrowing in basket trials. 篮子试验中动态信息借用的加权和序统计方法。
IF 1.2 4区 医学 Q4 PHARMACOLOGY & PHARMACY Pub Date : 2025-08-01 DOI: 10.1080/10543406.2025.2537088
Cheng Huang, Chenghao Chu, Yimeng Lu, Bingming Yi, Ming-Hui Chen

In basket trials, the same investigational therapy is studied on multiple sub-populations simultaneously under a single protocol. The goal of basket trials is to identify the sub-populations in which the therapy is effective. Basket trials have become a popular and generally accepted study design in disease areas including but not limited to oncology and rare diseases, for their advantages in operation and ethical considerations. Extensive research work on information borrowing has been conducted to explore the statistical efficiency in basket trials. In this paper, two novel frequentist methods for basket trials are proposed. The first method borrows information to minimize the mean squared errors in the treatment effect estimation. The second method uses information across all baskets to optimize the multiple testing task in detecting the treatment effects in each basket. Extensive simulation studies show that the proposed methods substantially improved statistical efficiency in basket trials while limiting family-wise error rate inflation. Both methods can be implemented with common statistical models with or without adjustment for covariates.

在篮子试验中,在单一方案下同时对多个亚群体研究相同的研究性治疗。篮子试验的目标是确定治疗有效的亚人群。篮子试验因其在操作和伦理方面的优势,已成为包括但不限于肿瘤和罕见疾病在内的疾病领域流行和普遍接受的研究设计。为了探索篮子试验的统计效率,开展了大量的信息借用研究工作。本文提出了筐试验的两种新的频域方法。第一种方法是利用信息最小化处理效果估计中的均方误差。第二种方法使用所有篮子中的信息来优化检测每个篮子中的处理效果的多重测试任务。大量的模拟研究表明,所提出的方法大大提高了篮子试验的统计效率,同时限制了家庭误差率膨胀。这两种方法都可以用有或没有协变量调整的常见统计模型来实现。
{"title":"Weighted sum and order statistics methods for dynamic information borrowing in basket trials.","authors":"Cheng Huang, Chenghao Chu, Yimeng Lu, Bingming Yi, Ming-Hui Chen","doi":"10.1080/10543406.2025.2537088","DOIUrl":"https://doi.org/10.1080/10543406.2025.2537088","url":null,"abstract":"<p><p>In basket trials, the same investigational therapy is studied on multiple sub-populations simultaneously under a single protocol. The goal of basket trials is to identify the sub-populations in which the therapy is effective. Basket trials have become a popular and generally accepted study design in disease areas including but not limited to oncology and rare diseases, for their advantages in operation and ethical considerations. Extensive research work on information borrowing has been conducted to explore the statistical efficiency in basket trials. In this paper, two novel frequentist methods for basket trials are proposed. The first method borrows information to minimize the mean squared errors in the treatment effect estimation. The second method uses information across all baskets to optimize the multiple testing task in detecting the treatment effects in each basket. Extensive simulation studies show that the proposed methods substantially improved statistical efficiency in basket trials while limiting family-wise error rate inflation. Both methods can be implemented with common statistical models with or without adjustment for covariates.</p>","PeriodicalId":54870,"journal":{"name":"Journal of Biopharmaceutical Statistics","volume":" ","pages":"1-20"},"PeriodicalIF":1.2,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144762352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of Clinically Meaningful Change on Patient-Reported Outcomes: Renewed Insights About Covariate Adjustment. 分析患者报告结果的临床意义变化:关于协变量调整的新见解。
IF 1.2 4区 医学 Q4 PHARMACOLOGY & PHARMACY Pub Date : 2025-08-01 Epub Date: 2023-08-01 DOI: 10.1080/10543406.2023.2237115
Joseph C Cappelleri, Paul R Cislo

Determining clinically meaningful change (CMC) in a patient-reported (PRO) measure is central to its existence in gauging how patients feel and function, especially for evaluating a treatment effect. Anchor-based approaches are recommended to estimate a CMC threshold on a PRO measure. Determination of CMC involves linking changes or differences in the target PRO measure to that in an external (anchor) measure that is easier to interpret than and appreciably associated with the PRO measure. One type of anchor-based approach for CMC is the "mean change method" where the mean change in score of the target PRO measure within a particular anchor transition level (e.g. one-category improvement) is subtracted from the mean change in score of within an adjacent anchor category (e.g. no change category). In the literature, the mean change method has been applied with and without an adjustment for the baseline scores for the PRO of interest. This article provides the analytic rationale and conceptual justification for keeping the analysis unadjusted and not controlling for baseline PRO scores. Two illustrative examples are highlighted. The current research is essentially a variation of Lord's paradox (where whether to adjust for a baseline variable depends on the research question) placed in a new context. Once the adjustment is made, the resulting CMC estimate reflects an artificial case where the anchor transition levels are forced to have the same average baseline PRO score. The unadjusted estimate acknowledges that the anchor transition levels are naturally occurring (not randomized) groups and thus maintains external validity.

在患者报告(PRO)测量中确定临床有意义的变化(CMC)是衡量患者感觉和功能的核心,特别是评估治疗效果。建议使用基于锚点的方法来估计PRO测量的CMC阈值。CMC的确定涉及将目标PRO测量的变化或差异与外部(锚定)测量的变化或差异联系起来,外部(锚定)测量比PRO测量更容易解释并明显与PRO测量相关。CMC的一种基于锚点的方法是“平均变化法”,即在特定锚点过渡水平(例如一类改进)内的目标PRO测量得分的平均变化从相邻锚点类别(例如无变化类别)内的得分平均变化中减去。在文献中,对于感兴趣的PRO的基线分数,在有或没有调整的情况下都应用了平均变化法。本文提供了保持分析不调整和不控制基线PRO分数的分析原理和概念上的理由。重点介绍了两个说明性的例子。当前的研究本质上是洛德悖论(是否调整基线变量取决于研究问题)在新背景下的变化。一旦进行调整,所得到的CMC估计反映了一个人为的情况,即锚点过渡水平被迫具有相同的平均基线PRO分数。未经调整的估计承认锚点过渡水平是自然发生的(不是随机的)群体,因此保持外部有效性。
{"title":"Analysis of Clinically Meaningful Change on Patient-Reported Outcomes: Renewed Insights About Covariate Adjustment.","authors":"Joseph C Cappelleri, Paul R Cislo","doi":"10.1080/10543406.2023.2237115","DOIUrl":"10.1080/10543406.2023.2237115","url":null,"abstract":"<p><p>Determining clinically meaningful change (CMC) in a patient-reported (PRO) measure is central to its existence in gauging how patients feel and function, especially for evaluating a treatment effect. Anchor-based approaches are recommended to estimate a CMC threshold on a PRO measure. Determination of CMC involves linking changes or differences in the target PRO measure to that in an external (anchor) measure that is easier to interpret than and appreciably associated with the PRO measure. One type of anchor-based approach for CMC is the \"mean change method\" where the mean change in score of the target PRO measure within a particular anchor transition level (e.g. one-category improvement) is subtracted from the mean change in score of within an adjacent anchor category (e.g. no change category). In the literature, the mean change method has been applied with and without an adjustment for the baseline scores for the PRO of interest. This article provides the analytic rationale and conceptual justification for keeping the analysis unadjusted and not controlling for baseline PRO scores. Two illustrative examples are highlighted. The current research is essentially a variation of Lord's paradox (where whether to adjust for a baseline variable depends on the research question) placed in a new context. Once the adjustment is made, the resulting CMC estimate reflects an artificial case where the anchor transition levels are forced to have the same average baseline PRO score. The unadjusted estimate acknowledges that the anchor transition levels are naturally occurring (not randomized) groups and thus maintains external validity.</p>","PeriodicalId":54870,"journal":{"name":"Journal of Biopharmaceutical Statistics","volume":" ","pages":"812-825"},"PeriodicalIF":1.2,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10269465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction. 修正。
IF 1.2 4区 医学 Q4 PHARMACOLOGY & PHARMACY Pub Date : 2025-08-01 Epub Date: 2025-07-09 DOI: 10.1080/10543406.2025.2529762
{"title":"Correction.","authors":"","doi":"10.1080/10543406.2025.2529762","DOIUrl":"10.1080/10543406.2025.2529762","url":null,"abstract":"","PeriodicalId":54870,"journal":{"name":"Journal of Biopharmaceutical Statistics","volume":" ","pages":"i"},"PeriodicalIF":1.2,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144592979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recurrent neural networks and attention scores for personalized prediction and interpretation of patient-reported outcomes. 递归神经网络和注意力评分用于个性化预测和解释患者报告的结果。
IF 1.2 4区 医学 Q4 PHARMACOLOGY & PHARMACY Pub Date : 2025-08-01 Epub Date: 2025-03-13 DOI: 10.1080/10543406.2025.2469884
Jinxiang Hu, Mohsen Nayebi Kerdabadi, Xiaohang Mei, Joseph Cappelleri, Richard Barohn, Zijun Yao

We proposed an Interpretable Personalized Artificial Intelligence (AI) model for PRO measures via Recurrent Neural Networks (RNN) and attention scores, with data from an open label randomized clinical trial of pain in 402 participants with cryptogenic sensory polyneuropathy at 40 neurology care clinics. All patients were assigned to four treatment groups: nortriptyline, duloxetine, pregabalin, and mexiletine. Each patient had 4 PRO measures (quality of life SF-12; PROMIS: pain interference, fatigue, sleep disturbance) at 4 time points (baseline, week 4, week 8, and week 12). We included 201 patients without missing values. Participants were 30 years or older and 106 (52.7%) were men, majority were White (164, 81.6%). We fitted an RNN model with attention scores to the data to predict the PROMIS pain interference score. We evaluated the model performance with Mean Absolute Error (MAE) and R-square statistics. We also used attention scores to explain the global variable importance at model level, and at individual level for each patient. The best predictor of pain score was the SF-12 item (physical and emotional health interfere with social activities) and fatigue item (push yourself to get things done), the biggest drug-level contributor was mexiletine, the biggest time-level contributor was week 12. Overall, the model fit well (MAE = 3.7, R2 = 63%). Attention-RNN is a feasible and reliable model for predicting PRO outcomes utilizing longitudinal data, such as pain, and can provide personalized individual level interpretation.

我们提出了一个可解释的个性化人工智能(AI)模型,通过循环神经网络(RNN)和注意力评分来测量PRO,数据来自40家神经病学护理诊所的402名隐源性感觉多发性神经病患者的开放标签随机疼痛临床试验。所有患者被分配到四个治疗组:去甲替林、度洛西汀、普瑞巴林和美西汀。每位患者进行4项PRO测量(生活质量SF-12;承诺:疼痛干扰、疲劳、睡眠障碍)在4个时间点(基线、第4周、第8周和第12周)。我们纳入了201例无缺失值的患者。参与者年龄在30岁及以上,男性106人(52.7%),多数为白人(164人,81.6%)。我们对数据拟合了一个带有注意力分数的RNN模型来预测PROMIS疼痛干扰评分。我们用平均绝对误差(MAE)和r平方统计量来评估模型的性能。我们还使用注意力分数来解释模型水平和每个患者个体水平的全局变量重要性。疼痛评分的最佳预测因子是SF-12项目(身心健康对社交活动的干扰)和疲劳项目(强迫自己完成任务),药物水平的最大影响因子是美西汀,时间水平的最大影响因子是第12周。总体而言,模型拟合良好(MAE = 3.7, R2 = 63%)。注意- rnn是利用纵向数据(如疼痛)预测PRO结果的可行且可靠的模型,可以提供个性化的个体水平解释。
{"title":"Recurrent neural networks and attention scores for personalized prediction and interpretation of patient-reported outcomes.","authors":"Jinxiang Hu, Mohsen Nayebi Kerdabadi, Xiaohang Mei, Joseph Cappelleri, Richard Barohn, Zijun Yao","doi":"10.1080/10543406.2025.2469884","DOIUrl":"10.1080/10543406.2025.2469884","url":null,"abstract":"<p><p>We proposed an Interpretable Personalized Artificial Intelligence (AI) model for PRO measures via Recurrent Neural Networks (RNN) and attention scores, with data from an open label randomized clinical trial of pain in 402 participants with cryptogenic sensory polyneuropathy at 40 neurology care clinics. All patients were assigned to four treatment groups: nortriptyline, duloxetine, pregabalin, and mexiletine. Each patient had 4 PRO measures (quality of life SF-12; PROMIS: pain interference, fatigue, sleep disturbance) at 4 time points (baseline, week 4, week 8, and week 12). We included 201 patients without missing values. Participants were 30 years or older and 106 (52.7%) were men, majority were White (164, 81.6%). We fitted an RNN model with attention scores to the data to predict the PROMIS pain interference score. We evaluated the model performance with Mean Absolute Error (MAE) and R-square statistics. We also used attention scores to explain the global variable importance at model level, and at individual level for each patient. The best predictor of pain score was the SF-12 item (physical and emotional health interfere with social activities) and fatigue item (push yourself to get things done), the biggest drug-level contributor was mexiletine, the biggest time-level contributor was week 12. Overall, the model fit well (MAE = 3.7, R2 = 63%). Attention-RNN is a feasible and reliable model for predicting PRO outcomes utilizing longitudinal data, such as pain, and can provide personalized individual level interpretation.</p>","PeriodicalId":54870,"journal":{"name":"Journal of Biopharmaceutical Statistics","volume":" ","pages":"933-943"},"PeriodicalIF":1.2,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143626027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Use of Bayesian decision analysis to maximize value in patient-centered randomized clinical trials in Parkinson's disease. 使用贝叶斯决策分析,在以患者为中心的帕金森病随机临床试验中实现价值最大化。
IF 1.2 4区 医学 Q4 PHARMACOLOGY & PHARMACY Pub Date : 2025-08-01 Epub Date: 2023-03-02 DOI: 10.1080/10543406.2023.2170400
Shomesh E Chaudhuri, Zied Ben Chaouch, Brett Hauber, Brennan Mange, Mo Zhou, Stephanie Christopher, Dawn Bardot, Margaret Sheehan, Anne Donnelly, Lauren McLaughlin, Brittany Caldwell, Heather L Benz, Martin Ho, Anindita Saha, Katrina Gwinn, Murray Sheldon, Andrew W Lo

A fixed one-sided significance level of 5% is commonly used to interpret the statistical significance of randomized clinical trial (RCT) outcomes. While it is necessary to reduce the false positive rate, the threshold used could be chosen quantitatively and transparently to specifically reflect patient preferences regarding benefit-risk tradeoffs as well as other considerations. How can patient preferences be explicitly incorporated into RCTs in Parkinson's disease (PD), and what is the impact on statistical thresholds for device approval? In this analysis, we apply Bayesian decision analysis (BDA) to PD patient preference scores elicited from survey data. BDA allows us to choose a sample size (n) and significance level (α) that maximizes the overall expected value to patients of a balanced two-arm fixed-sample RCT, where the expected value is computed under both null and alternative hypotheses. For PD patients who had previously received deep brain stimulation (DBS) treatment, the BDA-optimal significance levels fell between 4.0% and 10.0%, similar to or greater than the traditional value of 5%. Conversely, for patients who had never received DBS, the optimal significance level ranged from 0.2% to 4.4%. In both of these populations, the optimal significance level increased with the severity of the patients' cognitive and motor function symptoms. By explicitly incorporating patient preferences into clinical trial designs and the regulatory decision-making process, BDA provides a quantitative and transparent approach to combine clinical and statistical significance. For PD patients who have never received DBS treatment, a 5% significance threshold may not be conservative enough to reflect their risk-aversion level. However, this study shows that patients who previously received DBS treatment present a higher tolerance to accept therapeutic risks in exchange for improved efficacy which is reflected in a higher statistical threshold.

在解释随机临床试验(RCT)结果的统计学意义时,通常采用 5%的固定单侧显著性水平。虽然有必要降低假阳性率,但可以定量、透明地选择所使用的阈值,以具体反映患者在获益与风险权衡方面的偏好以及其他考虑因素。如何将患者的偏好明确纳入帕金森病(PD)的 RCT 中,以及对器械审批的统计阈值有何影响?在本分析中,我们将贝叶斯决策分析(BDA)应用于从调查数据中得出的帕金森病患者偏好评分。通过贝叶斯决策分析,我们可以选择样本量(n)和显著性水平(α),使平衡双臂固定样本 RCT 对患者的总体预期值最大化,其中预期值是在零假设和备择假设下计算的。对于曾接受过脑深部刺激(DBS)治疗的帕金森病患者,BDA 最佳显著性水平介于 4.0% 和 10.0% 之间,类似于或大于传统的 5%。相反,对于从未接受过 DBS 治疗的患者,最佳显著性水平在 0.2% 到 4.4% 之间。在这两类人群中,最佳显著性水平随着患者认知和运动功能症状的严重程度而增加。通过将患者偏好明确纳入临床试验设计和监管决策过程,BDA 提供了一种定量、透明的方法,将临床意义和统计意义结合起来。对于从未接受过 DBS 治疗的帕金森病患者来说,5% 的显著性阈值可能不够保守,不足以反映他们的风险规避水平。然而,本研究表明,曾经接受过 DBS 治疗的患者接受治疗风险的容忍度更高,以换取更好的疗效,这反映在更高的统计阈值上。
{"title":"Use of Bayesian decision analysis to maximize value in patient-centered randomized clinical trials in Parkinson's disease.","authors":"Shomesh E Chaudhuri, Zied Ben Chaouch, Brett Hauber, Brennan Mange, Mo Zhou, Stephanie Christopher, Dawn Bardot, Margaret Sheehan, Anne Donnelly, Lauren McLaughlin, Brittany Caldwell, Heather L Benz, Martin Ho, Anindita Saha, Katrina Gwinn, Murray Sheldon, Andrew W Lo","doi":"10.1080/10543406.2023.2170400","DOIUrl":"10.1080/10543406.2023.2170400","url":null,"abstract":"<p><p>A fixed one-sided significance level of 5% is commonly used to interpret the statistical significance of randomized clinical trial (RCT) outcomes. While it is necessary to reduce the false positive rate, the threshold used could be chosen quantitatively and transparently to specifically reflect patient preferences regarding benefit-risk tradeoffs as well as other considerations. How can patient preferences be explicitly incorporated into RCTs in Parkinson's disease (PD), and what is the impact on statistical thresholds for device approval? In this analysis, we apply Bayesian decision analysis (BDA) to PD patient preference scores elicited from survey data. BDA allows us to choose a sample size (<math><mi>n</mi></math>) and significance level (<math><mi>α</mi></math>) that maximizes the overall expected value to patients of a balanced two-arm fixed-sample RCT, where the expected value is computed under both null and alternative hypotheses. For PD patients who had previously received deep brain stimulation (DBS) treatment, the BDA-optimal significance levels fell between 4.0% and 10.0%, similar to or greater than the traditional value of 5%. Conversely, for patients who had never received DBS, the optimal significance level ranged from 0.2% to 4.4%. In both of these populations, the optimal significance level increased with the severity of the patients' cognitive and motor function symptoms. By explicitly incorporating patient preferences into clinical trial designs and the regulatory decision-making process, BDA provides a quantitative and transparent approach to combine clinical and statistical significance. For PD patients who have never received DBS treatment, a 5% significance threshold may not be conservative enough to reflect their risk-aversion level. However, this study shows that patients who previously received DBS treatment present a higher tolerance to accept therapeutic risks in exchange for improved efficacy which is reflected in a higher statistical threshold.</p>","PeriodicalId":54870,"journal":{"name":"Journal of Biopharmaceutical Statistics","volume":" ","pages":"981-1000"},"PeriodicalIF":1.2,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10873547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CORRECTION. 更正。
IF 1.2 4区 医学 Q4 PHARMACOLOGY & PHARMACY Pub Date : 2025-08-01 Epub Date: 2023-09-16 DOI: 10.1080/10543406.2023.2258646
{"title":"CORRECTION.","authors":"","doi":"10.1080/10543406.2023.2258646","DOIUrl":"10.1080/10543406.2023.2258646","url":null,"abstract":"","PeriodicalId":54870,"journal":{"name":"Journal of Biopharmaceutical Statistics","volume":" ","pages":"1001-1002"},"PeriodicalIF":1.2,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10269100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reflections on estimands for patient-reported outcomes in cancer clinical trials. 对癌症临床试验中患者报告结果估计的思考。
IF 1.2 4区 医学 Q4 PHARMACOLOGY & PHARMACY Pub Date : 2025-08-01 Epub Date: 2023-11-19 DOI: 10.1080/10543406.2023.2280628
Rachael Lawrance, Konstantina Skaltsa, Antoine Regnault, Lysbeth Floden

It is common and important to include the patient's perspective of the impact of treatment on health-related quality of life (HRQoL) outcomes. In this commentary, we focus on applying the new addendum to ICH E9 guideline E9 (R1) relating to the estimand framework to Patient Reported Outcomes (PROs) collected in cancer clinical trials, from a statistician's viewpoint. Currently, common practice for statistical analysis of PRO endpoints of published cancer clinical trials demonstrates ambiguity, leaving critical questions unspecified, hindering conclusions about the effect of treatment on PRO endpoints as well as comparability between clinical trials. To avoid this scenario, we advocate the systematic use of the estimand framework which requires the prospective definition of clear PRO research questions. Among the five attributes of the estimands framework, the definition of the endpoint (what is the right PRO measure and timeframe to target and why?), the intercurrent event identification and management (what happens with PRO data post-disease progression, what is the impact of death?) and the population-level summary (what is an acceptable statistical summary for PRO data?) require the most attention for PRO estimands. We identify good practice and highlight discussion points including the challenges of statistical analysis in the presence of missing and/or unobservable data and in relation to death. Through this discussion we highlight that there is no "statistical magic", but that the estimand framework will help you find out what you really want to know when quantifying the benefit of treatments from the patients' perspective.

纳入患者对治疗对健康相关生活质量(HRQoL)结果影响的看法是常见且重要的。在这篇评论中,我们将从统计学家的角度,重点讨论如何应用ICH E9指南E9 (R1)的新附录,该附录与癌症临床试验中收集的患者报告结果(PROs)的估计框架有关。目前,对已发表的癌症临床试验的PRO终点进行统计分析的常见做法存在模糊性,使关键问题未明确,阻碍了对治疗对PRO终点的影响的结论以及临床试验之间的可比性。为了避免这种情况,我们提倡系统地使用评估框架,这需要对明确的PRO研究问题进行前瞻性定义。在估计框架的五个属性中,终点的定义(什么是正确的PRO测量和目标时间框架,为什么?)、并发事件的识别和管理(疾病进展后PRO数据发生了什么,死亡的影响是什么?)和人群水平的总结(什么是PRO数据可接受的统计总结?)需要对PRO估计最关注。我们确定了良好做法,并强调了讨论要点,包括在存在缺失和/或不可观察数据以及与死亡有关的情况下进行统计分析的挑战。通过这次讨论,我们强调没有“统计魔术”,但是估算框架将帮助您从患者的角度量化治疗的益处时找到您真正想知道的东西。
{"title":"Reflections on estimands for patient-reported outcomes in cancer clinical trials.","authors":"Rachael Lawrance, Konstantina Skaltsa, Antoine Regnault, Lysbeth Floden","doi":"10.1080/10543406.2023.2280628","DOIUrl":"10.1080/10543406.2023.2280628","url":null,"abstract":"<p><p>It is common and important to include the patient's perspective of the impact of treatment on health-related quality of life (HRQoL) outcomes. In this commentary, we focus on applying the new addendum to ICH E9 guideline E9 (R1) relating to the estimand framework to Patient Reported Outcomes (PROs) collected in cancer clinical trials, from a statistician's viewpoint. Currently, common practice for statistical analysis of PRO endpoints of published cancer clinical trials demonstrates ambiguity, leaving critical questions unspecified, hindering conclusions about the effect of treatment on PRO endpoints as well as comparability between clinical trials. To avoid this scenario, we advocate the systematic use of the estimand framework which requires the prospective definition of clear PRO research questions. Among the five attributes of the estimands framework, the definition of the endpoint (what is the right PRO measure and timeframe to target and why?), the intercurrent event identification and management (what happens with PRO data post-disease progression, what is the impact of death?) and the population-level summary (what is an acceptable statistical summary for PRO data?) require the most attention for PRO estimands. We identify good practice and highlight discussion points including the challenges of statistical analysis in the presence of missing and/or unobservable data and in relation to death. Through this discussion we highlight that there is no \"statistical magic\", but that the estimand framework will help you find out what you really want to know when quantifying the benefit of treatments from the patients' perspective.</p>","PeriodicalId":54870,"journal":{"name":"Journal of Biopharmaceutical Statistics","volume":" ","pages":"782-792"},"PeriodicalIF":1.2,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138048839","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A demonstration of estimands and sensitivity analyses for time-to-deterioration of patient reported outcomes. [特刊 PRO]患者报告结果恶化时间的估计值和敏感性分析演示。
IF 1.2 4区 医学 Q4 PHARMACOLOGY & PHARMACY Pub Date : 2025-08-01 Epub Date: 2024-04-30 DOI: 10.1080/10543406.2024.2341649
Lysbeth Floden, Michael DeRosa, Jessica Roydhouse, Jennifer L Beaumont, Stacie Hudgens

In oncology trials, health-related quality of life (HRQoL), specifically patient-reported symptom burden and functional status, can support the interpretation of survival endpoints, such as progression-free survival. However, applying time-to-event endpoints to patient-reported outcomes (PRO) data is challenging. For example, in time-to-deterioration analyses clinical events such as disease progression are common in many settings and are often handled through censoring the patient at the time of occurrence; however, disease progression and HRQoL are often related leading to informative censoring. Special consideration to the definition of events and intercurrent events (ICEs) is necessary. In this work, we demonstrate time-to-deterioration of PRO estimands and sensitivity analyses to answer research questions using composite, hypothetical, and treatment policy strategies applied to a single endpoint of disease-related symptoms. Multiple imputation methods under both the missing-at-random and missing-not-at-random assumptions are used as sensitivity analyses of primary estimands. Hazard ratios ranged from 0.52 to 0.66 over all the estimands and sensitivity analyses modeling a robust treatment effect favoring the treatment in time to disease symptom deterioration or death. Differences in the estimands include how people who experience disease progression or discontinue the randomized treatment due to AEs are accounted for in the analysis. We use the estimand framework to define interpretable and principled approaches for different time-to-deterioration research questions and provide practical recommendations. Reporting the proportions of patient events and patient censoring by reason helps understand the mechanisms that drive the results, allowing for optimal interpretation.

在肿瘤学试验中,与健康相关的生活质量(HRQoL),特别是患者报告的症状负担和功能状态,可以为解释无进展生存期等生存终点提供支持。然而,将时间到事件终点应用于患者报告的结果(PRO)数据具有挑战性。例如,在从时间到恶化的分析中,疾病进展等临床事件在许多情况下都很常见,通常通过在发生时对患者进行剔除来处理;然而,疾病进展和 HRQoL 通常是相关的,这就导致了信息剔除。有必要对事件和并发事件(ICEs)的定义进行特别考虑。在这项工作中,我们展示了PRO估计值的恶化时间和敏感性分析,使用复合、假设和治疗策略来回答研究问题,这些策略适用于疾病相关症状的单一终点。在随机缺失和非随机缺失假设下的多重估算方法被用作主要估计指标的敏感性分析。所有估计指标的危险比在 0.52 到 0.66 之间,敏感性分析表明,在疾病症状恶化或死亡时间方面,治疗效果显著。估计指标的差异包括在分析中如何考虑疾病进展或因不良反应而中断随机治疗的患者。我们利用估计值框架为不同的恶化时间研究问题定义了可解释的原则性方法,并提供了实用建议。按原因报告患者事件和患者剔除的比例有助于了解结果的驱动机制,从而实现最佳解释。
{"title":"A demonstration of estimands and sensitivity analyses for time-to-deterioration of patient reported outcomes.","authors":"Lysbeth Floden, Michael DeRosa, Jessica Roydhouse, Jennifer L Beaumont, Stacie Hudgens","doi":"10.1080/10543406.2024.2341649","DOIUrl":"10.1080/10543406.2024.2341649","url":null,"abstract":"<p><p>In oncology trials, health-related quality of life (HRQoL), specifically patient-reported symptom burden and functional status, can support the interpretation of survival endpoints, such as progression-free survival. However, applying time-to-event endpoints to patient-reported outcomes (PRO) data is challenging. For example, in time-to-deterioration analyses clinical events such as disease progression are common in many settings and are often handled through censoring the patient at the time of occurrence; however, disease progression and HRQoL are often related leading to informative censoring. Special consideration to the definition of events and intercurrent events (ICEs) is necessary. In this work, we demonstrate time-to-deterioration of PRO estimands and sensitivity analyses to answer research questions using composite, hypothetical, and treatment policy strategies applied to a single endpoint of disease-related symptoms. Multiple imputation methods under both the missing-at-random and missing-not-at-random assumptions are used as sensitivity analyses of primary estimands. Hazard ratios ranged from 0.52 to 0.66 over all the estimands and sensitivity analyses modeling a robust treatment effect favoring the treatment in time to disease symptom deterioration or death. Differences in the estimands include how people who experience disease progression or discontinue the randomized treatment due to AEs are accounted for in the analysis. We use the estimand framework to define interpretable and principled approaches for different time-to-deterioration research questions and provide practical recommendations. Reporting the proportions of patient events and patient censoring by reason helps understand the mechanisms that drive the results, allowing for optimal interpretation.</p>","PeriodicalId":54870,"journal":{"name":"Journal of Biopharmaceutical Statistics","volume":" ","pages":"918-932"},"PeriodicalIF":1.2,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140857008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Biopharmaceutical Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1