首页 > 最新文献

Biostatistics最新文献

英文 中文
Semi-supervised mixture multi-source exchangeability model for leveraging real-world data in clinical trials. 用于在临床试验中利用真实世界数据的半监督混合多源可交换性模型。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biostatistics/kxad024
Lillian M F Haine, Thomas A Murry, Raquel Nahra, Giota Touloumi, Eduardo Fernández-Cruz, Kathy Petoumenos, Joseph S Koopmeiners

The traditional trial paradigm is often criticized as being slow, inefficient, and costly. Statistical approaches that leverage external trial data have emerged to make trials more efficient by augmenting the sample size. However, these approaches assume that external data are from previously conducted trials, leaving a rich source of untapped real-world data (RWD) that cannot yet be effectively leveraged. We propose a semi-supervised mixture (SS-MIX) multisource exchangeability model (MEM); a flexible, two-step Bayesian approach for incorporating RWD into randomized controlled trial analyses. The first step is a SS-MIX model on a modified propensity score and the second step is a MEM. The first step targets a representative subgroup of individuals from the trial population and the second step avoids borrowing when there are substantial differences in outcomes among the trial sample and the representative observational sample. When comparing the proposed approach to competing borrowing approaches in a simulation study, we find that our approach borrows efficiently when the trial and RWD are consistent, while mitigating bias when the trial and external data differ on either measured or unmeasured covariates. We illustrate the proposed approach with an application to a randomized controlled trial investigating intravenous hyperimmune immunoglobulin in hospitalized patients with influenza, while leveraging data from an external observational study to supplement a subgroup analysis by influenza subtype.

传统的试验模式经常被批评为缓慢、低效和昂贵。利用外部试验数据的统计方法应运而生,通过扩大样本量来提高试验效率。然而,这些方法假定外部数据来自以前进行的试验,这就留下了尚未有效利用的丰富的真实世界数据(RWD)来源。我们提出了一种半监督混合(SS-MIX)多源可交换性模型(MEM);这是一种灵活的两步贝叶斯方法,可将 RWD 纳入随机对照试验分析。第一步是基于修正倾向得分的 SS-MIX 模型,第二步是 MEM。第一步以试验人群中具有代表性的个体子群为目标,第二步在试验样本与具有代表性的观察样本的结果存在实质性差异时避免借用。在一项模拟研究中,我们将所提出的方法与其他借用方法进行了比较,发现当试验数据与 RWD 数据一致时,我们的方法能有效地进行借用,而当试验数据与外部数据在测量或非测量协变量上存在差异时,我们的方法则能减轻偏差。我们将所提出的方法应用于一项随机对照试验,调查流感住院患者静脉注射超敏免疫球蛋白的情况,同时利用外部观察研究的数据来补充按流感亚型进行的亚组分析。
{"title":"Semi-supervised mixture multi-source exchangeability model for leveraging real-world data in clinical trials.","authors":"Lillian M F Haine, Thomas A Murry, Raquel Nahra, Giota Touloumi, Eduardo Fernández-Cruz, Kathy Petoumenos, Joseph S Koopmeiners","doi":"10.1093/biostatistics/kxad024","DOIUrl":"10.1093/biostatistics/kxad024","url":null,"abstract":"<p><p>The traditional trial paradigm is often criticized as being slow, inefficient, and costly. Statistical approaches that leverage external trial data have emerged to make trials more efficient by augmenting the sample size. However, these approaches assume that external data are from previously conducted trials, leaving a rich source of untapped real-world data (RWD) that cannot yet be effectively leveraged. We propose a semi-supervised mixture (SS-MIX) multisource exchangeability model (MEM); a flexible, two-step Bayesian approach for incorporating RWD into randomized controlled trial analyses. The first step is a SS-MIX model on a modified propensity score and the second step is a MEM. The first step targets a representative subgroup of individuals from the trial population and the second step avoids borrowing when there are substantial differences in outcomes among the trial sample and the representative observational sample. When comparing the proposed approach to competing borrowing approaches in a simulation study, we find that our approach borrows efficiently when the trial and RWD are consistent, while mitigating bias when the trial and external data differ on either measured or unmeasured covariates. We illustrate the proposed approach with an application to a randomized controlled trial investigating intravenous hyperimmune immunoglobulin in hospitalized patients with influenza, while leveraging data from an external observational study to supplement a subgroup analysis by influenza subtype.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"617-632"},"PeriodicalIF":1.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11247180/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10268326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint modeling in presence of informative censoring on the retrospective time scale with application to palliative care research. 在回顾性时间尺度上存在信息审查的情况下进行联合建模,并应用于姑息治疗研究。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biostatistics/kxad028
Quran Wu, Michael Daniels, Areej El-Jawahri, Marie Bakitas, Zhigang Li

Joint modeling of longitudinal data such as quality of life data and survival data is important for palliative care researchers to draw efficient inferences because it can account for the associations between those two types of data. Modeling quality of life on a retrospective from death time scale is useful for investigators to interpret the analysis results of palliative care studies which have relatively short life expectancies. However, informative censoring remains a complex challenge for modeling quality of life on the retrospective time scale although it has been addressed for joint models on the prospective time scale. To fill this gap, we develop a novel joint modeling approach that can address the challenge by allowing informative censoring events to be dependent on patients' quality of life and survival through a random effect. There are two sub-models in our approach: a linear mixed effect model for the longitudinal quality of life and a competing-risk model for the death time and dropout time that share the same random effect as the longitudinal model. Our approach can provide unbiased estimates for parameters of interest by appropriately modeling the informative censoring time. Model performance is assessed with a simulation study and compared with existing approaches. A real-world study is presented to illustrate the application of the new approach.

对生活质量数据和生存率数据等纵向数据进行联合建模,对于姑息治疗研究人员进行有效推断很重要,因为它可以解释这两类数据之间的关联。对死亡时间尺度的回顾性生活质量建模有助于研究人员解释预期寿命相对较短的姑息治疗研究的分析结果。然而,信息审查仍然是在回顾性时间尺度上建模生活质量的一个复杂挑战,尽管它已经在前瞻性时间尺度的联合模型中得到了解决。为了填补这一空白,我们开发了一种新的联合建模方法,通过允许信息审查事件通过随机效应依赖于患者的生活质量和生存率来应对这一挑战。我们的方法中有两个子模型:纵向生活质量的线性混合效应模型和死亡时间和辍学时间的竞争风险模型,它们与纵向模型具有相同的随机效应。我们的方法可以通过对信息审查时间进行适当建模,为感兴趣的参数提供无偏估计。模型性能通过模拟研究进行评估,并与现有方法进行比较。给出了一个真实世界的研究来说明新方法的应用。
{"title":"Joint modeling in presence of informative censoring on the retrospective time scale with application to palliative care research.","authors":"Quran Wu, Michael Daniels, Areej El-Jawahri, Marie Bakitas, Zhigang Li","doi":"10.1093/biostatistics/kxad028","DOIUrl":"10.1093/biostatistics/kxad028","url":null,"abstract":"<p><p>Joint modeling of longitudinal data such as quality of life data and survival data is important for palliative care researchers to draw efficient inferences because it can account for the associations between those two types of data. Modeling quality of life on a retrospective from death time scale is useful for investigators to interpret the analysis results of palliative care studies which have relatively short life expectancies. However, informative censoring remains a complex challenge for modeling quality of life on the retrospective time scale although it has been addressed for joint models on the prospective time scale. To fill this gap, we develop a novel joint modeling approach that can address the challenge by allowing informative censoring events to be dependent on patients' quality of life and survival through a random effect. There are two sub-models in our approach: a linear mixed effect model for the longitudinal quality of life and a competing-risk model for the death time and dropout time that share the same random effect as the longitudinal model. Our approach can provide unbiased estimates for parameters of interest by appropriately modeling the informative censoring time. Model performance is assessed with a simulation study and compared with existing approaches. A real-world study is presented to illustrate the application of the new approach.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"754-768"},"PeriodicalIF":1.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11247190/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41161763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved fMRI-based pain prediction using Bayesian group-wise functional registration. 使用贝叶斯分组功能配准改进了基于fMRI的疼痛预测。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biostatistics/kxad026
Guoqing Wang, Abhirup Datta, Martin A Lindquist

In recent years, the field of neuroimaging has undergone a paradigm shift, moving away from the traditional brain mapping approach towards the development of integrated, multivariate brain models that can predict categories of mental events. However, large interindividual differences in both brain anatomy and functional localization after standard anatomical alignment remain a major limitation in performing this type of analysis, as it leads to feature misalignment across subjects in subsequent predictive models. This article addresses this problem by developing and validating a new computational technique for reducing misalignment across individuals in functional brain systems by spatially transforming each subject's functional data to a common latent template map. Our proposed Bayesian functional group-wise registration approach allows us to assess differences in brain function across subjects and individual differences in activation topology. We achieve the probabilistic registration with inverse-consistency by utilizing the generalized Bayes framework with a loss function for the symmetric group-wise registration. It models the latent template with a Gaussian process, which helps capture spatial features in the template, producing a more precise estimation. We evaluate the method in simulation studies and apply it to data from an fMRI study of thermal pain, with the goal of using functional brain activity to predict physical pain. We find that the proposed approach allows for improved prediction of reported pain scores over conventional approaches. Received on 2 January 2017. Editorial decision on 8 June 2021.

近年来,神经成像领域发生了范式转变,从传统的大脑映射方法转向开发能够预测各类心理事件的综合、多变量大脑模型。然而,标准解剖比对后,大脑解剖和功能定位的巨大个体差异仍然是进行此类分析的主要限制,因为这会导致后续预测模型中受试者之间的特征错位。本文通过开发和验证一种新的计算技术来解决这个问题,该技术通过将每个受试者的功能数据空间转换为一个公共的潜在模板图来减少大脑功能系统中个体之间的错位。我们提出的贝叶斯功能分组配准方法使我们能够评估受试者大脑功能的差异以及激活拓扑的个体差异。我们利用具有损失函数的广义贝叶斯框架实现了具有逆一致性的概率配准。它使用高斯过程对潜在模板进行建模,这有助于捕捉模板中的空间特征,从而产生更精确的估计。我们在模拟研究中评估了这种方法,并将其应用于热疼痛功能磁共振成像研究的数据,目的是利用大脑功能活动来预测身体疼痛。我们发现,与传统方法相比,所提出的方法可以改进对报告的疼痛评分的预测。2017年1月2日收到。2021年6月8日的编辑决定。
{"title":"Improved fMRI-based pain prediction using Bayesian group-wise functional registration.","authors":"Guoqing Wang, Abhirup Datta, Martin A Lindquist","doi":"10.1093/biostatistics/kxad026","DOIUrl":"10.1093/biostatistics/kxad026","url":null,"abstract":"<p><p>In recent years, the field of neuroimaging has undergone a paradigm shift, moving away from the traditional brain mapping approach towards the development of integrated, multivariate brain models that can predict categories of mental events. However, large interindividual differences in both brain anatomy and functional localization after standard anatomical alignment remain a major limitation in performing this type of analysis, as it leads to feature misalignment across subjects in subsequent predictive models. This article addresses this problem by developing and validating a new computational technique for reducing misalignment across individuals in functional brain systems by spatially transforming each subject's functional data to a common latent template map. Our proposed Bayesian functional group-wise registration approach allows us to assess differences in brain function across subjects and individual differences in activation topology. We achieve the probabilistic registration with inverse-consistency by utilizing the generalized Bayes framework with a loss function for the symmetric group-wise registration. It models the latent template with a Gaussian process, which helps capture spatial features in the template, producing a more precise estimation. We evaluate the method in simulation studies and apply it to data from an fMRI study of thermal pain, with the goal of using functional brain activity to predict physical pain. We find that the proposed approach allows for improved prediction of reported pain scores over conventional approaches. Received on 2 January 2017. Editorial decision on 8 June 2021.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"885-903"},"PeriodicalIF":1.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41170769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An intersectional framework for counterfactual fairness in risk prediction. 风险预测中反事实公平性的交叉框架。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biostatistics/kxad021
Solvejg Wastvedt, Jared D Huling, Julian Wolfson

Along with the increasing availability of health data has come the rise of data-driven models to inform decision making and policy. These models have the potential to benefit both patients and health care providers but can also exacerbate health inequities. Existing "algorithmic fairness" methods for measuring and correcting model bias fall short of what is needed for health policy in two key ways. First, methods typically focus on a single grouping along which discrimination may occur rather than considering multiple, intersecting groups. Second, in clinical applications, risk prediction is typically used to guide treatment, creating distinct statistical issues that invalidate most existing techniques. We present novel unfairness metrics that address both challenges. We also develop a complete framework of estimation and inference tools for our metrics, including the unfairness value ("u-value"), used to determine the relative extremity of unfairness, and standard errors and confidence intervals employing an alternative to the standard bootstrap. We demonstrate application of our framework to a COVID-19 risk prediction model deployed in a major Midwestern health system.

随着健康数据的日益普及,为决策和政策提供信息的数据驱动模型也随之兴起。这些模型有可能使患者和医疗服务提供者受益,但也可能加剧健康不平等。现有的衡量和纠正模型偏差的 "算法公平性 "方法在两个关键方面无法满足卫生政策的需要。首先,这些方法通常只关注可能出现歧视的单一分组,而不是考虑多个交叉分组。其次,在临床应用中,风险预测通常用于指导治疗,这就产生了明显的统计问题,使大多数现有技术失效。我们提出了新的不公平度量方法来应对这两个挑战。我们还为我们的指标开发了一个完整的估算和推理工具框架,包括不公平值("u 值")(用于确定不公平的相对极值),以及标准误差和置信区间(采用标准自举法的替代方法)。我们展示了我们的框架在中西部一家大型医疗系统部署的 COVID-19 风险预测模型中的应用。
{"title":"An intersectional framework for counterfactual fairness in risk prediction.","authors":"Solvejg Wastvedt, Jared D Huling, Julian Wolfson","doi":"10.1093/biostatistics/kxad021","DOIUrl":"10.1093/biostatistics/kxad021","url":null,"abstract":"<p><p>Along with the increasing availability of health data has come the rise of data-driven models to inform decision making and policy. These models have the potential to benefit both patients and health care providers but can also exacerbate health inequities. Existing \"algorithmic fairness\" methods for measuring and correcting model bias fall short of what is needed for health policy in two key ways. First, methods typically focus on a single grouping along which discrimination may occur rather than considering multiple, intersecting groups. Second, in clinical applications, risk prediction is typically used to guide treatment, creating distinct statistical issues that invalidate most existing techniques. We present novel unfairness metrics that address both challenges. We also develop a complete framework of estimation and inference tools for our metrics, including the unfairness value (\"u-value\"), used to determine the relative extremity of unfairness, and standard errors and confidence intervals employing an alternative to the standard bootstrap. We demonstrate application of our framework to a COVID-19 risk prediction model deployed in a major Midwestern health system.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"702-717"},"PeriodicalIF":1.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10499741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DP2LM: leveraging deep learning approach for estimation and hypothesis testing on mediation effects with high-dimensional mediators and complex confounders. DP2LM:利用深度学习方法对具有高维中介因素和复杂混杂因素的中介效应进行估计和假设检验。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biostatistics/kxad037
Shuoyang Wang, Yuan Huang

Traditional linear mediation analysis has inherent limitations when it comes to handling high-dimensional mediators. Particularly, accurately estimating and rigorously inferring mediation effects is challenging, primarily due to the intertwined nature of the mediator selection issue. Despite recent developments, the existing methods are inadequate for addressing the complex relationships introduced by confounders. To tackle these challenges, we propose a novel approach called DP2LM (Deep neural network-based Penalized Partially Linear Mediation). This approach incorporates deep neural network techniques to account for nonlinear effects in confounders and utilizes the penalized partially linear model to accommodate high dimensionality. Unlike most existing works that concentrate on mediator selection, our method prioritizes estimation and inference on mediation effects. Specifically, we develop test procedures for testing the direct and indirect mediation effects. Theoretical analysis shows that the tests maintain the Type-I error rate. In simulation studies, DP2LM demonstrates its superior performance as a modeling tool for complex data, outperforming existing approaches in a wide range of settings and providing reliable estimation and inference in scenarios involving a considerable number of mediators. Further, we apply DP2LM to investigate the mediation effect of DNA methylation on cortisol stress reactivity in individuals who experienced childhood trauma, uncovering new insights through a comprehensive analysis.

传统的线性调解分析在处理高维调解因子时存在固有的局限性。特别是,准确估计和严格推断中介效应具有挑战性,这主要是由于中介选择问题具有交织性。尽管最近取得了一些进展,但现有方法仍不足以处理混杂因素带来的复杂关系。为了应对这些挑战,我们提出了一种名为 DP2LM(基于深度神经网络的惩罚性部分线性中介)的新方法。这种方法结合了深度神经网络技术来考虑混杂因素的非线性效应,并利用惩罚性部分线性模型来适应高维度。与大多数专注于中介选择的现有研究不同,我们的方法优先考虑中介效应的估计和推断。具体来说,我们开发了直接和间接中介效应测试程序。理论分析表明,测试能保持 I 类错误率。在模拟研究中,DP2LM 展示了其作为复杂数据建模工具的优越性能,在各种环境下均优于现有方法,并在涉及大量中介因子的情况下提供可靠的估计和推断。此外,我们还应用 DP2LM 研究了 DNA 甲基化对经历过童年创伤的个体皮质醇应激反应性的中介效应,通过综合分析发现了新的见解。
{"title":"DP2LM: leveraging deep learning approach for estimation and hypothesis testing on mediation effects with high-dimensional mediators and complex confounders.","authors":"Shuoyang Wang, Yuan Huang","doi":"10.1093/biostatistics/kxad037","DOIUrl":"10.1093/biostatistics/kxad037","url":null,"abstract":"<p><p>Traditional linear mediation analysis has inherent limitations when it comes to handling high-dimensional mediators. Particularly, accurately estimating and rigorously inferring mediation effects is challenging, primarily due to the intertwined nature of the mediator selection issue. Despite recent developments, the existing methods are inadequate for addressing the complex relationships introduced by confounders. To tackle these challenges, we propose a novel approach called DP2LM (Deep neural network-based Penalized Partially Linear Mediation). This approach incorporates deep neural network techniques to account for nonlinear effects in confounders and utilizes the penalized partially linear model to accommodate high dimensionality. Unlike most existing works that concentrate on mediator selection, our method prioritizes estimation and inference on mediation effects. Specifically, we develop test procedures for testing the direct and indirect mediation effects. Theoretical analysis shows that the tests maintain the Type-I error rate. In simulation studies, DP2LM demonstrates its superior performance as a modeling tool for complex data, outperforming existing approaches in a wide range of settings and providing reliable estimation and inference in scenarios involving a considerable number of mediators. Further, we apply DP2LM to investigate the mediation effect of DNA methylation on cortisol stress reactivity in individuals who experienced childhood trauma, uncovering new insights through a comprehensive analysis.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"818-832"},"PeriodicalIF":1.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139708546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian joint models for multi-regional clinical trials. 用于多地区临床试验的贝叶斯联合模型。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biostatistics/kxad023
Nathan W Bean, Joseph G Ibrahim, Matthew A Psioda

In recent years, multi-regional clinical trials (MRCTs) have increased in popularity in the pharmaceutical industry due to their ability to accelerate the global drug development process. To address potential challenges with MRCTs, the International Council for Harmonisation released the E17 guidance document which suggests the use of statistical methods that utilize information borrowing across regions if regional sample sizes are small. We develop an approach that allows for information borrowing via Bayesian model averaging in the context of a joint analysis of survival and longitudinal data from MRCTs. In this novel application of joint models to MRCTs, we use Laplace's method to integrate over subject-specific random effects and to approximate posterior distributions for region-specific treatment effects on the time-to-event outcome. Through simulation studies, we demonstrate that the joint modeling approach can result in an increased rejection rate when testing the global treatment effect compared with methods that analyze survival data alone. We then apply the proposed approach to data from a cardiovascular outcomes MRCT.

近年来,多地区临床试验(MRCT)在制药行业越来越受欢迎,因为它能够加快全球药物开发进程。为了应对 MRCT 可能面临的挑战,国际协调理事会发布了 E17 指导文件,建议在地区样本量较小的情况下,使用跨地区信息借用的统计方法。我们开发了一种方法,在对来自 MRCT 的生存和纵向数据进行联合分析时,通过贝叶斯模型平均法实现信息借用。在这种将联合模型应用于 MRCT 的新方法中,我们使用拉普拉斯方法对特定受试者的随机效应进行整合,并逼近特定地区治疗对时间到事件结果影响的后验分布。通过模拟研究,我们证明与单独分析生存数据的方法相比,联合建模方法在测试总体治疗效果时可提高拒绝率。然后,我们将提出的方法应用于心血管结果 MRCT 数据。
{"title":"Bayesian joint models for multi-regional clinical trials.","authors":"Nathan W Bean, Joseph G Ibrahim, Matthew A Psioda","doi":"10.1093/biostatistics/kxad023","DOIUrl":"10.1093/biostatistics/kxad023","url":null,"abstract":"<p><p>In recent years, multi-regional clinical trials (MRCTs) have increased in popularity in the pharmaceutical industry due to their ability to accelerate the global drug development process. To address potential challenges with MRCTs, the International Council for Harmonisation released the E17 guidance document which suggests the use of statistical methods that utilize information borrowing across regions if regional sample sizes are small. We develop an approach that allows for information borrowing via Bayesian model averaging in the context of a joint analysis of survival and longitudinal data from MRCTs. In this novel application of joint models to MRCTs, we use Laplace's method to integrate over subject-specific random effects and to approximate posterior distributions for region-specific treatment effects on the time-to-event outcome. Through simulation studies, we demonstrate that the joint modeling approach can result in an increased rejection rate when testing the global treatment effect compared with methods that analyze survival data alone. We then apply the proposed approach to data from a cardiovascular outcomes MRCT.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"852-866"},"PeriodicalIF":1.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11247186/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10152517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayesian nonparametric approach to correct for underreporting in count data. 一种贝叶斯非参数方法,用于纠正计数数据中的漏报。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biostatistics/kxad027
Serena Arima, Silvia Polettini, Giuseppe Pasculli, Loreto Gesualdo, Francesco Pesce, Deni-Aldo Procaccini

We propose a nonparametric compound Poisson model for underreported count data that introduces a latent clustering structure for the reporting probabilities. The latter are estimated with the model's parameters based on experts' opinion and exploiting a proxy for the reporting process. The proposed model is used to estimate the prevalence of chronic kidney disease in Apulia, Italy, based on a unique statistical database covering information on m = 258 municipalities obtained by integrating multisource register information. Accurate prevalence estimates are needed for monitoring, surveillance, and management purposes; yet, counts are deemed to be considerably underreported, especially in some areas of Apulia, one of the most deprived and heterogeneous regions in Italy. Our results agree with previous findings and highlight interesting geographical patterns of the disease. We compare our model to existing approaches in the literature using simulated as well as real data on early neonatal mortality risk in Brazil, described in previous research: the proposed approach proves to be accurate and particularly suitable when partial information about data quality is available.

我们提出了一个用于少报计数数据的非参数复合泊松模型,该模型引入了报告概率的潜在聚类结构。后者是根据专家的意见和报告过程中的代理使用模型参数进行估计的。所提出的模型用于估计意大利阿普利亚的慢性肾脏疾病患病率,基于一个独特的统计数据库,该数据库涵盖了通过整合多源登记信息获得的m=258个市镇的信息。为了监测、监测和管理目的,需要准确的流行率估计;然而,统计数字被认为被严重低估,尤其是在意大利最贫困、最异质的地区之一阿普利亚的一些地区。我们的研究结果与之前的发现一致,并突出了该疾病有趣的地理模式。我们使用先前研究中描述的巴西早期新生儿死亡率风险的模拟和真实数据,将我们的模型与文献中的现有方法进行了比较:当获得有关数据质量的部分信息时,所提出的方法被证明是准确的,特别适合。
{"title":"A Bayesian nonparametric approach to correct for underreporting in count data.","authors":"Serena Arima, Silvia Polettini, Giuseppe Pasculli, Loreto Gesualdo, Francesco Pesce, Deni-Aldo Procaccini","doi":"10.1093/biostatistics/kxad027","DOIUrl":"10.1093/biostatistics/kxad027","url":null,"abstract":"<p><p>We propose a nonparametric compound Poisson model for underreported count data that introduces a latent clustering structure for the reporting probabilities. The latter are estimated with the model's parameters based on experts' opinion and exploiting a proxy for the reporting process. The proposed model is used to estimate the prevalence of chronic kidney disease in Apulia, Italy, based on a unique statistical database covering information on m = 258 municipalities obtained by integrating multisource register information. Accurate prevalence estimates are needed for monitoring, surveillance, and management purposes; yet, counts are deemed to be considerably underreported, especially in some areas of Apulia, one of the most deprived and heterogeneous regions in Italy. Our results agree with previous findings and highlight interesting geographical patterns of the disease. We compare our model to existing approaches in the literature using simulated as well as real data on early neonatal mortality risk in Brazil, described in previous research: the proposed approach proves to be accurate and particularly suitable when partial information about data quality is available.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"904-918"},"PeriodicalIF":1.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41161396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analyzing microbial evolution through gene and genome phylogenies. 通过基因和基因组系统发育分析微生物进化。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biostatistics/kxad025
Sarah Teichman, Michael D Lee, Amy D Willis

Microbiome scientists critically need modern tools to explore and analyze microbial evolution. Often this involves studying the evolution of microbial genomes as a whole. However, different genes in a single genome can be subject to different evolutionary pressures, which can result in distinct gene-level evolutionary histories. To address this challenge, we propose to treat estimated gene-level phylogenies as data objects, and present an interactive method for the analysis of a collection of gene phylogenies. We use a local linear approximation of phylogenetic tree space to visualize estimated gene trees as points in low-dimensional Euclidean space, and address important practical limitations of existing related approaches, allowing an intuitive visualization of complex data objects. We demonstrate the utility of our proposed approach through microbial data analyses, including by identifying outlying gene histories in strains of Prevotella, and by contrasting Streptococcus phylogenies estimated using different gene sets. Our method is available as an open-source R package, and assists with estimating, visualizing, and interacting with a collection of bacterial gene phylogenies.

微生物组科学家迫切需要现代工具来探索和分析微生物进化。这通常涉及到从整体上研究微生物基因组的进化。然而,单个基因组中的不同基因可能受到不同的进化压力,这可能导致不同的基因水平进化史。为了应对这一挑战,我们建议将估计的基因水平系统发育视为数据对象,并提出一种用于分析基因系统发育集合的交互式方法。我们使用系统发育树空间的局部线性近似来将估计的基因树可视化为低维欧几里得空间中的点,并解决现有相关方法的重要实际局限性,从而实现复杂数据对象的直观可视化。我们通过微生物数据分析证明了我们提出的方法的实用性,包括通过鉴定普雷沃氏菌菌株中的外围基因史,以及通过对比使用不同基因集估计的链球菌系统发育。我们的方法是一个开源的R包,有助于估计、可视化和与细菌基因系统发育的集合相互作用。
{"title":"Analyzing microbial evolution through gene and genome phylogenies.","authors":"Sarah Teichman, Michael D Lee, Amy D Willis","doi":"10.1093/biostatistics/kxad025","DOIUrl":"10.1093/biostatistics/kxad025","url":null,"abstract":"<p><p>Microbiome scientists critically need modern tools to explore and analyze microbial evolution. Often this involves studying the evolution of microbial genomes as a whole. However, different genes in a single genome can be subject to different evolutionary pressures, which can result in distinct gene-level evolutionary histories. To address this challenge, we propose to treat estimated gene-level phylogenies as data objects, and present an interactive method for the analysis of a collection of gene phylogenies. We use a local linear approximation of phylogenetic tree space to visualize estimated gene trees as points in low-dimensional Euclidean space, and address important practical limitations of existing related approaches, allowing an intuitive visualization of complex data objects. We demonstrate the utility of our proposed approach through microbial data analyses, including by identifying outlying gene histories in strains of Prevotella, and by contrasting Streptococcus phylogenies estimated using different gene sets. Our method is available as an open-source R package, and assists with estimating, visualizing, and interacting with a collection of bacterial gene phylogenies.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"786-800"},"PeriodicalIF":1.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11247178/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66784613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayesian nonparametric approach for multiple mediators with applications in mental health studies. 应用于心理健康研究的贝叶斯非参数多重中介方法。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biostatistics/kxad038
Samrat Roy, Michael J Daniels, Jason Roy

Mediation analysis with contemporaneously observed multiple mediators is a significant area of causal inference. Recent approaches for multiple mediators are often based on parametric models and thus may suffer from model misspecification. Also, much of the existing literature either only allow estimation of the joint mediation effect or estimate the joint mediation effect just as the sum of individual mediator effects, ignoring the interaction among the mediators. In this article, we propose a novel Bayesian nonparametric method that overcomes the two aforementioned drawbacks. We model the joint distribution of the observed data (outcome, mediators, treatment, and confounders) flexibly using an enriched Dirichlet process mixture with three levels. We use standardization (g-computation) to compute all possible mediation effects, including pairwise and all other possible interaction among the mediators. We thoroughly explore our method via simulations and apply our method to a mental health data from Wisconsin Longitudinal Study, where we estimate how the effect of births from unintended pregnancies on later life mental depression (CES-D) among the mothers is mediated through lack of self-acceptance and autonomy, employment instability, lack of social participation, and increased family stress. Our method identified significant individual mediators, along with some significant pairwise effects.

利用同时观测到的多个中介因子进行中介分析是因果推断的一个重要领域。最近针对多中介因素的方法通常基于参数模型,因此可能存在模型规范错误的问题。此外,大部分现有文献要么只允许估计联合中介效应,要么只将联合中介效应估计为单个中介效应之和,而忽略了中介效应之间的相互作用。在本文中,我们提出了一种新颖的贝叶斯非参数方法,克服了上述两个缺点。我们使用一个具有三个层次的富集 Dirichlet 过程混合物,对观测数据(结果、中介效应、治疗和混杂因素)的联合分布进行灵活建模。我们使用标准化(g-计算)来计算所有可能的中介效应,包括成对的中介效应和中介间所有其他可能的相互作用。我们通过模拟对我们的方法进行了深入探讨,并将我们的方法应用于威斯康星纵向研究的心理健康数据中,我们估计了意外怀孕生育对母亲日后精神抑郁(CES-D)的影响是如何通过缺乏自我接纳和自主、就业不稳定、缺乏社会参与和家庭压力增大等因素进行中介的。我们的方法确定了重要的个体中介因素,以及一些重要的配对效应。
{"title":"A Bayesian nonparametric approach for multiple mediators with applications in mental health studies.","authors":"Samrat Roy, Michael J Daniels, Jason Roy","doi":"10.1093/biostatistics/kxad038","DOIUrl":"10.1093/biostatistics/kxad038","url":null,"abstract":"<p><p>Mediation analysis with contemporaneously observed multiple mediators is a significant area of causal inference. Recent approaches for multiple mediators are often based on parametric models and thus may suffer from model misspecification. Also, much of the existing literature either only allow estimation of the joint mediation effect or estimate the joint mediation effect just as the sum of individual mediator effects, ignoring the interaction among the mediators. In this article, we propose a novel Bayesian nonparametric method that overcomes the two aforementioned drawbacks. We model the joint distribution of the observed data (outcome, mediators, treatment, and confounders) flexibly using an enriched Dirichlet process mixture with three levels. We use standardization (g-computation) to compute all possible mediation effects, including pairwise and all other possible interaction among the mediators. We thoroughly explore our method via simulations and apply our method to a mental health data from Wisconsin Longitudinal Study, where we estimate how the effect of births from unintended pregnancies on later life mental depression (CES-D) among the mothers is mediated through lack of self-acceptance and autonomy, employment instability, lack of social participation, and increased family stress. Our method identified significant individual mediators, along with some significant pairwise effects.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"919-932"},"PeriodicalIF":1.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11247183/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139708545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Variable selection in high dimensions for discrete-outcome individualized treatment rules: Reducing severity of depression symptoms. 离散结果的高维度变量选择个性化治疗规则:降低抑郁症状的严重程度。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biostatistics/kxad022
Erica E M Moodie, Zeyu Bian, Janie Coulombe, Yi Lian, Archer Y Yang, Susan M Shortreed

Despite growing interest in estimating individualized treatment rules, little attention has been given the binary outcome setting. Estimation is challenging with nonlinear link functions, especially when variable selection is needed. We use a new computational approach to solve a recently proposed doubly robust regularized estimating equation to accomplish this difficult task in a case study of depression treatment. We demonstrate an application of this new approach in combination with a weighted and penalized estimating equation to this challenging binary outcome setting. We demonstrate the double robustness of the method and its effectiveness for variable selection. The work is motivated by and applied to an analysis of treatment for unipolar depression using a population of patients treated at Kaiser Permanente Washington.

尽管人们对评估个体化治疗规则越来越感兴趣,但很少关注二元结果设置。非线性链接函数的估计具有挑战性,尤其是在需要变量选择的情况下。在抑郁症治疗的案例研究中,我们使用一种新的计算方法来求解最近提出的双鲁棒正则化估计方程,以完成这项艰巨的任务。我们展示了这种新方法与加权和惩罚估计方程相结合在这种具有挑战性的二元结果设置中的应用。我们证明了该方法的双重稳健性及其对变量选择的有效性。这项工作的动机是利用在华盛顿凯撒永久医院接受治疗的患者群体对单极性抑郁症的治疗进行分析。
{"title":"Variable selection in high dimensions for discrete-outcome individualized treatment rules: Reducing severity of depression symptoms.","authors":"Erica E M Moodie, Zeyu Bian, Janie Coulombe, Yi Lian, Archer Y Yang, Susan M Shortreed","doi":"10.1093/biostatistics/kxad022","DOIUrl":"10.1093/biostatistics/kxad022","url":null,"abstract":"<p><p>Despite growing interest in estimating individualized treatment rules, little attention has been given the binary outcome setting. Estimation is challenging with nonlinear link functions, especially when variable selection is needed. We use a new computational approach to solve a recently proposed doubly robust regularized estimating equation to accomplish this difficult task in a case study of depression treatment. We demonstrate an application of this new approach in combination with a weighted and penalized estimating equation to this challenging binary outcome setting. We demonstrate the double robustness of the method and its effectiveness for variable selection. The work is motivated by and applied to an analysis of treatment for unipolar depression using a population of patients treated at Kaiser Permanente Washington.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"633-647"},"PeriodicalIF":1.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10201574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biostatistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1