首页 > 最新文献

Biostatistics最新文献

英文 中文
Recoverability of causal effects under presence of missing data: a longitudinal case study. 数据缺失情况下因果效应的可恢复性:纵向案例研究。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-11-18 DOI: 10.1093/biostatistics/kxae044
Anastasiia Holovchak, Helen McIlleron, Paolo Denti, Michael Schomaker

Missing data in multiple variables is a common issue. We investigate the applicability of the framework of graphical models for handling missing data to a complex longitudinal pharmacological study of children with HIV treated with an efavirenz-based regimen as part of the CHAPAS-3 trial. Specifically, we examine whether the causal effects of interest, defined through static interventions on multiple continuous variables, can be recovered (estimated consistently) from the available data only. So far, no general algorithms are available to decide on recoverability, and decisions have to be made on a case-by-case basis. We emphasize the sensitivity of recoverability to even the smallest changes in the graph structure, and present recoverability results for three plausible missingness-directed acyclic graphs (m-DAGs) in the CHAPAS-3 study, informed by clinical knowledge. Furthermore, we propose the concept of a "closed missingness mechanism": if missing data are generated based on this mechanism, an available case analysis is admissible for consistent estimation of any statistical or causal estimand, even if data are missing not at random. Both simulations and theoretical considerations demonstrate how, in the assumed MNAR setting of our study, a complete or available case analysis can be superior to multiple imputation, and estimation results vary depending on the assumed missingness DAG. Our analyses demonstrate an innovative application of missingness DAGs to complex longitudinal real-world data, while highlighting the sensitivity of the results with respect to the assumed causal model.

多个变量的缺失数据是一个常见问题。我们研究了处理缺失数据的图形模型框架在一项复杂的纵向药理学研究中的适用性,该研究是 CHAPAS-3 试验的一部分,研究对象是接受以依非韦伦为基础的方案治疗的 HIV 感染儿童。具体来说,我们研究了通过对多个连续变量的静态干预所确定的相关因果效应是否可以仅从现有数据中恢复(一致估计)。到目前为止,还没有可用来决定可恢复性的通用算法,必须根据具体情况做出决定。我们强调了可恢复性对图结构中最小变化的敏感性,并介绍了 CHAPAS-3 研究中三个可信的缺失指向无环图(m-DAG)的可恢复性结果,这些结果是以临床知识为基础的。此外,我们还提出了 "封闭缺失机制 "的概念:如果缺失数据是基于这种机制产生的,那么即使数据不是随机缺失,也可以通过可用的病例分析对任何统计或因果估计进行一致的估计。模拟和理论考虑都表明,在我们研究的假定 MNAR 设置中,完整或可用案例分析如何优于多重估算,估算结果因假定的缺失 DAG 而异。我们的分析展示了缺失 DAG 在复杂的纵向真实世界数据中的创新应用,同时强调了结果对假定因果模型的敏感性。
{"title":"Recoverability of causal effects under presence of missing data: a longitudinal case study.","authors":"Anastasiia Holovchak, Helen McIlleron, Paolo Denti, Michael Schomaker","doi":"10.1093/biostatistics/kxae044","DOIUrl":"https://doi.org/10.1093/biostatistics/kxae044","url":null,"abstract":"<p><p>Missing data in multiple variables is a common issue. We investigate the applicability of the framework of graphical models for handling missing data to a complex longitudinal pharmacological study of children with HIV treated with an efavirenz-based regimen as part of the CHAPAS-3 trial. Specifically, we examine whether the causal effects of interest, defined through static interventions on multiple continuous variables, can be recovered (estimated consistently) from the available data only. So far, no general algorithms are available to decide on recoverability, and decisions have to be made on a case-by-case basis. We emphasize the sensitivity of recoverability to even the smallest changes in the graph structure, and present recoverability results for three plausible missingness-directed acyclic graphs (m-DAGs) in the CHAPAS-3 study, informed by clinical knowledge. Furthermore, we propose the concept of a \"closed missingness mechanism\": if missing data are generated based on this mechanism, an available case analysis is admissible for consistent estimation of any statistical or causal estimand, even if data are missing not at random. Both simulations and theoretical considerations demonstrate how, in the assumed MNAR setting of our study, a complete or available case analysis can be superior to multiple imputation, and estimation results vary depending on the assumed missingness DAG. Our analyses demonstrate an innovative application of missingness DAGs to complex longitudinal real-world data, while highlighting the sensitivity of the results with respect to the assumed causal model.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142649856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast standard error estimation for joint models of longitudinal and time-to-event data based on stochastic EM algorithms. 基于随机 EM 算法的纵向数据和时间到事件数据联合模型的快速标准误差估计。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-11-11 DOI: 10.1093/biostatistics/kxae043
Tingting Yu, Lang Wu, Ronald J Bosch, Davey M Smith, Rui Wang

Maximum likelihood inference can often become computationally intensive when performing joint modeling of longitudinal and time-to-event data, due to the intractable integrals in the joint likelihood function. The computational challenges escalate further when modeling HIV-1 viral load data, owing to the nonlinear trajectories and the presence of left-censored data resulting from the assay's lower limit of quantification. In this paper, for a joint model comprising a nonlinear mixed-effect model and a Cox Proportional Hazards model, we develop a computationally efficient Stochastic EM (StEM) algorithm for parameter estimation. Furthermore, we propose a novel technique for fast standard error estimation, which directly estimates standard errors from the results of StEM iterations and is broadly applicable to various joint modeling settings, such as those containing generalized linear mixed-effect models, parametric survival models, or joint models with more than two submodels. We evaluate the performance of the proposed methods through simulation studies and apply them to HIV-1 viral load data from six AIDS Clinical Trials Group studies to characterize viral rebound trajectories following the interruption of antiretroviral therapy (ART), accounting for the informative duration of off-ART periods.

在对纵向数据和时间到事件数据进行联合建模时,由于联合似然函数中的积分难以处理,最大似然推断往往会变得计算密集。在对 HIV-1 病毒载量数据建模时,由于非线性轨迹和检测定量下限导致的左删失数据的存在,计算挑战进一步升级。本文针对由非线性混合效应模型和 Cox 比例危害模型组成的联合模型,开发了一种计算高效的随机 EM(StEM)算法,用于参数估计。此外,我们还提出了一种快速标准误差估计的新技术,该技术可直接从 StEM 迭代结果中估计标准误差,广泛适用于各种联合建模环境,如包含广义线性混合效应模型、参数生存模型或具有两个以上子模型的联合模型。我们通过模拟研究评估了所提方法的性能,并将其应用于六项艾滋病临床试验组研究中的 HIV-1 病毒载量数据,以描述抗逆转录病毒疗法(ART)中断后的病毒反弹轨迹,同时考虑到非抗病毒治疗期的信息持续时间。
{"title":"Fast standard error estimation for joint models of longitudinal and time-to-event data based on stochastic EM algorithms.","authors":"Tingting Yu, Lang Wu, Ronald J Bosch, Davey M Smith, Rui Wang","doi":"10.1093/biostatistics/kxae043","DOIUrl":"https://doi.org/10.1093/biostatistics/kxae043","url":null,"abstract":"<p><p>Maximum likelihood inference can often become computationally intensive when performing joint modeling of longitudinal and time-to-event data, due to the intractable integrals in the joint likelihood function. The computational challenges escalate further when modeling HIV-1 viral load data, owing to the nonlinear trajectories and the presence of left-censored data resulting from the assay's lower limit of quantification. In this paper, for a joint model comprising a nonlinear mixed-effect model and a Cox Proportional Hazards model, we develop a computationally efficient Stochastic EM (StEM) algorithm for parameter estimation. Furthermore, we propose a novel technique for fast standard error estimation, which directly estimates standard errors from the results of StEM iterations and is broadly applicable to various joint modeling settings, such as those containing generalized linear mixed-effect models, parametric survival models, or joint models with more than two submodels. We evaluate the performance of the proposed methods through simulation studies and apply them to HIV-1 viral load data from six AIDS Clinical Trials Group studies to characterize viral rebound trajectories following the interruption of antiretroviral therapy (ART), accounting for the informative duration of off-ART periods.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142632694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The impact of coarsening an exposure on partial identifiability in instrumental variable settings. 在工具变量设置中,粗化暴露对部分可识别性的影响。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-11-09 DOI: 10.1093/biostatistics/kxae042
Erin E Gabriel, Michael C Sachs, Arvid Sjölander

In instrumental variable (IV) settings, such as imperfect randomized trials and observational studies with Mendelian randomization, one may encounter a continuous exposure, the causal effect of which is not of true interest. Instead, scientific interest may lie in a coarsened version of this exposure. Although there is a lengthy literature on the impact of coarsening of an exposure with several works focusing specifically on IV settings, all methods proposed in this literature require parametric assumptions. Instead, just as in the standard IV setting, one can consider partial identification via bounds making no parametric assumptions. This was first pointed out in Alexander Balke's PhD dissertation. We extend and clarify his work and derive novel bounds in several settings, including for a three-level IV, which will most likely be the case in Mendelian randomization. We demonstrate our findings in two real data examples, a randomized trial for peanut allergy in infants and a Mendelian randomization setting investigating the effect of homocysteine on cardiovascular disease.

在工具变量(IV)环境中,如不完全随机试验和孟德尔随机化的观察研究中,我们可能会遇到一个连续的暴露因子,但其因果效应并不是我们真正感兴趣的。相反,科学兴趣可能在于这种暴露的粗略版本。尽管有大量文献研究了粗略化暴露的影响,其中有几部著作特别关注 IV 设置,但这些文献中提出的所有方法都需要参数假设。相反,就像在标准 IV 设置中一样,我们可以通过不带参数假设的约束来考虑部分识别。Alexander Balke 的博士论文首次指出了这一点。我们对他的工作进行了扩展和澄清,并在几种情况下推导出了新的边界,包括三层 IV,这很可能是孟德尔随机化的情况。我们在两个真实数据示例中展示了我们的发现,一个是针对婴儿花生过敏的随机试验,另一个是调查同型半胱氨酸对心血管疾病影响的孟德尔随机设置。
{"title":"The impact of coarsening an exposure on partial identifiability in instrumental variable settings.","authors":"Erin E Gabriel, Michael C Sachs, Arvid Sjölander","doi":"10.1093/biostatistics/kxae042","DOIUrl":"https://doi.org/10.1093/biostatistics/kxae042","url":null,"abstract":"<p><p>In instrumental variable (IV) settings, such as imperfect randomized trials and observational studies with Mendelian randomization, one may encounter a continuous exposure, the causal effect of which is not of true interest. Instead, scientific interest may lie in a coarsened version of this exposure. Although there is a lengthy literature on the impact of coarsening of an exposure with several works focusing specifically on IV settings, all methods proposed in this literature require parametric assumptions. Instead, just as in the standard IV setting, one can consider partial identification via bounds making no parametric assumptions. This was first pointed out in Alexander Balke's PhD dissertation. We extend and clarify his work and derive novel bounds in several settings, including for a three-level IV, which will most likely be the case in Mendelian randomization. We demonstrate our findings in two real data examples, a randomized trial for peanut allergy in infants and a Mendelian randomization setting investigating the effect of homocysteine on cardiovascular disease.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142632696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying predictors of resilience to stressors in single-arm studies of pre-post change. 在前后变化的单臂研究中确定对压力的恢复能力的预测因素。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-10-01 DOI: 10.1093/biostatistics/kxad018
Ravi Varadhan, Jiafeng Zhu, Karen Bandeen-Roche

Many older adults experience a major stressor at some point in their lives. The ability to recover well after a major stressor is known as resilience. An important goal of geriatric research is to identify factors that influence resilience to stressors. Studies of resilience in older adults are typically conducted with a single-arm where everyone experiences the stressor. The simplistic approach of regressing change versus baseline yields biased estimates due to mathematical coupling and regression to the mean (RTM). We develop a method to correct the bias. We extend the method to include covariates. Our approach considers a counterfactual control group and involves sensitivity analyses to evaluate different settings of control group parameters. Only minimal distributional assumptions are required. Simulation studies demonstrate the validity of the method. We illustrate the method using a large, registry of older adults (N  =7239) who underwent total knee replacement (TKR). We demonstrate how external data can be utilized to constrain the sensitivity analysis. Naive analyses implicated several treatment effect modifiers including baseline function, age, body-mass index (BMI), gender, number of comorbidities, income, and race. Corrected analysis revealed that baseline (pre-stressor) function was not strongly linked to recovery after TKR and among the covariates, only age and number of comorbidities were consistently and negatively associated with post-stressor recovery in all functional domains. Correction of mathematical coupling and RTM is necessary for drawing valid inferences regarding the effect of covariates and baseline status on pre-post change. Our method provides a simple estimator to this end.

许多老年人在一生中都会遇到重大压力。在经历重大压力后能够很好地恢复的能力被称为恢复力。老年医学研究的一个重要目标就是找出影响压力恢复能力的因素。对老年人复原力的研究通常采用单臂法,即每个人都经历压力源。由于数学耦合和均值回归(RTM)的原因,将变化与基线进行回归的简单方法会产生有偏差的估计值。我们开发了一种方法来纠正这种偏差。我们将该方法扩展到包括协变量。我们的方法考虑了反事实对照组,并进行了敏感性分析,以评估对照组参数的不同设置。只需要最低限度的分布假设。模拟研究证明了该方法的有效性。我们使用一个接受全膝关节置换术(TKR)的大型老年人登记册(N = 7239)来说明该方法。我们展示了如何利用外部数据来限制敏感性分析。原始分析揭示了多个治疗效果调节因素,包括基线功能、年龄、体重指数 (BMI)、性别、合并症数量、收入和种族。校正分析表明,基线(应激前)功能与 TKR 术后恢复的关系不大,在协变量中,只有年龄和合并症数量与应激后所有功能领域的恢复持续负相关。为了有效推断协变量和基线状态对前后变化的影响,有必要对数学耦合和 RTM 进行校正。我们的方法为此提供了一个简单的估算器。
{"title":"Identifying predictors of resilience to stressors in single-arm studies of pre-post change.","authors":"Ravi Varadhan, Jiafeng Zhu, Karen Bandeen-Roche","doi":"10.1093/biostatistics/kxad018","DOIUrl":"10.1093/biostatistics/kxad018","url":null,"abstract":"<p><p>Many older adults experience a major stressor at some point in their lives. The ability to recover well after a major stressor is known as resilience. An important goal of geriatric research is to identify factors that influence resilience to stressors. Studies of resilience in older adults are typically conducted with a single-arm where everyone experiences the stressor. The simplistic approach of regressing change versus baseline yields biased estimates due to mathematical coupling and regression to the mean (RTM). We develop a method to correct the bias. We extend the method to include covariates. Our approach considers a counterfactual control group and involves sensitivity analyses to evaluate different settings of control group parameters. Only minimal distributional assumptions are required. Simulation studies demonstrate the validity of the method. We illustrate the method using a large, registry of older adults (N  =7239) who underwent total knee replacement (TKR). We demonstrate how external data can be utilized to constrain the sensitivity analysis. Naive analyses implicated several treatment effect modifiers including baseline function, age, body-mass index (BMI), gender, number of comorbidities, income, and race. Corrected analysis revealed that baseline (pre-stressor) function was not strongly linked to recovery after TKR and among the covariates, only age and number of comorbidities were consistently and negatively associated with post-stressor recovery in all functional domains. Correction of mathematical coupling and RTM is necessary for drawing valid inferences regarding the effect of covariates and baseline status on pre-post change. Our method provides a simple estimator to this end.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"1094-1111"},"PeriodicalIF":1.8,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10297247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating dynamic and predictive discrimination for recurrent event models: use of a time-dependent C-index. 评估动态和预测判别的反复事件模型:使用时间相关的c指数。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-10-01 DOI: 10.1093/biostatistics/kxad031
Jian Wang, Xinyang Jiang, Jing Ning

Interest in analyzing recurrent event data has increased over the past few decades. One essential aspect of a risk prediction model for recurrent event data is to accurately distinguish individuals with different risks of developing a recurrent event. Although the concordance index (C-index) effectively evaluates the overall discriminative ability of a regression model for recurrent event data, a local measure is also desirable to capture dynamic performance of the regression model over time. Therefore, in this study, we propose a time-dependent C-index measure for inferring the model's discriminative ability locally. We formulated the C-index as a function of time using a flexible parametric model and constructed a concordance-based likelihood for estimation and inference. We adapted a perturbation-resampling procedure for variance estimation. Extensive simulations were conducted to investigate the proposed time-dependent C-index's finite-sample performance and estimation procedure. We applied the time-dependent C-index to three regression models of a study of re-hospitalization in patients with colorectal cancer to evaluate the models' discriminative capability.

在过去的几十年里,人们对分析周期性事件数据的兴趣越来越大。复发事件数据风险预测模型的一个重要方面是准确区分具有不同复发事件风险的个体。虽然一致性指数(C-index)有效地评估了回归模型对周期性事件数据的整体判别能力,但也需要一个局部度量来捕捉回归模型随时间的动态性能。因此,在本研究中,我们提出了一个与时间相关的c指数测度来推断模型的局部判别能力。我们使用一个灵活的参数模型将c指数表述为时间的函数,并构建了一个基于一致性的似然估计和推断。我们采用了一种扰动重采样方法来估计方差。我们进行了大量的模拟,以研究所提出的时变c指数的有限样本性能和估计过程。我们将时间依赖的c指数应用于一项结直肠癌患者再住院研究的三个回归模型,以评估模型的判别能力。
{"title":"Evaluating dynamic and predictive discrimination for recurrent event models: use of a time-dependent C-index.","authors":"Jian Wang, Xinyang Jiang, Jing Ning","doi":"10.1093/biostatistics/kxad031","DOIUrl":"10.1093/biostatistics/kxad031","url":null,"abstract":"<p><p>Interest in analyzing recurrent event data has increased over the past few decades. One essential aspect of a risk prediction model for recurrent event data is to accurately distinguish individuals with different risks of developing a recurrent event. Although the concordance index (C-index) effectively evaluates the overall discriminative ability of a regression model for recurrent event data, a local measure is also desirable to capture dynamic performance of the regression model over time. Therefore, in this study, we propose a time-dependent C-index measure for inferring the model's discriminative ability locally. We formulated the C-index as a function of time using a flexible parametric model and constructed a concordance-based likelihood for estimation and inference. We adapted a perturbation-resampling procedure for variance estimation. Extensive simulations were conducted to investigate the proposed time-dependent C-index's finite-sample performance and estimation procedure. We applied the time-dependent C-index to three regression models of a study of re-hospitalization in patients with colorectal cancer to evaluate the models' discriminative capability.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"1140-1155"},"PeriodicalIF":1.8,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11471962/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89720651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Signal detection statistics of adverse drug events in hierarchical structure for matched case-control data. 匹配病例对照数据的分级结构中药物不良事件的信号检测统计。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-10-01 DOI: 10.1093/biostatistics/kxad029
Seok-Jae Heo, Sohee Jeong, Dagyeom Jung, Inkyung Jung

The tree-based scan statistic is a data mining method used to identify signals of adverse drug reactions in a database of spontaneous reporting systems. It is particularly beneficial when dealing with hierarchical data structures. One may use a retrospective case-control study design from spontaneous reporting systems (SRS) to investigate whether a specific adverse event of interest is associated with certain drugs. However, the existing Bernoulli model of the tree-based scan statistic may not be suitable as it fails to adequately account for dependencies within matched pairs. In this article, we propose signal detection statistics for matched case-control data based on McNemar's test, Wald test for conditional logistic regression, and the likelihood ratio test for a multinomial distribution. Through simulation studies, we demonstrate that our proposed methods outperform the existing approach in terms of the type I error rate, power, sensitivity, and false detection rate. To illustrate our proposed approach, we applied the three methods and the existing method to detect drug signals for dizziness-related adverse events related to antihypertensive drugs using the database of the Korea Adverse Event Reporting System.

基于树的扫描统计是一种数据挖掘方法,用于在自发报告系统的数据库中识别药物不良反应的信号。它在处理分层数据结构时特别有益。可以使用自发报告系统(SRS)的回顾性病例对照研究设计来调查感兴趣的特定不良事件是否与某些药物有关。然而,现有的基于树的扫描统计的伯努利模型可能不合适,因为它不能充分考虑匹配对内的依赖性。在本文中,我们提出了基于McNemar检验、条件逻辑回归的Wald检验和多项式分布的似然比检验的匹配病例对照数据的信号检测统计。通过仿真研究,我们证明了我们提出的方法在I型错误率、功率、灵敏度和错误检测率方面优于现有方法。为了说明我们提出的方法,我们使用韩国不良事件报告系统的数据库,应用这三种方法和现有方法来检测与降压药相关的头晕相关不良事件的药物信号。
{"title":"Signal detection statistics of adverse drug events in hierarchical structure for matched case-control data.","authors":"Seok-Jae Heo, Sohee Jeong, Dagyeom Jung, Inkyung Jung","doi":"10.1093/biostatistics/kxad029","DOIUrl":"10.1093/biostatistics/kxad029","url":null,"abstract":"<p><p>The tree-based scan statistic is a data mining method used to identify signals of adverse drug reactions in a database of spontaneous reporting systems. It is particularly beneficial when dealing with hierarchical data structures. One may use a retrospective case-control study design from spontaneous reporting systems (SRS) to investigate whether a specific adverse event of interest is associated with certain drugs. However, the existing Bernoulli model of the tree-based scan statistic may not be suitable as it fails to adequately account for dependencies within matched pairs. In this article, we propose signal detection statistics for matched case-control data based on McNemar's test, Wald test for conditional logistic regression, and the likelihood ratio test for a multinomial distribution. Through simulation studies, we demonstrate that our proposed methods outperform the existing approach in terms of the type I error rate, power, sensitivity, and false detection rate. To illustrate our proposed approach, we applied the three methods and the existing method to detect drug signals for dizziness-related adverse events related to antihypertensive drugs using the database of the Korea Adverse Event Reporting System.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"1112-1121"},"PeriodicalIF":1.8,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"54232410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multivariate spatiotemporal functional principal component analysis for modeling hospitalization and mortality rates in the dialysis population. 透析人群住院率和死亡率建模的多变量时空功能主成分分析
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biostatistics/kxad013
Qi Qian, Danh V Nguyen, Donatello Telesca, Esra Kurum, Connie M Rhee, Sudipto Banerjee, Yihao Li, Damla Senturk

Dialysis patients experience frequent hospitalizations and a higher mortality rate compared to other Medicare populations, in whom hospitalizations are a major contributor to morbidity, mortality, and healthcare costs. Patients also typically remain on dialysis for the duration of their lives or until kidney transplantation. Hence, there is growing interest in studying the spatiotemporal trends in the correlated outcomes of hospitalization and mortality among dialysis patients as a function of time starting from transition to dialysis across the United States Utilizing national data from the United States Renal Data System (USRDS), we propose a novel multivariate spatiotemporal functional principal component analysis model to study the joint spatiotemporal patterns of hospitalization and mortality rates among dialysis patients. The proposal is based on a multivariate Karhunen-Loéve expansion that describes leading directions of variation across time and induces spatial correlations among region-specific scores. An efficient estimation procedure is proposed using only univariate principal components decompositions and a Markov Chain Monte Carlo framework for targeting the spatial correlations. The finite sample performance of the proposed method is studied through simulations. Novel applications to the USRDS data highlight hot spots across the United States with higher hospitalization and/or mortality rates and time periods of elevated risk.

与其他医疗保险人群相比,透析患者经历频繁的住院治疗和更高的死亡率,在其他人群中,住院治疗是发病率、死亡率和医疗费用的主要因素。患者通常在其一生中或直到肾移植前都要进行透析。因此,人们越来越有兴趣研究透析患者住院和死亡率相关结果的时空趋势,作为美国各地从过渡到透析的时间的函数。我们提出了一种新的多元时空功能主成分分析模型来研究透析患者住院率和死亡率的联合时空模式。该建议基于多元karhunen - losamade扩展,该扩展描述了跨时间变化的主要方向,并诱导了区域特定分数之间的空间相关性。提出了一种仅使用单变量主成分分解和马尔可夫链蒙特卡罗框架针对空间相关性的有效估计方法。通过仿真研究了该方法的有限样本性能。对USRDS数据的新应用突出了美国各地住院率和/或死亡率较高的热点地区以及风险升高的时间段。
{"title":"Multivariate spatiotemporal functional principal component analysis for modeling hospitalization and mortality rates in the dialysis population.","authors":"Qi Qian, Danh V Nguyen, Donatello Telesca, Esra Kurum, Connie M Rhee, Sudipto Banerjee, Yihao Li, Damla Senturk","doi":"10.1093/biostatistics/kxad013","DOIUrl":"10.1093/biostatistics/kxad013","url":null,"abstract":"<p><p>Dialysis patients experience frequent hospitalizations and a higher mortality rate compared to other Medicare populations, in whom hospitalizations are a major contributor to morbidity, mortality, and healthcare costs. Patients also typically remain on dialysis for the duration of their lives or until kidney transplantation. Hence, there is growing interest in studying the spatiotemporal trends in the correlated outcomes of hospitalization and mortality among dialysis patients as a function of time starting from transition to dialysis across the United States Utilizing national data from the United States Renal Data System (USRDS), we propose a novel multivariate spatiotemporal functional principal component analysis model to study the joint spatiotemporal patterns of hospitalization and mortality rates among dialysis patients. The proposal is based on a multivariate Karhunen-Loéve expansion that describes leading directions of variation across time and induces spatial correlations among region-specific scores. An efficient estimation procedure is proposed using only univariate principal components decompositions and a Markov Chain Monte Carlo framework for targeting the spatial correlations. The finite sample performance of the proposed method is studied through simulations. Novel applications to the USRDS data highlight hot spots across the United States with higher hospitalization and/or mortality rates and time periods of elevated risk.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"718-735"},"PeriodicalIF":1.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11358256/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10019524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A scalable approach for continuous time Markov models with covariates. 带有协变量的连续时间马尔可夫模型的可扩展方法
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biostatistics/kxad012
Farhad Hatami, Alex Ocampo, Gordon Graham, Thomas E Nichols, Habib Ganjgahi

Existing methods for fitting continuous time Markov models (CTMM) in the presence of covariates suffer from scalability issues due to high computational cost of matrix exponentials calculated for each observation. In this article, we propose an optimization technique for CTMM which uses a stochastic gradient descent algorithm combined with differentiation of the matrix exponential using a Padé approximation. This approach makes fitting large scale data feasible. We present two methods for computing standard errors, one novel approach using the Padé expansion and the other using power series expansion of the matrix exponential. Through simulations, we find improved performance relative to existing CTMM methods, and we demonstrate the method on the large-scale multiple sclerosis NO.MS data set.

在存在协变量的情况下,现有的连续时间马尔可夫模型(CTMM)拟合方法存在可扩展性问题,原因是为每个观测值计算矩阵指数的计算成本很高。在本文中,我们提出了一种 CTMM 的优化技术,该技术使用随机梯度下降算法,并结合使用 Padé 近似对矩阵指数进行微分。这种方法可以拟合大规模数据。我们提出了两种计算标准误差的方法,一种是使用 Padé 扩展的新方法,另一种是使用矩阵指数的幂级数扩展。通过模拟,我们发现相对于现有的 CTMM 方法,该方法的性能有所提高,我们还在大规模多发性硬化 NO.MS 数据集上演示了该方法。
{"title":"A scalable approach for continuous time Markov models with covariates.","authors":"Farhad Hatami, Alex Ocampo, Gordon Graham, Thomas E Nichols, Habib Ganjgahi","doi":"10.1093/biostatistics/kxad012","DOIUrl":"10.1093/biostatistics/kxad012","url":null,"abstract":"<p><p>Existing methods for fitting continuous time Markov models (CTMM) in the presence of covariates suffer from scalability issues due to high computational cost of matrix exponentials calculated for each observation. In this article, we propose an optimization technique for CTMM which uses a stochastic gradient descent algorithm combined with differentiation of the matrix exponential using a Padé approximation. This approach makes fitting large scale data feasible. We present two methods for computing standard errors, one novel approach using the Padé expansion and the other using power series expansion of the matrix exponential. Through simulations, we find improved performance relative to existing CTMM methods, and we demonstrate the method on the large-scale multiple sclerosis NO.MS data set.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"681-701"},"PeriodicalIF":1.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11247187/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9770094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semi-supervised mixture multi-source exchangeability model for leveraging real-world data in clinical trials. 用于在临床试验中利用真实世界数据的半监督混合多源可交换性模型。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biostatistics/kxad024
Lillian M F Haine, Thomas A Murry, Raquel Nahra, Giota Touloumi, Eduardo Fernández-Cruz, Kathy Petoumenos, Joseph S Koopmeiners

The traditional trial paradigm is often criticized as being slow, inefficient, and costly. Statistical approaches that leverage external trial data have emerged to make trials more efficient by augmenting the sample size. However, these approaches assume that external data are from previously conducted trials, leaving a rich source of untapped real-world data (RWD) that cannot yet be effectively leveraged. We propose a semi-supervised mixture (SS-MIX) multisource exchangeability model (MEM); a flexible, two-step Bayesian approach for incorporating RWD into randomized controlled trial analyses. The first step is a SS-MIX model on a modified propensity score and the second step is a MEM. The first step targets a representative subgroup of individuals from the trial population and the second step avoids borrowing when there are substantial differences in outcomes among the trial sample and the representative observational sample. When comparing the proposed approach to competing borrowing approaches in a simulation study, we find that our approach borrows efficiently when the trial and RWD are consistent, while mitigating bias when the trial and external data differ on either measured or unmeasured covariates. We illustrate the proposed approach with an application to a randomized controlled trial investigating intravenous hyperimmune immunoglobulin in hospitalized patients with influenza, while leveraging data from an external observational study to supplement a subgroup analysis by influenza subtype.

传统的试验模式经常被批评为缓慢、低效和昂贵。利用外部试验数据的统计方法应运而生,通过扩大样本量来提高试验效率。然而,这些方法假定外部数据来自以前进行的试验,这就留下了尚未有效利用的丰富的真实世界数据(RWD)来源。我们提出了一种半监督混合(SS-MIX)多源可交换性模型(MEM);这是一种灵活的两步贝叶斯方法,可将 RWD 纳入随机对照试验分析。第一步是基于修正倾向得分的 SS-MIX 模型,第二步是 MEM。第一步以试验人群中具有代表性的个体子群为目标,第二步在试验样本与具有代表性的观察样本的结果存在实质性差异时避免借用。在一项模拟研究中,我们将所提出的方法与其他借用方法进行了比较,发现当试验数据与 RWD 数据一致时,我们的方法能有效地进行借用,而当试验数据与外部数据在测量或非测量协变量上存在差异时,我们的方法则能减轻偏差。我们将所提出的方法应用于一项随机对照试验,调查流感住院患者静脉注射超敏免疫球蛋白的情况,同时利用外部观察研究的数据来补充按流感亚型进行的亚组分析。
{"title":"Semi-supervised mixture multi-source exchangeability model for leveraging real-world data in clinical trials.","authors":"Lillian M F Haine, Thomas A Murry, Raquel Nahra, Giota Touloumi, Eduardo Fernández-Cruz, Kathy Petoumenos, Joseph S Koopmeiners","doi":"10.1093/biostatistics/kxad024","DOIUrl":"10.1093/biostatistics/kxad024","url":null,"abstract":"<p><p>The traditional trial paradigm is often criticized as being slow, inefficient, and costly. Statistical approaches that leverage external trial data have emerged to make trials more efficient by augmenting the sample size. However, these approaches assume that external data are from previously conducted trials, leaving a rich source of untapped real-world data (RWD) that cannot yet be effectively leveraged. We propose a semi-supervised mixture (SS-MIX) multisource exchangeability model (MEM); a flexible, two-step Bayesian approach for incorporating RWD into randomized controlled trial analyses. The first step is a SS-MIX model on a modified propensity score and the second step is a MEM. The first step targets a representative subgroup of individuals from the trial population and the second step avoids borrowing when there are substantial differences in outcomes among the trial sample and the representative observational sample. When comparing the proposed approach to competing borrowing approaches in a simulation study, we find that our approach borrows efficiently when the trial and RWD are consistent, while mitigating bias when the trial and external data differ on either measured or unmeasured covariates. We illustrate the proposed approach with an application to a randomized controlled trial investigating intravenous hyperimmune immunoglobulin in hospitalized patients with influenza, while leveraging data from an external observational study to supplement a subgroup analysis by influenza subtype.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"617-632"},"PeriodicalIF":1.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11247180/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10268326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint modeling in presence of informative censoring on the retrospective time scale with application to palliative care research. 在回顾性时间尺度上存在信息审查的情况下进行联合建模,并应用于姑息治疗研究。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biostatistics/kxad028
Quran Wu, Michael Daniels, Areej El-Jawahri, Marie Bakitas, Zhigang Li

Joint modeling of longitudinal data such as quality of life data and survival data is important for palliative care researchers to draw efficient inferences because it can account for the associations between those two types of data. Modeling quality of life on a retrospective from death time scale is useful for investigators to interpret the analysis results of palliative care studies which have relatively short life expectancies. However, informative censoring remains a complex challenge for modeling quality of life on the retrospective time scale although it has been addressed for joint models on the prospective time scale. To fill this gap, we develop a novel joint modeling approach that can address the challenge by allowing informative censoring events to be dependent on patients' quality of life and survival through a random effect. There are two sub-models in our approach: a linear mixed effect model for the longitudinal quality of life and a competing-risk model for the death time and dropout time that share the same random effect as the longitudinal model. Our approach can provide unbiased estimates for parameters of interest by appropriately modeling the informative censoring time. Model performance is assessed with a simulation study and compared with existing approaches. A real-world study is presented to illustrate the application of the new approach.

对生活质量数据和生存率数据等纵向数据进行联合建模,对于姑息治疗研究人员进行有效推断很重要,因为它可以解释这两类数据之间的关联。对死亡时间尺度的回顾性生活质量建模有助于研究人员解释预期寿命相对较短的姑息治疗研究的分析结果。然而,信息审查仍然是在回顾性时间尺度上建模生活质量的一个复杂挑战,尽管它已经在前瞻性时间尺度的联合模型中得到了解决。为了填补这一空白,我们开发了一种新的联合建模方法,通过允许信息审查事件通过随机效应依赖于患者的生活质量和生存率来应对这一挑战。我们的方法中有两个子模型:纵向生活质量的线性混合效应模型和死亡时间和辍学时间的竞争风险模型,它们与纵向模型具有相同的随机效应。我们的方法可以通过对信息审查时间进行适当建模,为感兴趣的参数提供无偏估计。模型性能通过模拟研究进行评估,并与现有方法进行比较。给出了一个真实世界的研究来说明新方法的应用。
{"title":"Joint modeling in presence of informative censoring on the retrospective time scale with application to palliative care research.","authors":"Quran Wu, Michael Daniels, Areej El-Jawahri, Marie Bakitas, Zhigang Li","doi":"10.1093/biostatistics/kxad028","DOIUrl":"10.1093/biostatistics/kxad028","url":null,"abstract":"<p><p>Joint modeling of longitudinal data such as quality of life data and survival data is important for palliative care researchers to draw efficient inferences because it can account for the associations between those two types of data. Modeling quality of life on a retrospective from death time scale is useful for investigators to interpret the analysis results of palliative care studies which have relatively short life expectancies. However, informative censoring remains a complex challenge for modeling quality of life on the retrospective time scale although it has been addressed for joint models on the prospective time scale. To fill this gap, we develop a novel joint modeling approach that can address the challenge by allowing informative censoring events to be dependent on patients' quality of life and survival through a random effect. There are two sub-models in our approach: a linear mixed effect model for the longitudinal quality of life and a competing-risk model for the death time and dropout time that share the same random effect as the longitudinal model. Our approach can provide unbiased estimates for parameters of interest by appropriately modeling the informative censoring time. Model performance is assessed with a simulation study and compared with existing approaches. A real-world study is presented to illustrate the application of the new approach.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"754-768"},"PeriodicalIF":1.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11247190/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41161763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biostatistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1