首页 > 最新文献

Biostatistics最新文献

英文 中文
Identifying predictors of resilience to stressors in single-arm studies of pre-post change. 在前后变化的单臂研究中确定对压力的恢复能力的预测因素。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-10-01 DOI: 10.1093/biostatistics/kxad018
Ravi Varadhan, Jiafeng Zhu, Karen Bandeen-Roche

Many older adults experience a major stressor at some point in their lives. The ability to recover well after a major stressor is known as resilience. An important goal of geriatric research is to identify factors that influence resilience to stressors. Studies of resilience in older adults are typically conducted with a single-arm where everyone experiences the stressor. The simplistic approach of regressing change versus baseline yields biased estimates due to mathematical coupling and regression to the mean (RTM). We develop a method to correct the bias. We extend the method to include covariates. Our approach considers a counterfactual control group and involves sensitivity analyses to evaluate different settings of control group parameters. Only minimal distributional assumptions are required. Simulation studies demonstrate the validity of the method. We illustrate the method using a large, registry of older adults (N  =7239) who underwent total knee replacement (TKR). We demonstrate how external data can be utilized to constrain the sensitivity analysis. Naive analyses implicated several treatment effect modifiers including baseline function, age, body-mass index (BMI), gender, number of comorbidities, income, and race. Corrected analysis revealed that baseline (pre-stressor) function was not strongly linked to recovery after TKR and among the covariates, only age and number of comorbidities were consistently and negatively associated with post-stressor recovery in all functional domains. Correction of mathematical coupling and RTM is necessary for drawing valid inferences regarding the effect of covariates and baseline status on pre-post change. Our method provides a simple estimator to this end.

许多老年人在一生中都会遇到重大压力。在经历重大压力后能够很好地恢复的能力被称为恢复力。老年医学研究的一个重要目标就是找出影响压力恢复能力的因素。对老年人复原力的研究通常采用单臂法,即每个人都经历压力源。由于数学耦合和均值回归(RTM)的原因,将变化与基线进行回归的简单方法会产生有偏差的估计值。我们开发了一种方法来纠正这种偏差。我们将该方法扩展到包括协变量。我们的方法考虑了反事实对照组,并进行了敏感性分析,以评估对照组参数的不同设置。只需要最低限度的分布假设。模拟研究证明了该方法的有效性。我们使用一个接受全膝关节置换术(TKR)的大型老年人登记册(N = 7239)来说明该方法。我们展示了如何利用外部数据来限制敏感性分析。原始分析揭示了多个治疗效果调节因素,包括基线功能、年龄、体重指数 (BMI)、性别、合并症数量、收入和种族。校正分析表明,基线(应激前)功能与 TKR 术后恢复的关系不大,在协变量中,只有年龄和合并症数量与应激后所有功能领域的恢复持续负相关。为了有效推断协变量和基线状态对前后变化的影响,有必要对数学耦合和 RTM 进行校正。我们的方法为此提供了一个简单的估算器。
{"title":"Identifying predictors of resilience to stressors in single-arm studies of pre-post change.","authors":"Ravi Varadhan, Jiafeng Zhu, Karen Bandeen-Roche","doi":"10.1093/biostatistics/kxad018","DOIUrl":"10.1093/biostatistics/kxad018","url":null,"abstract":"<p><p>Many older adults experience a major stressor at some point in their lives. The ability to recover well after a major stressor is known as resilience. An important goal of geriatric research is to identify factors that influence resilience to stressors. Studies of resilience in older adults are typically conducted with a single-arm where everyone experiences the stressor. The simplistic approach of regressing change versus baseline yields biased estimates due to mathematical coupling and regression to the mean (RTM). We develop a method to correct the bias. We extend the method to include covariates. Our approach considers a counterfactual control group and involves sensitivity analyses to evaluate different settings of control group parameters. Only minimal distributional assumptions are required. Simulation studies demonstrate the validity of the method. We illustrate the method using a large, registry of older adults (N  =7239) who underwent total knee replacement (TKR). We demonstrate how external data can be utilized to constrain the sensitivity analysis. Naive analyses implicated several treatment effect modifiers including baseline function, age, body-mass index (BMI), gender, number of comorbidities, income, and race. Corrected analysis revealed that baseline (pre-stressor) function was not strongly linked to recovery after TKR and among the covariates, only age and number of comorbidities were consistently and negatively associated with post-stressor recovery in all functional domains. Correction of mathematical coupling and RTM is necessary for drawing valid inferences regarding the effect of covariates and baseline status on pre-post change. Our method provides a simple estimator to this end.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"1094-1111"},"PeriodicalIF":1.8,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10297247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating dynamic and predictive discrimination for recurrent event models: use of a time-dependent C-index. 评估动态和预测判别的反复事件模型:使用时间相关的c指数。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-10-01 DOI: 10.1093/biostatistics/kxad031
Jian Wang, Xinyang Jiang, Jing Ning

Interest in analyzing recurrent event data has increased over the past few decades. One essential aspect of a risk prediction model for recurrent event data is to accurately distinguish individuals with different risks of developing a recurrent event. Although the concordance index (C-index) effectively evaluates the overall discriminative ability of a regression model for recurrent event data, a local measure is also desirable to capture dynamic performance of the regression model over time. Therefore, in this study, we propose a time-dependent C-index measure for inferring the model's discriminative ability locally. We formulated the C-index as a function of time using a flexible parametric model and constructed a concordance-based likelihood for estimation and inference. We adapted a perturbation-resampling procedure for variance estimation. Extensive simulations were conducted to investigate the proposed time-dependent C-index's finite-sample performance and estimation procedure. We applied the time-dependent C-index to three regression models of a study of re-hospitalization in patients with colorectal cancer to evaluate the models' discriminative capability.

在过去的几十年里,人们对分析周期性事件数据的兴趣越来越大。复发事件数据风险预测模型的一个重要方面是准确区分具有不同复发事件风险的个体。虽然一致性指数(C-index)有效地评估了回归模型对周期性事件数据的整体判别能力,但也需要一个局部度量来捕捉回归模型随时间的动态性能。因此,在本研究中,我们提出了一个与时间相关的c指数测度来推断模型的局部判别能力。我们使用一个灵活的参数模型将c指数表述为时间的函数,并构建了一个基于一致性的似然估计和推断。我们采用了一种扰动重采样方法来估计方差。我们进行了大量的模拟,以研究所提出的时变c指数的有限样本性能和估计过程。我们将时间依赖的c指数应用于一项结直肠癌患者再住院研究的三个回归模型,以评估模型的判别能力。
{"title":"Evaluating dynamic and predictive discrimination for recurrent event models: use of a time-dependent C-index.","authors":"Jian Wang, Xinyang Jiang, Jing Ning","doi":"10.1093/biostatistics/kxad031","DOIUrl":"10.1093/biostatistics/kxad031","url":null,"abstract":"<p><p>Interest in analyzing recurrent event data has increased over the past few decades. One essential aspect of a risk prediction model for recurrent event data is to accurately distinguish individuals with different risks of developing a recurrent event. Although the concordance index (C-index) effectively evaluates the overall discriminative ability of a regression model for recurrent event data, a local measure is also desirable to capture dynamic performance of the regression model over time. Therefore, in this study, we propose a time-dependent C-index measure for inferring the model's discriminative ability locally. We formulated the C-index as a function of time using a flexible parametric model and constructed a concordance-based likelihood for estimation and inference. We adapted a perturbation-resampling procedure for variance estimation. Extensive simulations were conducted to investigate the proposed time-dependent C-index's finite-sample performance and estimation procedure. We applied the time-dependent C-index to three regression models of a study of re-hospitalization in patients with colorectal cancer to evaluate the models' discriminative capability.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"1140-1155"},"PeriodicalIF":1.8,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11471962/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89720651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Similarity-based multimodal regression. 基于相似性的多模态回归。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-10-01 DOI: 10.1093/biostatistics/kxad033
Andrew A Chen, Sarah M Weinstein, Azeez Adebimpe, Ruben C Gur, Raquel E Gur, Kathleen R Merikangas, Theodore D Satterthwaite, Russell T Shinohara, Haochang Shou

To better understand complex human phenotypes, large-scale studies have increasingly collected multiple data modalities across domains such as imaging, mobile health, and physical activity. The properties of each data type often differ substantially and require either separate analyses or extensive processing to obtain comparable features for a combined analysis. Multimodal data fusion enables certain analyses on matrix-valued and vector-valued data, but it generally cannot integrate modalities of different dimensions and data structures. For a single data modality, multivariate distance matrix regression provides a distance-based framework for regression accommodating a wide range of data types. However, no distance-based method exists to handle multiple complementary types of data. We propose a novel distance-based regression model, which we refer to as Similarity-based Multimodal Regression (SiMMR), that enables simultaneous regression of multiple modalities through their distance profiles. We demonstrate through simulation, imaging studies, and longitudinal mobile health analyses that our proposed method can detect associations between clinical variables and multimodal data of differing properties and dimensionalities, even with modest sample sizes. We perform experiments to evaluate several different test statistics and provide recommendations for applying our method across a broad range of scenarios.

为了更好地理解复杂的人类表型,大规模研究越来越多地收集了成像、移动健康和身体活动等领域的多种数据模式。每种数据类型的属性通常差别很大,需要单独分析或广泛处理才能获得可比较的特征,以便进行组合分析。多模态数据融合可以对矩阵值和向量值数据进行一定的分析,但通常不能将不同维数和数据结构的模态融合在一起。对于单一数据模式,多变量距离矩阵回归提供了一个基于距离的框架,用于容纳各种数据类型的回归。然而,目前还没有基于距离的方法来处理多种互补类型的数据。我们提出了一种新的基于距离的回归模型,我们称之为基于相似性的多模态回归(SiMMR),它可以通过它们的距离曲线同时回归多个模态。我们通过模拟、成像研究和纵向移动健康分析证明,即使样本量不大,我们提出的方法也可以检测临床变量与不同性质和维度的多模态数据之间的关联。我们执行实验来评估几个不同的测试统计数据,并为在广泛的场景中应用我们的方法提供建议。
{"title":"Similarity-based multimodal regression.","authors":"Andrew A Chen, Sarah M Weinstein, Azeez Adebimpe, Ruben C Gur, Raquel E Gur, Kathleen R Merikangas, Theodore D Satterthwaite, Russell T Shinohara, Haochang Shou","doi":"10.1093/biostatistics/kxad033","DOIUrl":"10.1093/biostatistics/kxad033","url":null,"abstract":"<p><p>To better understand complex human phenotypes, large-scale studies have increasingly collected multiple data modalities across domains such as imaging, mobile health, and physical activity. The properties of each data type often differ substantially and require either separate analyses or extensive processing to obtain comparable features for a combined analysis. Multimodal data fusion enables certain analyses on matrix-valued and vector-valued data, but it generally cannot integrate modalities of different dimensions and data structures. For a single data modality, multivariate distance matrix regression provides a distance-based framework for regression accommodating a wide range of data types. However, no distance-based method exists to handle multiple complementary types of data. We propose a novel distance-based regression model, which we refer to as Similarity-based Multimodal Regression (SiMMR), that enables simultaneous regression of multiple modalities through their distance profiles. We demonstrate through simulation, imaging studies, and longitudinal mobile health analyses that our proposed method can detect associations between clinical variables and multimodal data of differing properties and dimensionalities, even with modest sample sizes. We perform experiments to evaluate several different test statistics and provide recommendations for applying our method across a broad range of scenarios.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"1122-1139"},"PeriodicalIF":1.8,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11471965/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138500309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimation of optimal treatment regimes with electronic medical record data using the residual life value estimator. 使用剩余生命值估算器,利用电子病历数据估算最佳治疗方案。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-10-01 DOI: 10.1093/biostatistics/kxae002
Grace Rhodes, Marie Davidian, Wenbin Lu

Clinicians and patients must make treatment decisions at a series of key decision points throughout disease progression. A dynamic treatment regime is a set of sequential decision rules that return treatment decisions based on accumulating patient information, like that commonly found in electronic medical record (EMR) data. When applied to a patient population, an optimal treatment regime leads to the most favorable outcome on average. Identifying optimal treatment regimes that maximize residual life is especially desirable for patients with life-threatening diseases such as sepsis, a complex medical condition that involves severe infections with organ dysfunction. We introduce the residual life value estimator (ReLiVE), an estimator for the expected value of cumulative restricted residual life under a fixed treatment regime. Building on ReLiVE, we present a method for estimating an optimal treatment regime that maximizes expected cumulative restricted residual life. Our proposed method, ReLiVE-Q, conducts estimation via the backward induction algorithm Q-learning. We illustrate the utility of ReLiVE-Q in simulation studies, and we apply ReLiVE-Q to estimate an optimal treatment regime for septic patients in the intensive care unit using EMR data from the Multiparameter Intelligent Monitoring Intensive Care database. Ultimately, we demonstrate that ReLiVE-Q leverages accumulating patient information to estimate personalized treatment regimes that optimize a clinically meaningful function of residual life.

临床医生和患者必须在疾病进展过程中的一系列关键决策点上做出治疗决策。动态治疗方案是一套连续的决策规则,根据不断积累的患者信息(如电子病历(EMR)数据中常见的信息)返回治疗决策。当应用于患者群体时,最佳治疗方案平均会带来最有利的结果。对于脓毒症等危及生命的疾病患者来说,找出能最大限度延长剩余生命的最佳治疗方案尤为重要,脓毒症是一种复杂的疾病,涉及严重感染和器官功能障碍。我们引入了残余生命值估算器(ReLiVE),这是一种在固定治疗方案下累积受限残余生命预期值的估算器。在 ReLiVE 的基础上,我们提出了一种估算最佳治疗方案的方法,该方案可使预期累积受限残余寿命最大化。我们提出的 ReLiVE-Q 方法通过后向归纳算法 Q-learning 进行估算。我们在模拟研究中说明了 ReLiVE-Q 的实用性,并利用多参数智能监测重症监护数据库中的 EMR 数据,应用 ReLiVE-Q 估算了重症监护病房脓毒症患者的最佳治疗方案。最终,我们证明了 ReLiVE-Q 能够利用不断积累的患者信息来估算个性化治疗方案,从而优化具有临床意义的剩余生命功能。
{"title":"Estimation of optimal treatment regimes with electronic medical record data using the residual life value estimator.","authors":"Grace Rhodes, Marie Davidian, Wenbin Lu","doi":"10.1093/biostatistics/kxae002","DOIUrl":"10.1093/biostatistics/kxae002","url":null,"abstract":"<p><p>Clinicians and patients must make treatment decisions at a series of key decision points throughout disease progression. A dynamic treatment regime is a set of sequential decision rules that return treatment decisions based on accumulating patient information, like that commonly found in electronic medical record (EMR) data. When applied to a patient population, an optimal treatment regime leads to the most favorable outcome on average. Identifying optimal treatment regimes that maximize residual life is especially desirable for patients with life-threatening diseases such as sepsis, a complex medical condition that involves severe infections with organ dysfunction. We introduce the residual life value estimator (ReLiVE), an estimator for the expected value of cumulative restricted residual life under a fixed treatment regime. Building on ReLiVE, we present a method for estimating an optimal treatment regime that maximizes expected cumulative restricted residual life. Our proposed method, ReLiVE-Q, conducts estimation via the backward induction algorithm Q-learning. We illustrate the utility of ReLiVE-Q in simulation studies, and we apply ReLiVE-Q to estimate an optimal treatment regime for septic patients in the intensive care unit using EMR data from the Multiparameter Intelligent Monitoring Intensive Care database. Ultimately, we demonstrate that ReLiVE-Q leverages accumulating patient information to estimate personalized treatment regimes that optimize a clinically meaningful function of residual life.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"933-946"},"PeriodicalIF":1.8,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11471959/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139708547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Signal detection statistics of adverse drug events in hierarchical structure for matched case-control data. 匹配病例对照数据的分级结构中药物不良事件的信号检测统计。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-10-01 DOI: 10.1093/biostatistics/kxad029
Seok-Jae Heo, Sohee Jeong, Dagyeom Jung, Inkyung Jung

The tree-based scan statistic is a data mining method used to identify signals of adverse drug reactions in a database of spontaneous reporting systems. It is particularly beneficial when dealing with hierarchical data structures. One may use a retrospective case-control study design from spontaneous reporting systems (SRS) to investigate whether a specific adverse event of interest is associated with certain drugs. However, the existing Bernoulli model of the tree-based scan statistic may not be suitable as it fails to adequately account for dependencies within matched pairs. In this article, we propose signal detection statistics for matched case-control data based on McNemar's test, Wald test for conditional logistic regression, and the likelihood ratio test for a multinomial distribution. Through simulation studies, we demonstrate that our proposed methods outperform the existing approach in terms of the type I error rate, power, sensitivity, and false detection rate. To illustrate our proposed approach, we applied the three methods and the existing method to detect drug signals for dizziness-related adverse events related to antihypertensive drugs using the database of the Korea Adverse Event Reporting System.

基于树的扫描统计是一种数据挖掘方法,用于在自发报告系统的数据库中识别药物不良反应的信号。它在处理分层数据结构时特别有益。可以使用自发报告系统(SRS)的回顾性病例对照研究设计来调查感兴趣的特定不良事件是否与某些药物有关。然而,现有的基于树的扫描统计的伯努利模型可能不合适,因为它不能充分考虑匹配对内的依赖性。在本文中,我们提出了基于McNemar检验、条件逻辑回归的Wald检验和多项式分布的似然比检验的匹配病例对照数据的信号检测统计。通过仿真研究,我们证明了我们提出的方法在I型错误率、功率、灵敏度和错误检测率方面优于现有方法。为了说明我们提出的方法,我们使用韩国不良事件报告系统的数据库,应用这三种方法和现有方法来检测与降压药相关的头晕相关不良事件的药物信号。
{"title":"Signal detection statistics of adverse drug events in hierarchical structure for matched case-control data.","authors":"Seok-Jae Heo, Sohee Jeong, Dagyeom Jung, Inkyung Jung","doi":"10.1093/biostatistics/kxad029","DOIUrl":"10.1093/biostatistics/kxad029","url":null,"abstract":"<p><p>The tree-based scan statistic is a data mining method used to identify signals of adverse drug reactions in a database of spontaneous reporting systems. It is particularly beneficial when dealing with hierarchical data structures. One may use a retrospective case-control study design from spontaneous reporting systems (SRS) to investigate whether a specific adverse event of interest is associated with certain drugs. However, the existing Bernoulli model of the tree-based scan statistic may not be suitable as it fails to adequately account for dependencies within matched pairs. In this article, we propose signal detection statistics for matched case-control data based on McNemar's test, Wald test for conditional logistic regression, and the likelihood ratio test for a multinomial distribution. Through simulation studies, we demonstrate that our proposed methods outperform the existing approach in terms of the type I error rate, power, sensitivity, and false detection rate. To illustrate our proposed approach, we applied the three methods and the existing method to detect drug signals for dizziness-related adverse events related to antihypertensive drugs using the database of the Korea Adverse Event Reporting System.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"1112-1121"},"PeriodicalIF":1.8,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"54232410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of complier and noncomplier average causal effects in the presence of latent missing-at-random (LMAR) outcomes: a unifying view and choices of assumptions. 在存在潜在随机遗漏(LMAR)结果的情况下,识别合意者和非合意者的平均因果效应:统一观点和假设选择。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-10-01 DOI: 10.1093/biostatistics/kxae011
Trang Quynh Nguyen, Michelle C Carlson, Elizabeth A Stuart

The study of treatment effects is often complicated by noncompliance and missing data. In the one-sided noncompliance setting where of interest are the complier and noncomplier average causal effects, we address outcome missingness of the latent missing at random type (LMAR, also known as latent ignorability). That is, conditional on covariates and treatment assigned, the missingness may depend on compliance type. Within the instrumental variable (IV) approach to noncompliance, methods have been proposed for handling LMAR outcome that additionally invoke an exclusion restriction-type assumption on missingness, but no solution has been proposed for when a non-IV approach is used. This article focuses on effect identification in the presence of LMAR outcomes, with a view to flexibly accommodate different principal identification approaches. We show that under treatment assignment ignorability and LMAR only, effect nonidentifiability boils down to a set of two connected mixture equations involving unidentified stratum-specific response probabilities and outcome means. This clarifies that (except for a special case) effect identification generally requires two additional assumptions: a specific missingness mechanism assumption and a principal identification assumption. This provides a template for identifying effects based on separate choices of these assumptions. We consider a range of specific missingness assumptions, including those that have appeared in the literature and some new ones. Incidentally, we find an issue in the existing assumptions, and propose a modification of the assumptions to avoid the issue. Results under different assumptions are illustrated using data from the Baltimore Experience Corps Trial.

对治疗效果的研究通常会因违规和数据缺失而变得复杂。在单侧不遵医嘱的情况下,我们关注的是遵医嘱者和非遵医嘱者的平均因果效应,我们处理的是潜在随机缺失类型的结果缺失(LMAR,也称为潜在无知)。也就是说,在协变量和治疗分配的条件下,遗漏率可能取决于遵从类型。在处理不遵从的工具变量(IV)方法中,已经提出了处理 LMAR 结果的方法,这些方法额外援引了关于缺失的排除限制型假设,但还没有提出使用非 IV 方法时的解决方案。本文重点讨论存在 LMAR 结果时的效应识别,以期灵活地适应不同的主要识别方法。我们的研究表明,仅在处理分配无知和 LMAR 的情况下,效应不可识别性可归结为涉及未识别的特定分层反应概率和结果均值的两个相连混合方程组。这就说明(除特殊情况外)效应识别一般需要两个额外的假设:特定的遗漏机制假设和主要识别假设。这为根据这些假设的不同选择来识别效应提供了一个模板。我们考虑了一系列特定的缺失假设,包括文献中出现过的假设和一些新的假设。顺便提一下,我们发现了现有假设中的一个问题,并提出了修改假设以避免该问题的建议。我们使用巴尔的摩体验团试验的数据说明了不同假设下的结果。
{"title":"Identification of complier and noncomplier average causal effects in the presence of latent missing-at-random (LMAR) outcomes: a unifying view and choices of assumptions.","authors":"Trang Quynh Nguyen, Michelle C Carlson, Elizabeth A Stuart","doi":"10.1093/biostatistics/kxae011","DOIUrl":"10.1093/biostatistics/kxae011","url":null,"abstract":"<p><p>The study of treatment effects is often complicated by noncompliance and missing data. In the one-sided noncompliance setting where of interest are the complier and noncomplier average causal effects, we address outcome missingness of the latent missing at random type (LMAR, also known as latent ignorability). That is, conditional on covariates and treatment assigned, the missingness may depend on compliance type. Within the instrumental variable (IV) approach to noncompliance, methods have been proposed for handling LMAR outcome that additionally invoke an exclusion restriction-type assumption on missingness, but no solution has been proposed for when a non-IV approach is used. This article focuses on effect identification in the presence of LMAR outcomes, with a view to flexibly accommodate different principal identification approaches. We show that under treatment assignment ignorability and LMAR only, effect nonidentifiability boils down to a set of two connected mixture equations involving unidentified stratum-specific response probabilities and outcome means. This clarifies that (except for a special case) effect identification generally requires two additional assumptions: a specific missingness mechanism assumption and a principal identification assumption. This provides a template for identifying effects based on separate choices of these assumptions. We consider a range of specific missingness assumptions, including those that have appeared in the literature and some new ones. Incidentally, we find an issue in the existing assumptions, and propose a modification of the assumptions to avoid the issue. Results under different assumptions are illustrated using data from the Baltimore Experience Corps Trial.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"978-996"},"PeriodicalIF":1.8,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11471963/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140861668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic and concordance-assisted learning for risk stratification with application to Alzheimer's disease. 应用于阿尔茨海默病风险分层的动态和一致性辅助学习。
IF 2.1 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-09-10 DOI: 10.1093/biostatistics/kxae036
Wen Li,Ruosha Li,Ziding Feng,Jing Ning,
Dynamic prediction models capable of retaining accuracy by evolving over time could play a significant role for monitoring disease progression in clinical practice. In biomedical studies with long-term follow up, participants are often monitored through periodic clinical visits with repeat measurements until an occurrence of the event of interest (e.g. disease onset) or the study end. Acknowledging the dynamic nature of disease risk and clinical information contained in the longitudinal markers, we propose an innovative concordance-assisted learning algorithm to derive a real-time risk stratification score. The proposed approach bypasses the need to fit regression models, such as joint models of the longitudinal markers and time-to-event outcome, and hence enjoys the desirable property of model robustness. Simulation studies confirmed that the proposed method has satisfactory performance in dynamically monitoring the risk of developing disease and differentiating high-risk and low-risk population over time. We apply the proposed method to the Alzheimer's Disease Neuroimaging Initiative data and develop a dynamic risk score of Alzheimer's Disease for patients with mild cognitive impairment using multiple longitudinal markers and baseline prognostic factors.
动态预测模型能够随着时间的推移而不断变化,从而保持准确性,这对于临床实践中监测疾病的进展具有重要作用。在长期随访的生物医学研究中,参与者通常通过定期临床访问和重复测量进行监测,直到相关事件(如疾病发作)发生或研究结束。考虑到纵向标记中包含的疾病风险和临床信息的动态性质,我们提出了一种创新的一致性辅助学习算法,以得出实时风险分层评分。所提出的方法无需拟合回归模型,如纵向指标和时间到事件结果的联合模型,因此具有理想的模型稳健性。模拟研究证实,所提出的方法在动态监测患病风险和区分高危和低危人群方面具有令人满意的性能。我们将提出的方法应用于阿尔茨海默病神经影像倡议数据,并利用多个纵向标记和基线预后因素为轻度认知障碍患者建立了阿尔茨海默病动态风险评分。
{"title":"Dynamic and concordance-assisted learning for risk stratification with application to Alzheimer's disease.","authors":"Wen Li,Ruosha Li,Ziding Feng,Jing Ning,","doi":"10.1093/biostatistics/kxae036","DOIUrl":"https://doi.org/10.1093/biostatistics/kxae036","url":null,"abstract":"Dynamic prediction models capable of retaining accuracy by evolving over time could play a significant role for monitoring disease progression in clinical practice. In biomedical studies with long-term follow up, participants are often monitored through periodic clinical visits with repeat measurements until an occurrence of the event of interest (e.g. disease onset) or the study end. Acknowledging the dynamic nature of disease risk and clinical information contained in the longitudinal markers, we propose an innovative concordance-assisted learning algorithm to derive a real-time risk stratification score. The proposed approach bypasses the need to fit regression models, such as joint models of the longitudinal markers and time-to-event outcome, and hence enjoys the desirable property of model robustness. Simulation studies confirmed that the proposed method has satisfactory performance in dynamically monitoring the risk of developing disease and differentiating high-risk and low-risk population over time. We apply the proposed method to the Alzheimer's Disease Neuroimaging Initiative data and develop a dynamic risk score of Alzheimer's Disease for patients with mild cognitive impairment using multiple longitudinal markers and baseline prognostic factors.","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"110 1","pages":""},"PeriodicalIF":2.1,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142195464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HMM for discovering decision-making dynamics using reinforcement learning experiments. 利用强化学习实验发现决策动态的 HMM。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-09-03 DOI: 10.1093/biostatistics/kxae033
Xingche Guo, Donglin Zeng, Yuanjia Wang

Major depressive disorder (MDD), a leading cause of years of life lived with disability, presents challenges in diagnosis and treatment due to its complex and heterogeneous nature. Emerging evidence indicates that reward processing abnormalities may serve as a behavioral marker for MDD. To measure reward processing, patients perform computer-based behavioral tasks that involve making choices or responding to stimulants that are associated with different outcomes, such as gains or losses in the laboratory. Reinforcement learning (RL) models are fitted to extract parameters that measure various aspects of reward processing (e.g. reward sensitivity) to characterize how patients make decisions in behavioral tasks. Recent findings suggest the inadequacy of characterizing reward learning solely based on a single RL model; instead, there may be a switching of decision-making processes between multiple strategies. An important scientific question is how the dynamics of strategies in decision-making affect the reward learning ability of individuals with MDD. Motivated by the probabilistic reward task within the Establishing Moderators and Biosignatures of Antidepressant Response in Clinical Care (EMBARC) study, we propose a novel RL-HMM (hidden Markov model) framework for analyzing reward-based decision-making. Our model accommodates decision-making strategy switching between two distinct approaches under an HMM: subjects making decisions based on the RL model or opting for random choices. We account for continuous RL state space and allow time-varying transition probabilities in the HMM. We introduce a computationally efficient Expectation-maximization (EM) algorithm for parameter estimation and use a nonparametric bootstrap for inference. Extensive simulation studies validate the finite-sample performance of our method. We apply our approach to the EMBARC study to show that MDD patients are less engaged in RL compared to the healthy controls, and engagement is associated with brain activities in the negative affect circuitry during an emotional conflict task.

重度抑郁障碍(MDD)是导致残疾生活年数的主要原因,由于其复杂性和异质性,给诊断和治疗带来了挑战。新出现的证据表明,奖赏处理异常可作为重度抑郁症的行为标记。为了测量奖赏加工,患者要完成基于计算机的行为任务,其中涉及做出选择或对兴奋剂做出反应,而这些选择或反应与不同的结果有关,例如在实验室中的收益或损失。对强化学习(RL)模型进行拟合,以提取衡量奖赏处理各方面(如奖赏敏感性)的参数,从而描述患者在行为任务中如何做出决策。最近的研究结果表明,仅根据单一的 RL 模型来描述奖赏学习是不够的;相反,决策过程可能会在多种策略之间切换。一个重要的科学问题是,决策策略的动态变化如何影响 MDD 患者的奖赏学习能力。受 "建立临床护理中抗抑郁剂反应的调节因子和生物特征"(EMBARC)研究中的概率奖励任务的启发,我们提出了一种新的 RL-HMM(隐马尔可夫模型)框架,用于分析基于奖励的决策。我们的模型允许在 HMM 下的两种不同方法之间切换决策策略:受试者根据 RL 模型做出决策或选择随机选择。我们考虑了连续的 RL 状态空间,并允许 HMM 中的过渡概率随时间变化。我们引入了一种计算高效的期望最大化(EM)算法来进行参数估计,并使用非参数自举法进行推断。广泛的模拟研究验证了我们方法的有限样本性能。我们将我们的方法应用于 EMBARC 研究,结果表明与健康对照组相比,MDD 患者在 RL 中的参与度较低,而参与度与情绪冲突任务中负面情绪回路的大脑活动有关。
{"title":"HMM for discovering decision-making dynamics using reinforcement learning experiments.","authors":"Xingche Guo, Donglin Zeng, Yuanjia Wang","doi":"10.1093/biostatistics/kxae033","DOIUrl":"https://doi.org/10.1093/biostatistics/kxae033","url":null,"abstract":"<p><p>Major depressive disorder (MDD), a leading cause of years of life lived with disability, presents challenges in diagnosis and treatment due to its complex and heterogeneous nature. Emerging evidence indicates that reward processing abnormalities may serve as a behavioral marker for MDD. To measure reward processing, patients perform computer-based behavioral tasks that involve making choices or responding to stimulants that are associated with different outcomes, such as gains or losses in the laboratory. Reinforcement learning (RL) models are fitted to extract parameters that measure various aspects of reward processing (e.g. reward sensitivity) to characterize how patients make decisions in behavioral tasks. Recent findings suggest the inadequacy of characterizing reward learning solely based on a single RL model; instead, there may be a switching of decision-making processes between multiple strategies. An important scientific question is how the dynamics of strategies in decision-making affect the reward learning ability of individuals with MDD. Motivated by the probabilistic reward task within the Establishing Moderators and Biosignatures of Antidepressant Response in Clinical Care (EMBARC) study, we propose a novel RL-HMM (hidden Markov model) framework for analyzing reward-based decision-making. Our model accommodates decision-making strategy switching between two distinct approaches under an HMM: subjects making decisions based on the RL model or opting for random choices. We account for continuous RL state space and allow time-varying transition probabilities in the HMM. We introduce a computationally efficient Expectation-maximization (EM) algorithm for parameter estimation and use a nonparametric bootstrap for inference. Extensive simulation studies validate the finite-sample performance of our method. We apply our approach to the EMBARC study to show that MDD patients are less engaged in RL compared to the healthy controls, and engagement is associated with brain activities in the negative affect circuitry during an emotional conflict task.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142127451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction. 更正。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-26 DOI: 10.1093/biostatistics/kxae029
{"title":"Correction.","authors":"","doi":"10.1093/biostatistics/kxae029","DOIUrl":"10.1093/biostatistics/kxae029","url":null,"abstract":"","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142074610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exposure proximal immune correlates analysis. 接触近端免疫相关性分析。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-14 DOI: 10.1093/biostatistics/kxae031
Ying Huang, Dean Follmann

Immune response decays over time, and vaccine-induced protection often wanes. Understanding how vaccine efficacy changes over time is critical to guiding the development and application of vaccines in preventing infectious diseases. The objective of this article is to develop statistical methods that assess the effect of decaying immune responses on the risk of disease and on vaccine efficacy, within the context of Cox regression with sparse sampling of immune responses, in a baseline-naive population. We aim to further disentangle the various aspects of the time-varying vaccine effect, whether direct on disease or mediated through immune responses. Based on time-to-event data from a vaccine efficacy trial and sparse sampling of longitudinal immune responses, we propose a weighted estimated induced likelihood approach that models the longitudinal immune response trajectory and the time to event separately. This approach assesses the effects of the decaying immune response, the peak immune response, and/or the waning vaccine effect on the risk of disease. The proposed method is applicable not only to standard randomized trial designs but also to augmented vaccine trial designs that re-vaccinate uninfected placebo recipients at the end of the standard trial period. We conducted simulation studies to evaluate the performance of our method and applied the method to analyze immune correlates from a phase III SARS-CoV-2 vaccine trial.

免疫反应会随着时间的推移而衰减,疫苗诱导的保护作用往往会减弱。了解疫苗效力如何随时间而变化,对于指导疫苗的开发和应用以预防传染病至关重要。本文旨在开发统计方法,在对基线免疫人群的免疫反应进行稀疏采样的考克斯回归背景下,评估衰减的免疫反应对疾病风险和疫苗效力的影响。我们的目标是进一步厘清疫苗时变效应的各个方面,无论是直接影响疾病还是通过免疫反应介导。基于疫苗疗效试验的事件发生时间数据和纵向免疫反应的稀疏采样,我们提出了一种加权估计诱导似然法,该方法对纵向免疫反应轨迹和事件发生时间分别建模。这种方法可评估免疫反应衰减、免疫反应高峰和/或疫苗效果减弱对疾病风险的影响。所提出的方法不仅适用于标准随机试验设计,也适用于在标准试验期结束时对未感染的安慰剂受试者进行再接种的增强疫苗试验设计。我们进行了模拟研究来评估我们的方法的性能,并将该方法应用于分析 SARS-CoV-2 疫苗 III 期试验的免疫相关性。
{"title":"Exposure proximal immune correlates analysis.","authors":"Ying Huang, Dean Follmann","doi":"10.1093/biostatistics/kxae031","DOIUrl":"10.1093/biostatistics/kxae031","url":null,"abstract":"<p><p>Immune response decays over time, and vaccine-induced protection often wanes. Understanding how vaccine efficacy changes over time is critical to guiding the development and application of vaccines in preventing infectious diseases. The objective of this article is to develop statistical methods that assess the effect of decaying immune responses on the risk of disease and on vaccine efficacy, within the context of Cox regression with sparse sampling of immune responses, in a baseline-naive population. We aim to further disentangle the various aspects of the time-varying vaccine effect, whether direct on disease or mediated through immune responses. Based on time-to-event data from a vaccine efficacy trial and sparse sampling of longitudinal immune responses, we propose a weighted estimated induced likelihood approach that models the longitudinal immune response trajectory and the time to event separately. This approach assesses the effects of the decaying immune response, the peak immune response, and/or the waning vaccine effect on the risk of disease. The proposed method is applicable not only to standard randomized trial designs but also to augmented vaccine trial designs that re-vaccinate uninfected placebo recipients at the end of the standard trial period. We conducted simulation studies to evaluate the performance of our method and applied the method to analyze immune correlates from a phase III SARS-CoV-2 vaccine trial.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141984050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biostatistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1