首页 > 最新文献

Biometrics最新文献

英文 中文
Bayesian covariate-dependent graph learning with a dual group spike-and-slab prior. 具有双群峰-板先验的贝叶斯协变量相关图学习。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2025-04-02 DOI: 10.1093/biomtc/ujaf053
Zijian Zeng, Meng Li, Marina Vannucci

Covariate-dependent graph learning has gained increasing interest in the graphical modeling literature for the analysis of heterogeneous data. This task, however, poses challenges to modeling, computational efficiency, and interpretability. The parameter of interest can be naturally represented as a 3-dimensional array with elements that can be grouped according to 2 directions, corresponding to node level and covariate level, respectively. In this article, we propose a novel dual group spike-and-slab prior that enables multi-level selection at covariate-level and node-level, as well as individual (local) level sparsity. We introduce a nested strategy with specific choices to address distinct challenges posed by the various grouping directions. For posterior inference, we develop a full Gibbs sampler for all parameters, which mitigates the difficulties of parameter tuning often encountered in high-dimensional graphical models and facilitates routine implementation. Through simulation studies, we demonstrate that the proposed model outperforms existing methods in its accuracy of graph recovery. We show the practical utility of our model via an application to microbiome data where we seek to better understand the interactions among microbes as well as how these are affected by relevant covariates.

协变量相关图学习在异构数据分析的图形建模文献中获得了越来越多的兴趣。然而,这项任务对建模、计算效率和可解释性提出了挑战。感兴趣的参数可以自然地表示为一个三维数组,其中的元素可以按照2个方向分组,分别对应于节点级别和协变量级别。在本文中,我们提出了一种新的双群尖峰-板先验,它可以在协变量水平和节点水平以及个体(局部)水平稀疏度上进行多级选择。我们引入了一个具有特定选择的嵌套策略,以解决各种分组方向带来的不同挑战。对于后验推理,我们为所有参数开发了一个完整的Gibbs采样器,这减轻了在高维图形模型中经常遇到的参数调整困难,并便于日常实现。通过仿真研究,我们证明了该模型在图恢复精度上优于现有方法。我们通过微生物组数据的应用程序展示了我们模型的实际效用,我们试图更好地了解微生物之间的相互作用以及这些相互作用如何受到相关协变量的影响。
{"title":"Bayesian covariate-dependent graph learning with a dual group spike-and-slab prior.","authors":"Zijian Zeng, Meng Li, Marina Vannucci","doi":"10.1093/biomtc/ujaf053","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf053","url":null,"abstract":"<p><p>Covariate-dependent graph learning has gained increasing interest in the graphical modeling literature for the analysis of heterogeneous data. This task, however, poses challenges to modeling, computational efficiency, and interpretability. The parameter of interest can be naturally represented as a 3-dimensional array with elements that can be grouped according to 2 directions, corresponding to node level and covariate level, respectively. In this article, we propose a novel dual group spike-and-slab prior that enables multi-level selection at covariate-level and node-level, as well as individual (local) level sparsity. We introduce a nested strategy with specific choices to address distinct challenges posed by the various grouping directions. For posterior inference, we develop a full Gibbs sampler for all parameters, which mitigates the difficulties of parameter tuning often encountered in high-dimensional graphical models and facilitates routine implementation. Through simulation studies, we demonstrate that the proposed model outperforms existing methods in its accuracy of graph recovery. We show the practical utility of our model via an application to microbiome data where we seek to better understand the interactions among microbes as well as how these are affected by relevant covariates.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 2","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143962246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
COCA: a randomized Bayesian design integrating dose optimization and component contribution assessment for combination therapies. COCA:一个随机贝叶斯设计,将剂量优化和成分贡献评估整合到联合治疗中。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2025-04-02 DOI: 10.1093/biomtc/ujaf077
Xiaohan Chi, Ruitao Lin, Ying Yuan

In cancer treatment, the development of combination therapies requires demonstrating the contribution of each individual drug and optimizing the dose during early-phase trials. This necessitates a large sample size, presenting formidable obstacles for drug developers. To address this issue, we propose a 2-stage randomized phase II design that seamlessly integrates combination dose optimization with component contribution assessment. In stage 1, the optimal combination dose is determined by maximizing the risk-benefit tradeoff across multiple candidate combination doses. In stage 2, a multi-arm randomized phase is initiated to evaluate the contribution of each component within the combination therapy. To increase trial efficiency and reduce the sample size, efficacy data from both stages are adaptively combined using a Bayesian logistic regression model with a spike-and-slab prior. The sample size and decision cutoffs of the proposed design are systematically determined based on a novel calibration procedure to achieve desired operating characteristics. Extensive simulation studies show that the proposed design achieves the dual goals of dose optimization and contribution assessment, while yielding substantial sample size savings compared to competing designs.

在癌症治疗中,联合疗法的发展需要证明每种药物的作用,并在早期试验中优化剂量。这需要大量的样本,这给药物开发人员带来了巨大的障碍。为了解决这一问题,我们提出了一种两阶段随机II期设计,将组合剂量优化与成分贡献评估无缝集成。在第一阶段,最佳联合剂量是通过最大化多个候选联合剂量的风险-收益权衡来确定的。在第2阶段,开始了一个多组随机阶段,以评估联合治疗中每个成分的贡献。为了提高试验效率和减少样本量,两个阶段的疗效数据使用具有尖刺-板先验的贝叶斯逻辑回归模型自适应地组合在一起。基于一种新的校准程序,系统地确定了所提出设计的样本量和决策截止点,以实现所需的操作特性。大量的模拟研究表明,所提出的设计实现了剂量优化和贡献评估的双重目标,同时与竞争设计相比,节省了大量的样本量。
{"title":"COCA: a randomized Bayesian design integrating dose optimization and component contribution assessment for combination therapies.","authors":"Xiaohan Chi, Ruitao Lin, Ying Yuan","doi":"10.1093/biomtc/ujaf077","DOIUrl":"10.1093/biomtc/ujaf077","url":null,"abstract":"<p><p>In cancer treatment, the development of combination therapies requires demonstrating the contribution of each individual drug and optimizing the dose during early-phase trials. This necessitates a large sample size, presenting formidable obstacles for drug developers. To address this issue, we propose a 2-stage randomized phase II design that seamlessly integrates combination dose optimization with component contribution assessment. In stage 1, the optimal combination dose is determined by maximizing the risk-benefit tradeoff across multiple candidate combination doses. In stage 2, a multi-arm randomized phase is initiated to evaluate the contribution of each component within the combination therapy. To increase trial efficiency and reduce the sample size, efficacy data from both stages are adaptively combined using a Bayesian logistic regression model with a spike-and-slab prior. The sample size and decision cutoffs of the proposed design are systematically determined based on a novel calibration procedure to achieve desired operating characteristics. Extensive simulation studies show that the proposed design achieves the dual goals of dose optimization and contribution assessment, while yielding substantial sample size savings compared to competing designs.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 2","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12206157/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144473981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Continuous-space occupancy models. 连续空间占用模型。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2025-04-02 DOI: 10.1093/biomtc/ujaf055
Wilson J Wright, Mevin B Hooten

Occupancy models are used to infer species distributions over large spatial extents while accounting for imperfect detection. Current approaches, however, are unable to model species occurrence over continuous spatial domains while accounting for the discrete spatial domain of the observed data. We develop a new class of spatial occupancy models that embeds a change of spatial support between the observed data and occurrence process. We use a clipped Gaussian process to represent species occurrence in continuous space, which can provide inferences at a finer resolution than the observed occupancy data. Our approach is beneficial because it allows for more realistic models of species occurrence, can account for species occurring in only a portion of a surveyed site, and can relate detection probabilities to these within-site occurrence proportions. We show how our model can be fit using Bayesian methods and develop a computationally efficient MCMC algorithm. In particular, we rely on a Vecchia approximation to implement the spatial Gaussian process describing species occurrence and develop a surrogate data approach for jointly updating the spatial terms and spatial covariance parameters. We demonstrate our model using simulated data and compare our approach to alternative spatial occupancy models. We also use our model to analyze ovenbird occurrence data collected in New Hampshire, USA.

占用模型用于推断物种在大空间范围内的分布,同时考虑到不完善的检测。然而,目前的方法在考虑观测数据的离散空间域时,无法在连续空间域上模拟物种的发生。我们开发了一类新的空间占用模型,该模型嵌入了观测数据和发生过程之间的空间支持变化。我们使用剪切高斯过程来表示连续空间中的物种发生,这可以提供比观测到的占用数据更精细的分辨率推断。我们的方法是有益的,因为它允许更现实的物种发生模型,可以解释物种只在调查地点的一部分发生,并且可以将检测概率与这些地点内的发生比例联系起来。我们展示了如何使用贝叶斯方法拟合我们的模型,并开发了一个计算效率高的MCMC算法。特别是,我们依靠Vecchia近似来实现描述物种发生的空间高斯过程,并开发了一种替代数据方法来共同更新空间项和空间协方差参数。我们使用模拟数据演示了我们的模型,并将我们的方法与其他空间占用模型进行了比较。我们还使用我们的模型来分析在美国新罕布什尔州收集的灶鸟发生数据。
{"title":"Continuous-space occupancy models.","authors":"Wilson J Wright, Mevin B Hooten","doi":"10.1093/biomtc/ujaf055","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf055","url":null,"abstract":"<p><p>Occupancy models are used to infer species distributions over large spatial extents while accounting for imperfect detection. Current approaches, however, are unable to model species occurrence over continuous spatial domains while accounting for the discrete spatial domain of the observed data. We develop a new class of spatial occupancy models that embeds a change of spatial support between the observed data and occurrence process. We use a clipped Gaussian process to represent species occurrence in continuous space, which can provide inferences at a finer resolution than the observed occupancy data. Our approach is beneficial because it allows for more realistic models of species occurrence, can account for species occurring in only a portion of a surveyed site, and can relate detection probabilities to these within-site occurrence proportions. We show how our model can be fit using Bayesian methods and develop a computationally efficient MCMC algorithm. In particular, we rely on a Vecchia approximation to implement the spatial Gaussian process describing species occurrence and develop a surrogate data approach for jointly updating the spatial terms and spatial covariance parameters. We demonstrate our model using simulated data and compare our approach to alternative spatial occupancy models. We also use our model to analyze ovenbird occurrence data collected in New Hampshire, USA.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 2","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143969710","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving estimation efficiency for case-cohort studies with a cure fraction. 提高具有治愈率的病例队列研究的估计效率。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2025-04-02 DOI: 10.1093/biomtc/ujaf059
Qingning Zhou, Xu Cao

In the studies of time-to-event outcomes, it often happens that a fraction of subjects will never experience the event of interest, and these subjects are said to be cured. The studies with a cure fraction often yield a low event rate. To reduce cost and enhance study power, 2-phase sampling designs are often adopted, especially when the covariates of interest are expensive to measure or obtain. In this paper, we consider the generalized case-cohort design for studies with a cure fraction. Under this design, the expensive covariates are measured for a subset of the study cohort, called subcohort, and for all or a subset of the remaining subjects outside the subcohort who have experienced the event during the study, called cases. We propose a 2-step estimation procedure under a class of semiparametric transformation mixture cure models. We first develop a sieve maximum weighted likelihood method based only on the complete data and also devise an Expectation-Maximization (EM) algorithm for implementation. We then update the resulting estimator via a working model between the outcome and cheap covariates or auxiliary variables using the full data. We show that the proposed update estimator is consistent and asymptotically at least as efficient as the complete-data estimator, regardless of whether the working model is correctly specified or not. We also propose a weighted bootstrap procedure for variance estimation. Extensive simulation studies demonstrate the superior performance of the proposed method in finite-sample. An application to the National Wilms' Tumor Study is provided for illustration.

在对时间与事件结果的研究中,经常会有一小部分受试者永远不会经历感兴趣的事件,而这些受试者被认为是治愈的。使用治愈分数的研究通常产生较低的事件发生率。为了降低成本和提高研究能力,通常采用两相采样设计,特别是当感兴趣的协变量测量或获取成本很高时。在本文中,我们考虑了具有治愈率的研究的广义病例队列设计。在这种设计下,对研究队列的一个子集(称为亚队列)和在研究期间经历过该事件的子队列之外的所有或部分剩余受试者(称为病例)测量昂贵的协变量。提出了一类半参数变换混合模型下的两步估计方法。我们首先开发了一种仅基于完整数据的筛最大加权似然方法,并设计了一种期望最大化(EM)算法来实现。然后,我们通过使用完整数据的结果和廉价协变量或辅助变量之间的工作模型更新结果估计器。我们证明,无论工作模型是否正确指定,所提出的更新估计量是一致的,并且渐近地至少与完整数据估计量一样有效。我们还提出了一种加权自举法进行方差估计。大量的仿真研究证明了该方法在有限样本情况下的优越性能。为说明这一点,本文提供了国家威尔姆斯肿瘤研究的应用。
{"title":"Improving estimation efficiency for case-cohort studies with a cure fraction.","authors":"Qingning Zhou, Xu Cao","doi":"10.1093/biomtc/ujaf059","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf059","url":null,"abstract":"<p><p>In the studies of time-to-event outcomes, it often happens that a fraction of subjects will never experience the event of interest, and these subjects are said to be cured. The studies with a cure fraction often yield a low event rate. To reduce cost and enhance study power, 2-phase sampling designs are often adopted, especially when the covariates of interest are expensive to measure or obtain. In this paper, we consider the generalized case-cohort design for studies with a cure fraction. Under this design, the expensive covariates are measured for a subset of the study cohort, called subcohort, and for all or a subset of the remaining subjects outside the subcohort who have experienced the event during the study, called cases. We propose a 2-step estimation procedure under a class of semiparametric transformation mixture cure models. We first develop a sieve maximum weighted likelihood method based only on the complete data and also devise an Expectation-Maximization (EM) algorithm for implementation. We then update the resulting estimator via a working model between the outcome and cheap covariates or auxiliary variables using the full data. We show that the proposed update estimator is consistent and asymptotically at least as efficient as the complete-data estimator, regardless of whether the working model is correctly specified or not. We also propose a weighted bootstrap procedure for variance estimation. Extensive simulation studies demonstrate the superior performance of the proposed method in finite-sample. An application to the National Wilms' Tumor Study is provided for illustration.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 2","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143970761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Variant specific treatment effects with applications in vaccine studies. 变异特异性治疗效果及其在疫苗研究中的应用。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2025-04-02 DOI: 10.1093/biomtc/ujaf068
Gellért Perényi, Mats Stensrud

Pathogens usually exist in heterogeneous variants, like subtypes and strains. Quantifying treatment effects on the different variants is important for guiding prevention policies and vaccine development. Here, we ground analyses of variant-specific effects on a formal framework for causal inference. This allows us to clarify the interpretation of existing methods and define new estimands. Unlike most of the existing literature, we explicitly consider the (realistic) setting with interference in the target population: even if individuals can be sensibly perceived as iid in randomized trial data, there will often be interference in the target population where treatments, such as vaccines, are rolled out. Thus, one of our contributions is to derive explicit conditions guaranteeing that commonly reported vaccine efficacy parameters quantify well-defined causal effects, also in the presence of interference. Furthermore, our results give alternative justifications for reporting estimands on the relative, rather than absolute, scale. We illustrate the findings with an analysis of a large HIV1 vaccine trial, where there is interest in distinguishing vaccine effects on viruses with different genome sequences.

病原体通常以异质变体存在,如亚型和菌株。量化不同变异的治疗效果对于指导预防政策和疫苗开发非常重要。在这里,我们将变异特异性效应的分析建立在因果推理的正式框架上。这使我们能够澄清对现有方法的解释并定义新的估计。与大多数现有文献不同,我们明确考虑了目标人群中存在干扰的(现实)设置:即使在随机试验数据中可以合理地将个体视为iid,但在推出疫苗等治疗方法的目标人群中,通常会存在干扰。因此,我们的贡献之一是推导出明确的条件,保证在存在干扰的情况下,通常报告的疫苗效力参数也能量化定义明确的因果效应。此外,我们的结果为报告相对规模而不是绝对规模的估计提供了另一种理由。我们通过对一项大型hiv - 1疫苗试验的分析来说明这些发现,该试验对区分疫苗对具有不同基因组序列的病毒的作用很感兴趣。
{"title":"Variant specific treatment effects with applications in vaccine studies.","authors":"Gellért Perényi, Mats Stensrud","doi":"10.1093/biomtc/ujaf068","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf068","url":null,"abstract":"<p><p>Pathogens usually exist in heterogeneous variants, like subtypes and strains. Quantifying treatment effects on the different variants is important for guiding prevention policies and vaccine development. Here, we ground analyses of variant-specific effects on a formal framework for causal inference. This allows us to clarify the interpretation of existing methods and define new estimands. Unlike most of the existing literature, we explicitly consider the (realistic) setting with interference in the target population: even if individuals can be sensibly perceived as iid in randomized trial data, there will often be interference in the target population where treatments, such as vaccines, are rolled out. Thus, one of our contributions is to derive explicit conditions guaranteeing that commonly reported vaccine efficacy parameters quantify well-defined causal effects, also in the presence of interference. Furthermore, our results give alternative justifications for reporting estimands on the relative, rather than absolute, scale. We illustrate the findings with an analysis of a large HIV1 vaccine trial, where there is interest in distinguishing vaccine effects on viruses with different genome sequences.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 2","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144198239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating weighted quantile treatment effects with missing outcome data by double sampling. 通过双重抽样估计缺少结果数据的加权分位数治疗效果。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2025-04-02 DOI: 10.1093/biomtc/ujaf038
Shuo Sun, Sebastien Haneuse, Alexander W Levis, Catherine Lee, David E Arterburn, Heidi Fischer, Susan Shortreed, Rajarshi Mukherjee

Causal weighted quantile treatment effects (WQTEs) complement standard mean-focused causal contrasts when interest lies at the tails of the counterfactual distribution. However, existing methods for estimating and inferring causal WQTEs assume complete data on all relevant factors, which is often not the case in practice, particularly when the data are not collected for research purposes, such as electronic health records (EHRs) and disease registries. Furthermore, these data may be particularly susceptible to the outcome data being missing-not-at-random (MNAR). This paper proposes to use double sampling, through which the otherwise missing data are ascertained on a sub-sample of study units, as a strategy to mitigate bias due to MNAR data in estimating causal WQTEs. With the additional data, we present identifying conditions that do not require missingness assumptions in the original data. We then propose a novel inverse-probability weighted estimator and derive its asymptotic properties, both pointwise at specific quantiles and uniformly across quantiles over some compact subset of (0,1), allowing the propensity score and double-sampling probabilities to be estimated. For practical inference, we develop a bootstrap method that can be used for both pointwise and uniform inference. A simulation study is conducted to examine the finite sample performance of the proposed estimators. We illustrate the proposed method using EHR data examining the relative effects of 2 bariatric surgery procedures on BMI loss 3 years post-surgery.

当关注点位于反事实分布的尾部时,因果加权量子治疗效应(WQTE)是对标准的以平均值为重点的因果对比的补充。然而,估计和推断因果加权量子治疗效应的现有方法假定所有相关因素的数据都是完整的,而实际情况往往并非如此,特别是当数据不是出于研究目的而收集时,如电子健康记录(EHR)和疾病登记。此外,这些数据可能特别容易造成结果数据的非随机遗漏(MNAR)。本文建议使用双重抽样,即从研究单位的子样本中确定原本缺失的数据,以此作为一种策略,在估算因果性 WQTE 时减少因 MNAR 数据造成的偏差。利用附加数据,我们提出了不需要原始数据中缺失假设的识别条件。然后,我们提出了一种新颖的反概率加权估计器,并推导出其渐近特性,包括在特定量化点上的渐近特性,以及在某个紧凑子集(0,1)上均匀跨量化点的渐近特性,从而可以估计倾向得分和双重抽样概率。为了进行实际推断,我们开发了一种自举法,既可用于点推断,也可用于均匀推断。我们进行了一项模拟研究,以检验所提出的估计器的有限样本性能。我们使用电子病历数据说明了所提出的方法,该数据检验了两种减肥手术对术后 3 年体重指数下降的相对影响。
{"title":"Estimating weighted quantile treatment effects with missing outcome data by double sampling.","authors":"Shuo Sun, Sebastien Haneuse, Alexander W Levis, Catherine Lee, David E Arterburn, Heidi Fischer, Susan Shortreed, Rajarshi Mukherjee","doi":"10.1093/biomtc/ujaf038","DOIUrl":"10.1093/biomtc/ujaf038","url":null,"abstract":"<p><p>Causal weighted quantile treatment effects (WQTEs) complement standard mean-focused causal contrasts when interest lies at the tails of the counterfactual distribution. However, existing methods for estimating and inferring causal WQTEs assume complete data on all relevant factors, which is often not the case in practice, particularly when the data are not collected for research purposes, such as electronic health records (EHRs) and disease registries. Furthermore, these data may be particularly susceptible to the outcome data being missing-not-at-random (MNAR). This paper proposes to use double sampling, through which the otherwise missing data are ascertained on a sub-sample of study units, as a strategy to mitigate bias due to MNAR data in estimating causal WQTEs. With the additional data, we present identifying conditions that do not require missingness assumptions in the original data. We then propose a novel inverse-probability weighted estimator and derive its asymptotic properties, both pointwise at specific quantiles and uniformly across quantiles over some compact subset of (0,1), allowing the propensity score and double-sampling probabilities to be estimated. For practical inference, we develop a bootstrap method that can be used for both pointwise and uniform inference. A simulation study is conducted to examine the finite sample performance of the proposed estimators. We illustrate the proposed method using EHR data examining the relative effects of 2 bariatric surgery procedures on BMI loss 3 years post-surgery.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 2","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11973573/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143794495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Continuous-time mediation analysis for repeatedly measured mediators and outcomes. 重复测量介质和结果的连续时间中介分析。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2025-04-02 DOI: 10.1093/biomtc/ujaf062
Le Bourdonnec Kateline, Valeri Linda, Proust-Lima Cécile

Mediation analysis aims to decipher the underlying causal mechanisms between an exposure, an outcome, and intermediate variables called mediators. Initially developed for fixed-time mediator and outcome, it has been extended to the framework of longitudinal data by discretizing the assessment times of mediator and outcome. Yet, processes in play in longitudinal studies are usually defined in continuous time and measured at irregular and subject-specific visits. This is the case in dementia research when cerebral and cognitive changes measured at planned visits in cohorts are of interest. We thus propose a methodology to estimate the causal mechanisms between a time-fixed exposure ($X$), a mediator process ($mathcal {M}_t$), and an outcome process ($mathcal {Y}_t$) both measured repeatedly over time in the presence of a time-dependent confounding process ($mathcal {L}_t$). We consider 2 types of causal estimands, the natural effects and path-specific effects. We provide identifiability assumptions, and we employ a multivariate mixed model based on differential equations for their estimation. The performances of the method are assessed in simulations, and the method is illustrated in 2 real-world examples motivated by the 3C cerebral aging study to assess (1) the effect of educational level on functional dependency through depressive symptomatology and cognitive functioning and (2) the effect of a genetic factor on cognitive functioning potentially mediated by vascular brain lesions and confounded by neurodegeneration.

中介分析旨在解读暴露、结果和中介变量之间潜在的因果机制。它最初是为固定时间的中介和结果而开发的,通过离散中介和结果的评估时间,它已扩展到纵向数据框架。然而,在纵向研究中发挥作用的过程通常是在连续时间内定义的,并在不定期和特定受试者的访问中进行测量。在痴呆症研究中,在有计划的队列访问中测量大脑和认知变化是有意义的。因此,我们提出了一种方法来估计时间固定暴露($X$),中介过程($mathcal {M}_t$)和结果过程($mathcal {Y}_t$)之间的因果机制,这两个过程都是在存在时间依赖的混淆过程($mathcal {L}_t$)的情况下随时间重复测量的。我们考虑两种类型的因果估计,自然效应和路径特定效应。我们提供了可辨识性假设,并采用基于微分方程的多元混合模型对其进行估计。该方法的性能在模拟中进行了评估,并在3C脑衰老研究的两个现实世界的例子中进行了验证,以评估(1)教育水平通过抑郁症状学和认知功能对功能依赖的影响;(2)遗传因素对认知功能的影响可能由血管性脑病变介导,并与神经变性混淆。
{"title":"Continuous-time mediation analysis for repeatedly measured mediators and outcomes.","authors":"Le Bourdonnec Kateline, Valeri Linda, Proust-Lima Cécile","doi":"10.1093/biomtc/ujaf062","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf062","url":null,"abstract":"<p><p>Mediation analysis aims to decipher the underlying causal mechanisms between an exposure, an outcome, and intermediate variables called mediators. Initially developed for fixed-time mediator and outcome, it has been extended to the framework of longitudinal data by discretizing the assessment times of mediator and outcome. Yet, processes in play in longitudinal studies are usually defined in continuous time and measured at irregular and subject-specific visits. This is the case in dementia research when cerebral and cognitive changes measured at planned visits in cohorts are of interest. We thus propose a methodology to estimate the causal mechanisms between a time-fixed exposure ($X$), a mediator process ($mathcal {M}_t$), and an outcome process ($mathcal {Y}_t$) both measured repeatedly over time in the presence of a time-dependent confounding process ($mathcal {L}_t$). We consider 2 types of causal estimands, the natural effects and path-specific effects. We provide identifiability assumptions, and we employ a multivariate mixed model based on differential equations for their estimation. The performances of the method are assessed in simulations, and the method is illustrated in 2 real-world examples motivated by the 3C cerebral aging study to assess (1) the effect of educational level on functional dependency through depressive symptomatology and cognitive functioning and (2) the effect of a genetic factor on cognitive functioning potentially mediated by vascular brain lesions and confounded by neurodegeneration.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 2","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Non-parametric estimators of hazard ratios for comparing two survival curves. 比较两条生存曲线的风险比的非参数估计。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2025-04-02 DOI: 10.1093/biomtc/ujaf072
Mihai Giurcanu, Theodore Karrison

We propose non-parametric estimators of the hazard ratio for comparing two survival curves using estimating equations defined in terms of group-specific cumulative hazard functions. We first describe the methods and their asymptotic properties in the case of a constant hazard ratio. We then extend the methods and the asymptotic results when the hazard ratio is time dependent and well approximated by a locally constant function. We propose a method to select the change points in the local hazard ratios. We extend the methods to stratified estimators and propose tests for heterogeneity of constant and time-dependent hazard ratios across strata. In a simulation study, we describe the finite sample properties of the proposed estimators and compare their performance with the Cox partial maximum likelihood estimator (MLE) in terms of efficiency and accuracy of coverage rates. An example is provided to illustrate an application of the proposed methods in practice.

我们提出了比较两个生存曲线的风险比的非参数估计,使用根据群体特定累积风险函数定义的估计方程。我们首先描述了这些方法及其在风险比为常数情况下的渐近性质。然后,我们推广了这些方法,并得到了风险比随时间变化且由局部常数函数近似时的渐近结果。我们提出了一种选择局部风险比变化点的方法。我们将方法扩展到分层估计器,并提出了跨地层恒定和时间相关风险比异质性的检验。在模拟研究中,我们描述了所提出的估计器的有限样本特性,并在覆盖率的效率和准确性方面将其与Cox偏极大似然估计器(MLE)的性能进行了比较。最后以实例说明了所提方法在实际中的应用。
{"title":"Non-parametric estimators of hazard ratios for comparing two survival curves.","authors":"Mihai Giurcanu, Theodore Karrison","doi":"10.1093/biomtc/ujaf072","DOIUrl":"10.1093/biomtc/ujaf072","url":null,"abstract":"<p><p>We propose non-parametric estimators of the hazard ratio for comparing two survival curves using estimating equations defined in terms of group-specific cumulative hazard functions. We first describe the methods and their asymptotic properties in the case of a constant hazard ratio. We then extend the methods and the asymptotic results when the hazard ratio is time dependent and well approximated by a locally constant function. We propose a method to select the change points in the local hazard ratios. We extend the methods to stratified estimators and propose tests for heterogeneity of constant and time-dependent hazard ratios across strata. In a simulation study, we describe the finite sample properties of the proposed estimators and compare their performance with the Cox partial maximum likelihood estimator (MLE) in terms of efficiency and accuracy of coverage rates. An example is provided to illustrate an application of the proposed methods in practice.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 2","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144332419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semiparametric joint modeling for biomarker trajectory before disease onset. 疾病发病前生物标志物轨迹的半参数联合建模。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2025-04-02 DOI: 10.1093/biomtc/ujaf064
Yifei Sun, Xiwen Zhao, Kwun Chuen Gary Chan, Wanwan Xu, Heather Allore, Yize Zhao

Understanding how biomarkers change in relation to disease pathogenesis is a key area in biomedical research. We propose a semiparametric joint model to analyze the temporal evolution of biomarkers prior to the onset of disease. The model allows for a flexible biomarker trajectory that depends on two time scales: a natural time scale such as age and time to disease onset. In practice, the natural time scale often differs from time-on-study, leading to analytical challenges such as left-truncation bias. We introduce a profile kernel estimating equation approach to estimate regression coefficients and unspecified baseline mean trajectory functions. We establish the large-sample properties of the proposed estimators and conduct simulation studies to evaluate their finite-sample performance. Our method is applied to investigate brain biomarker trajectories before the onset of preclinical Alzheimer's disease. We observed a decline in cortical thickness prior to disease onset across brain regions, with APOE4 carriers showing lower levels compared to non-carriers.

了解生物标志物变化与疾病发病机制的关系是生物医学研究的一个关键领域。我们提出了一个半参数联合模型来分析疾病发病前生物标志物的时间演变。该模型允许灵活的生物标志物轨迹取决于两个时间尺度:自然时间尺度,如年龄和疾病发病时间。在实践中,自然时间尺度往往不同于研究时间,导致分析上的挑战,如左截断偏差。我们引入了一种轮廓核估计方程方法来估计回归系数和未指定基线平均轨迹函数。我们建立了所提出的估计器的大样本特性,并进行模拟研究以评估其有限样本性能。我们的方法被用于研究临床前阿尔茨海默病发病前的大脑生物标志物轨迹。我们观察到,在疾病发作之前,整个大脑区域的皮质厚度下降,APOE4携带者比非携带者表现出更低的水平。
{"title":"Semiparametric joint modeling for biomarker trajectory before disease onset.","authors":"Yifei Sun, Xiwen Zhao, Kwun Chuen Gary Chan, Wanwan Xu, Heather Allore, Yize Zhao","doi":"10.1093/biomtc/ujaf064","DOIUrl":"10.1093/biomtc/ujaf064","url":null,"abstract":"<p><p>Understanding how biomarkers change in relation to disease pathogenesis is a key area in biomedical research. We propose a semiparametric joint model to analyze the temporal evolution of biomarkers prior to the onset of disease. The model allows for a flexible biomarker trajectory that depends on two time scales: a natural time scale such as age and time to disease onset. In practice, the natural time scale often differs from time-on-study, leading to analytical challenges such as left-truncation bias. We introduce a profile kernel estimating equation approach to estimate regression coefficients and unspecified baseline mean trajectory functions. We establish the large-sample properties of the proposed estimators and conduct simulation studies to evaluate their finite-sample performance. Our method is applied to investigate brain biomarker trajectories before the onset of preclinical Alzheimer's disease. We observed a decline in cortical thickness prior to disease onset across brain regions, with APOE4 carriers showing lower levels compared to non-carriers.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 2","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12117339/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144156246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Addressing confounding and continuous exposure measurement error using corrected score functions. 使用校正分数函数解决混淆和连续曝光测量误差。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-04-02 DOI: 10.1093/biomtc/ujaf045
Brian D Richardson, Bryan S Blette, Peter B Gilbert, Michael G Hudgens

Confounding and exposure measurement error can introduce bias when drawing inference about the marginal effect of an exposure on an outcome of interest. While there are broad methodologies for addressing each source of bias individually, confounding and exposure measurement error frequently co-occur, and there is a need for methods that address them simultaneously. In this paper, corrected score methods are derived under classical additive measurement error to draw inference about marginal exposure effects using only measured variables. Three estimators are proposed based on g-formula, inverse probability weighting, and doubly-robust estimation techniques. The estimators are shown to be consistent and asymptotically normal, and the doubly-robust estimator is shown to exhibit its namesake property. The methods, which are implemented in the R package mismex, perform well in finite samples under both confounding and measurement error as demonstrated by simulation studies. The proposed doubly-robust estimator is applied to study the effects of two biomarkers on HIV-1 infection using data from the HVTN 505 preventative vaccine trial.

混淆和曝光测量误差可以引入偏差,当得出关于曝光对感兴趣的结果的边际效应的推论。虽然有广泛的方法来单独解决每个偏差来源,但混淆和暴露测量误差经常同时发生,需要同时解决它们的方法。本文在经典的加性测量误差下推导了校正分数方法,仅使用测量变量来推断边际暴露效应。提出了基于g公式、逆概率加权和双鲁棒估计技术的三种估计方法。证明了估计量是相合的和渐近正态的,并证明了双鲁棒估计量具有相同的性质。在R包mismex中实现的方法,在有限的样本中,在混杂和测量误差下都表现良好,仿真研究表明。利用来自HVTN 505预防性疫苗试验的数据,将提出的双稳健估计量应用于研究两种生物标志物对HIV-1感染的影响。
{"title":"Addressing confounding and continuous exposure measurement error using corrected score functions.","authors":"Brian D Richardson, Bryan S Blette, Peter B Gilbert, Michael G Hudgens","doi":"10.1093/biomtc/ujaf045","DOIUrl":"10.1093/biomtc/ujaf045","url":null,"abstract":"<p><p>Confounding and exposure measurement error can introduce bias when drawing inference about the marginal effect of an exposure on an outcome of interest. While there are broad methodologies for addressing each source of bias individually, confounding and exposure measurement error frequently co-occur, and there is a need for methods that address them simultaneously. In this paper, corrected score methods are derived under classical additive measurement error to draw inference about marginal exposure effects using only measured variables. Three estimators are proposed based on g-formula, inverse probability weighting, and doubly-robust estimation techniques. The estimators are shown to be consistent and asymptotically normal, and the doubly-robust estimator is shown to exhibit its namesake property. The methods, which are implemented in the R package mismex, perform well in finite samples under both confounding and measurement error as demonstrated by simulation studies. The proposed doubly-robust estimator is applied to study the effects of two biomarkers on HIV-1 infection using data from the HVTN 505 preventative vaccine trial.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 2","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12038274/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143962773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biometrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1