首页 > 最新文献

Statistics in Medicine最新文献

英文 中文
A Tutorial on Optimal Dynamic Treatment Regimes. 最佳动态治疗方案教程。
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-01 DOI: 10.1002/sim.70395
Chunyu Wang, Brian D M Tom

A dynamic treatment regime (DTR) is a sequence of treatment decision rules tailored to an individual's evolving status over time. In precision medicine, much focus has been placed on finding an optimal DTR which, if followed by everyone in the population, would yield the best outcome on average; and extensive investigations have been conducted from both methodological and applied standpoints. The purpose of this tutorial is to provide readers who are interested in optimal DTRs with a systematic, detailed, but accessible introduction, including the formal definition and formulation of this topic within the framework of causal inference, identification assumptions required to link the causal quantity of interest to the observed data, existing statistical models and estimation methods for learning the optimal regime from the data, and application of these methods to both simulated and real data.

动态治疗方案(DTR)是针对个体随时间变化的状态量身定制的一系列治疗决策规则。在精准医疗中,很多重点放在寻找最佳的DTR上,如果每个人都遵循这个DTR,平均而言会产生最好的结果;从方法和应用的角度进行了广泛的调查。本教程的目的是为对最优dtr感兴趣的读者提供一个系统的、详细的、但易于理解的介绍,包括在因果推理框架内该主题的正式定义和表述,将感兴趣的因果量与观测数据联系起来所需的识别假设,从数据中学习最优状态的现有统计模型和估计方法。并将这些方法应用于模拟数据和实际数据。
{"title":"A Tutorial on Optimal Dynamic Treatment Regimes.","authors":"Chunyu Wang, Brian D M Tom","doi":"10.1002/sim.70395","DOIUrl":"10.1002/sim.70395","url":null,"abstract":"<p><p>A dynamic treatment regime (DTR) is a sequence of treatment decision rules tailored to an individual's evolving status over time. In precision medicine, much focus has been placed on finding an optimal DTR which, if followed by everyone in the population, would yield the best outcome on average; and extensive investigations have been conducted from both methodological and applied standpoints. The purpose of this tutorial is to provide readers who are interested in optimal DTRs with a systematic, detailed, but accessible introduction, including the formal definition and formulation of this topic within the framework of causal inference, identification assumptions required to link the causal quantity of interest to the observed data, existing statistical models and estimation methods for learning the optimal regime from the data, and application of these methods to both simulated and real data.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":"45 3-5","pages":"e70395"},"PeriodicalIF":1.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12872042/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146120090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial Individual-Level Models for Transmission Dynamics of Seasonal Infectious Diseases. 季节性传染病传播动力学的空间个体水平模型。
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-01 DOI: 10.1002/sim.70384
Amin Abed, Mahmoud Torabi, Zeinab Mashreghi

Seasonality plays a crucial role in the transmission dynamics of many infectious diseases, contributing to periodic fluctuations in disease incidence. The previously developed geographically dependent individual-level model (GD-ILM) has been effective in modeling infectious diseases, but does not incorporate seasonal effects, limiting its ability to capture seasonal trends. In this study, we extend the GD-ILM by introducing a seasonally varying transmission component, allowing the model to account for periodic fluctuations in infection risk. Our approach integrates a seasonally forced infection kernel to model periodic changes in transmission rates over time, leading to a novel spatiotemporal kernel. To facilitate efficient and reliable parameter estimation in this high-dimensional setting, we employ the Monte Carlo expectation conditional maximization algorithm. We apply our model to individual-level influenza A data from Manitoba, Canada, examining spatial and seasonal infection patterns to identify high-risk regions and periods, and thus informing targeted intervention strategies. The proposed model's performance is further validated through comprehensive simulation studies. Simulation results confirm that models omitting seasonal components lead to biased spatial parameter estimates under various disease prevalence conditions. To support reproducibility and practical application, we developed the SeasEpi R package publicly available on the comprehensive R archive network (CRAN), which implements the seasonal GD-ILM framework and provides tools for model fitting, simulation, and evaluation. The seasonal GD-ILM offers a more accurate framework for modeling infectious disease transmission by integrating both spatial and seasonal dynamics. It supports more accurate risk assessment and enhances public health responses by enabling timely and location-specific interventions based on seasonal transmission patterns.

季节性在许多传染病的传播动态中起着至关重要的作用,造成疾病发病率的周期性波动。以前开发的地理依赖个体水平模型(GD-ILM)在传染病建模方面是有效的,但没有纳入季节性影响,限制了其捕捉季节性趋势的能力。在本研究中,我们通过引入季节性变化的传播成分来扩展GD-ILM,使模型能够考虑感染风险的周期性波动。我们的方法整合了季节性强迫感染内核来模拟传播率随时间的周期性变化,从而产生了一个新的时空内核。为了在这种高维环境下进行高效可靠的参数估计,我们采用了蒙特卡罗期望条件最大化算法。我们将我们的模型应用于加拿大马尼托巴省的个体水平甲型流感数据,检查空间和季节性感染模式,以确定高风险地区和时期,从而为有针对性的干预策略提供信息。通过综合仿真研究进一步验证了该模型的性能。模拟结果证实,在不同的疾病流行条件下,忽略季节因素的模型导致空间参数估计有偏倚。为了支持可重复性和实际应用,我们开发了SeasEpi R软件包,该软件包在综合R存档网络(CRAN)上公开提供,它实现了季节性GD-ILM框架,并提供了模型拟合、仿真和评估的工具。季节性GD-ILM通过整合空间和季节动态,为传染病传播建模提供了更准确的框架。它支持更准确的风险评估,并通过根据季节性传播模式采取及时和针对特定地点的干预措施,加强公共卫生应对。
{"title":"Spatial Individual-Level Models for Transmission Dynamics of Seasonal Infectious Diseases.","authors":"Amin Abed, Mahmoud Torabi, Zeinab Mashreghi","doi":"10.1002/sim.70384","DOIUrl":"10.1002/sim.70384","url":null,"abstract":"<p><p>Seasonality plays a crucial role in the transmission dynamics of many infectious diseases, contributing to periodic fluctuations in disease incidence. The previously developed geographically dependent individual-level model (GD-ILM) has been effective in modeling infectious diseases, but does not incorporate seasonal effects, limiting its ability to capture seasonal trends. In this study, we extend the GD-ILM by introducing a seasonally varying transmission component, allowing the model to account for periodic fluctuations in infection risk. Our approach integrates a seasonally forced infection kernel to model periodic changes in transmission rates over time, leading to a novel spatiotemporal kernel. To facilitate efficient and reliable parameter estimation in this high-dimensional setting, we employ the Monte Carlo expectation conditional maximization algorithm. We apply our model to individual-level influenza A data from Manitoba, Canada, examining spatial and seasonal infection patterns to identify high-risk regions and periods, and thus informing targeted intervention strategies. The proposed model's performance is further validated through comprehensive simulation studies. Simulation results confirm that models omitting seasonal components lead to biased spatial parameter estimates under various disease prevalence conditions. To support reproducibility and practical application, we developed the SeasEpi R package publicly available on the comprehensive R archive network (CRAN), which implements the seasonal GD-ILM framework and provides tools for model fitting, simulation, and evaluation. The seasonal GD-ILM offers a more accurate framework for modeling infectious disease transmission by integrating both spatial and seasonal dynamics. It supports more accurate risk assessment and enhances public health responses by enabling timely and location-specific interventions based on seasonal transmission patterns.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":"45 3-5","pages":"e70384"},"PeriodicalIF":1.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12880204/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146133262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Statistical Framework for Measuring Reproducibility and Replicability of High-Throughput Experiments From Multiple Sources. 测量多源高通量实验再现性和可复制性的统计框架。
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-01 DOI: 10.1002/sim.70354
Monia Ranalli, Yafei Lyu, Hillary Koch, Qunhua Li

Replication is essential to reliable and consistent scientific discovery in high-throughput experiments. Quantifying the replicability of scientific discoveries and identifying sources of irreproducibility have become important tasks for quality control and data integration. In this work we introduce a novel statistical model to measure the reproducibility and replicability of findings from replicate experiments in multi-source studies. Using a nested copula mixture model that characterizes the interdependence between replication experiments both across and within sources, our method quantifies reproducibility and replicability of each candidate simultaneously in a coherent framework. Through simulation studies, an ENCODE ChIP-seq dataset and a SEQC RNA-seq dataset, we demonstrate the effectiveness of our method in diagnosing the source of discordance and improving the reliability of scientific discoveries.

复制对于高通量实验中可靠和一致的科学发现至关重要。量化科学发现的可重复性和识别不可重复性的来源已经成为质量控制和数据集成的重要任务。在这项工作中,我们引入了一个新的统计模型来衡量多源研究中重复实验结果的再现性和可复制性。使用一个嵌套的copula混合模型来表征跨源和源内复制实验之间的相互依赖性,我们的方法在一个连贯的框架中同时量化每个候选实验的可重复性和可复制性。通过一个ENCODE ChIP-seq数据集和一个SEQC RNA-seq数据集的仿真研究,我们证明了我们的方法在诊断不一致的来源和提高科学发现的可靠性方面的有效性。
{"title":"A Statistical Framework for Measuring Reproducibility and Replicability of High-Throughput Experiments From Multiple Sources.","authors":"Monia Ranalli, Yafei Lyu, Hillary Koch, Qunhua Li","doi":"10.1002/sim.70354","DOIUrl":"10.1002/sim.70354","url":null,"abstract":"<p><p>Replication is essential to reliable and consistent scientific discovery in high-throughput experiments. Quantifying the replicability of scientific discoveries and identifying sources of irreproducibility have become important tasks for quality control and data integration. In this work we introduce a novel statistical model to measure the reproducibility and replicability of findings from replicate experiments in multi-source studies. Using a nested copula mixture model that characterizes the interdependence between replication experiments both across and within sources, our method quantifies reproducibility and replicability of each candidate simultaneously in a coherent framework. Through simulation studies, an ENCODE ChIP-seq dataset and a SEQC RNA-seq dataset, we demonstrate the effectiveness of our method in diagnosing the source of discordance and improving the reliability of scientific discoveries.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":"45 3-5","pages":"e70354"},"PeriodicalIF":1.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12871010/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146120055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Seamless Phase II/III Randomization Test Considering Treatment Group Selection Based on Short-Term Binary Outcomes. 基于短期二元结果考虑治疗组选择的自适应无缝II/III期随机化试验
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-01 DOI: 10.1002/sim.70400
Funato Sato, Kohei Uemura, Junki Mizusawa, Yutaka Matsuyama, Yoshihiko Morikawa

Recently, adaptive seamless phase II/III designs (ASDs) have gained attention because they improve the efficiency of drug development. In an ASD, a phase II trial, which explores the dose-response relationships and identifies treatments for the phase III trial, is combined with a phase III trial, which aims to demonstrate the efficacy and safety of the selected treatment arms in a single trial. This study focused on ASD, which selects treatment groups based on the short-term outcomes observed early in the trial, and involves a confirmatory outcome as the long-term outcome. The method based on the combination test, which considers treatment group selection based on short-term outcomes, tends to be conservative. In other words, it controls the type I error rate more strictly than necessary as the correlation decreases among outcomes in phases II and III. To address this issue, we proposed an adaptive seamless phase II/III randomization test that can appropriately consider the correlation between outcomes based on a randomization distribution, where phase II has a binary outcome and phase III has overall survival. Based on the simulation study, the proposed method improved conservatism owing to the correlation among outcomes and controlled the type I error rate around the nominal level. In addition, the power of this method tended to be higher than that of the method based on the combination test in most scenarios. Overall, the proposed method can increase the probability of trial success compared with conventional phase III designs.

近年来,自适应无缝II/III期设计(asd)因其提高了药物开发效率而受到关注。在ASD中,探索剂量-反应关系并确定用于III期试验的治疗方法的II期试验与旨在在单个试验中证明所选治疗组的有效性和安全性的III期试验相结合。这项研究的重点是ASD,它根据试验早期观察到的短期结果选择治疗组,并将一个确定的结果作为长期结果。基于联合试验的方法,根据短期结果考虑治疗组的选择,倾向于保守。换句话说,由于II期和III期结果之间的相关性降低,它对I型错误率的控制比必要的更严格。为了解决这个问题,我们提出了一种自适应无缝II/III期随机化试验,该试验可以适当地考虑基于随机化分布的结果之间的相关性,其中II期具有二元结果,III期具有总生存期。在仿真研究的基础上,提出的方法由于结果之间的相关性提高了保守性,并将一类错误率控制在名义水平附近。此外,在大多数情况下,该方法的功效往往高于基于组合试验的方法。总的来说,与传统的三期设计相比,所提出的方法可以增加试验成功的概率。
{"title":"Adaptive Seamless Phase II/III Randomization Test Considering Treatment Group Selection Based on Short-Term Binary Outcomes.","authors":"Funato Sato, Kohei Uemura, Junki Mizusawa, Yutaka Matsuyama, Yoshihiko Morikawa","doi":"10.1002/sim.70400","DOIUrl":"10.1002/sim.70400","url":null,"abstract":"<p><p>Recently, adaptive seamless phase II/III designs (ASDs) have gained attention because they improve the efficiency of drug development. In an ASD, a phase II trial, which explores the dose-response relationships and identifies treatments for the phase III trial, is combined with a phase III trial, which aims to demonstrate the efficacy and safety of the selected treatment arms in a single trial. This study focused on ASD, which selects treatment groups based on the short-term outcomes observed early in the trial, and involves a confirmatory outcome as the long-term outcome. The method based on the combination test, which considers treatment group selection based on short-term outcomes, tends to be conservative. In other words, it controls the type I error rate more strictly than necessary as the correlation decreases among outcomes in phases II and III. To address this issue, we proposed an adaptive seamless phase II/III randomization test that can appropriately consider the correlation between outcomes based on a randomization distribution, where phase II has a binary outcome and phase III has overall survival. Based on the simulation study, the proposed method improved conservatism owing to the correlation among outcomes and controlled the type I error rate around the nominal level. In addition, the power of this method tended to be higher than that of the method based on the combination test in most scenarios. Overall, the proposed method can increase the probability of trial success compared with conventional phase III designs.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":"45 3-5","pages":"e70400"},"PeriodicalIF":1.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12871014/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146120103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Neural Network With a Smooth Monotonic Output Layer for Dynamic Risk Prediction. 基于光滑单调输出层的深度神经网络动态风险预测。
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-01 DOI: 10.1002/sim.70401
Zhiyang Zhou, Yu Deng, Lei Liu, Hongmei Jiang, Yifan Peng, Xiaoyun Yang, Yun Zhao, Hongyan Ning, Norrina B Allen, John T Wilkins, Kiang Liu, Donald M Lloyd-Jones, Lihui Zhao

Risk prediction is a key component of survival analysis across various fields, including medicine, public health, economics, engineering, and others. The fundamental concern of risk prediction lies in the joint distribution of risk factors and the time to event. The recent success of survival analysis has already been extended to dynamic risk prediction, which incorporates multiple longitudinal observations into predictive models. However, existing methods often rely on parametric model assumptions or discretely approximate survival functions, potentially introducing more bias in predictions. To address these limitations, we introduce a deep neural network featuring a novel output layer termed the Smooth Monotonic Output Layer (SMOL). This model avoids discretization as well as parametric model assumptions. At its core, SMOL takes a general vector as the input and constructs a monotonic, differentiable function via B-splines. Employing SMOL as the output layer allows for direct, nonparametric estimation of monotonic functions of interest, such as survival and cumulative distribution functions. We performed extensive experiments utilizing data from the Cardiovascular Disease Lifetime Risk Pooling Project (LRPP), which harmonized individual data from multiple longitudinal community-based cardiovascular disease (CVD) studies. Our results demonstrate that the proposed approach achieves state-of-the-art accuracy in predicting individual-level risk for atherosclerotic CVD.

风险预测是跨各个领域生存分析的关键组成部分,包括医学、公共卫生、经济学、工程学等。风险预测的根本问题在于风险因素与事件发生时间的联合分布。最近成功的生存分析已经扩展到动态风险预测,它将多个纵向观察纳入预测模型。然而,现有的方法往往依赖于参数模型假设或离散近似生存函数,这可能会在预测中引入更多的偏差。为了解决这些限制,我们引入了一个具有新颖输出层的深度神经网络,称为平滑单调输出层(SMOL)。该模型避免了离散化和参数化模型假设。SMOL的核心是以一个一般向量作为输入,通过b样条构造一个单调的可微函数。使用SMOL作为输出层允许对感兴趣的单调函数(如生存和累积分布函数)进行直接的非参数估计。我们利用心血管疾病终生风险汇集项目(LRPP)的数据进行了广泛的实验,该项目协调了来自多个纵向社区心血管疾病(CVD)研究的个人数据。我们的研究结果表明,所提出的方法在预测动脉粥样硬化性心血管疾病的个人水平风险方面达到了最先进的准确性。
{"title":"Deep Neural Network With a Smooth Monotonic Output Layer for Dynamic Risk Prediction.","authors":"Zhiyang Zhou, Yu Deng, Lei Liu, Hongmei Jiang, Yifan Peng, Xiaoyun Yang, Yun Zhao, Hongyan Ning, Norrina B Allen, John T Wilkins, Kiang Liu, Donald M Lloyd-Jones, Lihui Zhao","doi":"10.1002/sim.70401","DOIUrl":"10.1002/sim.70401","url":null,"abstract":"<p><p>Risk prediction is a key component of survival analysis across various fields, including medicine, public health, economics, engineering, and others. The fundamental concern of risk prediction lies in the joint distribution of risk factors and the time to event. The recent success of survival analysis has already been extended to dynamic risk prediction, which incorporates multiple longitudinal observations into predictive models. However, existing methods often rely on parametric model assumptions or discretely approximate survival functions, potentially introducing more bias in predictions. To address these limitations, we introduce a deep neural network featuring a novel output layer termed the Smooth Monotonic Output Layer (SMOL). This model avoids discretization as well as parametric model assumptions. At its core, SMOL takes a general vector as the input and constructs a monotonic, differentiable function via B-splines. Employing SMOL as the output layer allows for direct, nonparametric estimation of monotonic functions of interest, such as survival and cumulative distribution functions. We performed extensive experiments utilizing data from the Cardiovascular Disease Lifetime Risk Pooling Project (LRPP), which harmonized individual data from multiple longitudinal community-based cardiovascular disease (CVD) studies. Our results demonstrate that the proposed approach achieves state-of-the-art accuracy in predicting individual-level risk for atherosclerotic CVD.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":"45 3-5","pages":"e70401"},"PeriodicalIF":1.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12873558/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146120097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reconciling Binary Replicates: Beyond the Average. 调和二元重复:超越平均值。
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-01 DOI: 10.1002/sim.70416
H Lorenzo, P Pudlo, M Royer-Carenzi

Binary observations are often repeated to improve data quality, creating technical replicates. Several scoring methods are commonly used to infer the actual individual state and obtain a probability for each state. The common practice of averaging replicates has limitations, and alternative methods for scoring and classifying individuals are proposed. Additionally, an indecisive response might be wiser than classifying all individuals based on their replicates in the medical context, where 1 indicates a particular health condition. Building on the inherent limitations of the averaging approach, three alternative methods are examined: the median, maximum penalized likelihood estimation, and a Bayesian algorithm. The theoretical analysis suggests that the proposed alternatives outperform the averaging approach, especially the Bayesian method, which incorporates uncertainty and provides credible intervals. Simulations and real-world medical datasets are used to demonstrate the practical implications of these methods for improving diagnostic accuracy and disease prevalence estimation.

经常重复二元观察以提高数据质量,创建技术复制。通常使用几种评分方法来推断实际的个体状态并获得每种状态的概率。平均重复的常见做法有局限性,并提出了评分和分类个体的替代方法。此外,优柔寡断的反应可能比根据医学背景下的重复对所有个人进行分类更明智,其中1表示特定的健康状况。在平均方法固有局限性的基础上,研究了三种替代方法:中位数,最大惩罚似然估计和贝叶斯算法。理论分析表明,所提出的方案优于平均方法,特别是贝叶斯方法,该方法考虑了不确定性并提供了可信区间。模拟和真实世界的医疗数据集被用来证明这些方法对提高诊断准确性和疾病流行率估计的实际意义。
{"title":"Reconciling Binary Replicates: Beyond the Average.","authors":"H Lorenzo, P Pudlo, M Royer-Carenzi","doi":"10.1002/sim.70416","DOIUrl":"10.1002/sim.70416","url":null,"abstract":"<p><p>Binary observations are often repeated to improve data quality, creating technical replicates. Several scoring methods are commonly used to infer the actual individual state and obtain a probability for each state. The common practice of averaging replicates has limitations, and alternative methods for scoring and classifying individuals are proposed. Additionally, an indecisive response might be wiser than classifying all individuals based on their replicates in the medical context, where 1 indicates a particular health condition. Building on the inherent limitations of the averaging approach, three alternative methods are examined: the median, maximum penalized likelihood estimation, and a Bayesian algorithm. The theoretical analysis suggests that the proposed alternatives outperform the averaging approach, especially the Bayesian method, which incorporates uncertainty and provides credible intervals. Simulations and real-world medical datasets are used to demonstrate the practical implications of these methods for improving diagnostic accuracy and disease prevalence estimation.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":"45 3-5","pages":"e70416"},"PeriodicalIF":1.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12874543/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146126467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Marginally Interpretable Spatial Logistic Regression With Bridge Processes. 具有桥过程的边际可解释空间逻辑回归。
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-01 DOI: 10.1002/sim.70399
Changwoo J Lee, David B Dunson

In including random effects to account for dependent observations, the odds ratio interpretation of logistic regression coefficients is changed from population-averaged to subject-specific. This is unappealing in many applications, motivating a rich literature on methods that maintain the marginal logistic regression structure without random effects, such as generalized estimating equations. However, for spatial data, random effect approaches are appealing in providing a full probabilistic characterization of the data that can be used for prediction. We propose a new class of spatial logistic regression models that maintain both population-averaged and subject-specific interpretations through a novel class of bridge processes for spatial random effects. These processes are shown to have appealing computational and theoretical properties, including a scale mixture of normal representation. The new methodology is illustrated with simulations and an analysis of childhood malaria prevalence data in Gambia.

在包括随机效应来解释相关观察时,逻辑回归系数的比值比解释从群体平均变为特定主题。这在许多应用中是不吸引人的,激发了大量关于保持边际逻辑回归结构而没有随机效应的方法的文献,例如广义估计方程。然而,对于空间数据,随机效应方法在提供可用于预测的数据的完整概率特征方面很有吸引力。我们提出了一类新的空间逻辑回归模型,该模型通过一类新的空间随机效应桥接过程来维持人口平均和主题特定的解释。这些过程显示出具有吸引人的计算和理论性质,包括正常表示的比例混合。冈比亚儿童疟疾流行数据的模拟和分析说明了这种新方法。
{"title":"Marginally Interpretable Spatial Logistic Regression With Bridge Processes.","authors":"Changwoo J Lee, David B Dunson","doi":"10.1002/sim.70399","DOIUrl":"https://doi.org/10.1002/sim.70399","url":null,"abstract":"<p><p>In including random effects to account for dependent observations, the odds ratio interpretation of logistic regression coefficients is changed from population-averaged to subject-specific. This is unappealing in many applications, motivating a rich literature on methods that maintain the marginal logistic regression structure without random effects, such as generalized estimating equations. However, for spatial data, random effect approaches are appealing in providing a full probabilistic characterization of the data that can be used for prediction. We propose a new class of spatial logistic regression models that maintain both population-averaged and subject-specific interpretations through a novel class of bridge processes for spatial random effects. These processes are shown to have appealing computational and theoretical properties, including a scale mixture of normal representation. The new methodology is illustrated with simulations and an analysis of childhood malaria prevalence data in Gambia.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":"45 3-5","pages":"e70399"},"PeriodicalIF":1.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146120124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recanting Twins: Addressing Intermediate Confounding in Mediation Analysis. 撤销双胞胎:解决中介分析中的中间混淆。
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-01 DOI: 10.1002/sim.70432
Tat-Thang Vo, Nicholas Williams, Richard Liu, Kara E Rudolph, Iván Díaz

The presence of intermediate confounders, also called recanting witnesses, is a fundamental challenge to the investigation of causal mechanisms in mediation analysis, preventing the identification of natural path-specific effects. Common alternatives (such as randomizational interventional effects) are problematic because they can take non-null values even when there is no mediation for any individual in the population. A promising alternative to natural path-specific effects was outlined in a recent article based on replacing recanting witnesses by draws from their conditional distribution. In this manuscript we formally develop these parameters (which we call recanting twin effects) into a viable alternative to natural effects for mediation analysis in the presence of intermediate confounding. Our contributions include (i) proposing a falsification procedure to test whether the observed data are compatible with intermediate confounding by a given intermediate variable, (ii) showing that recanting twin effects are equal to natural effects at the individual level in the absence of intermediate confounding, (iii) showing that recanting twin effects can be interpreted in agential frameworks such as the recently proposed separable effects, in addition to the non-agential framework in which they were originally outlined, and (iv) developing non-parametric efficiency theory including deriving the efficiency bound and non-parametric efficient estimators that can accommodate high-dimensional confounders through the use of data-adaptive estimation methods. We present an application of the methods to evaluate the role of new-onset anxiety and depressive disorder in explaining the relationship between gabapentin/pregabalin prescription and incident opioid use disorder among Medicaid beneficiaries with chronic pain.

中间混杂因素的存在,也称为撤回证人,是调解分析中因果机制调查的根本挑战,阻碍了自然路径特异性效应的识别。常见的替代方法(如随机干预效应)是有问题的,因为它们可以取非空值,即使在人群中没有任何个体的中介时也是如此。最近的一篇文章概述了一种有希望的替代自然路径特定效应的方法,该方法是用条件分布中的抽签代替放弃证人。在这篇手稿中,我们正式发展这些参数(我们称之为撤销双胞胎效应)成为一个可行的替代自然效应的中介分析在中间混淆的存在。我们的贡献包括(i)提出了一种证伪程序,以测试观察到的数据是否与给定中间变量的中间混淆相兼容,(ii)表明,在没有中间混淆的情况下,退缩双胞胎效应与个人层面的自然效应相等,(iii)表明退缩双胞胎效应可以在代理框架中解释,例如最近提出的可分离效应。除了最初概述的非代理框架之外,(iv)发展非参数效率理论,包括推导效率界和非参数效率估计器,这些估计器可以通过使用数据自适应估计方法来适应高维混杂因素。我们提出了一项应用方法来评估新发焦虑和抑郁障碍的作用,以解释加巴喷丁/普瑞巴林处方与医疗补助受益人慢性疼痛中的阿片类药物使用障碍之间的关系。
{"title":"Recanting Twins: Addressing Intermediate Confounding in Mediation Analysis.","authors":"Tat-Thang Vo, Nicholas Williams, Richard Liu, Kara E Rudolph, Iván Díaz","doi":"10.1002/sim.70432","DOIUrl":"10.1002/sim.70432","url":null,"abstract":"<p><p>The presence of intermediate confounders, also called recanting witnesses, is a fundamental challenge to the investigation of causal mechanisms in mediation analysis, preventing the identification of natural path-specific effects. Common alternatives (such as randomizational interventional effects) are problematic because they can take non-null values even when there is no mediation for any individual in the population. A promising alternative to natural path-specific effects was outlined in a recent article based on replacing recanting witnesses by draws from their conditional distribution. In this manuscript we formally develop these parameters (which we call recanting twin effects) into a viable alternative to natural effects for mediation analysis in the presence of intermediate confounding. Our contributions include (i) proposing a falsification procedure to test whether the observed data are compatible with intermediate confounding by a given intermediate variable, (ii) showing that recanting twin effects are equal to natural effects at the individual level in the absence of intermediate confounding, (iii) showing that recanting twin effects can be interpreted in agential frameworks such as the recently proposed separable effects, in addition to the non-agential framework in which they were originally outlined, and (iv) developing non-parametric efficiency theory including deriving the efficiency bound and non-parametric efficient estimators that can accommodate high-dimensional confounders through the use of data-adaptive estimation methods. We present an application of the methods to evaluate the role of new-onset anxiety and depressive disorder in explaining the relationship between gabapentin/pregabalin prescription and incident opioid use disorder among Medicaid beneficiaries with chronic pain.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":"45 3-5","pages":"e70432"},"PeriodicalIF":1.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12873466/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146120148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Concave Pairwise Fusion Approach to Heterogeneous Q-Learning for Dynamic Treatment Regimes. 基于凹对融合的动态治疗方案异构q -学习。
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-01 DOI: 10.1002/sim.70415
Jubo Sun, Wensheng Zhu, Guozhe Sun

A dynamic treatment regime is a sequence of decision rules that map available history information to a treatment option at each decision point. The optimal dynamic treatment regime seeks to make these decisions to maximize the expected outcome of interest. Most existing methods assume population homogeneity. In many complex applications, ignoring latent heterogeneous structures may compromise estimation, highlighting the necessity of exploring heterogeneous structures during the estimation of optimal treatment regimes. We propose heterogeneous Q-learning that facilitates the estimation of optimal dynamic treatment regimes using a concave pairwise fusion penalized approach. The proposed method employs an alternating direction method of multipliers algorithm to solve the concave pairwise fusion penalized least squares problem in each stage. Simulation studies demonstrate that our proposed method outperforms the standard Q-learning method, and it is further illustrated through a real data analysis from the China Rural Hypertension Control Project (CRHCP) study group.

动态治疗方案是一系列决策规则,将可用的历史信息映射到每个决策点的治疗方案。最佳动态治疗方案寻求做出这些决定,以最大限度地提高预期结果的兴趣。大多数现有的方法都假定人口同质性。在许多复杂的应用中,忽略潜在的异质结构可能会损害估计,强调在估计最佳处理方案时探索异质结构的必要性。我们提出了异构q -学习,它有助于使用凹成对融合惩罚方法估计最优动态处理方案。该方法采用交替方向乘法器算法求解凹对融合惩罚最小二乘问题。仿真研究表明,我们提出的方法优于标准的q -学习方法,并通过中国农村高血压控制项目(CRHCP)研究组的真实数据分析进一步证明了这一点。
{"title":"A Concave Pairwise Fusion Approach to Heterogeneous Q-Learning for Dynamic Treatment Regimes.","authors":"Jubo Sun, Wensheng Zhu, Guozhe Sun","doi":"10.1002/sim.70415","DOIUrl":"https://doi.org/10.1002/sim.70415","url":null,"abstract":"<p><p>A dynamic treatment regime is a sequence of decision rules that map available history information to a treatment option at each decision point. The optimal dynamic treatment regime seeks to make these decisions to maximize the expected outcome of interest. Most existing methods assume population homogeneity. In many complex applications, ignoring latent heterogeneous structures may compromise estimation, highlighting the necessity of exploring heterogeneous structures during the estimation of optimal treatment regimes. We propose heterogeneous Q-learning that facilitates the estimation of optimal dynamic treatment regimes using a concave pairwise fusion penalized approach. The proposed method employs an alternating direction method of multipliers algorithm to solve the concave pairwise fusion penalized least squares problem in each stage. Simulation studies demonstrate that our proposed method outperforms the standard Q-learning method, and it is further illustrated through a real data analysis from the China Rural Hypertension Control Project (CRHCP) study group.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":"45 3-5","pages":"e70415"},"PeriodicalIF":1.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146120045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Patient Retreat in Dose Escalation for Phase I Clinical Trials With Rare Diseases. 罕见病I期临床试验中剂量递增的患者撤退
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-01 DOI: 10.1002/sim.70409
Jialu Fang, Guosheng Yin

Phase I clinical trials aim to identify the maximum tolerated dose (MTD), a task that becomes challenging in rare disease due to limited patient recruitment. Traditional dose-finding designs, which assign one dose per patient, require a sufficient sample size that may be infeasible for rare disease trials. To address these limitations, we propose the patient retreat in dose escalation (PRIDE) scheme, which integrates intra-patient dose escalation and considers intra-patient correlations by incorporating random effects into a Bayesian hierarchical framework. We further introduce PRIDE-FA (flexible allocation), an extension of PRIDE with a flexible allocation strategy. By allowing retreated patients to be assigned to any dose level based on trial needs, PRIDE-FA improves resource efficiency, leading to greater reductions in required sample size and trial duration. This paper incorporates random effects into established dose-finding designs, including the calibration-free odds (CFO) design, the Bayesian optimal interval (BOIN) design, and the continual reassessment method (CRM) to account for intra-patient correlations when each patient may receive multiple doses. Simulation studies demonstrate that PRIDE and PRIDE-FA significantly improve the accuracy of MTD selection, reduce required sample size, and shorten trial duration compared to existing dose-finding methods. Together, PRIDE and PRIDE-FA provide a robust and efficient framework for phase I clinical trials with rare diseases.

I期临床试验旨在确定最大耐受剂量(MTD),由于患者招募有限,这一任务在罕见疾病中变得具有挑战性。传统的剂量发现设计,即为每个病人分配一个剂量,需要足够的样本量,这对于罕见病试验可能是不可行的。为了解决这些局限性,我们提出了患者剂量递增撤退(PRIDE)方案,该方案整合了患者内部剂量递增,并通过将随机效应纳入贝叶斯分层框架来考虑患者内部相关性。我们进一步介绍了PRIDE- fa(灵活分配),它是PRIDE的扩展,具有灵活的分配策略。PRIDE-FA允许根据试验需要将患者分配到任何剂量水平,从而提高了资源效率,从而大大减少了所需的样本量和试验时间。本文将随机效应纳入已建立的剂量发现设计,包括无校准几率(CFO)设计、贝叶斯最优区间(BOIN)设计和持续重新评估方法(CRM),以解释每个患者可能接受多个剂量时的患者内部相关性。仿真研究表明,与现有的剂量寻找方法相比,PRIDE和PRIDE- fa显著提高了MTD选择的准确性,减少了所需的样本量,缩短了试验时间。PRIDE和PRIDE- fa共同为罕见病的I期临床试验提供了一个强大而有效的框架。
{"title":"Patient Retreat in Dose Escalation for Phase I Clinical Trials With Rare Diseases.","authors":"Jialu Fang, Guosheng Yin","doi":"10.1002/sim.70409","DOIUrl":"10.1002/sim.70409","url":null,"abstract":"<p><p>Phase I clinical trials aim to identify the maximum tolerated dose (MTD), a task that becomes challenging in rare disease due to limited patient recruitment. Traditional dose-finding designs, which assign one dose per patient, require a sufficient sample size that may be infeasible for rare disease trials. To address these limitations, we propose the patient retreat in dose escalation (PRIDE) scheme, which integrates intra-patient dose escalation and considers intra-patient correlations by incorporating random effects into a Bayesian hierarchical framework. We further introduce PRIDE-FA (flexible allocation), an extension of PRIDE with a flexible allocation strategy. By allowing retreated patients to be assigned to any dose level based on trial needs, PRIDE-FA improves resource efficiency, leading to greater reductions in required sample size and trial duration. This paper incorporates random effects into established dose-finding designs, including the calibration-free odds (CFO) design, the Bayesian optimal interval (BOIN) design, and the continual reassessment method (CRM) to account for intra-patient correlations when each patient may receive multiple doses. Simulation studies demonstrate that PRIDE and PRIDE-FA significantly improve the accuracy of MTD selection, reduce required sample size, and shorten trial duration compared to existing dose-finding methods. Together, PRIDE and PRIDE-FA provide a robust and efficient framework for phase I clinical trials with rare diseases.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":"45 3-5","pages":"e70409"},"PeriodicalIF":1.8,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12873649/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146120195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Statistics in Medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1