首页 > 最新文献

Biometrics最新文献

英文 中文
Stable survival extrapolation using mortality projections. 使用死亡率预测的稳定生存推断。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf159
Anastasios Apsemidis, Nikolaos Demiris

The mean survival is the key ingredient of the decision process in several applications, notably in health economic evaluations. It is defined as the area under the complete survival curve, thus necessitating extrapolation of the observed data. This may be achieved in a more stable manner by borrowing long term evidence from registry and demographic data. In this article, we employ a Bayesian mortality model and transfer its projections in order to construct the baseline population that acts as an anchor of the survival model. We then propose extrapolation methods based on flexible parametric poly-hazard models which can naturally accommodate diverse shapes, including non-proportional hazards and crossing survival curves, while typically maintaining a natural interpretation as a data generating mechanism. We estimate the mean survival and related estimands in 3 cases, namely breast cancer, advanced melanoma, and cardiac arrhythmia. Specifically, we evaluate the survival disadvantage of triple-negative breast cancer cases, the efficacy of combining immunotherapy with mRNA cancer therapeutics for melanoma treatment and the suitability of implantable cardioverter defibrilators for cardiac arrhythmia. The last is conducted in a competing risks context illustrating how working on the cause-specific hazard alone minimizes potential instability. The results suggest that the proposed approach offers a flexible, interpretable, and robust approach when survival extrapolation is required.

在若干应用中,特别是在卫生经济评价中,平均存活率是决策过程的关键因素。它被定义为完整生存曲线下的面积,因此需要对观察到的数据进行外推。这可以通过借用登记和人口数据的长期证据以更稳定的方式实现。在本文中,我们采用贝叶斯死亡率模型并转移其预测,以构建作为生存模型锚点的基线人口。然后,我们提出了基于灵活参数多风险模型的外推方法,该模型可以自然地适应各种形状,包括非比例风险和交叉生存曲线,同时通常保持自然解释作为数据生成机制。我们估计了3例乳腺癌、晚期黑色素瘤和心律失常的平均生存期和相关估计。具体而言,我们评估了三阴性乳腺癌病例的生存劣势,免疫疗法联合mRNA癌症疗法治疗黑色素瘤的疗效,以及植入式心律转复除颤器治疗心律失常的适用性。最后一个是在竞争风险的背景下进行的,说明如何单独处理特定原因的危险,以最大限度地减少潜在的不稳定性。结果表明,当需要生存外推时,所提出的方法提供了一种灵活、可解释和可靠的方法。
{"title":"Stable survival extrapolation using mortality projections.","authors":"Anastasios Apsemidis, Nikolaos Demiris","doi":"10.1093/biomtc/ujaf159","DOIUrl":"10.1093/biomtc/ujaf159","url":null,"abstract":"<p><p>The mean survival is the key ingredient of the decision process in several applications, notably in health economic evaluations. It is defined as the area under the complete survival curve, thus necessitating extrapolation of the observed data. This may be achieved in a more stable manner by borrowing long term evidence from registry and demographic data. In this article, we employ a Bayesian mortality model and transfer its projections in order to construct the baseline population that acts as an anchor of the survival model. We then propose extrapolation methods based on flexible parametric poly-hazard models which can naturally accommodate diverse shapes, including non-proportional hazards and crossing survival curves, while typically maintaining a natural interpretation as a data generating mechanism. We estimate the mean survival and related estimands in 3 cases, namely breast cancer, advanced melanoma, and cardiac arrhythmia. Specifically, we evaluate the survival disadvantage of triple-negative breast cancer cases, the efficacy of combining immunotherapy with mRNA cancer therapeutics for melanoma treatment and the suitability of implantable cardioverter defibrilators for cardiac arrhythmia. The last is conducted in a competing risks context illustrating how working on the cause-specific hazard alone minimizes potential instability. The results suggest that the proposed approach offers a flexible, interpretable, and robust approach when survival extrapolation is required.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145720509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical inference for heterogeneous treatment effect with right-censored data from synthesizing randomized clinical trials and real-world data. 综合随机临床试验和真实世界数据的右删减数据对异质性治疗效果的统计推断。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf131
Guangcai Mao, Shu Yang, Xiaofei Wang

The heterogeneous treatment effect plays a crucial role in precision medicine. There is evidence that real-world data, even subject to biases, can be employed as supplementary evidence for randomized clinical trials to improve the statistical efficiency of the heterogeneous treatment effect estimation. In this paper, for survival data with right censoring, we consider estimating the heterogeneous treatment effect, defined as the difference of the treatment-specific conditional restricted mean survival times given covariates, by synthesizing evidence from randomized clinical trials and the real-world data with possible biases. We define an omnibus bias function to characterize the effect of biases caused by unmeasured confounders, censoring, and outcome heterogeneity, and further, identify it by combining the trial and real-world data. We propose a penalized sieve method to estimate the heterogeneous treatment effect and the bias function. We further study the theoretical properties of the proposed integrative estimators based on the theory of reproducing kernel Hilbert space and empirical process. The proposed methodology outperforms the approach solely based on the trial data through simulation studies and an integrative analysis of the data from a randomized trial and a real-world registry on early-stage non-small-cell lung cancer.

治疗效果的异质性在精准医疗中起着至关重要的作用。有证据表明,即使存在偏倚,现实世界的数据也可以作为随机临床试验的补充证据,以提高异质性治疗效果估计的统计效率。在本文中,我们考虑通过综合随机临床试验和可能存在偏差的真实数据的证据来估计具有正确审查的生存数据的异质性治疗效果,其定义为给定协变量的治疗特异性条件限制平均生存时间的差异。我们定义了一个综合偏倚函数来描述由未测量的混杂因素、审查和结果异质性引起的偏倚的影响,并进一步通过结合试验和实际数据来识别它。我们提出了一种惩罚筛法来估计非均质处理效果和偏差函数。基于核希尔伯特空间再现理论和经验过程,进一步研究了所提综合估计量的理论性质。所提出的方法优于仅基于试验数据的方法,该方法通过模拟研究和对来自早期非小细胞肺癌的随机试验和真实世界登记的数据进行综合分析。
{"title":"Statistical inference for heterogeneous treatment effect with right-censored data from synthesizing randomized clinical trials and real-world data.","authors":"Guangcai Mao, Shu Yang, Xiaofei Wang","doi":"10.1093/biomtc/ujaf131","DOIUrl":"10.1093/biomtc/ujaf131","url":null,"abstract":"<p><p>The heterogeneous treatment effect plays a crucial role in precision medicine. There is evidence that real-world data, even subject to biases, can be employed as supplementary evidence for randomized clinical trials to improve the statistical efficiency of the heterogeneous treatment effect estimation. In this paper, for survival data with right censoring, we consider estimating the heterogeneous treatment effect, defined as the difference of the treatment-specific conditional restricted mean survival times given covariates, by synthesizing evidence from randomized clinical trials and the real-world data with possible biases. We define an omnibus bias function to characterize the effect of biases caused by unmeasured confounders, censoring, and outcome heterogeneity, and further, identify it by combining the trial and real-world data. We propose a penalized sieve method to estimate the heterogeneous treatment effect and the bias function. We further study the theoretical properties of the proposed integrative estimators based on the theory of reproducing kernel Hilbert space and empirical process. The proposed methodology outperforms the approach solely based on the trial data through simulation studies and an integrative analysis of the data from a randomized trial and a real-world registry on early-stage non-small-cell lung cancer.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12505326/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145249501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayesian collocation integral method for system identification of ordinary differential equations. 常微分方程系统辨识的贝叶斯搭配积分法。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf141
Mingwei Xu, Samuel W K Wong, Peijun Sang

Ordinary differential equations (ODEs) are widely considered for modeling the dynamics of complex systems across various scientific areas. To identify the structure of high-dimensional sparse ODEs from noisy time-course data, most existing methods adopt a frequentist perspective, while uncertainty quantification in parameter estimation remains challenging. Under an additive ODE model assumption, we present a Bayesian hierarchical collocation method to provide better quantification of uncertainty. Our framework unifies the likelihood, integrated ODE constraints and a group-wise sparse penalty, allowing for simultaneous system identification and trajectory estimation. We demonstrate the favorable performance of the proposed method through simulation studies, where the recovered system trajectories and estimated additive components are compared with other recent methods. A real data example of gene regulatory networks is provided to illustrate the methodology.

常微分方程(ode)被广泛认为是各种科学领域复杂系统动力学的建模方法。为了从噪声时程数据中识别高维稀疏ode的结构,现有的方法大多采用频率论的视角,而参数估计中的不确定性量化仍然是一个挑战。在可加ODE模型假设下,提出了贝叶斯分层配置方法,以更好地量化不确定性。我们的框架统一了可能性、集成的ODE约束和组明智的稀疏惩罚,允许同时进行系统识别和轨迹估计。我们通过仿真研究证明了所提出方法的良好性能,其中恢复的系统轨迹和估计的附加成分与其他最近的方法进行了比较。提供了一个基因调控网络的真实数据示例来说明该方法。
{"title":"A Bayesian collocation integral method for system identification of ordinary differential equations.","authors":"Mingwei Xu, Samuel W K Wong, Peijun Sang","doi":"10.1093/biomtc/ujaf141","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf141","url":null,"abstract":"<p><p>Ordinary differential equations (ODEs) are widely considered for modeling the dynamics of complex systems across various scientific areas. To identify the structure of high-dimensional sparse ODEs from noisy time-course data, most existing methods adopt a frequentist perspective, while uncertainty quantification in parameter estimation remains challenging. Under an additive ODE model assumption, we present a Bayesian hierarchical collocation method to provide better quantification of uncertainty. Our framework unifies the likelihood, integrated ODE constraints and a group-wise sparse penalty, allowing for simultaneous system identification and trajectory estimation. We demonstrate the favorable performance of the proposed method through simulation studies, where the recovered system trajectories and estimated additive components are compared with other recent methods. A real data example of gene regulatory networks is provided to illustrate the methodology.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145372174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of cross-platform health communication with a network approach. 基于网络的跨平台健康通信分析。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf154
Xinyan Fan, Mengque Liu, Shuangge Ma

Online health communities (OHCs) provide a platform for patients and those related to share and communicate, making complex medical information more digestible and actionable. Health communication within OHCs can be impacted by other information sources. This study examines cross-platform health communication by mining Breastcancer.org (the largest online breast cancer community) and Twitter (now X). Early analyses of OHCs, Twitter, and other online platforms often adopt simple measures like word frequency, and more recent research has shifted towards word co-occurrence network analysis. Relatively, cross-platform communication analysis is limited, and the adopted techniques have drawbacks. We propose a new cross-platform communication model that collectively analyzes word co-occurrence networks and word frequency vectors. Here, the former describe the structural contents of health communication, and the latter describe the volumes. This model offers a nuanced perspective, accommodates temporal variations, and is examined for its theoretical and numerical properties. Collected from January 2010 to December 2020, the analyzed data contains over 1 395 000 tweets and 517 000 posts. Our analysis suggests that the Twitter's topics on breast cancer significantly impact the contents and volumes in the OHC. Distinct time phases are observed, with notable peaks during 2012-2013 and 2015-2018. This study can provide a venue for better understanding health communication and new insights into two highly important online platforms.

在线健康社区(ohc)为患者和相关人员提供了一个共享和交流的平台,使复杂的医疗信息更易于消化和操作。OHCs内部的卫生交流可能受到其他信息来源的影响。这项研究通过挖掘Breastcancer.org(最大的在线乳腺癌社区)和Twitter(现在的X)来检验跨平台的健康交流。早期对ohc、Twitter和其他在线平台的分析通常采用词频等简单的测量方法,最近的研究转向了词共现网络分析。相对而言,跨平台通信分析是有限的,所采用的技术也有缺点。我们提出了一种新的跨平台通信模型,该模型综合分析了词共现网络和词频向量。在这里,前者描述了健康传播的结构内容,后者描述了体量。该模型提供了一个细致入微的视角,适应时间变化,并对其理论和数值特性进行了检验。从2010年1月到2020年12月,分析的数据包含超过139.5万条推文和51.7万条帖子。我们的分析表明,Twitter上关于乳腺癌的话题对OHC的内容和数量有显著影响。在2012-2013年和2015-2018年有明显的高峰。这项研究可以为更好地理解健康传播提供一个场所,并为两个非常重要的在线平台提供新的见解。
{"title":"Analysis of cross-platform health communication with a network approach.","authors":"Xinyan Fan, Mengque Liu, Shuangge Ma","doi":"10.1093/biomtc/ujaf154","DOIUrl":"10.1093/biomtc/ujaf154","url":null,"abstract":"<p><p>Online health communities (OHCs) provide a platform for patients and those related to share and communicate, making complex medical information more digestible and actionable. Health communication within OHCs can be impacted by other information sources. This study examines cross-platform health communication by mining Breastcancer.org (the largest online breast cancer community) and Twitter (now X). Early analyses of OHCs, Twitter, and other online platforms often adopt simple measures like word frequency, and more recent research has shifted towards word co-occurrence network analysis. Relatively, cross-platform communication analysis is limited, and the adopted techniques have drawbacks. We propose a new cross-platform communication model that collectively analyzes word co-occurrence networks and word frequency vectors. Here, the former describe the structural contents of health communication, and the latter describe the volumes. This model offers a nuanced perspective, accommodates temporal variations, and is examined for its theoretical and numerical properties. Collected from January 2010 to December 2020, the analyzed data contains over 1 395 000 tweets and 517 000 posts. Our analysis suggests that the Twitter's topics on breast cancer significantly impact the contents and volumes in the OHC. Distinct time phases are observed, with notable peaks during 2012-2013 and 2015-2018. This study can provide a venue for better understanding health communication and new insights into two highly important online platforms.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145572906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A regularized continuous-time hidden Markov model for identifying latent state transition patterns of poly-tobacco use. 用正则化连续时间隐马尔可夫模型识别多烟草使用的潜在状态转变模式。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf138
Xinyu Yan, Ji-Hyun Lee, Xiang-Yang Lou

Hidden Markov models (HMMs) are widely used to characterize latent state transition patterns in substance use. However, traditional HMM frameworks are incompetent when dealing with the complexities introduced by high-dimensional risk factors and varying time intervals, particularly in determining the number of hidden states and selecting variables for state transition parameters. To tackle the analytical challenges in the Population Assessment of Tobacco and Health (PATH) Study, a nationally representative longitudinal cohort study on tobacco use, we propose a continuous-time HMM framework with a regularization algorithm to identify multi-dimensional risk factors underlying complex poly-tobacco use transitions. We develop an elastic-net regularization on the transition covariates to identify informative covariates and improve model estimation accuracy. The inclusion of key covariates enables accurate determination of the number of hidden states. We incorporate survey weights and information on strata and clustering throughout the modeling framework. We demonstrate the validity of our approach in determining state numbers, identifying informative covariates, and estimating model parameters through a series of simulations. Application of the proposed approach to PATH data analysis revealed several demographic, behavioral, and psychosocial factors that contribute to the differential risks of transition between tobacco-use states among youth and young adults. The model's capacity in identifying high-dimensional risk factors for underlying hidden variables substantiates its potential for enhancing public health research and informing interventions.

隐马尔可夫模型(hmm)被广泛用于描述物质使用中的潜在状态转移模式。然而,传统HMM框架在处理由高维风险因素和变时间间隔引入的复杂性时,特别是在确定隐藏状态数量和选择状态转移参数变量方面表现不佳。为了解决烟草与健康人口评估(PATH)研究(一项具有全国代表性的烟草使用纵向队列研究)中的分析挑战,我们提出了一个带有正则化算法的连续时间HMM框架,以识别复杂多烟草使用转变背后的多维风险因素。我们对过渡协变量进行弹性网络正则化,以识别信息协变量,提高模型估计精度。关键协变量的包含可以准确地确定隐藏状态的数量。我们在整个建模框架中结合了调查权重和关于分层和聚类的信息。通过一系列的模拟,我们证明了我们的方法在确定状态数、识别信息协变量和估计模型参数方面的有效性。将提出的方法应用于适宜卫生技术方案数据分析,揭示了若干人口统计学、行为和社会心理因素,这些因素导致了青少年和年轻人在烟草使用状态之间的转变风险差异。该模型在确定潜在隐藏变量的高维风险因素方面的能力证实了它在加强公共卫生研究和为干预措施提供信息方面的潜力。
{"title":"A regularized continuous-time hidden Markov model for identifying latent state transition patterns of poly-tobacco use.","authors":"Xinyu Yan, Ji-Hyun Lee, Xiang-Yang Lou","doi":"10.1093/biomtc/ujaf138","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf138","url":null,"abstract":"<p><p>Hidden Markov models (HMMs) are widely used to characterize latent state transition patterns in substance use. However, traditional HMM frameworks are incompetent when dealing with the complexities introduced by high-dimensional risk factors and varying time intervals, particularly in determining the number of hidden states and selecting variables for state transition parameters. To tackle the analytical challenges in the Population Assessment of Tobacco and Health (PATH) Study, a nationally representative longitudinal cohort study on tobacco use, we propose a continuous-time HMM framework with a regularization algorithm to identify multi-dimensional risk factors underlying complex poly-tobacco use transitions. We develop an elastic-net regularization on the transition covariates to identify informative covariates and improve model estimation accuracy. The inclusion of key covariates enables accurate determination of the number of hidden states. We incorporate survey weights and information on strata and clustering throughout the modeling framework. We demonstrate the validity of our approach in determining state numbers, identifying informative covariates, and estimating model parameters through a series of simulations. Application of the proposed approach to PATH data analysis revealed several demographic, behavioral, and psychosocial factors that contribute to the differential risks of transition between tobacco-use states among youth and young adults. The model's capacity in identifying high-dimensional risk factors for underlying hidden variables substantiates its potential for enhancing public health research and informing interventions.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145298373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Double robust conditional independence test for novel biomarkers given established risk factors with survival data. 双鲁棒条件独立测试新的生物标志物给定的风险因素与生存数据。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf133
Baoying Yang, Jing Qin, Jing Ning, Yukun Liu

Conditional independence is a foundational concept for understanding probabilistic relationships among variables, with broad applications in fields such as causal inference and machine learning. This study focuses on testing conditional independence, $Tperp X|Z$, where T represents survival data possibly subject to right censoring, Z represents established risk factors for T, and X represents potential novel biomarkers. The goal is to identify novel biomarkers that offer additional merits for further risk assessment and prediction. This can be achieved by using either the partial or parametric likelihood ratio statistic to evaluate whether the coefficient vector of X in the conditional model of T given $(X^{ mathrm{scriptscriptstyle top } }, Z^{ mathrm{scriptscriptstyle top } })^{ mathrm{scriptscriptstyle top } }$ is equal to zero. Traditional tests such as directly comparing likelihood ratios to chi-squared distributions may produce erroneous type-I error rates under model misspecification. As an alternative, we propose a resampling-based method to approximate the distribution of the likelihood ratios. A key advantage of the proposed test is its double robustness: it achieves approximately correct type-I error rates when either the conditional outcome model or the working model of ${rm pr} (X|Z)$ is correctly specified. Additionally, machine learning techniques can be incorporated to improve test performance. Simulation studies and the application to the Alzheimer's Disease Neuroimaging Initiative (ADNI) data demonstrate the finite-sample performance of the proposed tests.

条件独立是理解变量间概率关系的基本概念,在因果推理和机器学习等领域有着广泛的应用。这项研究的重点是测试条件独立性,$Tperp X|Z$,其中T代表可能受到正确审查的生存数据,Z代表T的既定风险因素,X代表潜在的新生物标志物。目标是确定新的生物标志物,为进一步的风险评估和预测提供额外的优点。这可以通过使用偏似然比或参数似然比统计量来评估给定$(X^{mathrm{scriptscriptstyle top}}, Z^{mathrm{scriptscriptstyle top}})^{mathrm{scriptscriptstyle top}}$的条件模型中X的系数向量是否等于零来实现。传统的检验,如直接将似然比与卡方分布进行比较,可能会在模型错误规范下产生错误的i型错误率。作为替代方案,我们提出了一种基于重采样的方法来近似似然比的分布。所提出的测试的一个关键优势是它的双重鲁棒性:当条件结果模型或${rm pr} (X|Z)$的工作模型被正确指定时,它实现了近似正确的i型错误率。此外,可以结合机器学习技术来提高测试性能。模拟研究和对阿尔茨海默病神经成像倡议(ADNI)数据的应用证明了所提出的测试的有限样本性能。
{"title":"Double robust conditional independence test for novel biomarkers given established risk factors with survival data.","authors":"Baoying Yang, Jing Qin, Jing Ning, Yukun Liu","doi":"10.1093/biomtc/ujaf133","DOIUrl":"10.1093/biomtc/ujaf133","url":null,"abstract":"<p><p>Conditional independence is a foundational concept for understanding probabilistic relationships among variables, with broad applications in fields such as causal inference and machine learning. This study focuses on testing conditional independence, $Tperp X|Z$, where T represents survival data possibly subject to right censoring, Z represents established risk factors for T, and X represents potential novel biomarkers. The goal is to identify novel biomarkers that offer additional merits for further risk assessment and prediction. This can be achieved by using either the partial or parametric likelihood ratio statistic to evaluate whether the coefficient vector of X in the conditional model of T given $(X^{ mathrm{scriptscriptstyle top } }, Z^{ mathrm{scriptscriptstyle top } })^{ mathrm{scriptscriptstyle top } }$ is equal to zero. Traditional tests such as directly comparing likelihood ratios to chi-squared distributions may produce erroneous type-I error rates under model misspecification. As an alternative, we propose a resampling-based method to approximate the distribution of the likelihood ratios. A key advantage of the proposed test is its double robustness: it achieves approximately correct type-I error rates when either the conditional outcome model or the working model of ${rm pr} (X|Z)$ is correctly specified. Additionally, machine learning techniques can be incorporated to improve test performance. Simulation studies and the application to the Alzheimer's Disease Neuroimaging Initiative (ADNI) data demonstrate the finite-sample performance of the proposed tests.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145336170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generalized nonparametric temporal modeling of recurrent events with application to a malaria vaccine trial. 复发事件的广义非参数时间模型及其在疟疾疫苗试验中的应用
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf146
Fei Heng, Yanqing Sun, Jing Xu, Peter B Gilbert

Motivated by a malaria vaccine efficacy trial, this paper investigates generalized nonparametric temporal models of intensity processes with multiple time scales. Through the choice of link functions, the proposed models encompass a wide range of models such as the multiplicative temporal intensity model and the additive temporal intensity model. A maximum likelihood estimation procedure is developed to estimate the effects of two time-scales via the local linear smoothing with double kernels. Computational algorithms are developed to facilitate applications of the proposed method. An adaptive algorithm is developed to overcome the challenges of overlapping covariates. A cross-validation bandwidth selection procedure based on the logarithm of likelihood criteria is discussed. The asymptotic properties of the proposed estimators are investigated. Our simulation study shows that the proposed methods have satisfactory finite sample performance for both the multiplicative temporal intensity model and additive temporal intensity model. The proposed methods are applied to analyze the MAL-094/MAL-095 malaria vaccine efficacy trial data to investigate how the new malaria infection risk changes over time and how a prior infection or vaccination changes the future infection risk. The proposed method provides new insight into the protective effects of the malaria vaccine against new malaria infections and how the vaccine efficacy is modified by the history of prior malaria infection over time.

受疟疾疫苗疗效试验的启发,本文研究了多时间尺度强度过程的广义非参数时间模型。通过对链接函数的选择,所提出的模型涵盖了广泛的模型,如乘法时间强度模型和加性时间强度模型。提出了一种最大似然估计方法,通过双核局部线性平滑来估计两个时间尺度的影响。开发了计算算法以促进所提出方法的应用。为了克服协变量重叠的问题,提出了一种自适应算法。讨论了基于对数似然准则的交叉验证带宽选择程序。研究了所提估计量的渐近性质。仿真研究表明,所提出的方法对乘法时间强度模型和加性时间强度模型都具有满意的有限样本性能。本文采用上述方法对MAL-094/MAL-095疟疾疫苗疗效试验数据进行分析,探讨新的疟疾感染风险如何随时间变化,以及既往感染或接种疫苗如何改变未来感染风险。所提出的方法为疟疾疫苗对新的疟疾感染的保护作用以及疫苗效力如何随着时间的推移而被既往疟疾感染史所改变提供了新的见解。
{"title":"Generalized nonparametric temporal modeling of recurrent events with application to a malaria vaccine trial.","authors":"Fei Heng, Yanqing Sun, Jing Xu, Peter B Gilbert","doi":"10.1093/biomtc/ujaf146","DOIUrl":"10.1093/biomtc/ujaf146","url":null,"abstract":"<p><p>Motivated by a malaria vaccine efficacy trial, this paper investigates generalized nonparametric temporal models of intensity processes with multiple time scales. Through the choice of link functions, the proposed models encompass a wide range of models such as the multiplicative temporal intensity model and the additive temporal intensity model. A maximum likelihood estimation procedure is developed to estimate the effects of two time-scales via the local linear smoothing with double kernels. Computational algorithms are developed to facilitate applications of the proposed method. An adaptive algorithm is developed to overcome the challenges of overlapping covariates. A cross-validation bandwidth selection procedure based on the logarithm of likelihood criteria is discussed. The asymptotic properties of the proposed estimators are investigated. Our simulation study shows that the proposed methods have satisfactory finite sample performance for both the multiplicative temporal intensity model and additive temporal intensity model. The proposed methods are applied to analyze the MAL-094/MAL-095 malaria vaccine efficacy trial data to investigate how the new malaria infection risk changes over time and how a prior infection or vaccination changes the future infection risk. The proposed method provides new insight into the protective effects of the malaria vaccine against new malaria infections and how the vaccine efficacy is modified by the history of prior malaria infection over time.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12635532/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145562433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep partially linear transformation model for right-censored survival data. 右截尾生存数据的深度部分线性变换模型。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf126
Junkai Yin, Yue Zhang, Zhangsheng Yu

Although the Cox proportional hazards (PH) model is well established and extensively used in the analysis of survival data, the PH assumption may not always hold in practical scenarios. The class of semiparametric transformation models extends the Cox model and also includes many other survival models as special cases. This paper introduces a deep partially linear transformation model as a general and flexible regression framework for right-censored data. The proposed method is capable of avoiding the curse of dimensionality while still retaining the interpretability of some covariates of interest. We derive the overall convergence rate of the maximum likelihood estimators, the minimax lower bound of the nonparametric deep neural network estimator, and the asymptotic normality and the semiparametric efficiency of the parametric estimator. Comprehensive simulation studies demonstrate the impressive performance of the proposed estimation procedure in terms of both the estimation accuracy and the predictive power, which is further validated by an application to a real-world dataset.

虽然Cox比例风险(PH)模型已经建立并广泛用于生存数据的分析,但PH假设在实际情况下并不总是成立。半参数变换模型是对Cox模型的扩展,并包含了许多其他的生存模型作为特例。本文介绍了一种深度部分线性变换模型作为右截尾数据的通用、灵活的回归框架。提出的方法能够避免维数的诅咒,同时仍然保留一些感兴趣的协变量的可解释性。我们得到了极大似然估计的总体收敛速率,非参数深度神经网络估计的极小极大下界,以及参数估计的渐近正态性和半参数效率。综合仿真研究表明,所提出的估计方法在估计精度和预测能力方面具有令人印象深刻的性能,并通过实际数据集的应用进一步验证了这一点。
{"title":"Deep partially linear transformation model for right-censored survival data.","authors":"Junkai Yin, Yue Zhang, Zhangsheng Yu","doi":"10.1093/biomtc/ujaf126","DOIUrl":"10.1093/biomtc/ujaf126","url":null,"abstract":"<p><p>Although the Cox proportional hazards (PH) model is well established and extensively used in the analysis of survival data, the PH assumption may not always hold in practical scenarios. The class of semiparametric transformation models extends the Cox model and also includes many other survival models as special cases. This paper introduces a deep partially linear transformation model as a general and flexible regression framework for right-censored data. The proposed method is capable of avoiding the curse of dimensionality while still retaining the interpretability of some covariates of interest. We derive the overall convergence rate of the maximum likelihood estimators, the minimax lower bound of the nonparametric deep neural network estimator, and the asymptotic normality and the semiparametric efficiency of the parametric estimator. Comprehensive simulation studies demonstrate the impressive performance of the proposed estimation procedure in terms of both the estimation accuracy and the predictive power, which is further validated by an application to a real-world dataset.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145399782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Randomized optimal selection design for dose optimization. 剂量优化的随机优化选择设计。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf124
Shuqi Wang, Ying Yuan, Suyu Liu

The US Food and Drug Administration (FDA) launched Project Optimus to shift the objective of dose selection from the maximum tolerated dose to the optimal biological dose (OBD), optimizing the benefit-risk tradeoff. One approach recommended by the FDA's guidance is to conduct randomized trials comparing multiple doses. In this paper, using the selection design framework, we propose a Randomized Optimal SElection (ROSE) design, which minimizes sample size while ensuring the probability of correct selection of the OBD at pre-specified accuracy levels. The ROSE design is simple to implement, involving a straightforward comparison of the difference in response rates between two dose arms against a predetermined decision boundary. We further consider a two-stage ROSE design that allows for early selection of the OBD at the interim when there is sufficient evidence, further reducing the sample size. Simulation studies demonstrate that the ROSE design exhibits desirable operating characteristics in correctly identifying the OBD. A sample size of 15-40 patients per dosage arm typically results in a percentage of correct selection of the optimal dose ranging from 60% to 70%.

美国食品和药物管理局(FDA)启动了Optimus项目,将剂量选择的目标从最大耐受剂量转移到最佳生物剂量(OBD),优化收益-风险权衡。FDA指南推荐的一种方法是进行随机试验,比较多种剂量。在本文中,我们使用选择设计框架,提出了一种随机最优选择(ROSE)设计,该设计最小化样本量,同时确保在预先指定的精度水平下正确选择OBD的概率。ROSE的设计很容易实现,它直接比较了两个剂量臂对预定决策边界的反应率差异。我们进一步考虑了两阶段ROSE设计,允许在有足够证据的中间阶段早期选择OBD,进一步减少样本量。仿真研究表明,ROSE设计在正确识别OBD方面具有良好的工作特性。每个剂量组15-40例患者的样本量通常导致正确选择最佳剂量的百分比在60%至70%之间。
{"title":"Randomized optimal selection design for dose optimization.","authors":"Shuqi Wang, Ying Yuan, Suyu Liu","doi":"10.1093/biomtc/ujaf124","DOIUrl":"10.1093/biomtc/ujaf124","url":null,"abstract":"<p><p>The US Food and Drug Administration (FDA) launched Project Optimus to shift the objective of dose selection from the maximum tolerated dose to the optimal biological dose (OBD), optimizing the benefit-risk tradeoff. One approach recommended by the FDA's guidance is to conduct randomized trials comparing multiple doses. In this paper, using the selection design framework, we propose a Randomized Optimal SElection (ROSE) design, which minimizes sample size while ensuring the probability of correct selection of the OBD at pre-specified accuracy levels. The ROSE design is simple to implement, involving a straightforward comparison of the difference in response rates between two dose arms against a predetermined decision boundary. We further consider a two-stage ROSE design that allows for early selection of the OBD at the interim when there is sufficient evidence, further reducing the sample size. Simulation studies demonstrate that the ROSE design exhibits desirable operating characteristics in correctly identifying the OBD. A sample size of 15-40 patients per dosage arm typically results in a percentage of correct selection of the optimal dose ranging from 60% to 70%.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12505323/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145249541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: Nonparametric assessment of regimen response curve estimators. 修正:方案反应曲线估计器的非参数评估。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf137
{"title":"Correction to: Nonparametric assessment of regimen response curve estimators.","authors":"","doi":"10.1093/biomtc/ujaf137","DOIUrl":"10.1093/biomtc/ujaf137","url":null,"abstract":"","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12525390/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145298458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biometrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1