首页 > 最新文献

Biometrics最新文献

英文 中文
Smooth and shape-constrained quantile distributed lag models. 光滑和形状约束的分位数分布滞后模型。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf101
Yisen Jin, Aaron J Molstad, Ander Wilson, Joseph Antonelli

Exposure to environmental pollutants during the gestational period can significantly impact infant health outcomes, such as birth weight and neurological development. Identifying critical windows of susceptibility, which are specific periods during pregnancy when exposure has the most profound effects, is essential for developing targeted interventions. Distributed lag models (DLMs) are widely used in environmental epidemiology to analyze the temporal patterns of exposure and their impact on health outcomes. However, traditional DLMs focus on modeling the conditional mean, which may fail to capture heterogeneity in the relationship between predictors and the outcome. Moreover, when modeling the distribution of health outcomes like gestational birth weight, it is the extreme quantiles that are of most clinical relevance. We introduce 2 new quantile distributed lag model (QDLM) estimators designed to address the limitations of existing methods by leveraging smoothness and shape constraints, such as unimodality and concavity, to enhance interpretability and efficiency. We apply our QDLM estimators to the Colorado birth cohort data, demonstrating their effectiveness in identifying critical windows of susceptibility and informing public health interventions.

妊娠期暴露于环境污染物可显著影响婴儿的健康结果,如出生体重和神经发育。确定易感性的关键窗口期,即怀孕期间暴露影响最深远的特定时期,对于制定有针对性的干预措施至关重要。分布滞后模型(DLMs)在环境流行病学中被广泛用于分析暴露的时间模式及其对健康结果的影响。然而,传统的dlm侧重于对条件均值进行建模,这可能无法捕捉预测因子与结果之间关系的异质性。此外,当对诸如妊娠出生体重等健康结果的分布进行建模时,最具临床相关性的是极端分位数。我们引入了两个新的分位数分布滞后模型(QDLM)估计器,旨在通过利用平滑性和形状约束(如单峰性和凹性)来解决现有方法的局限性,以提高可解释性和效率。我们将我们的QDLM估计器应用于科罗拉多州出生队列数据,证明了它们在识别易感性关键窗口和告知公共卫生干预措施方面的有效性。
{"title":"Smooth and shape-constrained quantile distributed lag models.","authors":"Yisen Jin, Aaron J Molstad, Ander Wilson, Joseph Antonelli","doi":"10.1093/biomtc/ujaf101","DOIUrl":"10.1093/biomtc/ujaf101","url":null,"abstract":"<p><p>Exposure to environmental pollutants during the gestational period can significantly impact infant health outcomes, such as birth weight and neurological development. Identifying critical windows of susceptibility, which are specific periods during pregnancy when exposure has the most profound effects, is essential for developing targeted interventions. Distributed lag models (DLMs) are widely used in environmental epidemiology to analyze the temporal patterns of exposure and their impact on health outcomes. However, traditional DLMs focus on modeling the conditional mean, which may fail to capture heterogeneity in the relationship between predictors and the outcome. Moreover, when modeling the distribution of health outcomes like gestational birth weight, it is the extreme quantiles that are of most clinical relevance. We introduce 2 new quantile distributed lag model (QDLM) estimators designed to address the limitations of existing methods by leveraging smoothness and shape constraints, such as unimodality and concavity, to enhance interpretability and efficiency. We apply our QDLM estimators to the Colorado birth cohort data, demonstrating their effectiveness in identifying critical windows of susceptibility and informing public health interventions.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12381565/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144941091","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Valid and efficient inference for nonparametric variable importance in two-phase studies. 两阶段研究中非参数变量重要性的有效推断。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf095
Guorong Dai, Raymond J Carroll, Jinbo Chen

We consider a common nonparametric regression setting, where the data consist of a response variable Y, some easily obtainable covariates $mathbf {X}$, and a set of costly covariates $mathbf {Z}$. Before establishing predictive models for Y, a natural question arises: Is it worthwhile to include $mathbf {Z}$ as predictors, given the additional cost of collecting data on $mathbf {Z}$ for both training the models and predicting Y for future individuals? Therefore, we aim to conduct preliminary investigations to infer importance of $mathbf {Z}$ in predicting Y in the presence of $mathbf {X}$. To achieve this goal, we propose a nonparametric variable importance measure for $mathbf {Z}$. It is defined as a parameter that aggregates maximum potential contributions of $mathbf {Z}$ in single or multiple predictive models, with contributions quantified by general loss functions. Considering two-phase data that provide a large number of observations for $(Y,mathbf {X})$ with the expensive $mathbf {Z}$ measured only in a small subsample, we develop a novel approach to infer the proposed importance measure, accommodating missingness of $mathbf {Z}$ in the sample by substituting functions of $(Y,mathbf {X})$ for each individual's contribution to the predictive loss of models involving $mathbf {Z}$. Our approach attains unified and efficient inference regardless of whether $mathbf {Z}$ makes zero or positive contribution to predicting Y, a desirable yet surprising property owing to data incompleteness. As intermediate steps of our theoretical development, we establish novel results in two relevant research areas, semi-supervised inference and two-phase nonparametric estimation. Numerical results from both simulated and real data demonstrate superior performance of our approach.

我们考虑一个常见的非参数回归设置,其中数据由响应变量Y,一些容易获得的协变量$mathbf {X}$和一组昂贵的协变量$mathbf {Z}$组成。在为Y建立预测模型之前,一个自然的问题出现了:考虑到在$mathbf {Z}$上收集数据以训练模型和预测未来个体的Y的额外成本,是否值得将$mathbf {Z}$作为预测器?因此,我们的目标是进行初步调查,以推断$mathbf {Z}$在$mathbf {X}$存在时预测Y的重要性。为了实现这一目标,我们提出了$mathbf {Z}$的非参数变量重要性度量。它被定义为在单个或多个预测模型中聚合$mathbf {Z}$的最大潜在贡献的参数,其贡献由一般损失函数量化。考虑到两阶段数据为$(Y,mathbf {X})$提供了大量的观测值,而$mathbf {Z}$仅在一个小的子样本中测量,我们开发了一种新的方法来推断所提出的重要性度量,通过将$(Y,mathbf {X})$的函数替换为每个个体对涉及$mathbf {Z}$的模型的预测损失的贡献来适应样本中$mathbf {Z}$的缺失。无论$mathbf {Z}$对预测Y的贡献是零还是正,我们的方法都实现了统一和有效的推理,这是由于数据不完整而令人期望但令人惊讶的性质。作为我们理论发展的中间步骤,我们在半监督推理和两相非参数估计两个相关的研究领域建立了新的结果。仿真和实际数据的数值结果都证明了该方法的优越性。
{"title":"Valid and efficient inference for nonparametric variable importance in two-phase studies.","authors":"Guorong Dai, Raymond J Carroll, Jinbo Chen","doi":"10.1093/biomtc/ujaf095","DOIUrl":"10.1093/biomtc/ujaf095","url":null,"abstract":"<p><p>We consider a common nonparametric regression setting, where the data consist of a response variable Y, some easily obtainable covariates $mathbf {X}$, and a set of costly covariates $mathbf {Z}$. Before establishing predictive models for Y, a natural question arises: Is it worthwhile to include $mathbf {Z}$ as predictors, given the additional cost of collecting data on $mathbf {Z}$ for both training the models and predicting Y for future individuals? Therefore, we aim to conduct preliminary investigations to infer importance of $mathbf {Z}$ in predicting Y in the presence of $mathbf {X}$. To achieve this goal, we propose a nonparametric variable importance measure for $mathbf {Z}$. It is defined as a parameter that aggregates maximum potential contributions of $mathbf {Z}$ in single or multiple predictive models, with contributions quantified by general loss functions. Considering two-phase data that provide a large number of observations for $(Y,mathbf {X})$ with the expensive $mathbf {Z}$ measured only in a small subsample, we develop a novel approach to infer the proposed importance measure, accommodating missingness of $mathbf {Z}$ in the sample by substituting functions of $(Y,mathbf {X})$ for each individual's contribution to the predictive loss of models involving $mathbf {Z}$. Our approach attains unified and efficient inference regardless of whether $mathbf {Z}$ makes zero or positive contribution to predicting Y, a desirable yet surprising property owing to data incompleteness. As intermediate steps of our theoretical development, we establish novel results in two relevant research areas, semi-supervised inference and two-phase nonparametric estimation. Numerical results from both simulated and real data demonstrate superior performance of our approach.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12312401/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144752245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tree-based additive noise directed acyclic graphical models for nonlinear causal discovery with interactions. 具有相互作用的非线性因果发现的树型加性噪声有向无环图模型。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf089
Fangting Zhou, Kejun He, Yang Ni

Directed acyclic graphical models with additive noises are essential in nonlinear causal discovery and have numerous applications in various domains, such as social science and systems biology. Most such models further assume that structural causal functions are additive to ensure causal identifiability and computational feasibility, which may be too restrictive in the presence of causal interactions. Some methods consider general nonlinear causal functions represented by, for example, Gaussian processes and neural networks, to accommodate interactions. However, they are either computationally intensive or lack interpretability. We propose a highly interpretable and computationally feasible approach using trees to incorporate interactions in nonlinear causal discovery, termed tree-based additive noise models. The nature of the tree construction leads to piecewise constant causal functions, making existing causal identifiability results of additive noise models with continuous and smooth causal functions inapplicable. Therefore, we provide new conditions under which the proposed model is identifiable. We develop a recursive algorithm for source node identification and a score-based ordering search algorithm. Through extensive simulations, we demonstrate the utility of the proposed model and algorithms benchmarking against existing additive noise models, especially when there are strong causal interactions. Our method is applied to infer a protein-protein interaction network for breast cancer, where proteins may form protein complexes to perform their functions.

具有加性噪声的有向无环图模型在非线性因果发现中是必不可少的,在社会科学和系统生物学等各个领域都有广泛的应用。大多数这样的模型进一步假设结构因果函数是相加的,以确保因果可识别性和计算可行性,这在因果相互作用的存在下可能过于限制。一些方法考虑一般的非线性因果函数,例如,高斯过程和神经网络,以适应相互作用。然而,它们要么是计算密集型的,要么缺乏可解释性。我们提出了一种高度可解释和计算上可行的方法,使用树来结合非线性因果发现中的相互作用,称为基于树的加性噪声模型。由于树型结构的性质导致因果函数为分段常数,使得具有连续光滑因果函数的加性噪声模型的现有因果可辨识性结果不适用。因此,我们提供了新的条件,在这些条件下,所提议的模型是可识别的。我们开发了一种用于源节点识别的递归算法和一种基于分数的排序搜索算法。通过广泛的模拟,我们证明了所提出的模型和算法对现有的加性噪声模型的基准测试的实用性,特别是当存在强烈的因果相互作用时。我们的方法被应用于推断乳腺癌的蛋白质-蛋白质相互作用网络,其中蛋白质可能形成蛋白质复合物来执行其功能。
{"title":"Tree-based additive noise directed acyclic graphical models for nonlinear causal discovery with interactions.","authors":"Fangting Zhou, Kejun He, Yang Ni","doi":"10.1093/biomtc/ujaf089","DOIUrl":"10.1093/biomtc/ujaf089","url":null,"abstract":"<p><p>Directed acyclic graphical models with additive noises are essential in nonlinear causal discovery and have numerous applications in various domains, such as social science and systems biology. Most such models further assume that structural causal functions are additive to ensure causal identifiability and computational feasibility, which may be too restrictive in the presence of causal interactions. Some methods consider general nonlinear causal functions represented by, for example, Gaussian processes and neural networks, to accommodate interactions. However, they are either computationally intensive or lack interpretability. We propose a highly interpretable and computationally feasible approach using trees to incorporate interactions in nonlinear causal discovery, termed tree-based additive noise models. The nature of the tree construction leads to piecewise constant causal functions, making existing causal identifiability results of additive noise models with continuous and smooth causal functions inapplicable. Therefore, we provide new conditions under which the proposed model is identifiable. We develop a recursive algorithm for source node identification and a score-based ordering search algorithm. Through extensive simulations, we demonstrate the utility of the proposed model and algorithms benchmarking against existing additive noise models, especially when there are strong causal interactions. Our method is applied to infer a protein-protein interaction network for breast cancer, where proteins may form protein complexes to perform their functions.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12288665/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144706199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonparametric Bayesian approach for dynamic borrowing of historical control data. 历史控制数据动态借用的非参数贝叶斯方法。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf118
Tomohiro Ohigashi, Kazushi Maruo, Takashi Sozu, Masahiko Gosho

When incorporating historical control data into the analysis of current randomized controlled trial data, it is critical to account for differences between the datasets. When the cause of difference is an unmeasured factor and adjustment for only observed covariates is insufficient, it is desirable to use a dynamic borrowing method that reduces the impact of heterogeneous historical controls. We propose a nonparametric Bayesian approach that addresses between-trial heterogeneity and allows borrowing historical controls homogeneous with the current control. Additionally, to emphasize conflict resolution between historical controls and the current control, we introduce a method based on the dependent Dirichlet process (DP) mixture. The proposed methods can be implemented using the same procedure, regardless of whether the outcome data comprise aggregated study-level data or individual participant data. We also develop a novel index of similarity between the historical and current control data, based on the posterior distribution of the parameter of interest. We conduct a simulation study and analyze clinical trial examples to evaluate the performance of the proposed methods compared to existing methods. The proposed method, based on the dependent DP mixture, can accurately borrow from homogeneous historical controls while reducing the impact of heterogeneous historical controls compared to the typical DP mixture. The proposed methods outperform existing methods in scenarios with heterogeneous historical controls, in which the meta-analytic approach is ineffective.

当将历史对照数据纳入当前随机对照试验数据分析时,考虑数据集之间的差异是至关重要的。当差异的原因是一个无法测量的因素,并且仅对观察到的协变量进行调整是不够的,需要使用动态借用方法来减少异质历史控制的影响。我们提出了一种非参数贝叶斯方法,该方法解决了试验之间的异质性,并允许借用与当前控制相同的历史控制。此外,为了强调历史控制和当前控制之间的冲突解决,我们引入了一种基于相关狄利克雷过程(DP)混合的方法。无论结果数据是包括总体研究水平数据还是个体参与者数据,所建议的方法都可以使用相同的程序来实施。我们还基于感兴趣参数的后验分布,开发了一种新的历史和当前控制数据之间的相似性指数。我们进行了模拟研究并分析了临床试验实例,以评估所提出的方法与现有方法的性能。该方法基于依赖的DP混合,与典型的DP混合相比,可以准确地借鉴同质历史控制,同时减少异质历史控制的影响。在具有异构历史控制的情况下,所提出的方法优于现有方法,在这种情况下,元分析方法是无效的。
{"title":"Nonparametric Bayesian approach for dynamic borrowing of historical control data.","authors":"Tomohiro Ohigashi, Kazushi Maruo, Takashi Sozu, Masahiko Gosho","doi":"10.1093/biomtc/ujaf118","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf118","url":null,"abstract":"<p><p>When incorporating historical control data into the analysis of current randomized controlled trial data, it is critical to account for differences between the datasets. When the cause of difference is an unmeasured factor and adjustment for only observed covariates is insufficient, it is desirable to use a dynamic borrowing method that reduces the impact of heterogeneous historical controls. We propose a nonparametric Bayesian approach that addresses between-trial heterogeneity and allows borrowing historical controls homogeneous with the current control. Additionally, to emphasize conflict resolution between historical controls and the current control, we introduce a method based on the dependent Dirichlet process (DP) mixture. The proposed methods can be implemented using the same procedure, regardless of whether the outcome data comprise aggregated study-level data or individual participant data. We also develop a novel index of similarity between the historical and current control data, based on the posterior distribution of the parameter of interest. We conduct a simulation study and analyze clinical trial examples to evaluate the performance of the proposed methods compared to existing methods. The proposed method, based on the dependent DP mixture, can accurately borrow from homogeneous historical controls while reducing the impact of heterogeneous historical controls compared to the typical DP mixture. The proposed methods outperform existing methods in scenarios with heterogeneous historical controls, in which the meta-analytic approach is ineffective.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144941173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inference on age-specific fertility in ecology and evolution. Learning from other disciplines and improving the state of the art. 生态学和进化中年龄生育率的推论。向其他学科学习,提高技术水平。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf081
Fernando Colchero

Despite the importance of age-specific fertility for ecology and evolution, the methods for modeling and inference have proven considerably limited. However, other disciplines have long focused on exploring and developing a vast number of models. Here, I provide an overview of the different models proposed since the 1940s by formal demographers, statisticians, and social scientists, most of which are unknown to the ecological and evolutionary communities. I describe how these fall into 2 main categories, namely polynomials and those based on probability density functions. I discuss their merits in terms of their overall behavior and how well they represent the different stages of fertility. Despite many alternative models, inference on age-specific fertility has usually been limited to simple least squares. Although this might be sufficient for human data, I hope to demonstrate that inference requires more sophisticated approaches for ecological and evolutionary datasets. To illustrate how inference and model choice can be achieved on different types of typical ecological and evolutionary data, I present the new R package Bayesian Fertility Trajectory Analysis, which I apply to published aggregated data for lions and baboons. I then conduct a simulation study to test its performance on individual-level data. I show that appropriate inference and model selection can be achieved even when a small number of parents are followed.

尽管特定年龄的生育能力对生态学和进化具有重要意义,但建模和推理的方法已被证明相当有限。然而,其他学科长期以来一直专注于探索和开发大量的模型。在这里,我概述了自20世纪40年代以来由正式的人口统计学家、统计学家和社会科学家提出的不同模型,其中大多数模型尚不为生态和进化社区所知。我描述了它们如何分为两大类,即多项式和基于概率密度函数的多项式。我从它们的整体行为以及它们如何很好地代表生育力的不同阶段来讨论它们的优点。尽管有许多可供选择的模型,但对特定年龄生育率的推断通常仅限于简单的最小二乘。虽然这对于人类数据来说可能已经足够了,但我希望证明,对于生态和进化数据集,推理需要更复杂的方法。为了说明如何在不同类型的典型生态和进化数据上实现推理和模型选择,我介绍了新的R包贝叶斯生育轨迹分析,我将其应用于已发表的狮子和狒狒的汇总数据。然后,我进行了模拟研究,以测试其在个人层面数据上的性能。我表明,适当的推理和模型选择可以实现,即使少数父母被跟踪。
{"title":"Inference on age-specific fertility in ecology and evolution. Learning from other disciplines and improving the state of the art.","authors":"Fernando Colchero","doi":"10.1093/biomtc/ujaf081","DOIUrl":"10.1093/biomtc/ujaf081","url":null,"abstract":"<p><p>Despite the importance of age-specific fertility for ecology and evolution, the methods for modeling and inference have proven considerably limited. However, other disciplines have long focused on exploring and developing a vast number of models. Here, I provide an overview of the different models proposed since the 1940s by formal demographers, statisticians, and social scientists, most of which are unknown to the ecological and evolutionary communities. I describe how these fall into 2 main categories, namely polynomials and those based on probability density functions. I discuss their merits in terms of their overall behavior and how well they represent the different stages of fertility. Despite many alternative models, inference on age-specific fertility has usually been limited to simple least squares. Although this might be sufficient for human data, I hope to demonstrate that inference requires more sophisticated approaches for ecological and evolutionary datasets. To illustrate how inference and model choice can be achieved on different types of typical ecological and evolutionary data, I present the new R package Bayesian Fertility Trajectory Analysis, which I apply to published aggregated data for lions and baboons. I then conduct a simulation study to test its performance on individual-level data. I show that appropriate inference and model selection can be achieved even when a small number of parents are followed.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144706183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Precision generalized phase I-II designs. 精密广义I-II期设计。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf043
Saijun Zhao, Peter F Thall, Ying Yuan, Juhee Lee, Pavlos Msaouel, Yong Zang

A new family of precision Bayesian dose optimization designs, PGen I-II, based on early efficacy, early toxicity, and long-term time to treatment failure is proposed. A PGen I-II design refines a Gen I-II design by accounting for patient heterogeneity characterized by subgroups that may be defined by prognostic levels, disease subtypes, or biomarker categories. The design makes subgroup-specific decisions, which may be to drop an unacceptably toxic or inefficacious dose, randomize patients among acceptable doses, or identify a best dose in terms of treatment success defined in terms of time to failure over long-term follow-up. A piecewise exponential distribution for failure time is assumed, including subgroup-specific effects of dose, response, and toxicity. Latent variables are used to adaptively cluster subgroups found to have similar dose-outcome distributions, with the model simplified to borrow strength between subgroups in the same cluster. Guidelines and user-friendly computer software for implementing the design are provided. A simulation study is reported that shows the PGen I-II design is superior to similarly structured designs that either assume patient homogeneity or conduct separate trials within subgroups.

提出了一种新的精确贝叶斯剂量优化设计,PGen I-II,基于早期疗效,早期毒性和治疗失败的长期时间。PGen I-II设计通过考虑以预后水平、疾病亚型或生物标志物类别定义的亚组为特征的患者异质性,完善了Gen I-II设计。该设计针对亚组做出特定的决定,可能是减少不可接受的毒性或无效剂量,将患者随机分配到可接受的剂量中,或根据长期随访中从失败到治疗成功的时间确定最佳剂量。假设失效时间呈分段指数分布,包括剂量、反应和毒性的亚组特异性效应。潜在变量用于自适应地聚类具有相似剂量-结果分布的子组,并简化模型以借用同一聚类中子组之间的强度。提供了实施设计的指引和用户友好的计算机软件。据报道,一项模拟研究表明,PGen I-II设计优于类似的结构设计,即假设患者同质性或在亚组内进行单独的试验。
{"title":"Precision generalized phase I-II designs.","authors":"Saijun Zhao, Peter F Thall, Ying Yuan, Juhee Lee, Pavlos Msaouel, Yong Zang","doi":"10.1093/biomtc/ujaf043","DOIUrl":"10.1093/biomtc/ujaf043","url":null,"abstract":"<p><p>A new family of precision Bayesian dose optimization designs, PGen I-II, based on early efficacy, early toxicity, and long-term time to treatment failure is proposed. A PGen I-II design refines a Gen I-II design by accounting for patient heterogeneity characterized by subgroups that may be defined by prognostic levels, disease subtypes, or biomarker categories. The design makes subgroup-specific decisions, which may be to drop an unacceptably toxic or inefficacious dose, randomize patients among acceptable doses, or identify a best dose in terms of treatment success defined in terms of time to failure over long-term follow-up. A piecewise exponential distribution for failure time is assumed, including subgroup-specific effects of dose, response, and toxicity. Latent variables are used to adaptively cluster subgroups found to have similar dose-outcome distributions, with the model simplified to borrow strength between subgroups in the same cluster. Guidelines and user-friendly computer software for implementing the design are provided. A simulation study is reported that shows the PGen I-II design is superior to similarly structured designs that either assume patient homogeneity or conduct separate trials within subgroups.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12288667/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144706184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Doubly robust nonparametric estimators of the predictive value of covariates for survival data. 生存数据协变量预测值的双鲁棒非参数估计。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf084
Torben Martinussen, Mark J van der Laan

The predictive value of a covariate is often of interest in studies with a survival endpoint. A common situation is that there are some well established predictors and a potential valuable new marker. The challenge is how to judge the potentially added predictive value of this new marker. We propose to use the positive predictive value (PPV) curve based on a nonparametric scoring rule. The estimand of interest is viewed as a single transformation of the underlying data generating probability measure, which allows us to develop a robust nonparametric estimator of the PPV by first calculating the corresponding efficient influence function. We provide asymptotic results and illustrate the approach with numerical studies and with 2 cancer data studies.

在有生存终点的研究中,协变量的预测值通常很重要。一种常见的情况是,有一些很好的预测因素和一个潜在的有价值的新标记。挑战在于如何判断这种新标记物潜在的附加预测价值。我们建议使用基于非参数评分规则的正预测值(PPV)曲线。感兴趣的估计被视为生成概率度量的底层数据的单一变换,这使我们能够通过首先计算相应的有效影响函数来开发PPV的鲁棒非参数估计。我们提供了渐近的结果,并用数值研究和2个癌症数据研究说明了该方法。
{"title":"Doubly robust nonparametric estimators of the predictive value of covariates for survival data.","authors":"Torben Martinussen, Mark J van der Laan","doi":"10.1093/biomtc/ujaf084","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf084","url":null,"abstract":"<p><p>The predictive value of a covariate is often of interest in studies with a survival endpoint. A common situation is that there are some well established predictors and a potential valuable new marker. The challenge is how to judge the potentially added predictive value of this new marker. We propose to use the positive predictive value (PPV) curve based on a nonparametric scoring rule. The estimand of interest is viewed as a single transformation of the underlying data generating probability measure, which allows us to develop a robust nonparametric estimator of the PPV by first calculating the corresponding efficient influence function. We provide asymptotic results and illustrate the approach with numerical studies and with 2 cancer data studies.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144697476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: Evaluating the effects of high-throughput structural neuroimaging predictors on whole-brain functional connectome outcomes via network-based matrix-on-vector regression. 修正:通过基于网络的矩阵向量回归评估高通量结构神经成像预测因子对全脑功能连接组结果的影响。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf111
{"title":"Correction to: Evaluating the effects of high-throughput structural neuroimaging predictors on whole-brain functional connectome outcomes via network-based matrix-on-vector regression.","authors":"","doi":"10.1093/biomtc/ujaf111","DOIUrl":"10.1093/biomtc/ujaf111","url":null,"abstract":"","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12341978/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144833843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regression analysis of interval-censored failure time data with change points and a cured subgroup. 具有变化点和修复子群的间隔截尾失效时间数据的回归分析。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf100
Yichen Lou, Mingyue Du, Xinyuan Song

There exists a substantial body of literature that discusses regression analysis of interval-censored failure time data and also many methods have been proposed for handling the presence of a cured subgroup. However, only limited research exists on the problems incorporating change points, with or without a cured subgroup, which can occur in various contexts such as clinical trials where disease risks may shift dramatically when certain biological indicators exceed specific thresholds. To fill this gap, we consider a class of partly linear transformation models within the mixture cure model framework and propose a sieve maximum likelihood estimation approach using Bernstein polynomials and piecewise linear functions for inference. Additionally, we provide a data-driven adaptive procedure to identify the number and locations of change points and establish the asymptotic properties of the proposed method. Extensive simulation studies demonstrate the effectiveness and practical utility of the proposed methods, which are applied to the real data from a breast cancer study that motivated this work.

有大量的文献讨论了间隔截短失效时间数据的回归分析,也提出了许多方法来处理治愈亚群的存在。然而,只有有限的研究存在纳入变化点的问题,有或没有治愈亚组,这可能发生在各种情况下,如临床试验中,当某些生物指标超过特定阈值时,疾病风险可能会发生巨大变化。为了填补这一空白,我们考虑了混合模型框架内的一类部分线性变换模型,并提出了一种使用Bernstein多项式和分段线性函数进行推理的筛极大似然估计方法。此外,我们提供了一个数据驱动的自适应过程来识别变化点的数量和位置,并建立了所提出方法的渐近性质。大量的模拟研究证明了所提出方法的有效性和实用性,并将其应用于激发这项工作的乳腺癌研究的真实数据。
{"title":"Regression analysis of interval-censored failure time data with change points and a cured subgroup.","authors":"Yichen Lou, Mingyue Du, Xinyuan Song","doi":"10.1093/biomtc/ujaf100","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf100","url":null,"abstract":"<p><p>There exists a substantial body of literature that discusses regression analysis of interval-censored failure time data and also many methods have been proposed for handling the presence of a cured subgroup. However, only limited research exists on the problems incorporating change points, with or without a cured subgroup, which can occur in various contexts such as clinical trials where disease risks may shift dramatically when certain biological indicators exceed specific thresholds. To fill this gap, we consider a class of partly linear transformation models within the mixture cure model framework and propose a sieve maximum likelihood estimation approach using Bernstein polynomials and piecewise linear functions for inference. Additionally, we provide a data-driven adaptive procedure to identify the number and locations of change points and establish the asymptotic properties of the proposed method. Extensive simulation studies demonstrate the effectiveness and practical utility of the proposed methods, which are applied to the real data from a breast cancer study that motivated this work.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144833845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Negative binomial mixed effects location-scale models for intensive longitudinal count-type physical activity data provided by wearable devices. 可穿戴设备提供的密集纵向计数型体力活动数据的负二项混合效应位置尺度模型。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf099
Qianheng Ma, Genevieve F Dunton, Donald Hedeker

In recent years, the use of wearable devices, for example, accelerometers, have become increasingly prevalent. Wearable devices enable more accurate real-time tracking of a subject's physical activity (PA) level, such as steps, number of activity bouts, or time in moderate-to-vigorous intensity PA (MVPA), which are important general health markers and can often be represented as counts. These intensive within-subject count data provided by wearable devices, for example, minutes in MVPA summarized per hour across days and even months, allow the possibility for modeling not only the mean PA level, but also the dispersion level for each subject. Especially in the context of daily PA, subjects' dispersion levels are potentially informative in reflecting their exercise patterns: some subjects might exhibit consistent PA across time and can be considered "less dispersed" subjects; while others might have a large amount of PA at a particular time point, while being sedentary for most of the day, and can be considered "more dispersed" subjects. Thus, we propose a negative binomial mixed effects location-scale model to model these intensive longitudinal PA counts and to account for the heterogeneity in both the mean and dispersion level across subjects. Further, to handle the issue of inflated numbers of zeros in the PA data, we also propose a hurdle/zero-inflated version which additionally includes the modeling of the probability of having $>$0 PA levels.

近年来,使用可穿戴设备,例如加速度计,变得越来越普遍。可穿戴设备可以更准确地实时跟踪受试者的身体活动(PA)水平,如步数、活动次数或中高强度PA (MVPA)的时间,这些都是重要的一般健康指标,通常可以用计数表示。这些由可穿戴设备提供的密集的受试者内部计数数据,例如,在几天甚至几个月的时间里,每小时总结的MVPA分钟数,不仅可以模拟平均PA水平,还可以模拟每个受试者的分散水平。特别是在日常PA的背景下,受试者的分散水平在反映他们的运动模式方面具有潜在的信息:一些受试者可能在一段时间内表现出一致的PA,可以被认为是“较少分散”的受试者;而另一些人可能在一个特定的时间点有大量的PA,而一天中的大部分时间都是久坐不动的,可以被认为是“更分散”的受试者。因此,我们提出了一个负二项混合效应位置尺度模型来模拟这些密集的纵向PA计数,并解释受试者之间均值和分散水平的异质性。此外,为了处理PA数据中虚增的零数问题,我们还提出了一个障碍/零虚增的版本,该版本还包括具有$>$0 PA水平的概率建模。
{"title":"Negative binomial mixed effects location-scale models for intensive longitudinal count-type physical activity data provided by wearable devices.","authors":"Qianheng Ma, Genevieve F Dunton, Donald Hedeker","doi":"10.1093/biomtc/ujaf099","DOIUrl":"10.1093/biomtc/ujaf099","url":null,"abstract":"<p><p>In recent years, the use of wearable devices, for example, accelerometers, have become increasingly prevalent. Wearable devices enable more accurate real-time tracking of a subject's physical activity (PA) level, such as steps, number of activity bouts, or time in moderate-to-vigorous intensity PA (MVPA), which are important general health markers and can often be represented as counts. These intensive within-subject count data provided by wearable devices, for example, minutes in MVPA summarized per hour across days and even months, allow the possibility for modeling not only the mean PA level, but also the dispersion level for each subject. Especially in the context of daily PA, subjects' dispersion levels are potentially informative in reflecting their exercise patterns: some subjects might exhibit consistent PA across time and can be considered \"less dispersed\" subjects; while others might have a large amount of PA at a particular time point, while being sedentary for most of the day, and can be considered \"more dispersed\" subjects. Thus, we propose a negative binomial mixed effects location-scale model to model these intensive longitudinal PA counts and to account for the heterogeneity in both the mean and dispersion level across subjects. Further, to handle the issue of inflated numbers of zeros in the PA data, we also propose a hurdle/zero-inflated version which additionally includes the modeling of the probability of having $>$0 PA levels.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12312402/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144752244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biometrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1