首页 > 最新文献

Biometrics最新文献

英文 中文
Bayesian inference for Cox regression models using catalytic prior distributions. 使用催化先验分布的Cox回归模型的贝叶斯推断。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2026-01-06 DOI: 10.1093/biomtc/ujag004
Weihao Li, Dongming Huang

The Cox proportional hazards model (Cox model) is a popular model for survival data analysis. When the sample size is small relative to the dimension of the model, the standard maximum partial likelihood inference is often problematic. In this work, we propose the Cox catalytic prior distribution for Bayesian inference on Cox models, which extends a general class of prior distributions originally designed to stabilize complex parametric models. The Cox catalytic prior is formulated as a weighted likelihood of the regression coefficients derived from synthetic data and a surrogate baseline hazard constant. This surrogate hazard can be either provided by the user or estimated from the data, and the synthetic data are generated from the predictive distribution of a fitted simpler model. For point estimation, we derive an approximation of the marginal posterior mode, which can be computed conveniently as a regularized log partial likelihood estimator. We prove that our prior distribution is proper and the resulting estimator is consistent under mild conditions. In simulation studies, our proposed method outperforms standard maximum partial likelihood inference and is on par with existing shrinkage methods. We further illustrate the application of our method to a real dataset.

Cox比例风险模型(Cox model)是一种常用的生存数据分析模型。当样本量相对于模型的维数较小时,标准的最大部分似然推理往往存在问题。在这项工作中,我们提出了Cox催化先验分布用于Cox模型上的贝叶斯推断,它扩展了最初设计用于稳定复杂参数模型的一般先验分布。Cox催化先验被表述为从合成数据和替代基线危险常数中得出的回归系数的加权似然。这种替代风险既可以由用户提供,也可以从数据中估计,合成数据是由拟合的更简单模型的预测分布生成的。对于点估计,我们导出了一个边际后验模的近似,它可以方便地计算为正则化对数偏似然估计量。在温和条件下,证明了先验分布是适当的,估计量是一致的。在模拟研究中,我们提出的方法优于标准的最大部分似然推理,与现有的收缩方法相当。我们进一步说明了我们的方法在真实数据集上的应用。
{"title":"Bayesian inference for Cox regression models using catalytic prior distributions.","authors":"Weihao Li, Dongming Huang","doi":"10.1093/biomtc/ujag004","DOIUrl":"https://doi.org/10.1093/biomtc/ujag004","url":null,"abstract":"<p><p>The Cox proportional hazards model (Cox model) is a popular model for survival data analysis. When the sample size is small relative to the dimension of the model, the standard maximum partial likelihood inference is often problematic. In this work, we propose the Cox catalytic prior distribution for Bayesian inference on Cox models, which extends a general class of prior distributions originally designed to stabilize complex parametric models. The Cox catalytic prior is formulated as a weighted likelihood of the regression coefficients derived from synthetic data and a surrogate baseline hazard constant. This surrogate hazard can be either provided by the user or estimated from the data, and the synthetic data are generated from the predictive distribution of a fitted simpler model. For point estimation, we derive an approximation of the marginal posterior mode, which can be computed conveniently as a regularized log partial likelihood estimator. We prove that our prior distribution is proper and the resulting estimator is consistent under mild conditions. In simulation studies, our proposed method outperforms standard maximum partial likelihood inference and is on par with existing shrinkage methods. We further illustrate the application of our method to a real dataset.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"82 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Long-term memory effects of an incremental blood pressure intervention in a mortal cohort. 增加血压干预对人类长期记忆的影响。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2026-01-06 DOI: 10.1093/biomtc/ujaf176
Maria Josefsson, Nina Karalija, Michael J Daniels

In the present study, we examine long-term population-level effects on episodic memory of an intervention over 15 years that reduces systolic blood pressure in individuals with hypertension. A limitation with previous research on the potential risk reduction of such interventions is that they do not properly account for the reduction of mortality rates. Hence, one can only speculate whether the effect is due to changes in memory or changes in mortality. Therefore, we extend previous research by providing both an etiological and a prognostic effect estimate. To do this, we propose a Bayesian semi-parametric estimation approach for an incremental threshold intervention, using the extended G-formula. Additionally, we introduce a novel sparsity-inducing Dirichlet prior for longitudinal data, that exploits the longitudinal structure of the data. We demonstrate the usefulness of our approach in simulations, and compare its performance to other Bayesian decision tree ensemble approaches. In our analysis of the data from the Betula cohort, we found no significant prognostic or etiological effects across all ages. This suggests that systolic blood pressure interventions likely do not strongly affect memory, either at the overall population level or among individuals who would remain alive under both the natural course and the intervention (the always survivor stratum).

在本研究中,我们研究了15年以上降低高血压患者收缩压的干预对情景记忆的长期影响。以往关于此类干预措施可能降低风险的研究的一个局限性是,它们没有适当地考虑到死亡率的降低。因此,人们只能推测这种影响是由于记忆的变化还是由于死亡率的变化。因此,我们通过提供病因学和预后影响评估来扩展先前的研究。为此,我们提出了一种贝叶斯半参数估计方法,用于增量阈值干预,使用扩展的g公式。此外,我们还为纵向数据引入了一种新的稀疏诱导Dirichlet先验,该先验利用了数据的纵向结构。我们在模拟中证明了我们的方法的实用性,并将其性能与其他贝叶斯决策树集成方法进行了比较。在我们对桦树队列数据的分析中,我们发现在所有年龄段都没有明显的预后或病因影响。这表明收缩压干预可能不会强烈影响记忆,无论是在总体水平上,还是在自然过程和干预(始终存活的阶层)下仍然存活的个体中。
{"title":"Long-term memory effects of an incremental blood pressure intervention in a mortal cohort.","authors":"Maria Josefsson, Nina Karalija, Michael J Daniels","doi":"10.1093/biomtc/ujaf176","DOIUrl":"10.1093/biomtc/ujaf176","url":null,"abstract":"<p><p>In the present study, we examine long-term population-level effects on episodic memory of an intervention over 15 years that reduces systolic blood pressure in individuals with hypertension. A limitation with previous research on the potential risk reduction of such interventions is that they do not properly account for the reduction of mortality rates. Hence, one can only speculate whether the effect is due to changes in memory or changes in mortality. Therefore, we extend previous research by providing both an etiological and a prognostic effect estimate. To do this, we propose a Bayesian semi-parametric estimation approach for an incremental threshold intervention, using the extended G-formula. Additionally, we introduce a novel sparsity-inducing Dirichlet prior for longitudinal data, that exploits the longitudinal structure of the data. We demonstrate the usefulness of our approach in simulations, and compare its performance to other Bayesian decision tree ensemble approaches. In our analysis of the data from the Betula cohort, we found no significant prognostic or etiological effects across all ages. This suggests that systolic blood pressure interventions likely do not strongly affect memory, either at the overall population level or among individuals who would remain alive under both the natural course and the intervention (the always survivor stratum).</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"82 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12865380/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal design of dynamic experiments for scalar-on-function linear models with application to a biopharmaceutical study. 函数上标度线性模型的动态实验优化设计及其在生物制药研究中的应用。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2026-01-06 DOI: 10.1093/biomtc/ujaf169
Damianos Michaelides, Maria Adamou, David C Woods, Antony M Overstall

A Bayesian optimal experimental design framework is developed for experiments where settings of one or more variables, referred to as profile variables, can be functions. For this type of experiment, a design consists of combinations of functions for each run of the experiment. Within a scalar-on-function linear model, profile variables are represented through basis expansions. This allows finite-dimensional representation of the profile variables and optimal designs to be found. The approach enables control over the complexity of the profile variables and model. The method is illustrated on a real application involving dynamic feeding strategies in an Ambr250 modular bioreactor system.

一个贝叶斯最优实验设计框架开发的实验,其中一个或多个变量的设置,称为轮廓变量,可以是函数。对于这种类型的实验,设计由每次实验运行的功能组合组成。在函数上标度线性模型中,轮廓变量通过基展开表示。这允许有限维表示的轮廓变量和最佳设计被发现。该方法可以控制概要文件变量和模型的复杂性。该方法在Ambr250模块化生物反应器系统中的动态投料策略的实际应用中得到了说明。
{"title":"Optimal design of dynamic experiments for scalar-on-function linear models with application to a biopharmaceutical study.","authors":"Damianos Michaelides, Maria Adamou, David C Woods, Antony M Overstall","doi":"10.1093/biomtc/ujaf169","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf169","url":null,"abstract":"<p><p>A Bayesian optimal experimental design framework is developed for experiments where settings of one or more variables, referred to as profile variables, can be functions. For this type of experiment, a design consists of combinations of functions for each run of the experiment. Within a scalar-on-function linear model, profile variables are represented through basis expansions. This allows finite-dimensional representation of the profile variables and optimal designs to be found. The approach enables control over the complexity of the profile variables and model. The method is illustrated on a real application involving dynamic feeding strategies in an Ambr250 modular bioreactor system.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"82 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145932042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An adaptive design for optimizing treatment assignment in randomized clinical trials. 随机临床试验中优化治疗分配的自适应设计。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2026-01-06 DOI: 10.1093/biomtc/ujaf168
Wei Zhang, Zhiwei Zhang, Aiyi Liu

The treatment assignment mechanism in a randomized clinical trial can be optimized for statistical efficiency within a specified class of randomization mechanisms. Optimal designs of this type have been characterized in terms of the variances of potential outcomes conditional on baseline covariates. Approximating these optimal designs requires information about the conditional variance functions, which is often unavailable or unreliable at the design stage. As a practical solution to this dilemma, we propose a multi-stage adaptive design that allows the treatment assignment mechanism to be modified at interim analyses based on accruing information about the conditional variance functions. This adaptation has profound implications on the distribution of trial data, which need to be accounted for in treatment effect estimation. We consider a class of treatment effect estimators that are consistent and asymptotically normal, identify the most efficient estimator within this class, and approximate the most efficient estimator by substituting estimates of unknown quantities. Simulation results indicate that, when there is little or no prior information available, the proposed design can bring substantial efficiency gains over conventional one-stage designs based on the same prior information. The methodology is illustrated with real data from a completed trial in stroke.

随机临床试验中的治疗分配机制可以在特定类别的随机化机制中优化统计效率。这种类型的优化设计的特点是潜在结果的方差取决于基线协变量。近似这些最优设计需要条件方差函数的信息,这些信息在设计阶段通常是不可用或不可靠的。作为解决这一困境的实际方案,我们提出了一种多阶段自适应设计,允许根据有关条件方差函数的累积信息在中期分析时修改处理分配机制。这种适应对试验数据的分布有深远的影响,需要在治疗效果估计中加以考虑。我们考虑了一类一致且渐近正态的治疗效果估计量,确定了该类中最有效的估计量,并通过替换未知量的估计来近似最有效的估计量。仿真结果表明,在先验信息很少或没有先验信息的情况下,与基于相同先验信息的常规单级设计相比,所提设计能带来显著的效率提升。该方法用一个已完成的中风试验的真实数据加以说明。
{"title":"An adaptive design for optimizing treatment assignment in randomized clinical trials.","authors":"Wei Zhang, Zhiwei Zhang, Aiyi Liu","doi":"10.1093/biomtc/ujaf168","DOIUrl":"10.1093/biomtc/ujaf168","url":null,"abstract":"<p><p>The treatment assignment mechanism in a randomized clinical trial can be optimized for statistical efficiency within a specified class of randomization mechanisms. Optimal designs of this type have been characterized in terms of the variances of potential outcomes conditional on baseline covariates. Approximating these optimal designs requires information about the conditional variance functions, which is often unavailable or unreliable at the design stage. As a practical solution to this dilemma, we propose a multi-stage adaptive design that allows the treatment assignment mechanism to be modified at interim analyses based on accruing information about the conditional variance functions. This adaptation has profound implications on the distribution of trial data, which need to be accounted for in treatment effect estimation. We consider a class of treatment effect estimators that are consistent and asymptotically normal, identify the most efficient estimator within this class, and approximate the most efficient estimator by substituting estimates of unknown quantities. Simulation results indicate that, when there is little or no prior information available, the proposed design can bring substantial efficiency gains over conventional one-stage designs based on the same prior information. The methodology is illustrated with real data from a completed trial in stroke.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"82 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145916646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating the causal effect of redlining on present-day air pollution. 估计红线对当今空气污染的因果影响。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2026-01-06 DOI: 10.1093/biomtc/ujaf173
Xiaodan Zhou, Shu Yang, Brian J Reich

Recent studies have shown associations between redlining policies (1935-1974) and present-day fine particulate matter (PM$_{2.5}$) and nitrogen dioxide (NO$_2$) air pollution concentrations. In this paper, we move beyond associations and investigate the causal effects of redlining using spatial causal inference. Redlining policies were enacted in the 1930s, so there is very limited documentation of pre-treatment covariates. Consequently, traditional methods failed to sufficiently account for unmeasured confounders, potentially biasing causal interpretations. By integrating historical redlining data with 2010 PM$_{2.5}$ and NO$_2$ concentrations, our study seeks to estimate the long-term causal impact. Our study addresses challenges with a novel spatial and non-spatial latent factor framework, using the unemployment rate, house rent and percentage of Black population in 1940 US Census as proxies to reconstruct pre-treatment latent socio-economic status. We establish identification of a causal effect under broad assumptions, and use Bayesian Markov Chain Monte Carlo to quantify uncertainty. Our causal analysis provides evidence that historically redlined neighborhoods are exposed to notably higher NO$_2$ concentration. In contrast, the disparities in PM$_{2.5}$ between these neighborhoods are less pronounced. Among the cities analyzed, Los Angeles, CA, and Atlanta, GA, demonstrate the most significant effects for both NO$_2$ and PM$_{2.5}$.

最近的研究表明,红线政策(1935-1974年)与当今细颗粒物(PM${2.5}$)和二氧化氮(NO$_2$)空气污染浓度之间存在关联。在本文中,我们超越了关联,并使用空间因果推理来研究红线的因果效应。红线政策是在20世纪30年代制定的,所以关于治疗前协变量的文献非常有限。因此,传统方法未能充分考虑未测量的混杂因素,可能会使因果解释产生偏差。通过将历史红线数据与2010年PM$_{2.5}$和NO$_2$浓度相结合,我们的研究试图估计长期因果影响。我们的研究通过一个新的空间和非空间潜在因素框架来解决这些挑战,使用失业率、房屋租金和1940年美国人口普查中的黑人人口百分比作为代理来重建治疗前潜在的社会经济地位。我们在广泛的假设下建立了因果效应的识别,并使用贝叶斯马尔可夫链蒙特卡罗来量化不确定性。我们的因果分析提供了历史上红线社区暴露于明显较高的NO$_2$浓度的证据。相比之下,这些社区之间的PM$_{2.5}$差异不太明显。在分析的城市中,加州洛杉矶和乔治亚州亚特兰大对NO$_2$和PM$_{2.5}$的影响最为显著。
{"title":"Estimating the causal effect of redlining on present-day air pollution.","authors":"Xiaodan Zhou, Shu Yang, Brian J Reich","doi":"10.1093/biomtc/ujaf173","DOIUrl":"10.1093/biomtc/ujaf173","url":null,"abstract":"<p><p>Recent studies have shown associations between redlining policies (1935-1974) and present-day fine particulate matter (PM$_{2.5}$) and nitrogen dioxide (NO$_2$) air pollution concentrations. In this paper, we move beyond associations and investigate the causal effects of redlining using spatial causal inference. Redlining policies were enacted in the 1930s, so there is very limited documentation of pre-treatment covariates. Consequently, traditional methods failed to sufficiently account for unmeasured confounders, potentially biasing causal interpretations. By integrating historical redlining data with 2010 PM$_{2.5}$ and NO$_2$ concentrations, our study seeks to estimate the long-term causal impact. Our study addresses challenges with a novel spatial and non-spatial latent factor framework, using the unemployment rate, house rent and percentage of Black population in 1940 US Census as proxies to reconstruct pre-treatment latent socio-economic status. We establish identification of a causal effect under broad assumptions, and use Bayesian Markov Chain Monte Carlo to quantify uncertainty. Our causal analysis provides evidence that historically redlined neighborhoods are exposed to notably higher NO$_2$ concentration. In contrast, the disparities in PM$_{2.5}$ between these neighborhoods are less pronounced. Among the cities analyzed, Los Angeles, CA, and Atlanta, GA, demonstrate the most significant effects for both NO$_2$ and PM$_{2.5}$.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"82 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805554/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145984363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semiparametric piecewise accelerated failure time model for the analysis of immune-oncology clinical trials. 用于免疫肿瘤临床试验分析的半参数分段加速失效时间模型。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2026-01-06 DOI: 10.1093/biomtc/ujaf171
Hisato Sunami, Satoshi Hattori

Effectiveness of immune-oncology chemotherapies has been presented in recent clinical trials. The Kaplan-Meier estimates of the survival functions of the immune therapy and the control often suggested the presence of the lag-time until the immune therapy began to act. It implies the use of hazard ratio under the proportional hazards assumption would not be appealing, and many alternatives have been investigated such as the restricted mean survival time. In addition to such overall summary of the treatment contrast, the lag-time is also an important feature of the treatment effect. Identical survival functions up to the lag-time implies patients who are likely to die before the lag-time would not benefit the treatment and identifying such patients would be very important. We propose the semiparametric piecewise accelerated failure time model and its inference procedure based on the semiparametric maximum likelihood method. It provides not only an overall treatment summary, but also a framework to identify patients who have less benefit from the immune-therapy in a unified way. Numerical experiments confirm that each parameter can be estimated with minimal bias. Through a real data analysis, we illustrate the evaluation of the effect of immune-oncology therapy and the characterization of covariates in which patients are unlikely to receive the benefit of treatment.

免疫肿瘤化疗的有效性已在最近的临床试验中得到证实。Kaplan-Meier对免疫治疗和对照组的生存功能的估计通常表明在免疫治疗开始起作用之前存在滞后时间。这意味着在比例风险假设下使用风险比并不具有吸引力,并且已经研究了许多替代方案,例如限制平均生存时间。除了对治疗对比进行这样的总体总结外,滞后时间也是治疗效果的一个重要特征。相同的生存功能直到滞后时间意味着可能在滞后时间之前死亡的患者将不会受益于治疗,识别此类患者将非常重要。提出了基于半参数极大似然法的半参数分段加速失效时间模型及其推理过程。它不仅提供了一个整体的治疗总结,而且还提供了一个框架,以统一的方式识别从免疫治疗中获益较少的患者。数值实验证明,每个参数都能以最小的偏差进行估计。通过真实的数据分析,我们说明了免疫肿瘤治疗效果的评估和协变量的特征,其中患者不太可能获得治疗的好处。
{"title":"Semiparametric piecewise accelerated failure time model for the analysis of immune-oncology clinical trials.","authors":"Hisato Sunami, Satoshi Hattori","doi":"10.1093/biomtc/ujaf171","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf171","url":null,"abstract":"<p><p>Effectiveness of immune-oncology chemotherapies has been presented in recent clinical trials. The Kaplan-Meier estimates of the survival functions of the immune therapy and the control often suggested the presence of the lag-time until the immune therapy began to act. It implies the use of hazard ratio under the proportional hazards assumption would not be appealing, and many alternatives have been investigated such as the restricted mean survival time. In addition to such overall summary of the treatment contrast, the lag-time is also an important feature of the treatment effect. Identical survival functions up to the lag-time implies patients who are likely to die before the lag-time would not benefit the treatment and identifying such patients would be very important. We propose the semiparametric piecewise accelerated failure time model and its inference procedure based on the semiparametric maximum likelihood method. It provides not only an overall treatment summary, but also a framework to identify patients who have less benefit from the immune-therapy in a unified way. Numerical experiments confirm that each parameter can be estimated with minimal bias. Through a real data analysis, we illustrate the evaluation of the effect of immune-oncology therapy and the characterization of covariates in which patients are unlikely to receive the benefit of treatment.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"82 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145916649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quasi-likelihood estimation for semiparametric circular regression models. 半参数圆形回归模型的拟似然估计。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2026-01-06 DOI: 10.1093/biomtc/ujag002
Anna Gottard, Andrea Meilán-Vila, Agnese Panzera

Motivated by the need for flexible and interpretable models to handle circular data, this paper introduces a semiparametric regression model for a circular response that can include both linear and circular covariates in its parametric and nonparametric components. Rather than imposing a particular parametric distribution on the error term, we adopt a circular quasi-likelihood function, which is useful when the underlying distribution is unknown. We discuss the asymptotic properties of the resulting estimators and a backfitting algorithm for model fitting. We evaluate the finite-sample performance of our proposal through simulations and illustrate its advantages for assessing the genetic effect on the migratory patterns of willow warblers. This offers new insights into how specific genomic elements can influence migratory behaviour.

由于需要灵活和可解释的模型来处理循环数据,本文介绍了一个循环响应的半参数回归模型,该模型可以在其参数和非参数分量中包含线性和循环协变量。我们没有在误差项上强加一个特定的参数分布,而是采用了一个圆形的拟似然函数,这在底层分布未知时很有用。讨论了所得估计量的渐近性质和模型拟合的反拟合算法。我们通过模拟评估了我们的建议的有限样本性能,并说明了它在评估柳莺迁徙模式的遗传影响方面的优势。这为特定的基因组元素如何影响迁徙行为提供了新的见解。
{"title":"Quasi-likelihood estimation for semiparametric circular regression models.","authors":"Anna Gottard, Andrea Meilán-Vila, Agnese Panzera","doi":"10.1093/biomtc/ujag002","DOIUrl":"https://doi.org/10.1093/biomtc/ujag002","url":null,"abstract":"<p><p>Motivated by the need for flexible and interpretable models to handle circular data, this paper introduces a semiparametric regression model for a circular response that can include both linear and circular covariates in its parametric and nonparametric components. Rather than imposing a particular parametric distribution on the error term, we adopt a circular quasi-likelihood function, which is useful when the underlying distribution is unknown. We discuss the asymptotic properties of the resulting estimators and a backfitting algorithm for model fitting. We evaluate the finite-sample performance of our proposal through simulations and illustrate its advantages for assessing the genetic effect on the migratory patterns of willow warblers. This offers new insights into how specific genomic elements can influence migratory behaviour.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"82 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inhomogeneous mark correlation functions for general marked point processes. 一般标记点过程的非齐次标记相关函数。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2026-01-06 DOI: 10.1093/biomtc/ujaf177
Mehdi Moradi, Matthias Eckardt

Spatial phenomena in environmental and biological contexts often involve events that are unevenly distributed across space, carrying attributes whose associations/variations are space-dependent. In this paper, we introduce the class of inhomogeneous mark correlation functions, which capture mark associations/variations while explicitly accounting for spatial inhomogeneity. The proposed functions quantify how, on average, marks vary or associate with one another as a function of pairwise spatial distances. We develop nonparametric estimators and evaluate their performance through simulation studies, covering a range of scenarios with mark association or variation, spanning from nonstationary point patterns without spatial interaction to patterns with clustering tendencies and sparse regions. Our simulations reveal the shortcomings of traditional methods under spatial inhomogeneity, underscoring the necessity of our approach. The results show that our estimators accurately identify both the positivity/negativity and the effective spatial range for detected mark associations/variations. Furthermore, we show that differences in how intensity is estimated generally have only a negligible influence on the empirical bias/variance of our proposed inhomogeneous mark correlation functions. The proposed inhomogeneous mark correlation functions are then applied to two distinct forest ecosystems: Longleaf pine trees in southern Georgia, USA, marked by their diameter at breast height, and Scots pine trees in Pfynwald, Switzerland, marked by their height. Our findings reveal that the inhomogeneous mark correlation functions provide more detailed insights into tree growth patterns compared to traditional methods.

环境和生物背景下的空间现象通常涉及在空间上不均匀分布的事件,其关联/变化具有空间依赖性。在本文中,我们引入了一类非齐次标记相关函数,它可以捕获标记关联/变化,同时显式地考虑空间非同质性。所提出的函数量化了标记作为成对空间距离的函数平均如何变化或相互关联。我们开发了非参数估计器,并通过模拟研究评估了它们的性能,涵盖了一系列具有显著关联或变化的场景,从没有空间相互作用的非平稳点模式到具有聚类趋势和稀疏区域的模式。我们的模拟揭示了传统方法在空间非均匀性下的不足,强调了我们的方法的必要性。结果表明,我们的估计器可以准确地识别出检测到的标记关联/变化的正/负性和有效空间范围。此外,我们表明,如何估计强度的差异通常对我们提出的非均匀标记相关函数的经验偏差/方差的影响可以忽略不计。然后将提出的非均匀标记相关函数应用于两个不同的森林生态系统:美国佐治亚州南部的长叶松,以其胸径高度标记,以及瑞士Pfynwald的苏格兰松,以其高度标记。我们的研究结果表明,与传统方法相比,非均匀标记相关函数可以更详细地了解树木的生长模式。
{"title":"Inhomogeneous mark correlation functions for general marked point processes.","authors":"Mehdi Moradi, Matthias Eckardt","doi":"10.1093/biomtc/ujaf177","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf177","url":null,"abstract":"<p><p>Spatial phenomena in environmental and biological contexts often involve events that are unevenly distributed across space, carrying attributes whose associations/variations are space-dependent. In this paper, we introduce the class of inhomogeneous mark correlation functions, which capture mark associations/variations while explicitly accounting for spatial inhomogeneity. The proposed functions quantify how, on average, marks vary or associate with one another as a function of pairwise spatial distances. We develop nonparametric estimators and evaluate their performance through simulation studies, covering a range of scenarios with mark association or variation, spanning from nonstationary point patterns without spatial interaction to patterns with clustering tendencies and sparse regions. Our simulations reveal the shortcomings of traditional methods under spatial inhomogeneity, underscoring the necessity of our approach. The results show that our estimators accurately identify both the positivity/negativity and the effective spatial range for detected mark associations/variations. Furthermore, we show that differences in how intensity is estimated generally have only a negligible influence on the empirical bias/variance of our proposed inhomogeneous mark correlation functions. The proposed inhomogeneous mark correlation functions are then applied to two distinct forest ecosystems: Longleaf pine trees in southern Georgia, USA, marked by their diameter at breast height, and Scots pine trees in Pfynwald, Switzerland, marked by their height. Our findings reveal that the inhomogeneous mark correlation functions provide more detailed insights into tree growth patterns compared to traditional methods.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"82 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146008434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating optimal dynamic treatment regimes with Gaussian process emulation. 用高斯过程仿真估计最优动态处理方案。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2026-01-06 DOI: 10.1093/biomtc/ujaf174
Daniel Rodriguez Duque, David A Stephens, Erica E M Moodie

Identifying dynamic treatment regimes (DTRs) is a key objective in precision medicine. Value search approaches, including (Bayesian) dynamic marginal structural models offer an attractive approach to estimation by mapping candidate regimes to their expected outcome. As parametric models for the expected outcomes may be mis-specified and lead to incorrect conclusions, a grid search over candidate DTRs has been proposed, but this may be computationally prohibitive and also subject to high uncertainty in the estimated value function. These inferential challenges can be addressed using Gaussian process ($mathcal {GP}$) optimization methods with estimators for the causal effect of adherence to a specified DTR. We demonstrate how to identify optimal DTRs using this approach in a variety of settings, including when the value function is multi-modal and show that the $mathcal {GP}$ modeling approach that recognizes noise in the estimated response surface leads to improved results as compared to a grid search approach. Further, we show that a grid search may not yield a robust solution and that it often utilizes information less efficiently than a $mathcal {GP}$ approach. The proposed approach is used to understand tailoring of HIV therapy to optimize CD4 cell counts.

确定动态治疗方案(DTRs)是精准医学的关键目标。价值搜索方法,包括(贝叶斯)动态边际结构模型,通过将候选制度映射到其预期结果,提供了一种有吸引力的估计方法。由于预期结果的参数模型可能被错误指定并导致不正确的结论,已经提出了对候选dtr的网格搜索,但这可能在计算上是禁止的,并且在估计的值函数中也受到高度不确定性的影响。这些推理挑战可以使用高斯过程($mathcal {GP}$)优化方法来解决,该方法具有遵守指定DTR的因果效应的估计器。我们演示了如何在各种设置中使用这种方法识别最佳dtr,包括当值函数是多模态时,并表明与网格搜索方法相比,识别估计响应面中的噪声的$mathcal {GP}$建模方法可以改善结果。此外,我们表明网格搜索可能不会产生健壮的解决方案,并且它通常比$mathcal {GP}$方法更有效地利用信息。所提出的方法是用来了解剪裁艾滋病毒治疗,以优化CD4细胞计数。
{"title":"Estimating optimal dynamic treatment regimes with Gaussian process emulation.","authors":"Daniel Rodriguez Duque, David A Stephens, Erica E M Moodie","doi":"10.1093/biomtc/ujaf174","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf174","url":null,"abstract":"<p><p>Identifying dynamic treatment regimes (DTRs) is a key objective in precision medicine. Value search approaches, including (Bayesian) dynamic marginal structural models offer an attractive approach to estimation by mapping candidate regimes to their expected outcome. As parametric models for the expected outcomes may be mis-specified and lead to incorrect conclusions, a grid search over candidate DTRs has been proposed, but this may be computationally prohibitive and also subject to high uncertainty in the estimated value function. These inferential challenges can be addressed using Gaussian process ($mathcal {GP}$) optimization methods with estimators for the causal effect of adherence to a specified DTR. We demonstrate how to identify optimal DTRs using this approach in a variety of settings, including when the value function is multi-modal and show that the $mathcal {GP}$ modeling approach that recognizes noise in the estimated response surface leads to improved results as compared to a grid search approach. Further, we show that a grid search may not yield a robust solution and that it often utilizes information less efficiently than a $mathcal {GP}$ approach. The proposed approach is used to understand tailoring of HIV therapy to optimize CD4 cell counts.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"82 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145984366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Non-boundary covariance matrix estimation in generalized linear mixed effects models using data augmentation priors. 基于数据增广先验的广义线性混合效应模型非边界协方差矩阵估计。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2026-01-06 DOI: 10.1093/biomtc/ujag013
Tina Košuta, Erik Langerholc, Rok Blagus

Boundary estimates of random effects covariance matrices commonly arise when using maximum likelihood (ML) estimation in generalized linear mixed effects models, leading to numerical challenges and affecting statistical inference. To mitigate this, we introduce penalties to the likelihood function derived from conditionally conjugate priors for the covariance or precision matrices of the random effects. Our choice of penalties (priors) allows representation through pseudo-observations, enabling implementation of the proposed penalized estimator within the existing ML software by augmenting the original data. We derive a procedure for constructing these pseudo-observations, a non-trivial task because their likelihood contribution must match the functional form of the penalty and depend only on the covariance or precision matrix of the random effects. Our method includes penalty parameters that can be set using existing prior knowledge or, when no reliable prior information is available, via a novel fully data-driven procedure that eliminates the need for prior specification. Through simulation studies under realistic scenarios, we illustrate that the proposed approach can provide improved estimates of random-effects covariance matrices compared with competing methods in the settings considered. The approach is further illustrated on real-world data.

在广义线性混合效应模型中使用最大似然(ML)估计时,通常会出现随机效应协方差矩阵的边界估计,导致数值挑战并影响统计推断。为了减轻这一点,我们引入惩罚的似然函数从条件共轭先验的协方差或精度矩阵的随机效应。我们选择的惩罚(先验)允许通过伪观察来表示,通过增加原始数据,可以在现有ML软件中实现所建议的惩罚估计器。我们推导了构造这些伪观测的程序,这是一项重要的任务,因为它们的似然贡献必须与惩罚的函数形式相匹配,并且仅依赖于随机效应的协方差或精度矩阵。我们的方法包括惩罚参数,可以使用现有的先验知识设置,或者,当没有可靠的先验信息可用时,通过一种新颖的完全数据驱动的程序,消除了对先验规范的需要。通过在实际场景下的模拟研究,我们表明,与竞争方法相比,所提出的方法可以提供更好的随机效应协方差矩阵估计。该方法在实际数据中得到进一步说明。
{"title":"Non-boundary covariance matrix estimation in generalized linear mixed effects models using data augmentation priors.","authors":"Tina Košuta, Erik Langerholc, Rok Blagus","doi":"10.1093/biomtc/ujag013","DOIUrl":"https://doi.org/10.1093/biomtc/ujag013","url":null,"abstract":"<p><p>Boundary estimates of random effects covariance matrices commonly arise when using maximum likelihood (ML) estimation in generalized linear mixed effects models, leading to numerical challenges and affecting statistical inference. To mitigate this, we introduce penalties to the likelihood function derived from conditionally conjugate priors for the covariance or precision matrices of the random effects. Our choice of penalties (priors) allows representation through pseudo-observations, enabling implementation of the proposed penalized estimator within the existing ML software by augmenting the original data. We derive a procedure for constructing these pseudo-observations, a non-trivial task because their likelihood contribution must match the functional form of the penalty and depend only on the covariance or precision matrix of the random effects. Our method includes penalty parameters that can be set using existing prior knowledge or, when no reliable prior information is available, via a novel fully data-driven procedure that eliminates the need for prior specification. Through simulation studies under realistic scenarios, we illustrate that the proposed approach can provide improved estimates of random-effects covariance matrices compared with competing methods in the settings considered. The approach is further illustrated on real-world data.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"82 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146140881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biometrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1