首页 > 最新文献

Biometrics最新文献

英文 中文
Inference on age-specific fertility in ecology and evolution. Learning from other disciplines and improving the state of the art. 生态学和进化中年龄生育率的推论。向其他学科学习,提高技术水平。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf081
Fernando Colchero

Despite the importance of age-specific fertility for ecology and evolution, the methods for modeling and inference have proven considerably limited. However, other disciplines have long focused on exploring and developing a vast number of models. Here, I provide an overview of the different models proposed since the 1940s by formal demographers, statisticians, and social scientists, most of which are unknown to the ecological and evolutionary communities. I describe how these fall into 2 main categories, namely polynomials and those based on probability density functions. I discuss their merits in terms of their overall behavior and how well they represent the different stages of fertility. Despite many alternative models, inference on age-specific fertility has usually been limited to simple least squares. Although this might be sufficient for human data, I hope to demonstrate that inference requires more sophisticated approaches for ecological and evolutionary datasets. To illustrate how inference and model choice can be achieved on different types of typical ecological and evolutionary data, I present the new R package Bayesian Fertility Trajectory Analysis, which I apply to published aggregated data for lions and baboons. I then conduct a simulation study to test its performance on individual-level data. I show that appropriate inference and model selection can be achieved even when a small number of parents are followed.

尽管特定年龄的生育能力对生态学和进化具有重要意义,但建模和推理的方法已被证明相当有限。然而,其他学科长期以来一直专注于探索和开发大量的模型。在这里,我概述了自20世纪40年代以来由正式的人口统计学家、统计学家和社会科学家提出的不同模型,其中大多数模型尚不为生态和进化社区所知。我描述了它们如何分为两大类,即多项式和基于概率密度函数的多项式。我从它们的整体行为以及它们如何很好地代表生育力的不同阶段来讨论它们的优点。尽管有许多可供选择的模型,但对特定年龄生育率的推断通常仅限于简单的最小二乘。虽然这对于人类数据来说可能已经足够了,但我希望证明,对于生态和进化数据集,推理需要更复杂的方法。为了说明如何在不同类型的典型生态和进化数据上实现推理和模型选择,我介绍了新的R包贝叶斯生育轨迹分析,我将其应用于已发表的狮子和狒狒的汇总数据。然后,我进行了模拟研究,以测试其在个人层面数据上的性能。我表明,适当的推理和模型选择可以实现,即使少数父母被跟踪。
{"title":"Inference on age-specific fertility in ecology and evolution. Learning from other disciplines and improving the state of the art.","authors":"Fernando Colchero","doi":"10.1093/biomtc/ujaf081","DOIUrl":"10.1093/biomtc/ujaf081","url":null,"abstract":"<p><p>Despite the importance of age-specific fertility for ecology and evolution, the methods for modeling and inference have proven considerably limited. However, other disciplines have long focused on exploring and developing a vast number of models. Here, I provide an overview of the different models proposed since the 1940s by formal demographers, statisticians, and social scientists, most of which are unknown to the ecological and evolutionary communities. I describe how these fall into 2 main categories, namely polynomials and those based on probability density functions. I discuss their merits in terms of their overall behavior and how well they represent the different stages of fertility. Despite many alternative models, inference on age-specific fertility has usually been limited to simple least squares. Although this might be sufficient for human data, I hope to demonstrate that inference requires more sophisticated approaches for ecological and evolutionary datasets. To illustrate how inference and model choice can be achieved on different types of typical ecological and evolutionary data, I present the new R package Bayesian Fertility Trajectory Analysis, which I apply to published aggregated data for lions and baboons. I then conduct a simulation study to test its performance on individual-level data. I show that appropriate inference and model selection can be achieved even when a small number of parents are followed.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144706183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Precision generalized phase I-II designs. 精密广义I-II期设计。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf043
Saijun Zhao, Peter F Thall, Ying Yuan, Juhee Lee, Pavlos Msaouel, Yong Zang

A new family of precision Bayesian dose optimization designs, PGen I-II, based on early efficacy, early toxicity, and long-term time to treatment failure is proposed. A PGen I-II design refines a Gen I-II design by accounting for patient heterogeneity characterized by subgroups that may be defined by prognostic levels, disease subtypes, or biomarker categories. The design makes subgroup-specific decisions, which may be to drop an unacceptably toxic or inefficacious dose, randomize patients among acceptable doses, or identify a best dose in terms of treatment success defined in terms of time to failure over long-term follow-up. A piecewise exponential distribution for failure time is assumed, including subgroup-specific effects of dose, response, and toxicity. Latent variables are used to adaptively cluster subgroups found to have similar dose-outcome distributions, with the model simplified to borrow strength between subgroups in the same cluster. Guidelines and user-friendly computer software for implementing the design are provided. A simulation study is reported that shows the PGen I-II design is superior to similarly structured designs that either assume patient homogeneity or conduct separate trials within subgroups.

提出了一种新的精确贝叶斯剂量优化设计,PGen I-II,基于早期疗效,早期毒性和治疗失败的长期时间。PGen I-II设计通过考虑以预后水平、疾病亚型或生物标志物类别定义的亚组为特征的患者异质性,完善了Gen I-II设计。该设计针对亚组做出特定的决定,可能是减少不可接受的毒性或无效剂量,将患者随机分配到可接受的剂量中,或根据长期随访中从失败到治疗成功的时间确定最佳剂量。假设失效时间呈分段指数分布,包括剂量、反应和毒性的亚组特异性效应。潜在变量用于自适应地聚类具有相似剂量-结果分布的子组,并简化模型以借用同一聚类中子组之间的强度。提供了实施设计的指引和用户友好的计算机软件。据报道,一项模拟研究表明,PGen I-II设计优于类似的结构设计,即假设患者同质性或在亚组内进行单独的试验。
{"title":"Precision generalized phase I-II designs.","authors":"Saijun Zhao, Peter F Thall, Ying Yuan, Juhee Lee, Pavlos Msaouel, Yong Zang","doi":"10.1093/biomtc/ujaf043","DOIUrl":"10.1093/biomtc/ujaf043","url":null,"abstract":"<p><p>A new family of precision Bayesian dose optimization designs, PGen I-II, based on early efficacy, early toxicity, and long-term time to treatment failure is proposed. A PGen I-II design refines a Gen I-II design by accounting for patient heterogeneity characterized by subgroups that may be defined by prognostic levels, disease subtypes, or biomarker categories. The design makes subgroup-specific decisions, which may be to drop an unacceptably toxic or inefficacious dose, randomize patients among acceptable doses, or identify a best dose in terms of treatment success defined in terms of time to failure over long-term follow-up. A piecewise exponential distribution for failure time is assumed, including subgroup-specific effects of dose, response, and toxicity. Latent variables are used to adaptively cluster subgroups found to have similar dose-outcome distributions, with the model simplified to borrow strength between subgroups in the same cluster. Guidelines and user-friendly computer software for implementing the design are provided. A simulation study is reported that shows the PGen I-II design is superior to similarly structured designs that either assume patient homogeneity or conduct separate trials within subgroups.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12288667/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144706184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Doubly robust nonparametric estimators of the predictive value of covariates for survival data. 生存数据协变量预测值的双鲁棒非参数估计。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf084
Torben Martinussen, Mark J van der Laan

The predictive value of a covariate is often of interest in studies with a survival endpoint. A common situation is that there are some well established predictors and a potential valuable new marker. The challenge is how to judge the potentially added predictive value of this new marker. We propose to use the positive predictive value (PPV) curve based on a nonparametric scoring rule. The estimand of interest is viewed as a single transformation of the underlying data generating probability measure, which allows us to develop a robust nonparametric estimator of the PPV by first calculating the corresponding efficient influence function. We provide asymptotic results and illustrate the approach with numerical studies and with 2 cancer data studies.

在有生存终点的研究中,协变量的预测值通常很重要。一种常见的情况是,有一些很好的预测因素和一个潜在的有价值的新标记。挑战在于如何判断这种新标记物潜在的附加预测价值。我们建议使用基于非参数评分规则的正预测值(PPV)曲线。感兴趣的估计被视为生成概率度量的底层数据的单一变换,这使我们能够通过首先计算相应的有效影响函数来开发PPV的鲁棒非参数估计。我们提供了渐近的结果,并用数值研究和2个癌症数据研究说明了该方法。
{"title":"Doubly robust nonparametric estimators of the predictive value of covariates for survival data.","authors":"Torben Martinussen, Mark J van der Laan","doi":"10.1093/biomtc/ujaf084","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf084","url":null,"abstract":"<p><p>The predictive value of a covariate is often of interest in studies with a survival endpoint. A common situation is that there are some well established predictors and a potential valuable new marker. The challenge is how to judge the potentially added predictive value of this new marker. We propose to use the positive predictive value (PPV) curve based on a nonparametric scoring rule. The estimand of interest is viewed as a single transformation of the underlying data generating probability measure, which allows us to develop a robust nonparametric estimator of the PPV by first calculating the corresponding efficient influence function. We provide asymptotic results and illustrate the approach with numerical studies and with 2 cancer data studies.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144697476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: Evaluating the effects of high-throughput structural neuroimaging predictors on whole-brain functional connectome outcomes via network-based matrix-on-vector regression. 修正:通过基于网络的矩阵向量回归评估高通量结构神经成像预测因子对全脑功能连接组结果的影响。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf111
{"title":"Correction to: Evaluating the effects of high-throughput structural neuroimaging predictors on whole-brain functional connectome outcomes via network-based matrix-on-vector regression.","authors":"","doi":"10.1093/biomtc/ujaf111","DOIUrl":"10.1093/biomtc/ujaf111","url":null,"abstract":"","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12341978/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144833843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regression analysis of interval-censored failure time data with change points and a cured subgroup. 具有变化点和修复子群的间隔截尾失效时间数据的回归分析。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf100
Yichen Lou, Mingyue Du, Xinyuan Song

There exists a substantial body of literature that discusses regression analysis of interval-censored failure time data and also many methods have been proposed for handling the presence of a cured subgroup. However, only limited research exists on the problems incorporating change points, with or without a cured subgroup, which can occur in various contexts such as clinical trials where disease risks may shift dramatically when certain biological indicators exceed specific thresholds. To fill this gap, we consider a class of partly linear transformation models within the mixture cure model framework and propose a sieve maximum likelihood estimation approach using Bernstein polynomials and piecewise linear functions for inference. Additionally, we provide a data-driven adaptive procedure to identify the number and locations of change points and establish the asymptotic properties of the proposed method. Extensive simulation studies demonstrate the effectiveness and practical utility of the proposed methods, which are applied to the real data from a breast cancer study that motivated this work.

有大量的文献讨论了间隔截短失效时间数据的回归分析,也提出了许多方法来处理治愈亚群的存在。然而,只有有限的研究存在纳入变化点的问题,有或没有治愈亚组,这可能发生在各种情况下,如临床试验中,当某些生物指标超过特定阈值时,疾病风险可能会发生巨大变化。为了填补这一空白,我们考虑了混合模型框架内的一类部分线性变换模型,并提出了一种使用Bernstein多项式和分段线性函数进行推理的筛极大似然估计方法。此外,我们提供了一个数据驱动的自适应过程来识别变化点的数量和位置,并建立了所提出方法的渐近性质。大量的模拟研究证明了所提出方法的有效性和实用性,并将其应用于激发这项工作的乳腺癌研究的真实数据。
{"title":"Regression analysis of interval-censored failure time data with change points and a cured subgroup.","authors":"Yichen Lou, Mingyue Du, Xinyuan Song","doi":"10.1093/biomtc/ujaf100","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf100","url":null,"abstract":"<p><p>There exists a substantial body of literature that discusses regression analysis of interval-censored failure time data and also many methods have been proposed for handling the presence of a cured subgroup. However, only limited research exists on the problems incorporating change points, with or without a cured subgroup, which can occur in various contexts such as clinical trials where disease risks may shift dramatically when certain biological indicators exceed specific thresholds. To fill this gap, we consider a class of partly linear transformation models within the mixture cure model framework and propose a sieve maximum likelihood estimation approach using Bernstein polynomials and piecewise linear functions for inference. Additionally, we provide a data-driven adaptive procedure to identify the number and locations of change points and establish the asymptotic properties of the proposed method. Extensive simulation studies demonstrate the effectiveness and practical utility of the proposed methods, which are applied to the real data from a breast cancer study that motivated this work.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144833845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatially aware adjusted Rand index for evaluating spatial transcriptomics clustering. 空间感知调整Rand指数评估空间转录组聚类。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf127
Yinqiao Yan, Xiangnan Feng, Xiangyu Luo

The spatial transcriptomics (ST) clustering plays a crucial role in elucidating the tissue spatial heterogeneity. An accurate ST clustering result can greatly benefit downstream biological analyses. As various ST clustering approaches are proposed in recent years, comparing their clustering accuracy becomes important in benchmarking studies. However, the widely used metric, adjusted Rand index (ARI), totally ignores the spatial information in ST data, which prevents ARI from fully evaluating spatial ST clustering methods. We propose a spatially aware Rand index (spRI) as well as spatially aware adjusted Rand index (spARI) that incorporate the spatial distance information. Specifically, when comparing two partitions, spRI provides a disagreement object pair with a weight relying on the distance of the two objects, whereas Rand index assigns a zero weight to it. This spatially aware feature of spRI adaptively differentiates disagreement object pairs based on their distinct distances, providing a useful evaluation metric that favors spatial coherence of clustering. The spARI is obtained by adjusting spRI for random chances such that its expectation takes zero under an appropriate null model. Statistical properties of spRI and spARI are discussed. The applications to simulation study and two ST datasets demonstrate the improved utilities of spARI compared to ARI in evaluating ST clustering methods.

空间转录组学(ST)聚类在阐明组织空间异质性中起着至关重要的作用。准确的ST聚类结果可以极大地有利于下游生物分析。由于近年来提出了各种ST聚类方法,比较它们的聚类精度在基准测试研究中变得非常重要。然而,目前广泛使用的指标调整Rand指数(ARI)完全忽略了ST数据中的空间信息,这使得ARI无法充分评价空间ST聚类方法。本文提出了包含空间距离信息的空间感知兰德指数(spRI)和空间感知调整兰德指数(spARI)。具体来说,在比较两个分区时,spRI提供了一个不一致的对象对,其权重依赖于两个对象的距离,而Rand索引为其分配了一个零权重。spRI的这种空间感知特征基于不同的距离自适应区分不同的目标对,提供了一个有用的评价指标,有利于聚类的空间一致性。通过根据随机机会调整spRI,使其期望在适当的零模型下为零,从而获得spARI。讨论了spRI和spARI的统计性质。模拟研究和两个ST数据集的应用表明,与ARI相比,spARI在评估ST聚类方法方面的效用有所提高。
{"title":"Spatially aware adjusted Rand index for evaluating spatial transcriptomics clustering.","authors":"Yinqiao Yan, Xiangnan Feng, Xiangyu Luo","doi":"10.1093/biomtc/ujaf127","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf127","url":null,"abstract":"<p><p>The spatial transcriptomics (ST) clustering plays a crucial role in elucidating the tissue spatial heterogeneity. An accurate ST clustering result can greatly benefit downstream biological analyses. As various ST clustering approaches are proposed in recent years, comparing their clustering accuracy becomes important in benchmarking studies. However, the widely used metric, adjusted Rand index (ARI), totally ignores the spatial information in ST data, which prevents ARI from fully evaluating spatial ST clustering methods. We propose a spatially aware Rand index (spRI) as well as spatially aware adjusted Rand index (spARI) that incorporate the spatial distance information. Specifically, when comparing two partitions, spRI provides a disagreement object pair with a weight relying on the distance of the two objects, whereas Rand index assigns a zero weight to it. This spatially aware feature of spRI adaptively differentiates disagreement object pairs based on their distinct distances, providing a useful evaluation metric that favors spatial coherence of clustering. The spARI is obtained by adjusting spRI for random chances such that its expectation takes zero under an appropriate null model. Statistical properties of spRI and spARI are discussed. The applications to simulation study and two ST datasets demonstrate the improved utilities of spARI compared to ARI in evaluating ST clustering methods.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145147550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Covariance-on-covariance regression. Covariance-on-covariance回归。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf097
Yi Zhao, Yize Zhao

A covariance-on-covariance regression model is introduced in this manuscript. It is assumed that there exists (at least) a pair of linear projections on outcome covariance matrices and predictor covariance matrices such that a log-linear model links the variances in the projection spaces, as well as additional covariates of interest. An ordinary least square type of estimator is proposed to simultaneously identify the projections and estimate model coefficients. Under regularity conditions, the proposed estimator is asymptotically consistent. The superior performance of the proposed approach over existing methods is demonstrated via simulation studies. Applying to data collected in the Human Connectome Project Aging study, the proposed approach identifies 3 pairs of brain networks, where functional connectivity within the resting-state network predicts functional connectivity within the corresponding task-state network. The 3 networks correspond to a global signal network, a task-related network, and a task-unrelated network. The findings are consistent with existing knowledge about brain function.

本文介绍了协方差-对协方差回归模型。假设在结果协方差矩阵和预测协方差矩阵上存在(至少)一对线性投影,使得对数线性模型将投影空间中的方差以及其他感兴趣的协变量联系起来。提出了一种普通最小二乘估计量,用于同时识别投影和估计模型系数。在正则性条件下,所提出的估计量是渐近一致的。通过仿真研究证明了该方法优于现有方法的性能。应用于人类连接组项目衰老研究中收集的数据,提出的方法确定了3对大脑网络,其中静息状态网络中的功能连接预测了相应任务状态网络中的功能连接。这3个网络分别对应一个全局信号网络、一个任务相关网络和一个任务无关网络。这些发现与现有的大脑功能知识是一致的。
{"title":"Covariance-on-covariance regression.","authors":"Yi Zhao, Yize Zhao","doi":"10.1093/biomtc/ujaf097","DOIUrl":"10.1093/biomtc/ujaf097","url":null,"abstract":"<p><p>A covariance-on-covariance regression model is introduced in this manuscript. It is assumed that there exists (at least) a pair of linear projections on outcome covariance matrices and predictor covariance matrices such that a log-linear model links the variances in the projection spaces, as well as additional covariates of interest. An ordinary least square type of estimator is proposed to simultaneously identify the projections and estimate model coefficients. Under regularity conditions, the proposed estimator is asymptotically consistent. The superior performance of the proposed approach over existing methods is demonstrated via simulation studies. Applying to data collected in the Human Connectome Project Aging study, the proposed approach identifies 3 pairs of brain networks, where functional connectivity within the resting-state network predicts functional connectivity within the corresponding task-state network. The 3 networks correspond to a global signal network, a task-related network, and a task-unrelated network. The findings are consistent with existing knowledge about brain function.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12312406/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144752243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparse 2-stage Bayesian meta-analysis for individualized treatments. 个性化治疗的稀疏2阶段贝叶斯荟萃分析。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf082
Junwei Shen, Erica E M Moodie, Shirin Golchi

Individualized treatment rules tailor treatments to patients based on clinical, demographic, and other characteristics. Estimation of individualized treatment rules requires the identification of individuals who benefit most from the particular treatments and thus the detection of variability in treatment effects. To develop an effective individualized treatment rule, data from multisite studies may be required due to the low power provided by smaller datasets for detecting the often small treatment-covariate interactions. However, sharing of individual-level data is sometimes constrained. Furthermore, sparsity may arise in 2 senses: different data sites may recruit from different populations, making it infeasible to estimate identical models or all parameters of interest at all sites, and the number of non-zero parameters in the model for the treatment rule may be small. To address these issues, we adopt a 2-stage Bayesian meta-analysis approach to estimate individualized treatment rules which optimize expected patient outcomes using multisite data without disclosing individual-level data beyond the sites. Simulation results demonstrate that our approach can provide consistent estimates of the parameters which fully characterize the optimal individualized treatment rule. We estimate the optimal Warfarin dose strategy using data from the International Warfarin Pharmacogenetics Consortium, where data sparsity and small treatment-covariate interaction effects pose additional statistical challenges.

个体化治疗规则根据临床、人口统计学和其他特征为患者量身定制治疗方案。估计个体化治疗规则需要确定从特定治疗中获益最多的个体,从而检测治疗效果的可变性。为了制定有效的个体化治疗规则,可能需要来自多地点研究的数据,因为较小的数据集用于检测通常较小的治疗-协变量相互作用的能力较低。然而,个人层面数据的共享有时受到限制。此外,稀疏性可能在两种意义上产生:不同的数据点可能从不同的种群中招募,使得在所有地点估计相同的模型或所有感兴趣的参数是不可行的,并且模型中用于处理规则的非零参数的数量可能很小。为了解决这些问题,我们采用两阶段贝叶斯荟萃分析方法来估计个性化治疗规则,该规则使用多站点数据优化患者预期结果,而不会泄露超出站点的个人水平数据。仿真结果表明,我们的方法可以提供一致的参数估计,充分表征了最优的个性化治疗规则。我们使用来自国际华法林药物遗传学协会的数据来估计最佳华法林剂量策略,其中数据稀疏和小治疗-协变量相互作用效应带来了额外的统计挑战。
{"title":"Sparse 2-stage Bayesian meta-analysis for individualized treatments.","authors":"Junwei Shen, Erica E M Moodie, Shirin Golchi","doi":"10.1093/biomtc/ujaf082","DOIUrl":"10.1093/biomtc/ujaf082","url":null,"abstract":"<p><p>Individualized treatment rules tailor treatments to patients based on clinical, demographic, and other characteristics. Estimation of individualized treatment rules requires the identification of individuals who benefit most from the particular treatments and thus the detection of variability in treatment effects. To develop an effective individualized treatment rule, data from multisite studies may be required due to the low power provided by smaller datasets for detecting the often small treatment-covariate interactions. However, sharing of individual-level data is sometimes constrained. Furthermore, sparsity may arise in 2 senses: different data sites may recruit from different populations, making it infeasible to estimate identical models or all parameters of interest at all sites, and the number of non-zero parameters in the model for the treatment rule may be small. To address these issues, we adopt a 2-stage Bayesian meta-analysis approach to estimate individualized treatment rules which optimize expected patient outcomes using multisite data without disclosing individual-level data beyond the sites. Simulation results demonstrate that our approach can provide consistent estimates of the parameters which fully characterize the optimal individualized treatment rule. We estimate the optimal Warfarin dose strategy using data from the International Warfarin Pharmacogenetics Consortium, where data sparsity and small treatment-covariate interaction effects pose additional statistical challenges.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12288668/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144706198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating associations between cumulative exposure and health via generalized distributed lag non-linear models using penalized splines. 利用惩罚样条的广义分布滞后非线性模型估计累积暴露与健康之间的关系。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf116
Tianyi Pan, Hwashin Hyun Shin, Glen McGee, Alex Stringer

Quantifying associations between short-term exposure to ambient air pollution and health outcomes is an important public health priority. Many studies have investigated the association considering delayed effects within the past few days. Adaptive cumulative exposure distributed lag non-linear models (ACE-DLNMs) quantify associations between health outcomes and cumulative exposure that is specified in a data-adaptive way. While the ACE-DLNM framework is highly interpretable, it is limited to continuous outcomes and does not scale well to large datasets. Motivated by a large analysis of daily pollution and respiratory hospitalization counts in Canada between 2001 and 2018, we propose a generalized ACE-DLNM incorporating penalized splines, improving upon existing ACE-DLNM methods to accommodate general response types. We then develop a computationally efficient estimation strategy based on profile likelihood and Laplace approximate marginal likelihood with Newton-type methods. We demonstrate the performance and practical advantages of the proposed method through simulations. In application to the motivating analysis, the proposed method yields more stable inferences compared to generalized additive models with fixed exposures, while retaining interpretability.

量化短期暴露于环境空气污染与健康结果之间的关系是一项重要的公共卫生优先事项。许多研究调查了过去几天内考虑延迟效应的关联。自适应累积暴露分布滞后非线性模型(ACE-DLNMs)量化健康结果与以数据自适应方式指定的累积暴露之间的关联。虽然ACE-DLNM框架具有高度的可解释性,但它仅限于连续的结果,不能很好地扩展到大型数据集。受2001年至2018年间加拿大每日污染和呼吸住院数的大量分析的启发,我们提出了一种包含惩罚样条的广义ACE-DLNM方法,改进了现有的ACE-DLNM方法,以适应一般的反应类型。然后,我们利用牛顿型方法开发了基于轮廓似然和拉普拉斯近似边际似然的计算效率估计策略。通过仿真验证了该方法的性能和实用优势。在应用于激励分析时,与固定暴露的广义加性模型相比,该方法产生更稳定的推断,同时保持可解释性。
{"title":"Estimating associations between cumulative exposure and health via generalized distributed lag non-linear models using penalized splines.","authors":"Tianyi Pan, Hwashin Hyun Shin, Glen McGee, Alex Stringer","doi":"10.1093/biomtc/ujaf116","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf116","url":null,"abstract":"<p><p>Quantifying associations between short-term exposure to ambient air pollution and health outcomes is an important public health priority. Many studies have investigated the association considering delayed effects within the past few days. Adaptive cumulative exposure distributed lag non-linear models (ACE-DLNMs) quantify associations between health outcomes and cumulative exposure that is specified in a data-adaptive way. While the ACE-DLNM framework is highly interpretable, it is limited to continuous outcomes and does not scale well to large datasets. Motivated by a large analysis of daily pollution and respiratory hospitalization counts in Canada between 2001 and 2018, we propose a generalized ACE-DLNM incorporating penalized splines, improving upon existing ACE-DLNM methods to accommodate general response types. We then develop a computationally efficient estimation strategy based on profile likelihood and Laplace approximate marginal likelihood with Newton-type methods. We demonstrate the performance and practical advantages of the proposed method through simulations. In application to the motivating analysis, the proposed method yields more stable inferences compared to generalized additive models with fixed exposures, while retaining interpretability.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145091086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian inference for copy number intra-tumoral heterogeneity from single-cell RNA-sequencing data. 单细胞rna测序数据对拷贝数肿瘤内异质性的贝叶斯推断。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-07-03 DOI: 10.1093/biomtc/ujaf115
PuXue Qiao, Chun Fung Kwok, Guoqi Qian, Davis J McCarthy

Copy number alterations (CNA) are important drivers and markers of clonal structures within tumors. Understanding these structures at single-cell resolution is crucial to advancing cancer treatments. The objective is to cluster single cells into clones and identify CNA events in each clone. Early attempts often sacrifice the intrinsic link between cell clustering and clonal CNA detection for simplicity and rely heavily on human input for critical parameters such as the number of clones. Here, we develop a Bayesian model to utilize single-cell RNA sequencing (scRNA-seq) data for automatic analysis of intra-tumoral clonal structure concerning CNAs, without reliance on prior knowledge. The model clusters cells into sub-tumoral clones, identifies the number of clones, and simultaneously infers the clonal CNA profiles. It synergistically incorporates input from gene expression and germline single-nucleotide polymorphisms. A Gibbs sampling algorithm has been implemented and is available as an R package Chloris. We demonstrate that our new method compares strongly against existing software tools in terms of both cell clustering and CNA profile identification accuracy. Application to human metastatic melanoma and anaplastic thyroid tumor data demonstrates accurate clustering of tumor and non-tumor cells and reveals clonal CNA profiles that highlight functional gene expression differences between clones from the same tumor.

拷贝数改变(Copy number change, CNA)是肿瘤克隆结构的重要驱动因素和标志。在单细胞分辨率上理解这些结构对于推进癌症治疗至关重要。目的是将单个细胞聚类成克隆,并确定每个克隆中的CNA事件。早期的尝试往往为了简单而牺牲细胞聚类和克隆CNA检测之间的内在联系,并且严重依赖于人类输入关键参数,如克隆数量。在这里,我们开发了一个贝叶斯模型,利用单细胞RNA测序(scRNA-seq)数据自动分析肿瘤内克隆结构,而不依赖于先验知识。该模型将细胞聚集成亚肿瘤克隆,识别克隆数量,同时推断克隆CNA谱。它协同整合了基因表达和种系单核苷酸多态性的输入。吉布斯采样算法已经实现,并可作为一个R包氯气。我们证明了我们的新方法在细胞聚类和CNA轮廓识别精度方面与现有的软件工具有很强的对比。应用于人类转移性黑色素瘤和间变性甲状腺肿瘤数据证实了肿瘤和非肿瘤细胞的准确聚类,揭示了克隆CNA谱,突出了来自同一肿瘤的克隆之间功能基因表达的差异。
{"title":"Bayesian inference for copy number intra-tumoral heterogeneity from single-cell RNA-sequencing data.","authors":"PuXue Qiao, Chun Fung Kwok, Guoqi Qian, Davis J McCarthy","doi":"10.1093/biomtc/ujaf115","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf115","url":null,"abstract":"<p><p>Copy number alterations (CNA) are important drivers and markers of clonal structures within tumors. Understanding these structures at single-cell resolution is crucial to advancing cancer treatments. The objective is to cluster single cells into clones and identify CNA events in each clone. Early attempts often sacrifice the intrinsic link between cell clustering and clonal CNA detection for simplicity and rely heavily on human input for critical parameters such as the number of clones. Here, we develop a Bayesian model to utilize single-cell RNA sequencing (scRNA-seq) data for automatic analysis of intra-tumoral clonal structure concerning CNAs, without reliance on prior knowledge. The model clusters cells into sub-tumoral clones, identifies the number of clones, and simultaneously infers the clonal CNA profiles. It synergistically incorporates input from gene expression and germline single-nucleotide polymorphisms. A Gibbs sampling algorithm has been implemented and is available as an R package Chloris. We demonstrate that our new method compares strongly against existing software tools in terms of both cell clustering and CNA profile identification accuracy. Application to human metastatic melanoma and anaplastic thyroid tumor data demonstrates accurate clustering of tumor and non-tumor cells and reveals clonal CNA profiles that highlight functional gene expression differences between clones from the same tumor.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144940997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biometrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1