首页 > 最新文献

Biometrics最新文献

英文 中文
Large row-constrained supersaturated designs for high-throughput screening. 用于高通量筛选的大行约束过饱和设计。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf160
Byran J Smucker, Stephen E Wright, Isaac Williams, Richard C Page, Andor J Kiss, Surendra Bikram Silwal, Maria Weese, David J Edwards

High-throughput screening, in which large numbers of compounds are traditionally studied one-at-a-time in multiwell plates against specific targets, is widely used across many areas of the biological sciences, including drug discovery. To improve the effectiveness of these screens, we propose a new class of supersaturated designs that guide the construction of pools of compounds in each well. Because the size of the pools is typically limited by the particular application, the new designs accommodate this constraint and are part of a larger procedure that we call Constrained Row Screening or CRowS. We develop an efficient computational procedure to construct the CRowS designs, provide some initial lower bounds on the average squared off-diagonal values of their main-effects information matrix, and study the impact of the constraint on design quality. We also show via simulation that CRowS is statistically superior to the traditional one-compound-one-well approach as well as an existing pooling method, and demonstrate the use of the new methodology on a Verona Integron-encoded Metallo-$beta$-lactamase-2 assay.

传统的高通量筛选方法是在多孔板上针对特定靶点一次对大量化合物进行研究,这种方法被广泛应用于生物科学的许多领域,包括药物发现。为了提高这些筛管的有效性,我们提出了一类新的过饱和设计,可以指导每口井中化合物池的构建。由于池的大小通常受到特定应用程序的限制,因此新的设计适应了这一约束,并且是我们称为约束行筛选(Constrained Row Screening, CRowS)的更大过程的一部分。我们开发了一种高效的计算程序来构建CRowS设计,给出了其主效应信息矩阵的非对角线平均平方值的初始下界,并研究了约束对设计质量的影响。我们还通过模拟表明,CRowS在统计上优于传统的一化合物一井方法以及现有的池化方法,并演示了新方法在维罗纳整合子编码的金属- β -内酰胺酶-2分析中的应用。
{"title":"Large row-constrained supersaturated designs for high-throughput screening.","authors":"Byran J Smucker, Stephen E Wright, Isaac Williams, Richard C Page, Andor J Kiss, Surendra Bikram Silwal, Maria Weese, David J Edwards","doi":"10.1093/biomtc/ujaf160","DOIUrl":"10.1093/biomtc/ujaf160","url":null,"abstract":"<p><p>High-throughput screening, in which large numbers of compounds are traditionally studied one-at-a-time in multiwell plates against specific targets, is widely used across many areas of the biological sciences, including drug discovery. To improve the effectiveness of these screens, we propose a new class of supersaturated designs that guide the construction of pools of compounds in each well. Because the size of the pools is typically limited by the particular application, the new designs accommodate this constraint and are part of a larger procedure that we call Constrained Row Screening or CRowS. We develop an efficient computational procedure to construct the CRowS designs, provide some initial lower bounds on the average squared off-diagonal values of their main-effects information matrix, and study the impact of the constraint on design quality. We also show via simulation that CRowS is statistically superior to the traditional one-compound-one-well approach as well as an existing pooling method, and demonstrate the use of the new methodology on a Verona Integron-encoded Metallo-$beta$-lactamase-2 assay.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12696866/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145720530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical inference on high-dimensional covariate-dependent Gaussian graphical regressions. 高维协变量相关高斯图形回归的统计推断。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf165
Xuran Meng, Jingfei Zhang, Yi Li

In many genomic studies, gene co-expression graphs are influenced by subject-level covariates like single nucleotide polymorphisms. Traditional Gaussian graphical models ignore these covariates and estimate only population-level networks, potentially masking important heterogeneity. Covariate-dependent Gaussian graphical regressions address this limitation by regressing the precision matrix on covariates, thereby modeling how graph structures vary with high-dimensional subject-specific covariates. To fit the model, we adopt a multi-task learning approach that achieves lower error rates than node-wise regressions. Yet, the important problem of statistical inference in this setting remains largely unexplored. We propose a class of debiased estimators based on multi-task learners, which can be computed quickly and separately. In a key step, we introduce a novel projection technique for estimating the inverse covariance matrix, reducing optimization costs to scale with the sample size n. Our debiased estimators achieve fast convergence and asymptotic normality, enabling valid inference. Simulations demonstrate the utility of the method, and an application to a brain cancer gene-expression dataset reveals meaningful biological relationships.

在许多基因组研究中,基因共表达图受到受试者水平协变量(如单核苷酸多态性)的影响。传统的高斯图形模型忽略了这些协变量,只估计人口水平的网络,潜在地掩盖了重要的异质性。协变量相关的高斯图形回归通过在协变量上回归精度矩阵来解决这一限制,从而建模图结构如何随高维特定主题的协变量而变化。为了拟合模型,我们采用了一种多任务学习方法,它比节点智能回归的错误率更低。然而,在这种情况下,统计推断的重要问题在很大程度上仍未得到探索。我们提出了一种基于多任务学习器的去偏估计器,它可以快速且独立地计算。在关键步骤中,我们引入了一种新的投影技术来估计逆协方差矩阵,减少了随样本量n缩放的优化成本。我们的去偏估计器实现了快速收敛和渐近正态性,从而实现了有效的推理。模拟证明了该方法的实用性,并将其应用于脑癌基因表达数据集,揭示了有意义的生物学关系。
{"title":"Statistical inference on high-dimensional covariate-dependent Gaussian graphical regressions.","authors":"Xuran Meng, Jingfei Zhang, Yi Li","doi":"10.1093/biomtc/ujaf165","DOIUrl":"10.1093/biomtc/ujaf165","url":null,"abstract":"<p><p>In many genomic studies, gene co-expression graphs are influenced by subject-level covariates like single nucleotide polymorphisms. Traditional Gaussian graphical models ignore these covariates and estimate only population-level networks, potentially masking important heterogeneity. Covariate-dependent Gaussian graphical regressions address this limitation by regressing the precision matrix on covariates, thereby modeling how graph structures vary with high-dimensional subject-specific covariates. To fit the model, we adopt a multi-task learning approach that achieves lower error rates than node-wise regressions. Yet, the important problem of statistical inference in this setting remains largely unexplored. We propose a class of debiased estimators based on multi-task learners, which can be computed quickly and separately. In a key step, we introduce a novel projection technique for estimating the inverse covariance matrix, reducing optimization costs to scale with the sample size n. Our debiased estimators achieve fast convergence and asymptotic normality, enabling valid inference. Simulations demonstrate the utility of the method, and an application to a brain cancer gene-expression dataset reveals meaningful biological relationships.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12720500/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145802935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rejoinder to Letter to the Editors "Comments on 'Statistical inference on change points in generalized semiparametric segmented models' by Yang et al. (2025)" by Vito M.R. Muggeo. Vito M.R. Muggeo的《致编辑的信》“对Yang等人(2025)的‘广义半参数分段模型中变化点的统计推断’的评论”的回复。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf148
Guangyu Yang, Min Zhang
{"title":"Rejoinder to Letter to the Editors \"Comments on 'Statistical inference on change points in generalized semiparametric segmented models' by Yang et al. (2025)\" by Vito M.R. Muggeo.","authors":"Guangyu Yang, Min Zhang","doi":"10.1093/biomtc/ujaf148","DOIUrl":"10.1093/biomtc/ujaf148","url":null,"abstract":"","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":" ","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145602083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A meta-learning method for estimation of causal excursion effects to assess time-varying moderation. 用元学习方法估计因果偏移效应以评估时变适度。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf129
Jieru Shi, Walter Dempsey

Advances in wearable technologies and health interventions delivered by smartphones have greatly increased the accessibility of mobile health (mHealth) interventions. Micro-randomized trials (MRTs) are designed to assess the effectiveness of the mHealth intervention and introduce a novel class of causal estimands called "causal excursion effects." These estimands enable the evaluation of how intervention effects change over time and are influenced by individual characteristics or context. Existing methods for analyzing causal excursion effects assume known randomization probabilities, complete observations, and a linear nuisance function with prespecified features of the high-dimensional observed history. However, in complex mobile systems, these assumptions often fall short: randomization probabilities can be uncertain, observations may be incomplete, and the granularity of mHealth data makes linear modeling difficult. To address this issue, we propose a flexible and doubly robust inferential procedure, called "DR-WCLS," for estimating causal excursion effects from a meta-learner perspective. We present the bidirectional asymptotic properties of the proposed estimators and compare them with existing methods both theoretically and through extensive simulations. The results show a consistent and more efficient estimate, even with missing observations or uncertain treatment randomization probabilities. Finally, the practical utility of the proposed methods is demonstrated by analyzing data from a multi-institution cohort of first-year medical residents in the United States.

可穿戴技术的进步和智能手机提供的卫生干预措施大大提高了移动卫生干预措施的可及性。微随机试验(MRTs)旨在评估移动医疗干预的有效性,并引入一类称为“因果偏移效应”的新型因果估计。这些估计能够评估干预效果如何随时间变化,以及如何受到个体特征或环境的影响。现有的分析因果偏移效应的方法假设已知的随机化概率、完整的观测值和具有高维观测历史的预先指定特征的线性干扰函数。然而,在复杂的移动系统中,这些假设往往不足:随机概率可能是不确定的,观察可能是不完整的,移动健康数据的粒度使得线性建模变得困难。为了解决这个问题,我们提出了一种灵活且双重稳健的推理程序,称为“DR-WCLS”,用于从元学习者的角度估计因果偏移效应。我们给出了所提出的估计量的双向渐近性质,并从理论上和通过广泛的仿真将它们与现有方法进行了比较。结果显示了一个一致的和更有效的估计,即使有缺失的观察或不确定的治疗随机化概率。最后,通过分析美国多机构一年级住院医师队列的数据,证明了所提出方法的实际效用。
{"title":"A meta-learning method for estimation of causal excursion effects to assess time-varying moderation.","authors":"Jieru Shi, Walter Dempsey","doi":"10.1093/biomtc/ujaf129","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf129","url":null,"abstract":"<p><p>Advances in wearable technologies and health interventions delivered by smartphones have greatly increased the accessibility of mobile health (mHealth) interventions. Micro-randomized trials (MRTs) are designed to assess the effectiveness of the mHealth intervention and introduce a novel class of causal estimands called \"causal excursion effects.\" These estimands enable the evaluation of how intervention effects change over time and are influenced by individual characteristics or context. Existing methods for analyzing causal excursion effects assume known randomization probabilities, complete observations, and a linear nuisance function with prespecified features of the high-dimensional observed history. However, in complex mobile systems, these assumptions often fall short: randomization probabilities can be uncertain, observations may be incomplete, and the granularity of mHealth data makes linear modeling difficult. To address this issue, we propose a flexible and doubly robust inferential procedure, called \"DR-WCLS,\" for estimating causal excursion effects from a meta-learner perspective. We present the bidirectional asymptotic properties of the proposed estimators and compare them with existing methods both theoretically and through extensive simulations. The results show a consistent and more efficient estimate, even with missing observations or uncertain treatment randomization probabilities. Finally, the practical utility of the proposed methods is demonstrated by analyzing data from a multi-institution cohort of first-year medical residents in the United States.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145249539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of transition probabilities in multi-state models with nested case-control data. 嵌套病例控制数据的多状态模型转移概率预测。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf164
Yen Chang, Anastasia Ivanova, Demetrius Albanes, Jason P Fine, Yei Eun Shin

Multi-state models are widely used to study complex interrelated life events. In resource-limited settings, nested case-control (NCC) sampling may be employed to extract subsamples from a cohort for an event of interest, followed by a conditional likelihood analysis. However, conditioning restricts the reuse of NCC data for studying additional events. An alternative approach constructs pseudolikelihoods using inverse probability weighting (IPW) for inference with NCC data. Existing IPW-based pseudolikelihood methods focus primarily on estimating relative risks for multiple outcomes or secondary endpoints. In this work, we extend these methods to predict transition probabilities under general multi-state models and evaluate their efficiency. As the standard IPW methods for the prediction of transition probabilities may suffer from inefficiency, we propose two novel approaches for more efficient prediction and derive explicit variance estimates for these methods. The first approach calibrates the design weights using cohort-level information, while the second jointly models transitions originating from the same state. A simulation study demonstrates that either approach substantially improves efficiency and that their combined application yields further gains. We illustrate these methods with real data from the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial.

多状态模型被广泛用于研究复杂的相互关联的生活事件。在资源有限的情况下,可以采用嵌套病例对照(NCC)抽样,从队列中提取感兴趣事件的子样本,然后进行条件似然分析。然而,条件限制了NCC数据在研究其他事件时的重用。另一种方法使用逆概率加权(IPW)构建伪似然,用于NCC数据的推理。现有的基于ipw的伪似然方法主要侧重于估计多个结果或次要终点的相对风险。在这项工作中,我们将这些方法扩展到一般多状态模型下的转移概率预测并评估其效率。针对传统的IPW方法在预测转移概率方面存在效率低下的问题,本文提出了两种新的方法来提高转移概率的预测效率,并对这些方法进行了显式方差估计。第一种方法使用队列级信息校准设计权重,而第二种方法联合建模源自同一状态的转换。仿真研究表明,这两种方法都可以大大提高效率,并且它们的组合应用可以产生进一步的收益。我们用前列腺癌、肺癌、结直肠癌和卵巢癌筛查试验的真实数据来说明这些方法。
{"title":"Prediction of transition probabilities in multi-state models with nested case-control data.","authors":"Yen Chang, Anastasia Ivanova, Demetrius Albanes, Jason P Fine, Yei Eun Shin","doi":"10.1093/biomtc/ujaf164","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf164","url":null,"abstract":"<p><p>Multi-state models are widely used to study complex interrelated life events. In resource-limited settings, nested case-control (NCC) sampling may be employed to extract subsamples from a cohort for an event of interest, followed by a conditional likelihood analysis. However, conditioning restricts the reuse of NCC data for studying additional events. An alternative approach constructs pseudolikelihoods using inverse probability weighting (IPW) for inference with NCC data. Existing IPW-based pseudolikelihood methods focus primarily on estimating relative risks for multiple outcomes or secondary endpoints. In this work, we extend these methods to predict transition probabilities under general multi-state models and evaluate their efficiency. As the standard IPW methods for the prediction of transition probabilities may suffer from inefficiency, we propose two novel approaches for more efficient prediction and derive explicit variance estimates for these methods. The first approach calibrates the design weights using cohort-level information, while the second jointly models transitions originating from the same state. A simulation study demonstrates that either approach substantially improves efficiency and that their combined application yields further gains. We illustrate these methods with real data from the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145802938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stable survival extrapolation using mortality projections. 使用死亡率预测的稳定生存推断。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf159
Anastasios Apsemidis, Nikolaos Demiris

The mean survival is the key ingredient of the decision process in several applications, notably in health economic evaluations. It is defined as the area under the complete survival curve, thus necessitating extrapolation of the observed data. This may be achieved in a more stable manner by borrowing long term evidence from registry and demographic data. In this article, we employ a Bayesian mortality model and transfer its projections in order to construct the baseline population that acts as an anchor of the survival model. We then propose extrapolation methods based on flexible parametric poly-hazard models which can naturally accommodate diverse shapes, including non-proportional hazards and crossing survival curves, while typically maintaining a natural interpretation as a data generating mechanism. We estimate the mean survival and related estimands in 3 cases, namely breast cancer, advanced melanoma, and cardiac arrhythmia. Specifically, we evaluate the survival disadvantage of triple-negative breast cancer cases, the efficacy of combining immunotherapy with mRNA cancer therapeutics for melanoma treatment and the suitability of implantable cardioverter defibrilators for cardiac arrhythmia. The last is conducted in a competing risks context illustrating how working on the cause-specific hazard alone minimizes potential instability. The results suggest that the proposed approach offers a flexible, interpretable, and robust approach when survival extrapolation is required.

在若干应用中,特别是在卫生经济评价中,平均存活率是决策过程的关键因素。它被定义为完整生存曲线下的面积,因此需要对观察到的数据进行外推。这可以通过借用登记和人口数据的长期证据以更稳定的方式实现。在本文中,我们采用贝叶斯死亡率模型并转移其预测,以构建作为生存模型锚点的基线人口。然后,我们提出了基于灵活参数多风险模型的外推方法,该模型可以自然地适应各种形状,包括非比例风险和交叉生存曲线,同时通常保持自然解释作为数据生成机制。我们估计了3例乳腺癌、晚期黑色素瘤和心律失常的平均生存期和相关估计。具体而言,我们评估了三阴性乳腺癌病例的生存劣势,免疫疗法联合mRNA癌症疗法治疗黑色素瘤的疗效,以及植入式心律转复除颤器治疗心律失常的适用性。最后一个是在竞争风险的背景下进行的,说明如何单独处理特定原因的危险,以最大限度地减少潜在的不稳定性。结果表明,当需要生存外推时,所提出的方法提供了一种灵活、可解释和可靠的方法。
{"title":"Stable survival extrapolation using mortality projections.","authors":"Anastasios Apsemidis, Nikolaos Demiris","doi":"10.1093/biomtc/ujaf159","DOIUrl":"10.1093/biomtc/ujaf159","url":null,"abstract":"<p><p>The mean survival is the key ingredient of the decision process in several applications, notably in health economic evaluations. It is defined as the area under the complete survival curve, thus necessitating extrapolation of the observed data. This may be achieved in a more stable manner by borrowing long term evidence from registry and demographic data. In this article, we employ a Bayesian mortality model and transfer its projections in order to construct the baseline population that acts as an anchor of the survival model. We then propose extrapolation methods based on flexible parametric poly-hazard models which can naturally accommodate diverse shapes, including non-proportional hazards and crossing survival curves, while typically maintaining a natural interpretation as a data generating mechanism. We estimate the mean survival and related estimands in 3 cases, namely breast cancer, advanced melanoma, and cardiac arrhythmia. Specifically, we evaluate the survival disadvantage of triple-negative breast cancer cases, the efficacy of combining immunotherapy with mRNA cancer therapeutics for melanoma treatment and the suitability of implantable cardioverter defibrilators for cardiac arrhythmia. The last is conducted in a competing risks context illustrating how working on the cause-specific hazard alone minimizes potential instability. The results suggest that the proposed approach offers a flexible, interpretable, and robust approach when survival extrapolation is required.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145720509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical inference for heterogeneous treatment effect with right-censored data from synthesizing randomized clinical trials and real-world data. 综合随机临床试验和真实世界数据的右删减数据对异质性治疗效果的统计推断。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf131
Guangcai Mao, Shu Yang, Xiaofei Wang

The heterogeneous treatment effect plays a crucial role in precision medicine. There is evidence that real-world data, even subject to biases, can be employed as supplementary evidence for randomized clinical trials to improve the statistical efficiency of the heterogeneous treatment effect estimation. In this paper, for survival data with right censoring, we consider estimating the heterogeneous treatment effect, defined as the difference of the treatment-specific conditional restricted mean survival times given covariates, by synthesizing evidence from randomized clinical trials and the real-world data with possible biases. We define an omnibus bias function to characterize the effect of biases caused by unmeasured confounders, censoring, and outcome heterogeneity, and further, identify it by combining the trial and real-world data. We propose a penalized sieve method to estimate the heterogeneous treatment effect and the bias function. We further study the theoretical properties of the proposed integrative estimators based on the theory of reproducing kernel Hilbert space and empirical process. The proposed methodology outperforms the approach solely based on the trial data through simulation studies and an integrative analysis of the data from a randomized trial and a real-world registry on early-stage non-small-cell lung cancer.

治疗效果的异质性在精准医疗中起着至关重要的作用。有证据表明,即使存在偏倚,现实世界的数据也可以作为随机临床试验的补充证据,以提高异质性治疗效果估计的统计效率。在本文中,我们考虑通过综合随机临床试验和可能存在偏差的真实数据的证据来估计具有正确审查的生存数据的异质性治疗效果,其定义为给定协变量的治疗特异性条件限制平均生存时间的差异。我们定义了一个综合偏倚函数来描述由未测量的混杂因素、审查和结果异质性引起的偏倚的影响,并进一步通过结合试验和实际数据来识别它。我们提出了一种惩罚筛法来估计非均质处理效果和偏差函数。基于核希尔伯特空间再现理论和经验过程,进一步研究了所提综合估计量的理论性质。所提出的方法优于仅基于试验数据的方法,该方法通过模拟研究和对来自早期非小细胞肺癌的随机试验和真实世界登记的数据进行综合分析。
{"title":"Statistical inference for heterogeneous treatment effect with right-censored data from synthesizing randomized clinical trials and real-world data.","authors":"Guangcai Mao, Shu Yang, Xiaofei Wang","doi":"10.1093/biomtc/ujaf131","DOIUrl":"10.1093/biomtc/ujaf131","url":null,"abstract":"<p><p>The heterogeneous treatment effect plays a crucial role in precision medicine. There is evidence that real-world data, even subject to biases, can be employed as supplementary evidence for randomized clinical trials to improve the statistical efficiency of the heterogeneous treatment effect estimation. In this paper, for survival data with right censoring, we consider estimating the heterogeneous treatment effect, defined as the difference of the treatment-specific conditional restricted mean survival times given covariates, by synthesizing evidence from randomized clinical trials and the real-world data with possible biases. We define an omnibus bias function to characterize the effect of biases caused by unmeasured confounders, censoring, and outcome heterogeneity, and further, identify it by combining the trial and real-world data. We propose a penalized sieve method to estimate the heterogeneous treatment effect and the bias function. We further study the theoretical properties of the proposed integrative estimators based on the theory of reproducing kernel Hilbert space and empirical process. The proposed methodology outperforms the approach solely based on the trial data through simulation studies and an integrative analysis of the data from a randomized trial and a real-world registry on early-stage non-small-cell lung cancer.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12505326/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145249501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayesian collocation integral method for system identification of ordinary differential equations. 常微分方程系统辨识的贝叶斯搭配积分法。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf141
Mingwei Xu, Samuel W K Wong, Peijun Sang

Ordinary differential equations (ODEs) are widely considered for modeling the dynamics of complex systems across various scientific areas. To identify the structure of high-dimensional sparse ODEs from noisy time-course data, most existing methods adopt a frequentist perspective, while uncertainty quantification in parameter estimation remains challenging. Under an additive ODE model assumption, we present a Bayesian hierarchical collocation method to provide better quantification of uncertainty. Our framework unifies the likelihood, integrated ODE constraints and a group-wise sparse penalty, allowing for simultaneous system identification and trajectory estimation. We demonstrate the favorable performance of the proposed method through simulation studies, where the recovered system trajectories and estimated additive components are compared with other recent methods. A real data example of gene regulatory networks is provided to illustrate the methodology.

常微分方程(ode)被广泛认为是各种科学领域复杂系统动力学的建模方法。为了从噪声时程数据中识别高维稀疏ode的结构,现有的方法大多采用频率论的视角,而参数估计中的不确定性量化仍然是一个挑战。在可加ODE模型假设下,提出了贝叶斯分层配置方法,以更好地量化不确定性。我们的框架统一了可能性、集成的ODE约束和组明智的稀疏惩罚,允许同时进行系统识别和轨迹估计。我们通过仿真研究证明了所提出方法的良好性能,其中恢复的系统轨迹和估计的附加成分与其他最近的方法进行了比较。提供了一个基因调控网络的真实数据示例来说明该方法。
{"title":"A Bayesian collocation integral method for system identification of ordinary differential equations.","authors":"Mingwei Xu, Samuel W K Wong, Peijun Sang","doi":"10.1093/biomtc/ujaf141","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf141","url":null,"abstract":"<p><p>Ordinary differential equations (ODEs) are widely considered for modeling the dynamics of complex systems across various scientific areas. To identify the structure of high-dimensional sparse ODEs from noisy time-course data, most existing methods adopt a frequentist perspective, while uncertainty quantification in parameter estimation remains challenging. Under an additive ODE model assumption, we present a Bayesian hierarchical collocation method to provide better quantification of uncertainty. Our framework unifies the likelihood, integrated ODE constraints and a group-wise sparse penalty, allowing for simultaneous system identification and trajectory estimation. We demonstrate the favorable performance of the proposed method through simulation studies, where the recovered system trajectories and estimated additive components are compared with other recent methods. A real data example of gene regulatory networks is provided to illustrate the methodology.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145372174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of cross-platform health communication with a network approach. 基于网络的跨平台健康通信分析。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf154
Xinyan Fan, Mengque Liu, Shuangge Ma

Online health communities (OHCs) provide a platform for patients and those related to share and communicate, making complex medical information more digestible and actionable. Health communication within OHCs can be impacted by other information sources. This study examines cross-platform health communication by mining Breastcancer.org (the largest online breast cancer community) and Twitter (now X). Early analyses of OHCs, Twitter, and other online platforms often adopt simple measures like word frequency, and more recent research has shifted towards word co-occurrence network analysis. Relatively, cross-platform communication analysis is limited, and the adopted techniques have drawbacks. We propose a new cross-platform communication model that collectively analyzes word co-occurrence networks and word frequency vectors. Here, the former describe the structural contents of health communication, and the latter describe the volumes. This model offers a nuanced perspective, accommodates temporal variations, and is examined for its theoretical and numerical properties. Collected from January 2010 to December 2020, the analyzed data contains over 1 395 000 tweets and 517 000 posts. Our analysis suggests that the Twitter's topics on breast cancer significantly impact the contents and volumes in the OHC. Distinct time phases are observed, with notable peaks during 2012-2013 and 2015-2018. This study can provide a venue for better understanding health communication and new insights into two highly important online platforms.

在线健康社区(ohc)为患者和相关人员提供了一个共享和交流的平台,使复杂的医疗信息更易于消化和操作。OHCs内部的卫生交流可能受到其他信息来源的影响。这项研究通过挖掘Breastcancer.org(最大的在线乳腺癌社区)和Twitter(现在的X)来检验跨平台的健康交流。早期对ohc、Twitter和其他在线平台的分析通常采用词频等简单的测量方法,最近的研究转向了词共现网络分析。相对而言,跨平台通信分析是有限的,所采用的技术也有缺点。我们提出了一种新的跨平台通信模型,该模型综合分析了词共现网络和词频向量。在这里,前者描述了健康传播的结构内容,后者描述了体量。该模型提供了一个细致入微的视角,适应时间变化,并对其理论和数值特性进行了检验。从2010年1月到2020年12月,分析的数据包含超过139.5万条推文和51.7万条帖子。我们的分析表明,Twitter上关于乳腺癌的话题对OHC的内容和数量有显著影响。在2012-2013年和2015-2018年有明显的高峰。这项研究可以为更好地理解健康传播提供一个场所,并为两个非常重要的在线平台提供新的见解。
{"title":"Analysis of cross-platform health communication with a network approach.","authors":"Xinyan Fan, Mengque Liu, Shuangge Ma","doi":"10.1093/biomtc/ujaf154","DOIUrl":"10.1093/biomtc/ujaf154","url":null,"abstract":"<p><p>Online health communities (OHCs) provide a platform for patients and those related to share and communicate, making complex medical information more digestible and actionable. Health communication within OHCs can be impacted by other information sources. This study examines cross-platform health communication by mining Breastcancer.org (the largest online breast cancer community) and Twitter (now X). Early analyses of OHCs, Twitter, and other online platforms often adopt simple measures like word frequency, and more recent research has shifted towards word co-occurrence network analysis. Relatively, cross-platform communication analysis is limited, and the adopted techniques have drawbacks. We propose a new cross-platform communication model that collectively analyzes word co-occurrence networks and word frequency vectors. Here, the former describe the structural contents of health communication, and the latter describe the volumes. This model offers a nuanced perspective, accommodates temporal variations, and is examined for its theoretical and numerical properties. Collected from January 2010 to December 2020, the analyzed data contains over 1 395 000 tweets and 517 000 posts. Our analysis suggests that the Twitter's topics on breast cancer significantly impact the contents and volumes in the OHC. Distinct time phases are observed, with notable peaks during 2012-2013 and 2015-2018. This study can provide a venue for better understanding health communication and new insights into two highly important online platforms.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145572906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A regularized continuous-time hidden Markov model for identifying latent state transition patterns of poly-tobacco use. 用正则化连续时间隐马尔可夫模型识别多烟草使用的潜在状态转变模式。
IF 1.7 4区 数学 Q3 BIOLOGY Pub Date : 2025-10-08 DOI: 10.1093/biomtc/ujaf138
Xinyu Yan, Ji-Hyun Lee, Xiang-Yang Lou

Hidden Markov models (HMMs) are widely used to characterize latent state transition patterns in substance use. However, traditional HMM frameworks are incompetent when dealing with the complexities introduced by high-dimensional risk factors and varying time intervals, particularly in determining the number of hidden states and selecting variables for state transition parameters. To tackle the analytical challenges in the Population Assessment of Tobacco and Health (PATH) Study, a nationally representative longitudinal cohort study on tobacco use, we propose a continuous-time HMM framework with a regularization algorithm to identify multi-dimensional risk factors underlying complex poly-tobacco use transitions. We develop an elastic-net regularization on the transition covariates to identify informative covariates and improve model estimation accuracy. The inclusion of key covariates enables accurate determination of the number of hidden states. We incorporate survey weights and information on strata and clustering throughout the modeling framework. We demonstrate the validity of our approach in determining state numbers, identifying informative covariates, and estimating model parameters through a series of simulations. Application of the proposed approach to PATH data analysis revealed several demographic, behavioral, and psychosocial factors that contribute to the differential risks of transition between tobacco-use states among youth and young adults. The model's capacity in identifying high-dimensional risk factors for underlying hidden variables substantiates its potential for enhancing public health research and informing interventions.

隐马尔可夫模型(hmm)被广泛用于描述物质使用中的潜在状态转移模式。然而,传统HMM框架在处理由高维风险因素和变时间间隔引入的复杂性时,特别是在确定隐藏状态数量和选择状态转移参数变量方面表现不佳。为了解决烟草与健康人口评估(PATH)研究(一项具有全国代表性的烟草使用纵向队列研究)中的分析挑战,我们提出了一个带有正则化算法的连续时间HMM框架,以识别复杂多烟草使用转变背后的多维风险因素。我们对过渡协变量进行弹性网络正则化,以识别信息协变量,提高模型估计精度。关键协变量的包含可以准确地确定隐藏状态的数量。我们在整个建模框架中结合了调查权重和关于分层和聚类的信息。通过一系列的模拟,我们证明了我们的方法在确定状态数、识别信息协变量和估计模型参数方面的有效性。将提出的方法应用于适宜卫生技术方案数据分析,揭示了若干人口统计学、行为和社会心理因素,这些因素导致了青少年和年轻人在烟草使用状态之间的转变风险差异。该模型在确定潜在隐藏变量的高维风险因素方面的能力证实了它在加强公共卫生研究和为干预措施提供信息方面的潜力。
{"title":"A regularized continuous-time hidden Markov model for identifying latent state transition patterns of poly-tobacco use.","authors":"Xinyu Yan, Ji-Hyun Lee, Xiang-Yang Lou","doi":"10.1093/biomtc/ujaf138","DOIUrl":"https://doi.org/10.1093/biomtc/ujaf138","url":null,"abstract":"<p><p>Hidden Markov models (HMMs) are widely used to characterize latent state transition patterns in substance use. However, traditional HMM frameworks are incompetent when dealing with the complexities introduced by high-dimensional risk factors and varying time intervals, particularly in determining the number of hidden states and selecting variables for state transition parameters. To tackle the analytical challenges in the Population Assessment of Tobacco and Health (PATH) Study, a nationally representative longitudinal cohort study on tobacco use, we propose a continuous-time HMM framework with a regularization algorithm to identify multi-dimensional risk factors underlying complex poly-tobacco use transitions. We develop an elastic-net regularization on the transition covariates to identify informative covariates and improve model estimation accuracy. The inclusion of key covariates enables accurate determination of the number of hidden states. We incorporate survey weights and information on strata and clustering throughout the modeling framework. We demonstrate the validity of our approach in determining state numbers, identifying informative covariates, and estimating model parameters through a series of simulations. Application of the proposed approach to PATH data analysis revealed several demographic, behavioral, and psychosocial factors that contribute to the differential risks of transition between tobacco-use states among youth and young adults. The model's capacity in identifying high-dimensional risk factors for underlying hidden variables substantiates its potential for enhancing public health research and informing interventions.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145298373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biometrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1