首页 > 最新文献

Biometrics最新文献

英文 中文
PathGPS: discover shared genetic architecture using GWAS summary data. PathGPS:利用 GWAS 摘要数据发现共享遗传结构。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biomtc/ujae060
Zijun Gao, Qingyuan Zhao, Trevor Hastie

The increasing availability and scale of biobanks and "omic" datasets bring new horizons for understanding biological mechanisms. PathGPS is an exploratory data analysis tool to discover genetic architectures using Genome Wide Association Studies (GWAS) summary data. PathGPS is based on a linear structural equation model where traits are regulated by both genetic and environmental pathways. PathGPS decouples the genetic and environmental components by contrasting the GWAS associations of "signal" genes with those of "noise" genes. From the estimated genetic component, PathGPS then extracts genetic pathways via principal component and factor analysis, leveraging the low-rank and sparse properties. In addition, we provide a bootstrap aggregating ("bagging") algorithm to improve stability under data perturbation and hyperparameter tuning. When applied to a metabolomics dataset and the UK Biobank, PathGPS confirms several known gene-trait clusters and suggests multiple new hypotheses for future investigations.

生物库和 "omic "数据集的可用性和规模不断扩大,为了解生物机制带来了新的视野。PathGPS 是一种探索性数据分析工具,用于利用全基因组关联研究(GWAS)汇总数据发现遗传结构。PathGPS 基于线性结构方程模型,在该模型中,性状同时受遗传和环境途径的调节。PathGPS 通过对比 "信号 "基因与 "噪音 "基因在 GWAS 中的关联,将遗传和环境因素分离开来。然后,PathGPS 利用低秩和稀疏特性,通过主成分和因子分析,从估计的遗传成分中提取遗传途径。此外,我们还提供了一种自举聚合("bagging")算法,以提高数据扰动和超参数调整下的稳定性。当应用到代谢组学数据集和英国生物库时,PathGPS 证实了几个已知的基因性状群,并为未来的研究提出了多个新的假设。
{"title":"PathGPS: discover shared genetic architecture using GWAS summary data.","authors":"Zijun Gao, Qingyuan Zhao, Trevor Hastie","doi":"10.1093/biomtc/ujae060","DOIUrl":"10.1093/biomtc/ujae060","url":null,"abstract":"<p><p>The increasing availability and scale of biobanks and \"omic\" datasets bring new horizons for understanding biological mechanisms. PathGPS is an exploratory data analysis tool to discover genetic architectures using Genome Wide Association Studies (GWAS) summary data. PathGPS is based on a linear structural equation model where traits are regulated by both genetic and environmental pathways. PathGPS decouples the genetic and environmental components by contrasting the GWAS associations of \"signal\" genes with those of \"noise\" genes. From the estimated genetic component, PathGPS then extracts genetic pathways via principal component and factor analysis, leveraging the low-rank and sparse properties. In addition, we provide a bootstrap aggregating (\"bagging\") algorithm to improve stability under data perturbation and hyperparameter tuning. When applied to a metabolomics dataset and the UK Biobank, PathGPS confirms several known gene-trait clusters and suggests multiple new hypotheses for future investigations.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 3","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11247175/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141615885","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adjusting for incomplete baseline covariates in randomized controlled trials: a cross-world imputation framework. 调整随机对照试验中不完整的基线协变量:跨世界估算框架。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biomtc/ujae094
Yilin Song, James P Hughes, Ting Ye

In randomized controlled trials, adjusting for baseline covariates is commonly used to improve the precision of treatment effect estimation. However, covariates often have missing values. Recently, Zhao and Ding studied two simple strategies, the single imputation method and missingness-indicator method (MIM), to handle missing covariates and showed that both methods can provide an efficiency gain compared to not adjusting for covariates. To better understand and compare these two strategies, we propose and investigate a novel theoretical imputation framework termed cross-world imputation (CWI). This framework includes both single imputation and MIM as special cases, facilitating the comparison of their efficiency. Through the lens of CWI, we show that MIM implicitly searches for the optimal CWI values and thus achieves optimal efficiency. We also derive conditions under which the single imputation method, by searching for the optimal single imputation values, can achieve the same efficiency as the MIM. We illustrate our findings through simulation studies and a real data analysis based on the Childhood Adenotonsillectomy Trial. We conclude by discussing the practical implications of our findings.

在随机对照试验中,调整基线协变量通常用于提高治疗效果估计的精确度。然而,协变量往往存在缺失值。最近,赵(Zhao)和丁(Ding)研究了处理缺失协变量的两种简单策略,即单一估算法和缺失指示器法(MIM),结果表明,与不调整协变量相比,这两种方法都能提高效率。为了更好地理解和比较这两种策略,我们提出并研究了一种新的理论估算框架,称为跨世界估算(CWI)。该框架将单一估算和 MIM 作为特例,便于比较它们的效率。通过 CWI 的视角,我们表明 MIM 会隐含地搜索最佳 CWI 值,从而实现最佳效率。我们还推导出了单一估算方法通过寻找最佳单一估算值而达到与 MIM 相同效率的条件。我们通过模拟研究和基于儿童腺样体切除术试验的真实数据分析来说明我们的发现。最后,我们将讨论我们的发现的实际意义。
{"title":"Adjusting for incomplete baseline covariates in randomized controlled trials: a cross-world imputation framework.","authors":"Yilin Song, James P Hughes, Ting Ye","doi":"10.1093/biomtc/ujae094","DOIUrl":"https://doi.org/10.1093/biomtc/ujae094","url":null,"abstract":"<p><p>In randomized controlled trials, adjusting for baseline covariates is commonly used to improve the precision of treatment effect estimation. However, covariates often have missing values. Recently, Zhao and Ding studied two simple strategies, the single imputation method and missingness-indicator method (MIM), to handle missing covariates and showed that both methods can provide an efficiency gain compared to not adjusting for covariates. To better understand and compare these two strategies, we propose and investigate a novel theoretical imputation framework termed cross-world imputation (CWI). This framework includes both single imputation and MIM as special cases, facilitating the comparison of their efficiency. Through the lens of CWI, we show that MIM implicitly searches for the optimal CWI values and thus achieves optimal efficiency. We also derive conditions under which the single imputation method, by searching for the optimal single imputation values, can achieve the same efficiency as the MIM. We illustrate our findings through simulation studies and a real data analysis based on the Childhood Adenotonsillectomy Trial. We conclude by discussing the practical implications of our findings.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 3","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11398886/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142280079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Causal inference using multivariate generalized linear mixed-effects models. 利用多变量广义线性混合效应模型进行因果推断。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biomtc/ujae100
Yizhen Xu, Ji Soo Kim, Laura K Hummers, Ami A Shah, Scott L Zeger

Dynamic prediction of causal effects under different treatment regimens is an essential problem in precision medicine. It is challenging because the actual mechanisms of treatment assignment and effects are unknown in observational studies. We propose a multivariate generalized linear mixed-effects model and a Bayesian g-computation algorithm to calculate the posterior distribution of subgroup-specific intervention benefits of dynamic treatment regimes. Unmeasured time-invariant factors are included as subject-specific random effects in the assumed joint distribution of outcomes, time-varying confounders, and treatment assignments. We identify a sequential ignorability assumption conditional on treatment assignment heterogeneity, that is, analogous to balancing the latent treatment preference due to unmeasured time-invariant factors. We present a simulation study to assess the proposed method's performance. The method is applied to observational clinical data to investigate the efficacy of continuously using mycophenolate in different subgroups of scleroderma patients.

对不同治疗方案的因果效应进行动态预测是精准医学的一个基本问题。由于在观察性研究中,治疗分配和效果的实际机制尚不清楚,因此这项工作极具挑战性。我们提出了一种多变量广义线性混合效应模型和贝叶斯 g 计算算法,用于计算动态治疗方案的亚组特异性干预效益的后验分布。在假定的结果、时变混杂因素和治疗分配的联合分布中,未测量的时变因素作为特定受试者的随机效应被包含在内。我们确定了一个以治疗分配异质性为条件的连续无知假设,即类似于平衡未测量时变因素导致的潜在治疗偏好。我们通过模拟研究来评估所提出方法的性能。我们将该方法应用于观察性临床数据,以研究在硬皮病患者的不同亚组中持续使用霉酚酸酯的疗效。
{"title":"Causal inference using multivariate generalized linear mixed-effects models.","authors":"Yizhen Xu, Ji Soo Kim, Laura K Hummers, Ami A Shah, Scott L Zeger","doi":"10.1093/biomtc/ujae100","DOIUrl":"https://doi.org/10.1093/biomtc/ujae100","url":null,"abstract":"<p><p>Dynamic prediction of causal effects under different treatment regimens is an essential problem in precision medicine. It is challenging because the actual mechanisms of treatment assignment and effects are unknown in observational studies. We propose a multivariate generalized linear mixed-effects model and a Bayesian g-computation algorithm to calculate the posterior distribution of subgroup-specific intervention benefits of dynamic treatment regimes. Unmeasured time-invariant factors are included as subject-specific random effects in the assumed joint distribution of outcomes, time-varying confounders, and treatment assignments. We identify a sequential ignorability assumption conditional on treatment assignment heterogeneity, that is, analogous to balancing the latent treatment preference due to unmeasured time-invariant factors. We present a simulation study to assess the proposed method's performance. The method is applied to observational clinical data to investigate the efficacy of continuously using mycophenolate in different subgroups of scleroderma patients.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 3","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11422711/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142340678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discussion on "LEAP: the latent exchangeability prior for borrowing information from historical data" by Ethan M. Alt, Xiuya Chang, Xun Jiang, Qing Liu, May Mo, H. Amy Xia, and Joseph G. Ibrahim. 关于 Ethan M. Alt、Xiuya Chang、Xun Jiang、Qing Liu、May Mo、H. Amy Xia 和 Joseph G. Ibrahim 所著《LEAP:从历史数据中借用信息的潜在可交换性先验》的讨论。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biomtc/ujae086
Shannon D Thomas, Alexander M Kaizer

This discussion provides commentary on the paper by Ethan M. Alt, Xiuya Chang, Xun Jiang, Qing Liu, May Mo, H. Amy Xia, and Joseph G. Ibrahim entitled "LEAP: the latent exchangeability prior for borrowing information from historical data". The authors propose a novel method to bridge the incorporation of supplemental information into a study while also identifying potentially exchangeable subgroups to better facilitate information sharing. In this discussion, we highlight the potential relationship with other Bayesian model averaging approaches, such as multisource exchangeability modeling, and provide a brief numeric case study to illustrate how the concepts behind latent exchangeability prior may also improve the performance of existing methods. The results provided by Alt et al. are exciting, and we believe that the method represents a meaningful approach to more efficient information sharing.

本讨论对 Ethan M. Alt、Xiuya Chang、Xun Jiang、Qing Liu、May Mo、H. Amy Xia 和 Joseph G. Ibrahim 题为 "LEAP:从历史数据中借用信息的潜在可交换性先验 "的论文进行了评论。作者提出了一种新方法,在将补充信息纳入研究的同时,还能识别潜在的可交换子群,从而更好地促进信息共享。在讨论中,我们强调了与其他贝叶斯模型平均方法(如多源可交换性建模)的潜在关系,并提供了一个简短的数字案例研究,以说明潜在可交换性先验背后的概念如何也能提高现有方法的性能。Alt 等人提供的结果令人振奋,我们相信该方法是实现更高效信息共享的一种有意义的方法。
{"title":"Discussion on \"LEAP: the latent exchangeability prior for borrowing information from historical data\" by Ethan M. Alt, Xiuya Chang, Xun Jiang, Qing Liu, May Mo, H. Amy Xia, and Joseph G. Ibrahim.","authors":"Shannon D Thomas, Alexander M Kaizer","doi":"10.1093/biomtc/ujae086","DOIUrl":"10.1093/biomtc/ujae086","url":null,"abstract":"<p><p>This discussion provides commentary on the paper by Ethan M. Alt, Xiuya Chang, Xun Jiang, Qing Liu, May Mo, H. Amy Xia, and Joseph G. Ibrahim entitled \"LEAP: the latent exchangeability prior for borrowing information from historical data\". The authors propose a novel method to bridge the incorporation of supplemental information into a study while also identifying potentially exchangeable subgroups to better facilitate information sharing. In this discussion, we highlight the potential relationship with other Bayesian model averaging approaches, such as multisource exchangeability modeling, and provide a brief numeric case study to illustrate how the concepts behind latent exchangeability prior may also improve the performance of existing methods. The results provided by Alt et al. are exciting, and we believe that the method represents a meaningful approach to more efficient information sharing.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 3","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11427888/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142340681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rejoinder to the discussion on "LEAP: the latent exchangeability prior for borrowing information from historical data". 关于 "LEAP:从历史数据中借用信息的潜在可交换性先验 "讨论的再评论。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biomtc/ujae087
Ethan M Alt, Xiuya Chang, Xun Jiang, Qing Liu, May Mo, Hong Amy Xia, Joseph G Ibrahim

The discussions of our paper provide insights into the practical considerations of the latent exchangeability prior while also highlighting further extensions. In this rejoinder, we briefly summarize the discussions and provide comments.

我们论文的讨论为潜在可交换性先验的实际考虑提供了见解,同时也强调了进一步的扩展。在这篇反驳文章中,我们将简要总结讨论内容并提出评论。
{"title":"Rejoinder to the discussion on \"LEAP: the latent exchangeability prior for borrowing information from historical data\".","authors":"Ethan M Alt, Xiuya Chang, Xun Jiang, Qing Liu, May Mo, Hong Amy Xia, Joseph G Ibrahim","doi":"10.1093/biomtc/ujae087","DOIUrl":"https://doi.org/10.1093/biomtc/ujae087","url":null,"abstract":"<p><p>The discussions of our paper provide insights into the practical considerations of the latent exchangeability prior while also highlighting further extensions. In this rejoinder, we briefly summarize the discussions and provide comments.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 3","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142340684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hypothesis tests in ordinal predictive models with optimal accuracy. 具有最佳准确性的序数预测模型中的假设检验。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biomtc/ujae079
Yuyang Liu, Shan Luo, Jialiang Li

In real-world applications involving multi-class ordinal discrimination, a common approach is to aggregate multiple predictive variables into a linear combination, aiming to develop a classifier with high prediction accuracy. Assessment of such multi-class classifiers often utilizes the hypervolume under ROC manifolds (HUM). When dealing with a substantial pool of potential predictors and achieving optimal HUM, it becomes imperative to conduct appropriate statistical inference. However, prevalent methodologies in existing literature are computationally expensive. We propose to use the jackknife empirical likelihood method to address this issue. The Wilks' theorem under moderate conditions is established and the power analysis under the Pitman alternative is provided. We also introduce a novel network-based rapid computation algorithm specifically designed for computing a general multi-sample $U$-statistic in our test procedure. To compare our approach against existing approaches, we conduct extensive simulations. Results demonstrate the superior performance of our method in terms of test size, power, and implementation time. Furthermore, we apply our method to analyze a real medical dataset and obtain some new findings.

在涉及多类序数判别的实际应用中,一种常见的方法是将多个预测变量聚合成一个线性组合,从而开发出一种预测精度高的分类器。对这种多类分类器的评估通常使用 ROC 流形下的超体积(HUM)。在处理大量潜在预测因子并实现最佳 HUM 时,必须进行适当的统计推断。然而,现有文献中普遍采用的方法计算成本高昂。我们建议使用杰克刀经验似然法(jackknife empirical likelihood method)来解决这一问题。我们建立了温和条件下的 Wilks' 定理,并提供了 Pitman 备选方案下的幂次分析。我们还引入了一种基于网络的新型快速计算算法,专门用于计算测试程序中的一般多样本 U$ 统计量。为了将我们的方法与现有方法进行比较,我们进行了大量模拟。结果表明,我们的方法在测试规模、功率和实施时间方面都具有卓越的性能。此外,我们还应用我们的方法分析了一个真实的医疗数据集,并获得了一些新的发现。
{"title":"Hypothesis tests in ordinal predictive models with optimal accuracy.","authors":"Yuyang Liu, Shan Luo, Jialiang Li","doi":"10.1093/biomtc/ujae079","DOIUrl":"https://doi.org/10.1093/biomtc/ujae079","url":null,"abstract":"<p><p>In real-world applications involving multi-class ordinal discrimination, a common approach is to aggregate multiple predictive variables into a linear combination, aiming to develop a classifier with high prediction accuracy. Assessment of such multi-class classifiers often utilizes the hypervolume under ROC manifolds (HUM). When dealing with a substantial pool of potential predictors and achieving optimal HUM, it becomes imperative to conduct appropriate statistical inference. However, prevalent methodologies in existing literature are computationally expensive. We propose to use the jackknife empirical likelihood method to address this issue. The Wilks' theorem under moderate conditions is established and the power analysis under the Pitman alternative is provided. We also introduce a novel network-based rapid computation algorithm specifically designed for computing a general multi-sample $U$-statistic in our test procedure. To compare our approach against existing approaches, we conduct extensive simulations. Results demonstrate the superior performance of our method in terms of test size, power, and implementation time. Furthermore, we apply our method to analyze a real medical dataset and obtain some new findings.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 3","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142016282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-dimensional multivariate analysis of variance via geometric median and bootstrapping. 通过几何中值和引导进行高维多变量方差分析。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biomtc/ujae088
Guanghui Cheng, Ruitao Lin, Liuhua Peng

The geometric median, which is applicable to high-dimensional data, can be viewed as a generalization of the univariate median used in 1-dimensional data. It can be used as a robust estimator for identifying the location of multi-dimensional data and has a wide range of applications in real-world scenarios. This paper explores the problem of high-dimensional multivariate analysis of variance (MANOVA) using the geometric median. A maximum-type statistic that relies on the differences between the geometric medians among various groups is introduced. The distribution of the new test statistic is derived under the null hypothesis using Gaussian approximations, and its consistency under the alternative hypothesis is established. To approximate the distribution of the new statistic in high dimensions, a wild bootstrap algorithm is proposed and theoretically justified. Through simulation studies conducted across a variety of dimensions, sample sizes, and data-generating models, we demonstrate the finite-sample performance of our geometric median-based MANOVA method. Additionally, we implement the proposed approach to analyze a breast cancer gene expression dataset.

几何中值适用于高维数据,可视为用于一维数据的单变量中值的一般化。它可以作为一种稳健的估计器来识别多维数据的位置,在现实世界中有着广泛的应用。本文探讨了使用几何中值进行高维多变量方差分析(MANOVA)的问题。本文介绍了一种依赖于各组间几何中值差异的最大值型统计量。利用高斯近似法得出了新检验统计量在零假设下的分布,并确定了其在备择假设下的一致性。为了逼近新统计量在高维度下的分布,提出了一种野生引导算法,并从理论上证明了该算法的合理性。通过对各种维度、样本大小和数据生成模型进行模拟研究,我们证明了基于几何中值的 MANOVA 方法的有限样本性能。此外,我们还利用提出的方法分析了乳腺癌基因表达数据集。
{"title":"High-dimensional multivariate analysis of variance via geometric median and bootstrapping.","authors":"Guanghui Cheng, Ruitao Lin, Liuhua Peng","doi":"10.1093/biomtc/ujae088","DOIUrl":"10.1093/biomtc/ujae088","url":null,"abstract":"<p><p>The geometric median, which is applicable to high-dimensional data, can be viewed as a generalization of the univariate median used in 1-dimensional data. It can be used as a robust estimator for identifying the location of multi-dimensional data and has a wide range of applications in real-world scenarios. This paper explores the problem of high-dimensional multivariate analysis of variance (MANOVA) using the geometric median. A maximum-type statistic that relies on the differences between the geometric medians among various groups is introduced. The distribution of the new test statistic is derived under the null hypothesis using Gaussian approximations, and its consistency under the alternative hypothesis is established. To approximate the distribution of the new statistic in high dimensions, a wild bootstrap algorithm is proposed and theoretically justified. Through simulation studies conducted across a variety of dimensions, sample sizes, and data-generating models, we demonstrate the finite-sample performance of our geometric median-based MANOVA method. Additionally, we implement the proposed approach to analyze a breast cancer gene expression dataset.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 3","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11381952/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142153109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayesian nonparametric approach for causal mediation with a post-treatment confounder. 贝叶斯非参数方法,用于处理后混杂因素的因果中介。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biomtc/ujae099
Woojung Bae, Michael J Daniels, Michael G Perri

We propose a new Bayesian nonparametric method for estimating the causal effects of mediation in the presence of a post-treatment confounder. The methodology is motivated by the Rural Lifestyle Intervention Treatment Effectiveness Trial (Rural LITE) for which there is interest in estimating causal mediation effects but is complicated by the presence of a post-treatment confounder. We specify an enriched Dirichlet process mixture (EDPM) to model the joint distribution of the observed data (outcome, mediator, post-treatment confounder, treatment, and baseline confounders). For identifiability, we use the extended version of the standard sequential ignorability (SI) as introduced in Hong et al. along with a Gaussian copula model assumption. The observed data model and causal identification assumptions enable us to estimate and identify the causal effects of mediation, that is, the natural direct effects (NDE) and natural indirect effects (NIE). Our method enables easy computation of NIE and NDE for a subset of confounding variables and addresses missing data through data augmentation under the assumption of ignorable missingness. We conduct simulation studies to assess the performance of our proposed method. Furthermore, we apply this approach to evaluate the causal mediation effect in the Rural LITE trial, finding that there was not strong evidence for the potential mediator.

我们提出了一种新的贝叶斯非参数方法,用于在存在治疗后混杂因素的情况下估计中介的因果效应。该方法受农村生活方式干预治疗效果试验(Rural Lifestyle Intervention Treatment Effectiveness Trial,RITE)的启发,该试验对因果中介效应的估计很感兴趣,但由于存在治疗后混杂因素而变得复杂。我们指定了一个丰富的 Dirichlet 过程混合物(EDPM)来模拟观察数据(结果、中介因素、治疗后混杂因素、治疗和基线混杂因素)的联合分布。在可识别性方面,我们使用了 Hong 等人引入的标准序列无知(SI)的扩展版本,以及高斯共轭模型假设。观察数据模型和因果识别假设使我们能够估计和识别中介的因果效应,即自然直接效应(NDE)和自然间接效应(NIE)。我们的方法可以轻松计算混杂变量子集的自然直接效应(NIE)和自然间接效应(NDE),并在可忽略缺失的假设下通过数据扩增解决缺失数据问题。我们进行了模拟研究,以评估我们提出的方法的性能。此外,我们还应用这种方法评估了农村 LITE 试验中的因果中介效应,发现并没有强有力的证据证明潜在的中介效应。
{"title":"A Bayesian nonparametric approach for causal mediation with a post-treatment confounder.","authors":"Woojung Bae, Michael J Daniels, Michael G Perri","doi":"10.1093/biomtc/ujae099","DOIUrl":"10.1093/biomtc/ujae099","url":null,"abstract":"<p><p>We propose a new Bayesian nonparametric method for estimating the causal effects of mediation in the presence of a post-treatment confounder. The methodology is motivated by the Rural Lifestyle Intervention Treatment Effectiveness Trial (Rural LITE) for which there is interest in estimating causal mediation effects but is complicated by the presence of a post-treatment confounder. We specify an enriched Dirichlet process mixture (EDPM) to model the joint distribution of the observed data (outcome, mediator, post-treatment confounder, treatment, and baseline confounders). For identifiability, we use the extended version of the standard sequential ignorability (SI) as introduced in Hong et al. along with a Gaussian copula model assumption. The observed data model and causal identification assumptions enable us to estimate and identify the causal effects of mediation, that is, the natural direct effects (NDE) and natural indirect effects (NIE). Our method enables easy computation of NIE and NDE for a subset of confounding variables and addresses missing data through data augmentation under the assumption of ignorable missingness. We conduct simulation studies to assess the performance of our proposed method. Furthermore, we apply this approach to evaluate the causal mediation effect in the Rural LITE trial, finding that there was not strong evidence for the potential mediator.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 3","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11418020/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142280078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Heterogeneous latent transfer learning in Gaussian graphical models. 高斯图形模型中的异质潜移默化学习。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biomtc/ujae096
Qiong Wu, Chi Wang, Yong Chen

Gaussian graphical models (GGMs) are useful for understanding the complex relationships between biological entities. Transfer learning can improve the estimation of GGMs in a target dataset by incorporating relevant information from related source studies. However, biomedical research often involves intrinsic and latent heterogeneity within a study, such as heterogeneous subpopulations. This heterogeneity can make it difficult to identify informative source studies or lead to negative transfer if the source study is improperly used. To address this challenge, we developed a heterogeneous latent transfer learning (Latent-TL) approach that accounts for both within-sample and between-sample heterogeneity. The idea behind this approach is to "learn from the alike" by leveraging the similarities between source and target GGMs within each subpopulation. The Latent-TL algorithm simultaneously identifies common subpopulation structures among samples and facilitates the learning of target GGMs using source samples from the same subpopulation. Through extensive simulations and real data application, we have shown that the proposed method outperforms single-site learning and standard transfer learning that ignores the latent structures. We have also demonstrated the applicability of the proposed algorithm in characterizing gene co-expression networks in breast cancer patients, where the inferred genetic networks identified many biologically meaningful gene-gene interactions.

高斯图形模型(GGM)有助于理解生物实体之间的复杂关系。迁移学习可以通过纳入相关源研究的相关信息来改进目标数据集中高斯图模型的估计。然而,生物医学研究往往涉及研究中的内在和潜在异质性,如异质性亚群。这种异质性可能会导致难以识别信息来源研究,或者在来源研究使用不当的情况下导致负迁移。为了应对这一挑战,我们开发了一种异质性潜移默化迁移学习(Latent-TL)方法,它同时考虑了样本内和样本间的异质性。这种方法背后的理念是利用每个子群中源和目标 GGM 之间的相似性来 "从相似中学习"。Latent-TL 算法能同时识别样本间共同的亚群结构,并利用来自同一亚群的源样本促进目标 GGM 的学习。通过大量的模拟和实际数据应用,我们证明了所提出的方法优于忽略潜在结构的单点学习和标准迁移学习。我们还证明了所提算法在表征乳腺癌患者基因共表达网络中的适用性,推断出的基因网络发现了许多具有生物学意义的基因-基因相互作用。
{"title":"Heterogeneous latent transfer learning in Gaussian graphical models.","authors":"Qiong Wu, Chi Wang, Yong Chen","doi":"10.1093/biomtc/ujae096","DOIUrl":"10.1093/biomtc/ujae096","url":null,"abstract":"<p><p>Gaussian graphical models (GGMs) are useful for understanding the complex relationships between biological entities. Transfer learning can improve the estimation of GGMs in a target dataset by incorporating relevant information from related source studies. However, biomedical research often involves intrinsic and latent heterogeneity within a study, such as heterogeneous subpopulations. This heterogeneity can make it difficult to identify informative source studies or lead to negative transfer if the source study is improperly used. To address this challenge, we developed a heterogeneous latent transfer learning (Latent-TL) approach that accounts for both within-sample and between-sample heterogeneity. The idea behind this approach is to \"learn from the alike\" by leveraging the similarities between source and target GGMs within each subpopulation. The Latent-TL algorithm simultaneously identifies common subpopulation structures among samples and facilitates the learning of target GGMs using source samples from the same subpopulation. Through extensive simulations and real data application, we have shown that the proposed method outperforms single-site learning and standard transfer learning that ignores the latent structures. We have also demonstrated the applicability of the proposed algorithm in characterizing gene co-expression networks in breast cancer patients, where the inferred genetic networks identified many biologically meaningful gene-gene interactions.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 3","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11413907/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142280081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semi-parametric benchmark dose analysis with monotone additive models. 使用单调相加模型进行半参数基准剂量分析
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2024-07-01 DOI: 10.1093/biomtc/ujae098
Alex Stringer, Tugba Akkaya Hocagil, Richard J Cook, Louise M Ryan, Sandra W Jacobson, Joseph L Jacobson

Benchmark dose analysis aims to estimate the level of exposure to a toxin associated with a clinically significant adverse outcome and quantifies uncertainty using the lower limit of a confidence interval for this level. We develop a novel framework for benchmark dose analysis based on monotone additive dose-response models. We first introduce a flexible approach for fitting monotone additive models via penalized B-splines and Laplace-approximate marginal likelihood. A reflective Newton method is then developed that employs de Boor's algorithm for computing splines and their derivatives for efficient estimation of the benchmark dose. Finally, we develop a novel approach for calculating benchmark dose lower limits based on an approximate pivot for the nonlinear equation solved by the estimated benchmark dose. The favorable properties of this approach compared to the Delta method and a parameteric bootstrap are discussed. We apply the new methods to make inferences about the level of prenatal alcohol exposure associated with clinically significant cognitive defects in children using data from six NIH-funded longitudinal cohort studies. Software to reproduce the results in this paper is available online and makes use of the novel semibmd  R package, which implements the methods in this paper.

基准剂量分析旨在估算与临床显著不良结果相关的毒素暴露水平,并利用该水平置信区间的下限来量化不确定性。我们基于单调相加剂量反应模型开发了一种新的基准剂量分析框架。我们首先介绍了一种灵活的方法,通过受惩罚的 B-样条曲线和拉普拉斯近似边际似然法拟合单调相加模型。然后,我们开发了一种反射牛顿方法,该方法采用 de Boor 算法计算样条及其导数,从而高效地估算基准剂量。最后,我们根据估计基准剂量所求解的非线性方程的近似支点,开发了一种计算基准剂量下限的新方法。我们讨论了这种方法与德尔塔法和参数自举法相比的有利特性。我们利用美国国立卫生研究院(NIH)资助的六项纵向队列研究数据,运用新方法推断了与临床上重大儿童认知缺陷相关的产前酒精暴露水平。重现本文结果的软件可在线获取,该软件使用了新颖的 semibmd R 软件包,该软件包实现了本文的方法。
{"title":"Semi-parametric benchmark dose analysis with monotone additive models.","authors":"Alex Stringer, Tugba Akkaya Hocagil, Richard J Cook, Louise M Ryan, Sandra W Jacobson, Joseph L Jacobson","doi":"10.1093/biomtc/ujae098","DOIUrl":"https://doi.org/10.1093/biomtc/ujae098","url":null,"abstract":"<p><p>Benchmark dose analysis aims to estimate the level of exposure to a toxin associated with a clinically significant adverse outcome and quantifies uncertainty using the lower limit of a confidence interval for this level. We develop a novel framework for benchmark dose analysis based on monotone additive dose-response models. We first introduce a flexible approach for fitting monotone additive models via penalized B-splines and Laplace-approximate marginal likelihood. A reflective Newton method is then developed that employs de Boor's algorithm for computing splines and their derivatives for efficient estimation of the benchmark dose. Finally, we develop a novel approach for calculating benchmark dose lower limits based on an approximate pivot for the nonlinear equation solved by the estimated benchmark dose. The favorable properties of this approach compared to the Delta method and a parameteric bootstrap are discussed. We apply the new methods to make inferences about the level of prenatal alcohol exposure associated with clinically significant cognitive defects in children using data from six NIH-funded longitudinal cohort studies. Software to reproduce the results in this paper is available online and makes use of the novel semibmd  R package, which implements the methods in this paper.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 3","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11403299/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142280083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biometrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1