Biostatistics最新文献_第7页

A Bayesian semi-parametric approach to causal mediation for longitudinal mediators and time-to-event outcomes with application to a cardiovascular disease cohort study. 贝叶斯半参数方法对纵向介质和事件发生时间结果的因果中介的应用于心血管疾病队列研究。

IF 2 3区数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biostatistics

Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf027

Saurabh Bhandari, Michael J Daniels, Maria Josefsson, Donald M Lloyd-Jones, Juned Siddique

Causal mediation analysis of observational data is an important tool for investigating the potential causal effects of medications on disease-related risk factors, and on time-to-death (or disease progression) through these risk factors. However, when analyzing data from a cohort study, such analyses are complicated by the longitudinal structure of the risk factors and the presence of time-varying confounders. Leveraging data from the Atherosclerosis Risk in Communities (ARIC) cohort study, we develop a causal mediation approach, using (semi-parametric) Bayesian Additive Regression Tree (BART) models for the longitudinal and survival data. Our framework is developed using static longitudinal exposure regimes and allows for time-varying confounders and mediators, both of which can be either continuous or binary. We also identify and estimate direct and indirect causal effects in the presence of a competing event. We apply our methods to assess how medication, prescribed to target cardiovascular disease (CVD) risk factors, affects the time-to-CVD death.

观察数据的因果中介分析是研究药物对疾病相关危险因素以及通过这些危险因素对死亡时间（或疾病进展）的潜在因果影响的重要工具。然而，当分析来自队列研究的数据时，这种分析由于风险因素的纵向结构和时变混杂因素的存在而变得复杂。利用社区动脉粥样硬化风险（ARIC）队列研究的数据，我们开发了一种因果中介方法，使用（半参数）贝叶斯加性回归树（BART）模型处理纵向和生存数据。我们的框架是使用静态纵向暴露机制开发的，并允许时变混杂因素和中介因素，这两者都可以是连续的或二进制的。我们还识别和估计在竞争事件存在的直接和间接因果效应。我们应用我们的方法来评估针对心血管疾病（CVD）危险因素的药物治疗如何影响心血管疾病死亡时间。

{"title":"A Bayesian semi-parametric approach to causal mediation for longitudinal mediators and time-to-event outcomes with application to a cardiovascular disease cohort study.","authors":"Saurabh Bhandari, Michael J Daniels, Maria Josefsson, Donald M Lloyd-Jones, Juned Siddique","doi":"10.1093/biostatistics/kxaf027","DOIUrl":"10.1093/biostatistics/kxaf027","url":null,"abstract":"Causal mediation analysis of observational data is an important tool for investigating the potential causal effects of medications on disease-related risk factors, and on time-to-death (or disease progression) through these risk factors. However, when analyzing data from a cohort study, such analyses are complicated by the longitudinal structure of the risk factors and the presence of time-varying confounders. Leveraging data from the Atherosclerosis Risk in Communities (ARIC) cohort study, we develop a causal mediation approach, using (semi-parametric) Bayesian Additive Regression Tree (BART) models for the longitudinal and survival data. Our framework is developed using static longitudinal exposure regimes and allows for time-varying confounders and mediators, both of which can be either continuous or binary. We also identify and estimate direct and indirect causal effects in the presence of a competing event. We apply our methods to assess how medication, prescribed to target cardiovascular disease (CVD) risk factors, affects the time-to-CVD death.","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12479244/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145193833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Bayesian subtyping for multi-state brain functional connectome with application on preadolescent brain cognition. 多状态脑功能连接体贝叶斯分型及其在青春期前脑认知中的应用。

IF 2 3区数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biostatistics

Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae045

Tianqi Chen, Hongyu Zhao, Chichun Tan, Todd Constable, Sarah Yip, Yize Zhao

Converging evidence indicates that the heterogeneity of cognitive profiles may arise through detectable alternations in brain functional connectivity. Despite an unprecedented opportunity to uncover neurobiological subtypes through clustering or subtyping analyses on multi-state functional connectivity, few existing approaches are applicable to accommodate the network topology and unique biological architecture. To address this issue, we propose an innovative Bayesian nonparametric network-variate clustering analysis to uncover subgroups of individuals with homogeneous brain functional network patterns under multiple cognitive states. In light of the existing neuroscience literature, we assume there are unknown state-specific modular structures within functional connectivity. Concurrently, we identify informative network features essential for defining subtypes. To further facilitate practical use, we develop a computationally efficient variational inference algorithm to approximate posterior inference with satisfactory estimation accuracy. Extensive simulations show the superiority of our method. We apply the method to the Adolescent Brain Cognitive Development (ABCD) study, and identify neurodevelopmental subtypes and brain sub-network phenotypes under each state to signal neurobiological heterogeneity, suggesting promising directions for further exploration and investigation in neuroscience.

越来越多的证据表明，认知特征的异质性可能是由于大脑功能连接的可检测变化而产生的。尽管通过对多状态功能连接的聚类或分型分析来揭示神经生物学亚型的机会前所未有，但现有的方法很少适用于适应网络拓扑结构和独特的生物结构。为了解决这一问题，我们提出了一种创新的贝叶斯非参数网络-变量聚类分析，以揭示在多种认知状态下具有同质脑功能网络模式的个体亚群。根据现有的神经科学文献，我们假设在功能连接中存在未知的特定状态模块结构。同时，我们确定了定义子类型所必需的信息网络特征。为了进一步方便实际应用，我们开发了一种计算效率高的变分推理算法，以令人满意的估计精度近似后验推理。大量的仿真表明了该方法的优越性。我们将该方法应用于青少年脑认知发展（ABCD）研究，并确定了每种状态下的神经发育亚型和脑亚网络表型，以表明神经生物学的异质性，为神经科学的进一步探索和研究提供了有希望的方向。

{"title":"Bayesian subtyping for multi-state brain functional connectome with application on preadolescent brain cognition.","authors":"Tianqi Chen, Hongyu Zhao, Chichun Tan, Todd Constable, Sarah Yip, Yize Zhao","doi":"10.1093/biostatistics/kxae045","DOIUrl":"10.1093/biostatistics/kxae045","url":null,"abstract":"Converging evidence indicates that the heterogeneity of cognitive profiles may arise through detectable alternations in brain functional connectivity. Despite an unprecedented opportunity to uncover neurobiological subtypes through clustering or subtyping analyses on multi-state functional connectivity, few existing approaches are applicable to accommodate the network topology and unique biological architecture. To address this issue, we propose an innovative Bayesian nonparametric network-variate clustering analysis to uncover subgroups of individuals with homogeneous brain functional network patterns under multiple cognitive states. In light of the existing neuroscience literature, we assume there are unknown state-specific modular structures within functional connectivity. Concurrently, we identify informative network features essential for defining subtypes. To further facilitate practical use, we develop a computationally efficient variational inference algorithm to approximate posterior inference with satisfactory estimation accuracy. Extensive simulations show the superiority of our method. We apply the method to the Adolescent Brain Cognitive Development (ABCD) study, and identify neurodevelopmental subtypes and brain sub-network phenotypes under each state to signal neurobiological heterogeneity, suggesting promising directions for further exploration and investigation in neuroscience.","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11823269/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142830270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A modeling framework for detecting and leveraging node-level information in Bayesian network inference. 在贝叶斯网络推理中检测和利用节点级信息的建模框架。

IF 1.8 3区数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biostatistics

Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae021

Xiaoyue Xi, Hélène Ruffieux

Bayesian graphical models are powerful tools to infer complex relationships in high dimension, yet are often fraught with computational and statistical challenges. If exploited in a principled way, the increasing information collected alongside the data of primary interest constitutes an opportunity to mitigate these difficulties by guiding the detection of dependence structures. For instance, gene network inference may be informed by the use of publicly available summary statistics on the regulation of genes by genetic variants. Here we present a novel Gaussian graphical modeling framework to identify and leverage information on the centrality of nodes in conditional independence graphs. Specifically, we consider a fully joint hierarchical model to simultaneously infer (i) sparse precision matrices and (ii) the relevance of node-level information for uncovering the sought-after network structure. We encode such information as candidate auxiliary variables using a spike-and-slab submodel on the propensity of nodes to be hubs, which allows hypothesis-free selection and interpretation of a sparse subset of relevant variables. As efficient exploration of large posterior spaces is needed for real-world applications, we develop a variational expectation conditional maximization algorithm that scales inference to hundreds of samples, nodes and auxiliary variables. We illustrate and exploit the advantages of our approach in simulations and in a gene network study which identifies hub genes involved in biological pathways relevant to immune-mediated diseases.

贝叶斯图模型是推断高维度复杂关系的强大工具，但在计算和统计方面往往充满挑战。如果以有原则的方式加以利用，那么随着主要兴趣数据的收集而不断增加的信息，就有机会通过指导依赖结构的检测来减轻这些困难。例如，基因网络推断可以利用公开的基因变异调控汇总统计数据。在这里，我们提出了一种新颖的高斯图建模框架，用于识别和利用条件独立图中节点的中心性信息。具体来说，我们考虑了一个完全联合的分层模型，以同时推断 (i) 稀疏精度矩阵和 (ii) 节点级信息对揭示所需的网络结构的相关性。我们使用一个关于节点成为枢纽的倾向的尖峰-板块子模型，将这些信息编码为候选辅助变量，从而可以无假设地选择和解释相关变量的稀疏子集。由于现实世界的应用需要对大型后验空间进行有效探索，我们开发了一种变分期望条件最大化算法，可将推理扩展到数百个样本、节点和辅助变量。我们在模拟和基因网络研究中说明并利用了我们方法的优势，该研究确定了与免疫介导疾病相关的生物通路中的枢纽基因。

{"title":"A modeling framework for detecting and leveraging node-level information in Bayesian network inference.","authors":"Xiaoyue Xi, Hélène Ruffieux","doi":"10.1093/biostatistics/kxae021","DOIUrl":"10.1093/biostatistics/kxae021","url":null,"abstract":"Bayesian graphical models are powerful tools to infer complex relationships in high dimension, yet are often fraught with computational and statistical challenges. If exploited in a principled way, the increasing information collected alongside the data of primary interest constitutes an opportunity to mitigate these difficulties by guiding the detection of dependence structures. For instance, gene network inference may be informed by the use of publicly available summary statistics on the regulation of genes by genetic variants. Here we present a novel Gaussian graphical modeling framework to identify and leverage information on the centrality of nodes in conditional independence graphs. Specifically, we consider a fully joint hierarchical model to simultaneously infer (i) sparse precision matrices and (ii) the relevance of node-level information for uncovering the sought-after network structure. We encode such information as candidate auxiliary variables using a spike-and-slab submodel on the propensity of nodes to be hubs, which allows hypothesis-free selection and interpretation of a sparse subset of relevant variables. As efficient exploration of large posterior spaces is needed for real-world applications, we develop a variational expectation conditional maximization algorithm that scales inference to hundreds of samples, nodes and auxiliary variables. We illustrate and exploit the advantages of our approach in simulations and in a gene network study which identifies hub genes involved in biological pathways relevant to immune-mediated diseases.","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11823055/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141452158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Connectivity Regression. 连接回归。

IF 1.8 3区数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biostatistics

Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf002

Neel Desai, Veera Baladandayuthapani, Russell T Shinohara, Jeffrey S Morris

Assessing how brain functional connectivity networks vary across individuals promises to uncover important scientific questions such as patterns of healthy brain aging through the lifespan or dysconnectivity associated with disease. In this article, we introduce a general regression framework, Connectivity Regression (ConnReg), for regressing subject-specific functional connectivity networks on covariates while accounting for within-network inter-edge dependence. ConnReg utilizes a multivariate generalization of Fisher's transformation to project network objects into an alternative space where Gaussian assumptions are justified and positive semidefinite constraints are automatically satisfied. Penalized multivariate regression is fit in the transformed space to simultaneously induce sparsity in regression coefficients and in covariance elements, which capture within network inter-edge dependence. We use permutation tests to perform multiplicity-adjusted inference to identify covariates associated with connectivity, and stability selection scores to identify network edges that vary with selected covariates. Simulation studies validate the inferential properties of our proposed method and demonstrate how estimating and accounting for within-network inter-edge dependence leads to more efficient estimation, more powerful inference, and more accurate selection of covariate-dependent network edges. We apply ConnReg to the Human Connectome Project Young Adult study, revealing insights into how connectivity varies with language processing covariates and structural brain features.

评估大脑功能连接网络在个体之间的差异有望揭示重要的科学问题，例如健康大脑在整个生命周期中的衰老模式或与疾病相关的连接障碍。在本文中，我们介绍了一个通用的回归框架，Connectivity regression (ConnReg)，用于在考虑网络内边缘依赖的情况下，在协变量上回归特定主题的功能连接网络。ConnReg利用Fisher变换的多元泛化将网络对象投射到一个替代空间中，在这个空间中高斯假设被证明是正确的，并且正的半确定约束被自动满足。在变换后的空间中拟合惩罚多元回归，同时诱导回归系数和协方差元素的稀疏性，从而捕获网络边缘间的依赖关系。我们使用置换测试来执行多重调整推理，以识别与连通性相关的协变量，并使用稳定性选择分数来识别随所选协变量变化的网络边缘。仿真研究验证了我们提出的方法的推理特性，并展示了如何估计和计算网络内边缘间的依赖，从而更有效地估计，更强大的推理，更准确地选择协变量相关的网络边缘。我们将ConnReg应用于人类连接组项目年轻人研究，揭示了连接如何随语言处理协变量和大脑结构特征而变化的见解。

{"title":"Connectivity Regression.","authors":"Neel Desai, Veera Baladandayuthapani, Russell T Shinohara, Jeffrey S Morris","doi":"10.1093/biostatistics/kxaf002","DOIUrl":"10.1093/biostatistics/kxaf002","url":null,"abstract":"Assessing how brain functional connectivity networks vary across individuals promises to uncover important scientific questions such as patterns of healthy brain aging through the lifespan or dysconnectivity associated with disease. In this article, we introduce a general regression framework, Connectivity Regression (ConnReg), for regressing subject-specific functional connectivity networks on covariates while accounting for within-network inter-edge dependence. ConnReg utilizes a multivariate generalization of Fisher's transformation to project network objects into an alternative space where Gaussian assumptions are justified and positive semidefinite constraints are automatically satisfied. Penalized multivariate regression is fit in the transformed space to simultaneously induce sparsity in regression coefficients and in covariance elements, which capture within network inter-edge dependence. We use permutation tests to perform multiplicity-adjusted inference to identify covariates associated with connectivity, and stability selection scores to identify network edges that vary with selected covariates. Simulation studies validate the inferential properties of our proposed method and demonstrate how estimating and accounting for within-network inter-edge dependence leads to more efficient estimation, more powerful inference, and more accurate selection of covariate-dependent network edges. We apply ConnReg to the Human Connectome Project Young Adult study, revealing insights into how connectivity varies with language processing covariates and structural brain features.","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12020475/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143996159","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Identification and estimation of causal effects with confounders missing not at random. 非随机缺失混杂因素的因果效应识别和估计。

IF 1.8 3区数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biostatistics

Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf015

Jian Sun, Bo Fu

Making causal inferences from observational studies can be challenging when confounders are missing not at random. In such cases, identifying causal effects is often not guaranteed. Motivated by a real example, we consider a treatment-independent missingness assumption under which we establish the identification of causal effects when confounders are missing not at random. We propose a weighted estimating equation approach for estimating model parameters and introduce three estimators for the average causal effect, based on regression, propensity score weighting, and doubly robust estimation. We evaluate the performance of these estimators through simulations, and provide a real data analysis to illustrate our proposed method.

当混杂因素不是随机缺失时，从观察性研究中做出因果推断可能是具有挑战性的。在这种情况下，往往不能保证确定因果关系。在一个真实例子的激励下，我们考虑了一个与治疗无关的缺失假设，在这个假设下，我们建立了混杂因素非随机缺失时因果效应的识别。我们提出了一种加权估计方程方法来估计模型参数，并引入了三种基于回归、倾向得分加权和双重稳健估计的平均因果效应估计器。我们通过模拟来评估这些估计器的性能，并提供一个真实的数据分析来说明我们提出的方法。

引用次数: 0

Estimating causal effects for binary outcomes using per-decision inverse probability weighting. 使用每次决定的反概率加权法估算二元结果的因果效应。

IF 1.8 3区数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biostatistics

Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae025

Yihan Bao, Lauren Bell, Elizabeth Williamson, Claire Garnett, Tianchen Qian

Micro-randomized trials are commonly conducted for optimizing mobile health interventions such as push notifications for behavior change. In analyzing such trials, causal excursion effects are often of primary interest, and their estimation typically involves inverse probability weighting (IPW). However, in a micro-randomized trial, additional treatments can often occur during the time window over which an outcome is defined, and this can greatly inflate the variance of the causal effect estimator because IPW would involve a product of numerous weights. To reduce variance and improve estimation efficiency, we propose two new estimators using a modified version of IPW, which we call "per-decision IPW." The second estimator further improves efficiency using the projection idea from the semiparametric efficiency theory. These estimators are applicable when the outcome is binary and can be expressed as the maximum of a series of sub-outcomes defined over sub-intervals of time. We establish the estimators' consistency and asymptotic normality. Through simulation studies and real data applications, we demonstrate substantial efficiency improvement of the proposed estimator over existing estimators. The new estimators can be used to improve the precision of primary and secondary analyses for micro-randomized trials with binary outcomes.

微随机试验通常用于优化移动健康干预措施，如推送行为改变通知。在分析此类试验时，因果偏移效应通常是主要关注点，其估算通常涉及反概率加权（IPW）。然而，在微观随机试验中，在确定结果的时间窗口内经常会出现额外的治疗，这会大大增加因果效应估计值的方差，因为 IPW 会涉及众多权重的乘积。为了减少方差并提高估计效率，我们提出了两个使用改进版 IPW 的新估计器，我们称之为 "每次决定 IPW"。第二个估计器利用半参数效率理论中的投影思想进一步提高了效率。这些估计器适用于结果为二进制的情况，并可表示为一系列子结果的最大值，这些子结果定义在时间的子区间内。我们确定了估计值的一致性和渐近正态性。通过模拟研究和实际数据应用，我们证明了与现有的估计器相比，所提出的估计器在效率上有了很大的提高。新估计器可用于提高二元结果微型随机试验的一级和二级分析精度。

{"title":"Estimating causal effects for binary outcomes using per-decision inverse probability weighting.","authors":"Yihan Bao, Lauren Bell, Elizabeth Williamson, Claire Garnett, Tianchen Qian","doi":"10.1093/biostatistics/kxae025","DOIUrl":"10.1093/biostatistics/kxae025","url":null,"abstract":"Micro-randomized trials are commonly conducted for optimizing mobile health interventions such as push notifications for behavior change. In analyzing such trials, causal excursion effects are often of primary interest, and their estimation typically involves inverse probability weighting (IPW). However, in a micro-randomized trial, additional treatments can often occur during the time window over which an outcome is defined, and this can greatly inflate the variance of the causal effect estimator because IPW would involve a product of numerous weights. To reduce variance and improve estimation efficiency, we propose two new estimators using a modified version of IPW, which we call \"per-decision IPW.\" The second estimator further improves efficiency using the projection idea from the semiparametric efficiency theory. These estimators are applicable when the outcome is binary and can be expressed as the maximum of a series of sub-outcomes defined over sub-intervals of time. We establish the estimators' consistency and asymptotic normality. Through simulation studies and real data applications, we demonstrate substantial efficiency improvement of the proposed estimator over existing estimators. The new estimators can be used to improve the precision of primary and secondary analyses for micro-randomized trials with binary outcomes.","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141794123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Leveraging population information in brain connectivity via Bayesian ICA with a novel informative prior for correlation matrices. 利用贝叶斯独立成分分析在大脑连接中的人口信息，并为相关矩阵提供新的信息先验。

IF 2 3区数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biostatistics

Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf022

Amanda F Mejia, David Bolin, Daniel A Spencer, Ani Eloyan

Brain functional connectivity (FC), the temporal synchrony between brain networks, is essential to understand the functional organization of the brain and to identify changes due to neurological disorders, development, treatment, and other phenomena. Independent component analysis (ICA) is a matrix decomposition method used extensively for simultaneous estimation of functional brain topography and connectivity. However, estimation of FC via ICA is often sub-optimal due to the use of ad hoc estimation methods or temporal dimension reduction prior to ICA. Bayesian ICA can avoid dimension reduction, estimate latent variables and model parameters more accurately, and facilitate posterior inference. In this article, we develop a novel, computationally feasible Bayesian ICA method with population-derived priors on both the spatial ICs and their temporal correlation (that is, their FC). For the latter, we consider two priors: the inverse-Wishart, which is conjugate but is not ideally suited for modeling correlation matrices; and a novel informative prior for correlation matrices. For each prior, we derive a variational Bayes algorithm to estimate the model variables and facilitate posterior inference. Through extensive simulation studies, we evaluate the performance of the proposed methods and benchmark against existing approaches. We also analyze fMRI data from over 400 healthy adults in the Human Connectome Project. We find that our Bayesian ICA model and algorithms result in more accurate measures of functional connectivity and spatial brain features. Our novel prior for correlation matrices is more computationally intensive than the inverse-Wishart but provides improved accuracy and inference. The proposed framework is applicable to single-subject analysis, making it potentially clinically viable.

脑功能连通性（FC），即大脑网络之间的时间同步，对于理解大脑的功能组织以及识别由于神经系统疾病、发育、治疗和其他现象而引起的变化至关重要。独立分量分析（ICA）是一种矩阵分解方法，广泛应用于脑功能形貌和连通性的同时估计。然而，由于在ICA之前使用临时估计方法或时间降维，通过ICA对FC的估计通常不是最优的。贝叶斯ICA可以避免降维，更准确地估计潜在变量和模型参数，并便于后验推理。在本文中，我们开发了一种新的，计算上可行的贝叶斯ICA方法，该方法具有空间ic及其时间相关性（即FC）的种群衍生先验。对于后者，我们考虑两个先验：逆wishart，它是共轭的，但不适合建模相关矩阵；并提出了一种新的相关矩阵信息先验。对于每个先验，我们推导了一个变分贝叶斯算法来估计模型变量并促进后验推理。通过广泛的仿真研究，我们评估了所提出方法的性能，并对现有方法进行了基准测试。我们还在人类连接组项目中分析了400多名健康成年人的功能磁共振成像数据。我们发现我们的贝叶斯ICA模型和算法可以更准确地测量功能连接和空间大脑特征。我们对相关矩阵的新先验比逆wishart的计算量更大，但提供了更高的精度和推理。提出的框架适用于单受试者分析，使其具有潜在的临床可行性。

{"title":"Leveraging population information in brain connectivity via Bayesian ICA with a novel informative prior for correlation matrices.","authors":"Amanda F Mejia, David Bolin, Daniel A Spencer, Ani Eloyan","doi":"10.1093/biostatistics/kxaf022","DOIUrl":"https://doi.org/10.1093/biostatistics/kxaf022","url":null,"abstract":"Brain functional connectivity (FC), the temporal synchrony between brain networks, is essential to understand the functional organization of the brain and to identify changes due to neurological disorders, development, treatment, and other phenomena. Independent component analysis (ICA) is a matrix decomposition method used extensively for simultaneous estimation of functional brain topography and connectivity. However, estimation of FC via ICA is often sub-optimal due to the use of ad hoc estimation methods or temporal dimension reduction prior to ICA. Bayesian ICA can avoid dimension reduction, estimate latent variables and model parameters more accurately, and facilitate posterior inference. In this article, we develop a novel, computationally feasible Bayesian ICA method with population-derived priors on both the spatial ICs and their temporal correlation (that is, their FC). For the latter, we consider two priors: the inverse-Wishart, which is conjugate but is not ideally suited for modeling correlation matrices; and a novel informative prior for correlation matrices. For each prior, we derive a variational Bayes algorithm to estimate the model variables and facilitate posterior inference. Through extensive simulation studies, we evaluate the performance of the proposed methods and benchmark against existing approaches. We also analyze fMRI data from over 400 healthy adults in the Human Connectome Project. We find that our Bayesian ICA model and algorithms result in more accurate measures of functional connectivity and spatial brain features. Our novel prior for correlation matrices is more computationally intensive than the inverse-Wishart but provides improved accuracy and inference. The proposed framework is applicable to single-subject analysis, making it potentially clinically viable.","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12372588/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144979622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A semiparametric Gaussian mixture model for chest CT-based 3D blood vessel reconstruction. 基于胸部 CT 的三维血管重建半参数高斯混合物模型

IF 1.8 3区数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biostatistics

Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae013

Qianhan Zeng, Jing Zhou, Ying Ji, Hansheng Wang

Computed tomography (CT) has been a powerful diagnostic tool since its emergence in the 1970s. Using CT data, 3D structures of human internal organs and tissues, such as blood vessels, can be reconstructed using professional software. This 3D reconstruction is crucial for surgical operations and can serve as a vivid medical teaching example. However, traditional 3D reconstruction heavily relies on manual operations, which are time-consuming, subjective, and require substantial experience. To address this problem, we develop a novel semiparametric Gaussian mixture model tailored for the 3D reconstruction of blood vessels. This model extends the classical Gaussian mixture model by enabling nonparametric variations in the component-wise parameters of interest according to voxel positions. We develop a kernel-based expectation-maximization algorithm for estimating the model parameters, accompanied by a supporting asymptotic theory. Furthermore, we propose a novel regression method for optimal bandwidth selection. Compared to the conventional cross-validation-based (CV) method, the regression method outperforms the CV method in terms of computational and statistical efficiency. In application, this methodology facilitates the fully automated reconstruction of 3D blood vessel structures with remarkable accuracy.

计算机断层扫描（CT）自 20 世纪 70 年代问世以来，一直是一种强大的诊断工具。利用 CT 数据，可以通过专业软件重建血管等人体内部器官和组织的三维结构。这种三维重建对外科手术至关重要，并可作为生动的医学教学范例。然而，传统的三维重建严重依赖人工操作，耗时长、主观性强，而且需要丰富的经验。为解决这一问题，我们开发了一种专为血管三维重建量身定制的新型半参数高斯混合模型。该模型扩展了经典的高斯混合模型，可根据体素位置对相关分量参数进行非参数变化。我们开发了一种基于核的期望最大化算法来估计模型参数，并辅以渐近理论。此外，我们还提出了一种优化带宽选择的新型回归方法。与传统的基于交叉验证（CV）的方法相比，回归方法在计算和统计效率方面都优于 CV 方法。在应用中，该方法有助于全自动重建三维血管结构，且精确度极高。

{"title":"A semiparametric Gaussian mixture model for chest CT-based 3D blood vessel reconstruction.","authors":"Qianhan Zeng, Jing Zhou, Ying Ji, Hansheng Wang","doi":"10.1093/biostatistics/kxae013","DOIUrl":"10.1093/biostatistics/kxae013","url":null,"abstract":"Computed tomography (CT) has been a powerful diagnostic tool since its emergence in the 1970s. Using CT data, 3D structures of human internal organs and tissues, such as blood vessels, can be reconstructed using professional software. This 3D reconstruction is crucial for surgical operations and can serve as a vivid medical teaching example. However, traditional 3D reconstruction heavily relies on manual operations, which are time-consuming, subjective, and require substantial experience. To address this problem, we develop a novel semiparametric Gaussian mixture model tailored for the 3D reconstruction of blood vessels. This model extends the classical Gaussian mixture model by enabling nonparametric variations in the component-wise parameters of interest according to voxel positions. We develop a kernel-based expectation-maximization algorithm for estimating the model parameters, accompanied by a supporting asymptotic theory. Furthermore, we propose a novel regression method for optimal bandwidth selection. Compared to the conventional cross-validation-based (CV) method, the regression method outperforms the CV method in terms of computational and statistical efficiency. In application, this methodology facilitates the fully automated reconstruction of 3D blood vessel structures with remarkable accuracy.","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140869271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Control arm augmentation and hierarchical modeling in time-to-event trials: advantages and pitfalls. 时间-事件试验中的控制臂增强和分层建模：优势与缺陷。

IF 2 3区数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biostatistics

Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf021

Ethan M Alt, Xiuya Chang, Qing Liu, Xun Jiang, May Mo, H Amy Xia, Joseph G Ibrahim

In clinical trials, it is often valuable to borrow information from external data sources. Unfortunately, when the external data are fully or partially incompatible with the current trial data, type I error rates can be highly inflated under traditional blanket discounting schemes such as power priors, commensurate priors, and meta-analytic predictive priors. However, such inflation of the probability of a false positive can be necessary, as the alternative is to have an underpowered study. For clinical trials with time-to-event (TTE) outcomes, this problem is exacerbated since many observations are censored. In this paper, we develop the latent exchangeability prior for TTE data. We also present a novel framework to borrow information about the treatment effect between groups as well as incorporate information from external controls. Simulation results suggest that, although efficiency gains can be achieved by borrowing information among external controls, operating characteristics in general can be quite poor under a lack of exchangeability. We apply our approach to a real clinical trial in second-line metastatic colorectal cancer.

在临床试验中，从外部数据源中借用信息通常是有价值的。不幸的是，当外部数据与当前试验数据完全或部分不相容时，在传统的一揽子折扣方案（如功率先验、相称先验和元分析预测先验）下，I型错误率可能会被高度夸大。然而，这种假阳性概率的膨胀可能是必要的，因为另一种选择是有一个不足的研究。对于具有事件发生时间（TTE）结果的临床试验，由于许多观察结果被审查，这个问题更加严重。在本文中，我们发展了TTE数据的潜在交换性先验。我们还提出了一个新的框架来借鉴关于组间治疗效果的信息，以及从外部控制中吸收信息。仿真结果表明，尽管可以通过借用外部控制之间的信息来获得效率提高，但在缺乏可交换性的情况下，操作特性通常会相当差。我们将我们的方法应用于二线转移性结直肠癌的实际临床试验。

{"title":"Control arm augmentation and hierarchical modeling in time-to-event trials: advantages and pitfalls.","authors":"Ethan M Alt, Xiuya Chang, Qing Liu, Xun Jiang, May Mo, H Amy Xia, Joseph G Ibrahim","doi":"10.1093/biostatistics/kxaf021","DOIUrl":"10.1093/biostatistics/kxaf021","url":null,"abstract":"In clinical trials, it is often valuable to borrow information from external data sources. Unfortunately, when the external data are fully or partially incompatible with the current trial data, type I error rates can be highly inflated under traditional blanket discounting schemes such as power priors, commensurate priors, and meta-analytic predictive priors. However, such inflation of the probability of a false positive can be necessary, as the alternative is to have an underpowered study. For clinical trials with time-to-event (TTE) outcomes, this problem is exacerbated since many observations are censored. In this paper, we develop the latent exchangeability prior for TTE data. We also present a novel framework to borrow information about the treatment effect between groups as well as incorporate information from external controls. Simulation results suggest that, although efficiency gains can be achieved by borrowing information among external controls, operating characteristics in general can be quite poor under a lack of exchangeability. We apply our approach to a real clinical trial in second-line metastatic colorectal cancer.","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144838617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Bayesian thresholded modeling for integrating brain node and network predictors. 脑节点和网络预测器集成的贝叶斯阈值建模。

IF 2 3区数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biostatistics

Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae048

Zhe Sun, Wanwan Xu, Tianxi Li, Jian Kang, Gregorio Alanis-Lobato, Yize Zhao

Progress in neuroscience has provided unprecedented opportunities to advance our understanding of brain alterations and their correspondence to phenotypic profiles. With data collected from various imaging techniques, studies have integrated different types of information ranging from brain structure, function, or metabolism. More recently, an emerging way to categorize imaging traits is through a metric hierarchy, including localized node-level measurements and interactive network-level metrics. However, limited research has been conducted to integrate these different hierarchies and achieve a better understanding of the neurobiological mechanisms and communications. In this work, we address this literature gap by proposing a Bayesian regression model under both vector-variate and matrix-variate predictors. To characterize the interplay between different predicting components, we propose a set of biologically plausible prior models centered on an innovative joint thresholded prior. This captures the coupling and grouping effect of signal patterns, as well as their spatial contiguity across brain anatomy. By developing a posterior inference, we can identify and quantify the uncertainty of signaling node- and network-level neuromarkers, as well as their predictive mechanism for phenotypic outcomes. Through extensive simulations, we demonstrate that our proposed method outperforms the alternative approaches substantially in both out-of-sample prediction and feature selection. By implementing the model to study children's general mental abilities, we establish a powerful predictive mechanism based on the identified task contrast traits and resting-state sub-networks.

神经科学的进步提供了前所未有的机会来推进我们对大脑变化及其与表型特征的对应关系的理解。利用各种成像技术收集的数据，研究整合了从大脑结构、功能或新陈代谢等不同类型的信息。最近，一种新兴的成像特征分类方法是通过度量层次，包括局部节点级测量和交互式网络级度量。然而，有限的研究已经进行了整合这些不同的层次和实现更好的理解神经生物学机制和通信。在这项工作中，我们通过在向量变量和矩阵变量预测因子下提出贝叶斯回归模型来解决这一文献空白。为了描述不同预测成分之间的相互作用，我们提出了一套以创新的联合阈值先验为中心的生物学上合理的先验模型。这捕获了信号模式的耦合和分组效应，以及它们在大脑解剖结构中的空间连续性。通过发展后验推理，我们可以识别和量化信号传导节点和网络水平的神经标志物的不确定性，以及它们对表型结果的预测机制。通过大量的模拟，我们证明了我们提出的方法在样本外预测和特征选择方面都大大优于其他方法。将该模型应用于儿童一般心理能力的研究，建立了一种基于任务对比特征和静息状态子网络的预测机制。

{"title":"Bayesian thresholded modeling for integrating brain node and network predictors.","authors":"Zhe Sun, Wanwan Xu, Tianxi Li, Jian Kang, Gregorio Alanis-Lobato, Yize Zhao","doi":"10.1093/biostatistics/kxae048","DOIUrl":"10.1093/biostatistics/kxae048","url":null,"abstract":"Progress in neuroscience has provided unprecedented opportunities to advance our understanding of brain alterations and their correspondence to phenotypic profiles. With data collected from various imaging techniques, studies have integrated different types of information ranging from brain structure, function, or metabolism. More recently, an emerging way to categorize imaging traits is through a metric hierarchy, including localized node-level measurements and interactive network-level metrics. However, limited research has been conducted to integrate these different hierarchies and achieve a better understanding of the neurobiological mechanisms and communications. In this work, we address this literature gap by proposing a Bayesian regression model under both vector-variate and matrix-variate predictors. To characterize the interplay between different predicting components, we propose a set of biologically plausible prior models centered on an innovative joint thresholded prior. This captures the coupling and grouping effect of signal patterns, as well as their spatial contiguity across brain anatomy. By developing a posterior inference, we can identify and quantify the uncertainty of signaling node- and network-level neuromarkers, as well as their predictive mechanism for phenotypic outcomes. Through extensive simulations, we demonstrate that our proposed method outperforms the alternative approaches substantially in both out-of-sample prediction and feature selection. By implementing the model to study children's general mental abilities, we establish a powerful predictive mechanism based on the identified task contrast traits and resting-state sub-networks.","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11823287/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142959273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0