首页 > 最新文献

Biostatistics最新文献

英文 中文
Bayesian subtyping for multi-state brain functional connectome with application on preadolescent brain cognition. 多状态脑功能连接体贝叶斯分型及其在青春期前脑认知中的应用。
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae045
Tianqi Chen, Hongyu Zhao, Chichun Tan, Todd Constable, Sarah Yip, Yize Zhao

Converging evidence indicates that the heterogeneity of cognitive profiles may arise through detectable alternations in brain functional connectivity. Despite an unprecedented opportunity to uncover neurobiological subtypes through clustering or subtyping analyses on multi-state functional connectivity, few existing approaches are applicable to accommodate the network topology and unique biological architecture. To address this issue, we propose an innovative Bayesian nonparametric network-variate clustering analysis to uncover subgroups of individuals with homogeneous brain functional network patterns under multiple cognitive states. In light of the existing neuroscience literature, we assume there are unknown state-specific modular structures within functional connectivity. Concurrently, we identify informative network features essential for defining subtypes. To further facilitate practical use, we develop a computationally efficient variational inference algorithm to approximate posterior inference with satisfactory estimation accuracy. Extensive simulations show the superiority of our method. We apply the method to the Adolescent Brain Cognitive Development (ABCD) study, and identify neurodevelopmental subtypes and brain sub-network phenotypes under each state to signal neurobiological heterogeneity, suggesting promising directions for further exploration and investigation in neuroscience.

越来越多的证据表明,认知特征的异质性可能是由于大脑功能连接的可检测变化而产生的。尽管通过对多状态功能连接的聚类或分型分析来揭示神经生物学亚型的机会前所未有,但现有的方法很少适用于适应网络拓扑结构和独特的生物结构。为了解决这一问题,我们提出了一种创新的贝叶斯非参数网络-变量聚类分析,以揭示在多种认知状态下具有同质脑功能网络模式的个体亚群。根据现有的神经科学文献,我们假设在功能连接中存在未知的特定状态模块结构。同时,我们确定了定义子类型所必需的信息网络特征。为了进一步方便实际应用,我们开发了一种计算效率高的变分推理算法,以令人满意的估计精度近似后验推理。大量的仿真表明了该方法的优越性。我们将该方法应用于青少年脑认知发展(ABCD)研究,并确定了每种状态下的神经发育亚型和脑亚网络表型,以表明神经生物学的异质性,为神经科学的进一步探索和研究提供了有希望的方向。
{"title":"Bayesian subtyping for multi-state brain functional connectome with application on preadolescent brain cognition.","authors":"Tianqi Chen, Hongyu Zhao, Chichun Tan, Todd Constable, Sarah Yip, Yize Zhao","doi":"10.1093/biostatistics/kxae045","DOIUrl":"10.1093/biostatistics/kxae045","url":null,"abstract":"<p><p>Converging evidence indicates that the heterogeneity of cognitive profiles may arise through detectable alternations in brain functional connectivity. Despite an unprecedented opportunity to uncover neurobiological subtypes through clustering or subtyping analyses on multi-state functional connectivity, few existing approaches are applicable to accommodate the network topology and unique biological architecture. To address this issue, we propose an innovative Bayesian nonparametric network-variate clustering analysis to uncover subgroups of individuals with homogeneous brain functional network patterns under multiple cognitive states. In light of the existing neuroscience literature, we assume there are unknown state-specific modular structures within functional connectivity. Concurrently, we identify informative network features essential for defining subtypes. To further facilitate practical use, we develop a computationally efficient variational inference algorithm to approximate posterior inference with satisfactory estimation accuracy. Extensive simulations show the superiority of our method. We apply the method to the Adolescent Brain Cognitive Development (ABCD) study, and identify neurodevelopmental subtypes and brain sub-network phenotypes under each state to signal neurobiological heterogeneity, suggesting promising directions for further exploration and investigation in neuroscience.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11823269/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142830270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating causal effects for binary outcomes using per-decision inverse probability weighting. 使用每次决定的反概率加权法估算二元结果的因果效应。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae025
Yihan Bao, Lauren Bell, Elizabeth Williamson, Claire Garnett, Tianchen Qian

Micro-randomized trials are commonly conducted for optimizing mobile health interventions such as push notifications for behavior change. In analyzing such trials, causal excursion effects are often of primary interest, and their estimation typically involves inverse probability weighting (IPW). However, in a micro-randomized trial, additional treatments can often occur during the time window over which an outcome is defined, and this can greatly inflate the variance of the causal effect estimator because IPW would involve a product of numerous weights. To reduce variance and improve estimation efficiency, we propose two new estimators using a modified version of IPW, which we call "per-decision IPW." The second estimator further improves efficiency using the projection idea from the semiparametric efficiency theory. These estimators are applicable when the outcome is binary and can be expressed as the maximum of a series of sub-outcomes defined over sub-intervals of time. We establish the estimators' consistency and asymptotic normality. Through simulation studies and real data applications, we demonstrate substantial efficiency improvement of the proposed estimator over existing estimators. The new estimators can be used to improve the precision of primary and secondary analyses for micro-randomized trials with binary outcomes.

微随机试验通常用于优化移动健康干预措施,如推送行为改变通知。在分析此类试验时,因果偏移效应通常是主要关注点,其估算通常涉及反概率加权(IPW)。然而,在微观随机试验中,在确定结果的时间窗口内经常会出现额外的治疗,这会大大增加因果效应估计值的方差,因为 IPW 会涉及众多权重的乘积。为了减少方差并提高估计效率,我们提出了两个使用改进版 IPW 的新估计器,我们称之为 "每次决定 IPW"。第二个估计器利用半参数效率理论中的投影思想进一步提高了效率。这些估计器适用于结果为二进制的情况,并可表示为一系列子结果的最大值,这些子结果定义在时间的子区间内。我们确定了估计值的一致性和渐近正态性。通过模拟研究和实际数据应用,我们证明了与现有的估计器相比,所提出的估计器在效率上有了很大的提高。新估计器可用于提高二元结果微型随机试验的一级和二级分析精度。
{"title":"Estimating causal effects for binary outcomes using per-decision inverse probability weighting.","authors":"Yihan Bao, Lauren Bell, Elizabeth Williamson, Claire Garnett, Tianchen Qian","doi":"10.1093/biostatistics/kxae025","DOIUrl":"10.1093/biostatistics/kxae025","url":null,"abstract":"<p><p>Micro-randomized trials are commonly conducted for optimizing mobile health interventions such as push notifications for behavior change. In analyzing such trials, causal excursion effects are often of primary interest, and their estimation typically involves inverse probability weighting (IPW). However, in a micro-randomized trial, additional treatments can often occur during the time window over which an outcome is defined, and this can greatly inflate the variance of the causal effect estimator because IPW would involve a product of numerous weights. To reduce variance and improve estimation efficiency, we propose two new estimators using a modified version of IPW, which we call \"per-decision IPW.\" The second estimator further improves efficiency using the projection idea from the semiparametric efficiency theory. These estimators are applicable when the outcome is binary and can be expressed as the maximum of a series of sub-outcomes defined over sub-intervals of time. We establish the estimators' consistency and asymptotic normality. Through simulation studies and real data applications, we demonstrate substantial efficiency improvement of the proposed estimator over existing estimators. The new estimators can be used to improve the precision of primary and secondary analyses for micro-randomized trials with binary outcomes.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141794123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging population information in brain connectivity via Bayesian ICA with a novel informative prior for correlation matrices. 利用贝叶斯独立成分分析在大脑连接中的人口信息,并为相关矩阵提供新的信息先验。
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf022
Amanda F Mejia, David Bolin, Daniel A Spencer, Ani Eloyan

Brain functional connectivity (FC), the temporal synchrony between brain networks, is essential to understand the functional organization of the brain and to identify changes due to neurological disorders, development, treatment, and other phenomena. Independent component analysis (ICA) is a matrix decomposition method used extensively for simultaneous estimation of functional brain topography and connectivity. However, estimation of FC via ICA is often sub-optimal due to the use of ad hoc estimation methods or temporal dimension reduction prior to ICA. Bayesian ICA can avoid dimension reduction, estimate latent variables and model parameters more accurately, and facilitate posterior inference. In this article, we develop a novel, computationally feasible Bayesian ICA method with population-derived priors on both the spatial ICs and their temporal correlation (that is, their FC). For the latter, we consider two priors: the inverse-Wishart, which is conjugate but is not ideally suited for modeling correlation matrices; and a novel informative prior for correlation matrices. For each prior, we derive a variational Bayes algorithm to estimate the model variables and facilitate posterior inference. Through extensive simulation studies, we evaluate the performance of the proposed methods and benchmark against existing approaches. We also analyze fMRI data from over 400 healthy adults in the Human Connectome Project. We find that our Bayesian ICA model and algorithms result in more accurate measures of functional connectivity and spatial brain features. Our novel prior for correlation matrices is more computationally intensive than the inverse-Wishart but provides improved accuracy and inference. The proposed framework is applicable to single-subject analysis, making it potentially clinically viable.

脑功能连通性(FC),即大脑网络之间的时间同步,对于理解大脑的功能组织以及识别由于神经系统疾病、发育、治疗和其他现象而引起的变化至关重要。独立分量分析(ICA)是一种矩阵分解方法,广泛应用于脑功能形貌和连通性的同时估计。然而,由于在ICA之前使用临时估计方法或时间降维,通过ICA对FC的估计通常不是最优的。贝叶斯ICA可以避免降维,更准确地估计潜在变量和模型参数,并便于后验推理。在本文中,我们开发了一种新的,计算上可行的贝叶斯ICA方法,该方法具有空间ic及其时间相关性(即FC)的种群衍生先验。对于后者,我们考虑两个先验:逆wishart,它是共轭的,但不适合建模相关矩阵;并提出了一种新的相关矩阵信息先验。对于每个先验,我们推导了一个变分贝叶斯算法来估计模型变量并促进后验推理。通过广泛的仿真研究,我们评估了所提出方法的性能,并对现有方法进行了基准测试。我们还在人类连接组项目中分析了400多名健康成年人的功能磁共振成像数据。我们发现我们的贝叶斯ICA模型和算法可以更准确地测量功能连接和空间大脑特征。我们对相关矩阵的新先验比逆wishart的计算量更大,但提供了更高的精度和推理。提出的框架适用于单受试者分析,使其具有潜在的临床可行性。
{"title":"Leveraging population information in brain connectivity via Bayesian ICA with a novel informative prior for correlation matrices.","authors":"Amanda F Mejia, David Bolin, Daniel A Spencer, Ani Eloyan","doi":"10.1093/biostatistics/kxaf022","DOIUrl":"https://doi.org/10.1093/biostatistics/kxaf022","url":null,"abstract":"<p><p>Brain functional connectivity (FC), the temporal synchrony between brain networks, is essential to understand the functional organization of the brain and to identify changes due to neurological disorders, development, treatment, and other phenomena. Independent component analysis (ICA) is a matrix decomposition method used extensively for simultaneous estimation of functional brain topography and connectivity. However, estimation of FC via ICA is often sub-optimal due to the use of ad hoc estimation methods or temporal dimension reduction prior to ICA. Bayesian ICA can avoid dimension reduction, estimate latent variables and model parameters more accurately, and facilitate posterior inference. In this article, we develop a novel, computationally feasible Bayesian ICA method with population-derived priors on both the spatial ICs and their temporal correlation (that is, their FC). For the latter, we consider two priors: the inverse-Wishart, which is conjugate but is not ideally suited for modeling correlation matrices; and a novel informative prior for correlation matrices. For each prior, we derive a variational Bayes algorithm to estimate the model variables and facilitate posterior inference. Through extensive simulation studies, we evaluate the performance of the proposed methods and benchmark against existing approaches. We also analyze fMRI data from over 400 healthy adults in the Human Connectome Project. We find that our Bayesian ICA model and algorithms result in more accurate measures of functional connectivity and spatial brain features. Our novel prior for correlation matrices is more computationally intensive than the inverse-Wishart but provides improved accuracy and inference. The proposed framework is applicable to single-subject analysis, making it potentially clinically viable.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12372588/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144979622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A semiparametric Gaussian mixture model for chest CT-based 3D blood vessel reconstruction. 基于胸部 CT 的三维血管重建半参数高斯混合物模型
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae013
Qianhan Zeng, Jing Zhou, Ying Ji, Hansheng Wang

Computed tomography (CT) has been a powerful diagnostic tool since its emergence in the 1970s. Using CT data, 3D structures of human internal organs and tissues, such as blood vessels, can be reconstructed using professional software. This 3D reconstruction is crucial for surgical operations and can serve as a vivid medical teaching example. However, traditional 3D reconstruction heavily relies on manual operations, which are time-consuming, subjective, and require substantial experience. To address this problem, we develop a novel semiparametric Gaussian mixture model tailored for the 3D reconstruction of blood vessels. This model extends the classical Gaussian mixture model by enabling nonparametric variations in the component-wise parameters of interest according to voxel positions. We develop a kernel-based expectation-maximization algorithm for estimating the model parameters, accompanied by a supporting asymptotic theory. Furthermore, we propose a novel regression method for optimal bandwidth selection. Compared to the conventional cross-validation-based (CV) method, the regression method outperforms the CV method in terms of computational and statistical efficiency. In application, this methodology facilitates the fully automated reconstruction of 3D blood vessel structures with remarkable accuracy.

计算机断层扫描(CT)自 20 世纪 70 年代问世以来,一直是一种强大的诊断工具。利用 CT 数据,可以通过专业软件重建血管等人体内部器官和组织的三维结构。这种三维重建对外科手术至关重要,并可作为生动的医学教学范例。然而,传统的三维重建严重依赖人工操作,耗时长、主观性强,而且需要丰富的经验。为解决这一问题,我们开发了一种专为血管三维重建量身定制的新型半参数高斯混合模型。该模型扩展了经典的高斯混合模型,可根据体素位置对相关分量参数进行非参数变化。我们开发了一种基于核的期望最大化算法来估计模型参数,并辅以渐近理论。此外,我们还提出了一种优化带宽选择的新型回归方法。与传统的基于交叉验证(CV)的方法相比,回归方法在计算和统计效率方面都优于 CV 方法。在应用中,该方法有助于全自动重建三维血管结构,且精确度极高。
{"title":"A semiparametric Gaussian mixture model for chest CT-based 3D blood vessel reconstruction.","authors":"Qianhan Zeng, Jing Zhou, Ying Ji, Hansheng Wang","doi":"10.1093/biostatistics/kxae013","DOIUrl":"10.1093/biostatistics/kxae013","url":null,"abstract":"<p><p>Computed tomography (CT) has been a powerful diagnostic tool since its emergence in the 1970s. Using CT data, 3D structures of human internal organs and tissues, such as blood vessels, can be reconstructed using professional software. This 3D reconstruction is crucial for surgical operations and can serve as a vivid medical teaching example. However, traditional 3D reconstruction heavily relies on manual operations, which are time-consuming, subjective, and require substantial experience. To address this problem, we develop a novel semiparametric Gaussian mixture model tailored for the 3D reconstruction of blood vessels. This model extends the classical Gaussian mixture model by enabling nonparametric variations in the component-wise parameters of interest according to voxel positions. We develop a kernel-based expectation-maximization algorithm for estimating the model parameters, accompanied by a supporting asymptotic theory. Furthermore, we propose a novel regression method for optimal bandwidth selection. Compared to the conventional cross-validation-based (CV) method, the regression method outperforms the CV method in terms of computational and statistical efficiency. In application, this methodology facilitates the fully automated reconstruction of 3D blood vessel structures with remarkable accuracy.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140869271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian thresholded modeling for integrating brain node and network predictors. 脑节点和网络预测器集成的贝叶斯阈值建模。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae048
Zhe Sun, Wanwan Xu, Tianxi Li, Jian Kang, Gregorio Alanis-Lobato, Yize Zhao

Progress in neuroscience has provided unprecedented opportunities to advance our understanding of brain alterations and their correspondence to phenotypic profiles. With data collected from various imaging techniques, studies have integrated different types of information ranging from brain structure, function, or metabolism. More recently, an emerging way to categorize imaging traits is through a metric hierarchy, including localized node-level measurements and interactive network-level metrics. However, limited research has been conducted to integrate these different hierarchies and achieve a better understanding of the neurobiological mechanisms and communications. In this work, we address this literature gap by proposing a Bayesian regression model under both vector-variate and matrix-variate predictors. To characterize the interplay between different predicting components, we propose a set of biologically plausible prior models centered on an innovative joint thresholded prior. This captures the coupling and grouping effect of signal patterns, as well as their spatial contiguity across brain anatomy. By developing a posterior inference, we can identify and quantify the uncertainty of signaling node- and network-level neuromarkers, as well as their predictive mechanism for phenotypic outcomes. Through extensive simulations, we demonstrate that our proposed method outperforms the alternative approaches substantially in both out-of-sample prediction and feature selection. By implementing the model to study children's general mental abilities, we establish a powerful predictive mechanism based on the identified task contrast traits and resting-state sub-networks.

神经科学的进步提供了前所未有的机会来推进我们对大脑变化及其与表型特征的对应关系的理解。利用各种成像技术收集的数据,研究整合了从大脑结构、功能或新陈代谢等不同类型的信息。最近,一种新兴的成像特征分类方法是通过度量层次,包括局部节点级测量和交互式网络级度量。然而,有限的研究已经进行了整合这些不同的层次和实现更好的理解神经生物学机制和通信。在这项工作中,我们通过在向量变量和矩阵变量预测因子下提出贝叶斯回归模型来解决这一文献空白。为了描述不同预测成分之间的相互作用,我们提出了一套以创新的联合阈值先验为中心的生物学上合理的先验模型。这捕获了信号模式的耦合和分组效应,以及它们在大脑解剖结构中的空间连续性。通过发展后验推理,我们可以识别和量化信号传导节点和网络水平的神经标志物的不确定性,以及它们对表型结果的预测机制。通过大量的模拟,我们证明了我们提出的方法在样本外预测和特征选择方面都大大优于其他方法。将该模型应用于儿童一般心理能力的研究,建立了一种基于任务对比特征和静息状态子网络的预测机制。
{"title":"Bayesian thresholded modeling for integrating brain node and network predictors.","authors":"Zhe Sun, Wanwan Xu, Tianxi Li, Jian Kang, Gregorio Alanis-Lobato, Yize Zhao","doi":"10.1093/biostatistics/kxae048","DOIUrl":"10.1093/biostatistics/kxae048","url":null,"abstract":"<p><p>Progress in neuroscience has provided unprecedented opportunities to advance our understanding of brain alterations and their correspondence to phenotypic profiles. With data collected from various imaging techniques, studies have integrated different types of information ranging from brain structure, function, or metabolism. More recently, an emerging way to categorize imaging traits is through a metric hierarchy, including localized node-level measurements and interactive network-level metrics. However, limited research has been conducted to integrate these different hierarchies and achieve a better understanding of the neurobiological mechanisms and communications. In this work, we address this literature gap by proposing a Bayesian regression model under both vector-variate and matrix-variate predictors. To characterize the interplay between different predicting components, we propose a set of biologically plausible prior models centered on an innovative joint thresholded prior. This captures the coupling and grouping effect of signal patterns, as well as their spatial contiguity across brain anatomy. By developing a posterior inference, we can identify and quantify the uncertainty of signaling node- and network-level neuromarkers, as well as their predictive mechanism for phenotypic outcomes. Through extensive simulations, we demonstrate that our proposed method outperforms the alternative approaches substantially in both out-of-sample prediction and feature selection. By implementing the model to study children's general mental abilities, we establish a powerful predictive mechanism based on the identified task contrast traits and resting-state sub-networks.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11823287/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142959273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Control arm augmentation and hierarchical modeling in time-to-event trials: advantages and pitfalls. 时间-事件试验中的控制臂增强和分层建模:优势与缺陷。
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf021
Ethan M Alt, Xiuya Chang, Qing Liu, Xun Jiang, May Mo, H Amy Xia, Joseph G Ibrahim

In clinical trials, it is often valuable to borrow information from external data sources. Unfortunately, when the external data are fully or partially incompatible with the current trial data, type I error rates can be highly inflated under traditional blanket discounting schemes such as power priors, commensurate priors, and meta-analytic predictive priors. However, such inflation of the probability of a false positive can be necessary, as the alternative is to have an underpowered study. For clinical trials with time-to-event (TTE) outcomes, this problem is exacerbated since many observations are censored. In this paper, we develop the latent exchangeability prior for TTE data. We also present a novel framework to borrow information about the treatment effect between groups as well as incorporate information from external controls. Simulation results suggest that, although efficiency gains can be achieved by borrowing information among external controls, operating characteristics in general can be quite poor under a lack of exchangeability. We apply our approach to a real clinical trial in second-line metastatic colorectal cancer.

在临床试验中,从外部数据源中借用信息通常是有价值的。不幸的是,当外部数据与当前试验数据完全或部分不相容时,在传统的一揽子折扣方案(如功率先验、相称先验和元分析预测先验)下,I型错误率可能会被高度夸大。然而,这种假阳性概率的膨胀可能是必要的,因为另一种选择是有一个不足的研究。对于具有事件发生时间(TTE)结果的临床试验,由于许多观察结果被审查,这个问题更加严重。在本文中,我们发展了TTE数据的潜在交换性先验。我们还提出了一个新的框架来借鉴关于组间治疗效果的信息,以及从外部控制中吸收信息。仿真结果表明,尽管可以通过借用外部控制之间的信息来获得效率提高,但在缺乏可交换性的情况下,操作特性通常会相当差。我们将我们的方法应用于二线转移性结直肠癌的实际临床试验。
{"title":"Control arm augmentation and hierarchical modeling in time-to-event trials: advantages and pitfalls.","authors":"Ethan M Alt, Xiuya Chang, Qing Liu, Xun Jiang, May Mo, H Amy Xia, Joseph G Ibrahim","doi":"10.1093/biostatistics/kxaf021","DOIUrl":"10.1093/biostatistics/kxaf021","url":null,"abstract":"<p><p>In clinical trials, it is often valuable to borrow information from external data sources. Unfortunately, when the external data are fully or partially incompatible with the current trial data, type I error rates can be highly inflated under traditional blanket discounting schemes such as power priors, commensurate priors, and meta-analytic predictive priors. However, such inflation of the probability of a false positive can be necessary, as the alternative is to have an underpowered study. For clinical trials with time-to-event (TTE) outcomes, this problem is exacerbated since many observations are censored. In this paper, we develop the latent exchangeability prior for TTE data. We also present a novel framework to borrow information about the treatment effect between groups as well as incorporate information from external controls. Simulation results suggest that, although efficiency gains can be achieved by borrowing information among external controls, operating characteristics in general can be quite poor under a lack of exchangeability. We apply our approach to a real clinical trial in second-line metastatic colorectal cancer.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144838617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stratification-based instrumental variable analysis framework for nonlinear effect analysis. 基于分层的非线性效应分析工具变量分析框架。
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf043
Haodong Tian, Ashish Patel, Stephen Burgess

Nonlinear causal effects are prevalent in many research scenarios involving continuous exposures, and instrumental variables (IVs) can be employed to investigate such effects, particularly in the presence of unmeasured confounders. However, common IV methods for nonlinear effect analysis, such as IV regression or the control-function method, have inherent limitations, leading to either low statistical power or potentially misleading conclusions. In this work, we propose an alternative IV framework for nonlinear effect analysis, which has recently emerged in genetic epidemiology and addresses many of the drawbacks of existing IV methods. The proposed IV framework consists of up to three key "S" elements: (i) the Stratification approach, which constructs multiple strata that are sub-samples of the population in which the IV core assumptions remain valid, (ii) the Scalar-on-function model and Scalar-on-scalar model, which connect local stratum-specific information to global effect estimation, and (iii) the Sum-of-single-effects method for effect estimation. This framework enables study of the effect function while avoiding unnecessary model assumptions. In particular, it facilitates the identification of change points or threshold values in causal effects. Through a wide variety of simulations, we demonstrate that our framework outperforms other representative nonlinear IV methods in predicting the effect shape when the instrument is weak and can accurately estimate the effect function as well as identify the change point and predict its value under various structural model and effect shape scenarios. We further apply our framework to assess the nonlinear effect of alcohol consumption on systolic blood pressure using a genetic instrument (ie Mendelian randomization) with UK Biobank data. Our analysis detects a threshold beyond which alcohol intake exhibits a clear causal effect on the outcome. Our results are consistent with published medical guidelines.

非线性因果效应在许多涉及连续暴露的研究场景中很普遍,可以使用工具变量(IVs)来研究这种效应,特别是在存在未测量混杂因素的情况下。然而,用于非线性效应分析的常用IV方法,如IV回归或控制函数法,具有固有的局限性,导致统计能力低或可能导致误导性结论。在这项工作中,我们提出了一种非线性效应分析的替代IV框架,该框架最近出现在遗传流行病学中,并解决了现有IV方法的许多缺点。拟议的IV框架由多达三个关键的“S”元素组成:(i)分层方法,该方法构建了多个分层,这些分层是IV核心假设仍然有效的人口的子样本;(ii)函数上标量模型和标量上标量模型,将局部特定层的信息与全局效应估计联系起来;(iii)单一效应求和方法用于效果估计。这个框架可以在研究效果函数的同时避免不必要的模型假设。特别是,它有助于识别因果效应中的变化点或阈值。通过各种各样的模拟,我们证明了我们的框架在预测仪器弱时的效应形状方面优于其他代表性的非线性IV方法,并且可以准确地估计效应函数,并在各种结构模型和效应形状场景下识别变化点并预测其值。我们进一步应用我们的框架,利用遗传工具(即孟德尔随机化)和英国生物银行的数据,评估酒精消耗对收缩压的非线性影响。我们的分析发现了一个阈值,超过这个阈值,酒精摄入对结果表现出明显的因果影响。我们的结果与出版的医学指南一致。
{"title":"Stratification-based instrumental variable analysis framework for nonlinear effect analysis.","authors":"Haodong Tian, Ashish Patel, Stephen Burgess","doi":"10.1093/biostatistics/kxaf043","DOIUrl":"10.1093/biostatistics/kxaf043","url":null,"abstract":"<p><p>Nonlinear causal effects are prevalent in many research scenarios involving continuous exposures, and instrumental variables (IVs) can be employed to investigate such effects, particularly in the presence of unmeasured confounders. However, common IV methods for nonlinear effect analysis, such as IV regression or the control-function method, have inherent limitations, leading to either low statistical power or potentially misleading conclusions. In this work, we propose an alternative IV framework for nonlinear effect analysis, which has recently emerged in genetic epidemiology and addresses many of the drawbacks of existing IV methods. The proposed IV framework consists of up to three key \"S\" elements: (i) the Stratification approach, which constructs multiple strata that are sub-samples of the population in which the IV core assumptions remain valid, (ii) the Scalar-on-function model and Scalar-on-scalar model, which connect local stratum-specific information to global effect estimation, and (iii) the Sum-of-single-effects method for effect estimation. This framework enables study of the effect function while avoiding unnecessary model assumptions. In particular, it facilitates the identification of change points or threshold values in causal effects. Through a wide variety of simulations, we demonstrate that our framework outperforms other representative nonlinear IV methods in predicting the effect shape when the instrument is weak and can accurately estimate the effect function as well as identify the change point and predict its value under various structural model and effect shape scenarios. We further apply our framework to assess the nonlinear effect of alcohol consumption on systolic blood pressure using a genetic instrument (ie Mendelian randomization) with UK Biobank data. Our analysis detects a threshold beyond which alcohol intake exhibits a clear causal effect on the outcome. Our results are consistent with published medical guidelines.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12665183/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145643011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-dimensional inference for functional regression with an application to the Alzheimer's disease magnetoencephalography study. 功能回归的高维推断与阿尔茨海默病脑磁图研究的应用。
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf050
Huaqing Jin, Fei Jiang

Alzheimer's disease (AD) is a progressive, chronic neurodegenerative disorder affecting millions worldwide. A new clinical magnetoencephalography (MEG) study was conducted to identify neural activity biomarkers and key brain regions in AD. Traditional methods for analyzing MEG data, which typically extract features from power spectral density, suffer from information loss. Furthermore, functional regression with variable selection tends to produce non-robust results, making it less ideal for drawing reliable scientific conclusions. To address these challenges, we propose a high-dimensional hypothesis testing (HDHT) framework for functional covariates and introduce a rigorous inference process to support scientific conclusions. We establish the theoretical properties of the HDHT framework and validate its performance through simulation studies. Applying the HDHT framework to the AD MEG data, we identify 19 important regions associated with cognitive functions that align with established AD pathophysiology. These findings suggest that the non-invasive MEG can be a potential low-risk and low-toxicity modality for monitoring neurodegenerative progression.

阿尔茨海默病(AD)是一种进行性慢性神经退行性疾病,影响全球数百万人。一项新的临床脑磁图(MEG)研究用于识别AD患者的神经活动生物标志物和关键脑区。传统的脑磁图数据分析方法通常是从功率谱密度中提取特征,存在信息丢失的问题。此外,具有变量选择的函数回归往往产生非鲁棒性结果,使其不太适合得出可靠的科学结论。为了解决这些挑战,我们提出了一个功能协变量的高维假设检验(HDHT)框架,并引入了一个严格的推理过程来支持科学结论。建立了HDHT框架的理论特性,并通过仿真研究验证了其性能。将HDHT框架应用于AD MEG数据,我们确定了19个与认知功能相关的重要区域,这些区域与已建立的AD病理生理学相一致。这些发现表明,无创脑磁图可能是一种潜在的低风险和低毒性监测神经退行性进展的方式。
{"title":"High-dimensional inference for functional regression with an application to the Alzheimer's disease magnetoencephalography study.","authors":"Huaqing Jin, Fei Jiang","doi":"10.1093/biostatistics/kxaf050","DOIUrl":"10.1093/biostatistics/kxaf050","url":null,"abstract":"<p><p>Alzheimer's disease (AD) is a progressive, chronic neurodegenerative disorder affecting millions worldwide. A new clinical magnetoencephalography (MEG) study was conducted to identify neural activity biomarkers and key brain regions in AD. Traditional methods for analyzing MEG data, which typically extract features from power spectral density, suffer from information loss. Furthermore, functional regression with variable selection tends to produce non-robust results, making it less ideal for drawing reliable scientific conclusions. To address these challenges, we propose a high-dimensional hypothesis testing (HDHT) framework for functional covariates and introduce a rigorous inference process to support scientific conclusions. We establish the theoretical properties of the HDHT framework and validate its performance through simulation studies. Applying the HDHT framework to the AD MEG data, we identify 19 important regions associated with cognitive functions that align with established AD pathophysiology. These findings suggest that the non-invasive MEG can be a potential low-risk and low-toxicity modality for monitoring neurodegenerative progression.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12728160/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145822185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction. 更正。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae029
{"title":"Correction.","authors":"","doi":"10.1093/biostatistics/kxae029","DOIUrl":"10.1093/biostatistics/kxae029","url":null,"abstract":"","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11823215/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142074610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Simultaneous clustering and estimation of networks in multiple graphical models. 在多个图形模型中同时对网络进行聚类和估算。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae015
Gen Li, Miaoyan Wang

Gaussian graphical models are widely used to study the dependence structure among variables. When samples are obtained from multiple conditions or populations, joint analysis of multiple graphical models are desired due to their capacity to borrow strength across populations. Nonetheless, existing methods often overlook the varying levels of similarity between populations, leading to unsatisfactory results. Moreover, in many applications, learning the population-level clustering structure itself is of particular interest. In this article, we develop a novel method, called Simultaneous Clustering and Estimation of Networks via Tensor decomposition (SCENT), that simultaneously clusters and estimates graphical models from multiple populations. Precision matrices from different populations are uniquely organized as a three-way tensor array, and a low-rank sparse model is proposed for joint population clustering and network estimation. We develop a penalized likelihood method and an augmented Lagrangian algorithm for model fitting. We also establish the clustering accuracy and norm consistency of the estimated precision matrices. We demonstrate the efficacy of the proposed method with comprehensive simulation studies. The application to the Genotype-Tissue Expression multi-tissue gene expression data provides important insights into tissue clustering and gene coexpression patterns in multiple brain tissues.

高斯图形模型被广泛用于研究变量之间的依赖结构。当样本来自多个条件或群体时,由于多个图形模型具有跨群体借力的能力,因此需要对其进行联合分析。然而,现有的方法往往忽略了群体间不同程度的相似性,导致结果不尽人意。此外,在许多应用中,学习种群级聚类结构本身也是特别令人感兴趣的。在本文中,我们开发了一种名为 "通过张量分解同时聚类和估计网络"(SCENT)的新方法,可同时对多个种群的图形模型进行聚类和估计。来自不同种群的精确度矩阵被独特地组织成一个三向张量阵列,并提出了一个低秩稀疏模型,用于联合种群聚类和网络估计。我们开发了用于模型拟合的惩罚似然法和增强拉格朗日算法。我们还确定了聚类精度和估计精度矩阵的规范一致性。我们通过全面的模拟研究证明了所提方法的有效性。该方法在基因型-组织表达多组织基因表达数据中的应用,为我们了解多脑组织的组织聚类和基因共表达模式提供了重要的启示。
{"title":"Simultaneous clustering and estimation of networks in multiple graphical models.","authors":"Gen Li, Miaoyan Wang","doi":"10.1093/biostatistics/kxae015","DOIUrl":"10.1093/biostatistics/kxae015","url":null,"abstract":"<p><p>Gaussian graphical models are widely used to study the dependence structure among variables. When samples are obtained from multiple conditions or populations, joint analysis of multiple graphical models are desired due to their capacity to borrow strength across populations. Nonetheless, existing methods often overlook the varying levels of similarity between populations, leading to unsatisfactory results. Moreover, in many applications, learning the population-level clustering structure itself is of particular interest. In this article, we develop a novel method, called Simultaneous Clustering and Estimation of Networks via Tensor decomposition (SCENT), that simultaneously clusters and estimates graphical models from multiple populations. Precision matrices from different populations are uniquely organized as a three-way tensor array, and a low-rank sparse model is proposed for joint population clustering and network estimation. We develop a penalized likelihood method and an augmented Lagrangian algorithm for model fitting. We also establish the clustering accuracy and norm consistency of the estimated precision matrices. We demonstrate the efficacy of the proposed method with comprehensive simulation studies. The application to the Genotype-Tissue Expression multi-tissue gene expression data provides important insights into tissue clustering and gene coexpression patterns in multiple brain tissues.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11826093/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141263584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biostatistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1