首页 > 最新文献

Bayesian Analysis最新文献

英文 中文
Bayesian Semiparametric Hidden Markov Tensor Models for Time Varying Random Partitions with Local Variable Selection 局部变量选择时变随机分区的贝叶斯半参数隐马尔可夫张量模型
IF 4.4 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-01 DOI: 10.1214/23-ba1383
Giorgio Paulon, P. Müller, A. Sarkar
{"title":"Bayesian Semiparametric Hidden Markov Tensor Models for Time Varying Random Partitions with Local Variable Selection","authors":"Giorgio Paulon, P. Müller, A. Sarkar","doi":"10.1214/23-ba1383","DOIUrl":"https://doi.org/10.1214/23-ba1383","url":null,"abstract":"","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":" ","pages":""},"PeriodicalIF":4.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48190225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Approximate Bayesian Inference Based on Expected Evaluation 基于期望评价的近似贝叶斯推理
IF 4.4 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-01 DOI: 10.1214/23-ba1368
H. Hammer, M. Riegler, H. Tjelmeland
{"title":"Approximate Bayesian Inference Based on Expected Evaluation","authors":"H. Hammer, M. Riegler, H. Tjelmeland","doi":"10.1214/23-ba1368","DOIUrl":"https://doi.org/10.1214/23-ba1368","url":null,"abstract":"","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":" ","pages":""},"PeriodicalIF":4.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48228399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generalized Geographically Weighted Regression Model within a Modularized Bayesian Framework. 模块化贝叶斯框架内的广义地理加权回归模型。
IF 4.9 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-01 DOI: 10.1214/22-BA1357
Yang Liu, Robert J B Goudie

Geographically weighted regression (GWR) models handle geographical dependence through a spatially varying coefficient model and have been widely used in applied science, but its general Bayesian extension is unclear because it involves a weighted log-likelihood which does not imply a probability distribution on data. We present a Bayesian GWR model and show that its essence is dealing with partial misspecification of the model. Current modularized Bayesian inference models accommodate partial misspecification from a single component of the model. We extend these models to handle partial misspecification in more than one component of the model, as required for our Bayesian GWR model. Information from the various spatial locations is manipulated via a geographically weighted kernel and the optimal manipulation is chosen according to a Kullback-Leibler (KL) divergence. We justify the model via an information risk minimization approach and show the consistency of the proposed estimator in terms of a geographically weighted KL divergence.

地理加权回归(GWR)模型通过空间变化系数模型来处理地理依赖性,已广泛应用于应用科学领域,但其一般贝叶斯扩展尚不明确,因为它涉及加权对数概率,而加权对数概率并不意味着数据的概率分布。我们提出了一种贝叶斯 GWR 模型,并说明其本质是处理模型的部分错误规范。目前的模块化贝叶斯推理模型可处理来自模型单个组成部分的部分误指定。我们对这些模型进行了扩展,以处理模型中不止一个部分的部分误设,这正是我们的贝叶斯 GWR 模型所需要的。来自不同空间位置的信息通过地理加权核进行处理,并根据库尔贝克-莱伯勒(KL)分歧选择最佳处理方式。我们通过信息风险最小化的方法来证明该模型的合理性,并用地理加权 KL 分歧来证明所提出的估计器的一致性。
{"title":"Generalized Geographically Weighted Regression Model within a Modularized Bayesian Framework.","authors":"Yang Liu, Robert J B Goudie","doi":"10.1214/22-BA1357","DOIUrl":"10.1214/22-BA1357","url":null,"abstract":"<p><p>Geographically weighted regression (GWR) models handle geographical dependence through a spatially varying coefficient model and have been widely used in applied science, but its general Bayesian extension is unclear because it involves a weighted log-likelihood which does not imply a probability distribution on data. We present a Bayesian GWR model and show that its essence is dealing with partial misspecification of the model. Current modularized Bayesian inference models accommodate partial misspecification from a single component of the model. We extend these models to handle partial misspecification in more than one component of the model, as required for our Bayesian GWR model. Information from the various spatial locations is manipulated via a geographically weighted kernel and the optimal manipulation is chosen according to a Kullback-Leibler (KL) divergence. We justify the model via an information risk minimization approach and show the consistency of the proposed estimator in terms of a geographically weighted KL divergence.</p>","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":"-1 -1","pages":"1-36"},"PeriodicalIF":4.9,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7614111/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10577363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Defining a Credible Interval Is Not Always Possible with “Point-Null” Priors: A Lesser-Known Correlate of the Jeffreys-Lindley Paradox 用“零点”先验定义可信区间并不总是可能的:Jeffrey-Lindley悖论的一个鲜为人知的相关性
IF 4.4 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-09-30 DOI: 10.1214/23-ba1397
Harlan Campbell, P. Gustafson
In many common situations, a Bayesian credible interval will be, given the same data, very similar to a frequentist confidence interval, and researchers will interpret these intervals in a similar fashion. However, no predictable similarity exists when credible intervals are based on model-averaged posteriors whenever one of the two nested models under consideration is a so called ''point-null''. Not only can this model-averaged credible interval be quite different than the frequentist confidence interval, in some cases it may be undefined. This is a lesser-known correlate of the Jeffreys-Lindley paradox and is of particular interest given the popularity of the Bayes factor for testing point-null hypotheses.
在许多常见情况下,给定相同的数据,贝叶斯可信区间将与频率置信区间非常相似,研究人员将以类似的方式解释这些区间。然而,当考虑的两个嵌套模型中的一个是所谓的“点零”时,基于模型平均后验的可信区间不存在可预测的相似性。这种模型平均可信区间不仅可能与频率置信区间有很大不同,在某些情况下,它可能是未定义的。这是Jeffreys-Lindley悖论的一个鲜为人知的关联,考虑到用于检验点零假设的贝叶斯因子的流行,这是特别有趣的。
{"title":"Defining a Credible Interval Is Not Always Possible with “Point-Null” Priors: A Lesser-Known Correlate of the Jeffreys-Lindley Paradox","authors":"Harlan Campbell, P. Gustafson","doi":"10.1214/23-ba1397","DOIUrl":"https://doi.org/10.1214/23-ba1397","url":null,"abstract":"In many common situations, a Bayesian credible interval will be, given the same data, very similar to a frequentist confidence interval, and researchers will interpret these intervals in a similar fashion. However, no predictable similarity exists when credible intervals are based on model-averaged posteriors whenever one of the two nested models under consideration is a so called ''point-null''. Not only can this model-averaged credible interval be quite different than the frequentist confidence interval, in some cases it may be undefined. This is a lesser-known correlate of the Jeffreys-Lindley paradox and is of particular interest given the popularity of the Bayes factor for testing point-null hypotheses.","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":" ","pages":""},"PeriodicalIF":4.4,"publicationDate":"2022-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46209104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Structure Induced by a Multiple Membership Transformation on the Conditional Autoregressive Model 条件自回归模型的多重隶属变换诱导的结构
IF 4.4 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-08-13 DOI: 10.1214/23-ba1370
Marco Gramatica, S. Liverani, Peter Congdon
The objective of disease mapping is to model data aggregated at the areal level. In some contexts, however, (e.g. residential histories, general practitioner catchment areas) when data is arising from a variety of sources, not necessarily at the same spatial scale, it is possible to specify spatial random effects, or covariate effects, at the areal level, by using a multiple membership principle (MM) (Petrof et al. 2020, Gramatica et al. 2021). A weighted average of conditional autoregressive (CAR) spatial random effects embeds spatial information for a spatially-misaligned outcome and estimate relative risk for both frameworks (areas and memberships). In this paper we investigate the theoretical underpinnings of these application of the multiple membership principle to the CAR prior, in particular with regard to parameterisation, properness and identifiability. We carry out simulations involving different numbers of memberships as compared to number of areas and assess impact of this on estimating parameters of interest. Both analytical and simulation study results show under which conditions parameters of interest are identifiable, so that we can offer actionable recommendations to practitioners. Finally, we present the results of an application of the multiple membership model to diabetes prevalence data in South London, together with strategic implications for public health considerations
疾病绘图的目的是对区域层面上汇总的数据进行建模。然而,在某些情况下(例如,居住史、全科医生集水区),当数据来自各种来源,而不一定在同一空间尺度上时,可以通过使用多成员原理(MM)在区域水平上指定空间随机效应或协变量效应(Petrof等人2020,Gramatica等人2021)。条件自回归(CAR)空间随机效应的加权平均嵌入了空间错位结果的空间信息,并估计了两个框架(区域和成员)的相对风险。在本文中,我们研究了将多重隶属度原理应用于CAR先验的理论基础,特别是在参数化、适当性和可识别性方面。与区域数量相比,我们进行了涉及不同成员数量的模拟,并评估了这对估计感兴趣的参数的影响。分析和模拟研究结果都表明,在何种条件下,感兴趣的参数是可识别的,因此我们可以向从业者提供可操作的建议。最后,我们介绍了将多成员模型应用于伦敦南部糖尿病患病率数据的结果,以及对公共卫生考虑的战略意义
{"title":"Structure Induced by a Multiple Membership Transformation on the Conditional Autoregressive Model","authors":"Marco Gramatica, S. Liverani, Peter Congdon","doi":"10.1214/23-ba1370","DOIUrl":"https://doi.org/10.1214/23-ba1370","url":null,"abstract":"The objective of disease mapping is to model data aggregated at the areal level. In some contexts, however, (e.g. residential histories, general practitioner catchment areas) when data is arising from a variety of sources, not necessarily at the same spatial scale, it is possible to specify spatial random effects, or covariate effects, at the areal level, by using a multiple membership principle (MM) (Petrof et al. 2020, Gramatica et al. 2021). A weighted average of conditional autoregressive (CAR) spatial random effects embeds spatial information for a spatially-misaligned outcome and estimate relative risk for both frameworks (areas and memberships). In this paper we investigate the theoretical underpinnings of these application of the multiple membership principle to the CAR prior, in particular with regard to parameterisation, properness and identifiability. We carry out simulations involving different numbers of memberships as compared to number of areas and assess impact of this on estimating parameters of interest. Both analytical and simulation study results show under which conditions parameters of interest are identifiable, so that we can offer actionable recommendations to practitioners. Finally, we present the results of an application of the multiple membership model to diabetes prevalence data in South London, together with strategic implications for public health considerations","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":" ","pages":""},"PeriodicalIF":4.4,"publicationDate":"2022-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47219745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Data Augmentation for Partially Observed Stochastic Compartmental Models 部分观测随机区室模型的贝叶斯数据增强
IF 4.4 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-06-17 DOI: 10.1214/23-ba1398
Shuying Wang, S. Walker
Deterministic compartmental models are predominantly used in the modeling of infectious diseases, though stochastic models are considered more realistic, yet are complicated to estimate due to missing data. In this paper we present a novel algorithm for estimating the stochastic SIR/SEIR epidemic model within a Bayesian framework, which can be readily extended to more complex stochastic compartmental models. Specifically, based on the infinitesimal conditional independence properties of the model, we are able to find a proposal distribution for a Metropolis algorithm which is very close to the correct posterior distribution. As a consequence, rather than perform a Metropolis step updating one missing data point at a time, as in the current benchmark Markov chain Monte Carlo (MCMC) algorithm, we are able to extend our proposal to the entire set of missing observations. This improves the MCMC methods dramatically and makes the stochastic models now a viable modeling option. A number of real data illustrations and the necessary mathematical theory supporting our results are presented.
确定性分区模型主要用于传染病建模,尽管随机模型被认为更现实,但由于数据缺失,估计起来很复杂。在本文中,我们提出了一种在贝叶斯框架内估计随机SIR/SEIR流行病模型的新算法,该算法可以很容易地扩展到更复杂的随机分区模型。具体来说,基于模型的无穷小条件独立性,我们能够找到Metropolis算法的建议分布,该建议分布非常接近正确的后验分布。因此,我们可以将我们的建议扩展到整个缺失观测集,而不是像当前的基准马尔可夫链蒙特卡罗(MCMC)算法那样,执行Metropolis步骤一次更新一个缺失数据点。这大大改进了MCMC方法,使随机模型现在成为一种可行的建模选择。给出了一些实际数据的说明和必要的数学理论来支持我们的结果。
{"title":"Bayesian Data Augmentation for Partially Observed Stochastic Compartmental Models","authors":"Shuying Wang, S. Walker","doi":"10.1214/23-ba1398","DOIUrl":"https://doi.org/10.1214/23-ba1398","url":null,"abstract":"Deterministic compartmental models are predominantly used in the modeling of infectious diseases, though stochastic models are considered more realistic, yet are complicated to estimate due to missing data. In this paper we present a novel algorithm for estimating the stochastic SIR/SEIR epidemic model within a Bayesian framework, which can be readily extended to more complex stochastic compartmental models. Specifically, based on the infinitesimal conditional independence properties of the model, we are able to find a proposal distribution for a Metropolis algorithm which is very close to the correct posterior distribution. As a consequence, rather than perform a Metropolis step updating one missing data point at a time, as in the current benchmark Markov chain Monte Carlo (MCMC) algorithm, we are able to extend our proposal to the entire set of missing observations. This improves the MCMC methods dramatically and makes the stochastic models now a viable modeling option. A number of real data illustrations and the necessary mathematical theory supporting our results are presented.","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":" ","pages":""},"PeriodicalIF":4.4,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43825210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Consistent and Scalable Bayesian Joint Variable and Graph Selection for Disease Diagnosis Leveraging Functional Brain Network 基于功能脑网络的疾病诊断一致可扩展贝叶斯联合变量和图选择
IF 4.4 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-03-14 DOI: 10.1214/23-ba1376
Xuan Cao, Kyoungjae Lee
We consider the joint inference of regression coefficients and the inverse covariance matrix for covariates in high-dimensional probit regression, where the predictors are both relevant to the binary response and functionally related to one another. A hierarchical model with spike and slab priors over regression coefficients and the elements in the inverse covariance matrix is employed to simultaneously perform variable and graph selection. We establish joint selection consistency for both the variable and the underlying graph when the dimension of predictors is allowed to grow much larger than the sample size, which is the first theoretical result in the Bayesian literature. A scalable Gibbs sampler is derived that performs better in high-dimensional simulation studies compared with other state-of-art methods. We illustrate the practical impact and utilities of the proposed method via a functional MRI dataset, where both the regions of interest with altered functional activities and the underlying functional brain network are inferred and integrated together for stratifying disease risk.
我们考虑了高维probit回归中回归系数和协变量的逆协方差矩阵的联合推断,其中预测因子既与二元响应相关,又在功能上相互关联。采用具有回归系数上的尖峰和板先验以及逆协方差矩阵中的元素的分层模型来同时执行变量和图的选择。当预测因子的维数增长远大于样本量时,我们为变量和基础图建立了联合选择一致性,这是贝叶斯文献中的第一个理论结果。导出了一种可扩展的吉布斯采样器,与其他现有技术相比,该采样器在高维模拟研究中表现更好。我们通过功能性MRI数据集说明了所提出方法的实际影响和实用性,其中推断出功能活动改变的感兴趣区域和潜在的功能性脑网络,并将其整合在一起,以对疾病风险进行分层。
{"title":"Consistent and Scalable Bayesian Joint Variable and Graph Selection for Disease Diagnosis Leveraging Functional Brain Network","authors":"Xuan Cao, Kyoungjae Lee","doi":"10.1214/23-ba1376","DOIUrl":"https://doi.org/10.1214/23-ba1376","url":null,"abstract":"We consider the joint inference of regression coefficients and the inverse covariance matrix for covariates in high-dimensional probit regression, where the predictors are both relevant to the binary response and functionally related to one another. A hierarchical model with spike and slab priors over regression coefficients and the elements in the inverse covariance matrix is employed to simultaneously perform variable and graph selection. We establish joint selection consistency for both the variable and the underlying graph when the dimension of predictors is allowed to grow much larger than the sample size, which is the first theoretical result in the Bayesian literature. A scalable Gibbs sampler is derived that performs better in high-dimensional simulation studies compared with other state-of-art methods. We illustrate the practical impact and utilities of the proposed method via a functional MRI dataset, where both the regions of interest with altered functional activities and the underlying functional brain network are inferred and integrated together for stratifying disease risk.","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":" ","pages":""},"PeriodicalIF":4.4,"publicationDate":"2022-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48543252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scalable Approximate Bayesian Computation for Growing Network Models via Extrapolated and Sampled Summaries. 通过外推和采样摘要为成长网络模型提供可扩展的近似贝叶斯计算。
IF 4.9 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-03-01 Epub Date: 2020-12-08 DOI: 10.1214/20-ba1248
Louis Raynal, Sixing Chen, Antonietta Mira, Jukka-Pekka Onnela

Approximate Bayesian computation (ABC) is a simulation-based likelihood-free method applicable to both model selection and parameter estimation. ABC parameter estimation requires the ability to forward simulate datasets from a candidate model, but because the sizes of the observed and simulated datasets usually need to match, this can be computationally expensive. Additionally, since ABC inference is based on comparisons of summary statistics computed on the observed and simulated data, using computationally expensive summary statistics can lead to further losses in efficiency. ABC has recently been applied to the family of mechanistic network models, an area that has traditionally lacked tools for inference and model choice. Mechanistic models of network growth repeatedly add nodes to a network until it reaches the size of the observed network, which may be of the order of millions of nodes. With ABC, this process can quickly become computationally prohibitive due to the resource intensive nature of network simulations and evaluation of summary statistics. We propose two methodological developments to enable the use of ABC for inference in models for large growing networks. First, to save time needed for forward simulating model realizations, we propose a procedure to extrapolate (via both least squares and Gaussian processes) summary statistics from small to large networks. Second, to reduce computation time for evaluating summary statistics, we use sample-based rather than census-based summary statistics. We show that the ABC posterior obtained through this approach, which adds two additional layers of approximation to the standard ABC, is similar to a classic ABC posterior. Although we deal with growing network models, both extrapolated summaries and sampled summaries are expected to be relevant in other ABC settings where the data are generated incrementally.

近似贝叶斯计算(ABC)是一种基于模拟的无似然方法,适用于模型选择和参数估计。近似贝叶斯计算参数估计要求能够根据候选模型对数据集进行前向模拟,但由于观测数据集和模拟数据集的大小通常需要匹配,因此计算成本会很高。此外,由于 ABC 推理是基于对观察数据和模拟数据计算出的汇总统计量进行比较,因此使用计算成本高昂的汇总统计量会进一步降低效率。最近,ABC 被应用于机理网络模型系列,而这一领域历来缺乏推断和模型选择工具。网络增长机理模型会反复向网络中添加节点,直到达到观测到的网络规模,而观测到的网络规模可能达到数百万节点的数量级。在 ABC 中,由于网络模拟和汇总统计评估需要大量资源,这一过程很快就会变得难以计算。我们提出了两个方法上的发展,使 ABC 能够用于大型增长网络模型的推断。首先,为了节省前向模拟模型实现所需的时间,我们提出了一种从小型网络向大型网络推断(通过最小二乘法和高斯过程)汇总统计量的程序。其次,为了减少评估汇总统计的计算时间,我们使用了基于样本而非基于普查的汇总统计。我们表明,通过这种方法获得的 ABC 后验(在标准 ABC 的基础上增加了两层近似)与经典 ABC 后验相似。虽然我们处理的是增长型网络模型,但预计外推摘要和抽样摘要在数据增量生成的其他 ABC 环境中也是相关的。
{"title":"Scalable Approximate Bayesian Computation for Growing Network Models via Extrapolated and Sampled Summaries.","authors":"Louis Raynal, Sixing Chen, Antonietta Mira, Jukka-Pekka Onnela","doi":"10.1214/20-ba1248","DOIUrl":"10.1214/20-ba1248","url":null,"abstract":"<p><p>Approximate Bayesian computation (ABC) is a simulation-based likelihood-free method applicable to both model selection and parameter estimation. ABC parameter estimation requires the ability to forward simulate datasets from a candidate model, but because the sizes of the observed and simulated datasets usually need to match, this can be computationally expensive. Additionally, since ABC inference is based on comparisons of summary statistics computed on the observed and simulated data, using computationally expensive summary statistics can lead to further losses in efficiency. ABC has recently been applied to the family of mechanistic network models, an area that has traditionally lacked tools for inference and model choice. Mechanistic models of network growth repeatedly add nodes to a network until it reaches the size of the observed network, which may be of the order of millions of nodes. With ABC, this process can quickly become computationally prohibitive due to the resource intensive nature of network simulations and evaluation of summary statistics. We propose two methodological developments to enable the use of ABC for inference in models for large growing networks. First, to save time needed for forward simulating model realizations, we propose a procedure to extrapolate (via both least squares and Gaussian processes) summary statistics from small to large networks. Second, to reduce computation time for evaluating summary statistics, we use sample-based rather than census-based summary statistics. We show that the ABC posterior obtained through this approach, which adds two additional layers of approximation to the standard ABC, is similar to a classic ABC posterior. Although we deal with growing network models, both extrapolated summaries and sampled summaries are expected to be relevant in other ABC settings where the data are generated incrementally.</p>","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":"17 1","pages":"165-192"},"PeriodicalIF":4.9,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9541316/pdf/nihms-1803183.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33497387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Inference on Hierarchical Nonlocal Priors in Generalized Linear Models 广义线性模型中层次非局部先验的贝叶斯推理
IF 4.4 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-01-01 DOI: 10.1214/22-ba1350
Xuan Cao, Kyoungjae Lee
Variable selection methods with nonlocal priors have been widely studied in linear regression models, and their theoretical and empirical performances have been reported. However, the crucial model selection properties for hierarchical nonlocal priors in high-dimensional generalized linear regression have rarely been investigated. In this paper, we consider a hierarchical nonlocal prior for high-dimensional logistic regression models and investigate theoretical properties of the posterior distribution. Specifically, a product moment (pMOM) nonlocal prior is imposed over the regression coefficients with an Inverse-Gamma prior on the tuning parameter. Under standard regularity assumptions, we establish strong model selection consistency in a high-dimensional setting, where the number of covariates is allowed to increase at a subexponential rate with the sample size. We implement the Laplace approximation for computing the posterior probabilities, and a modified shotgun stochastic search procedure is suggested for efficiently exploring the model space. We demonstrate the validity of the proposed method through simulation studies and an RNA-sequencing dataset for stratifying disease risk.
具有非局部先验的变量选择方法在线性回归模型中得到了广泛的研究,并报道了它们的理论和经验性能。然而,在高维广义线性回归中,层次非局部先验的关键模型选择特性很少被研究。在本文中,我们考虑了高维逻辑回归模型的一个层次非局部先验,并研究了后验分布的理论性质。具体地,在回归系数上施加乘积矩(pMOM)非局部先验,在调谐参数上施加逆伽马先验。在标准正则性假设下,我们在高维环境中建立了强大的模型选择一致性,其中协变量的数量可以随着样本量以亚指数率增加。我们实现了拉普拉斯近似来计算后验概率,并提出了一种改进的霰弹枪随机搜索程序来有效地探索模型空间。我们通过模拟研究和RNA测序数据集对疾病风险进行分层,证明了所提出方法的有效性。
{"title":"Bayesian Inference on Hierarchical Nonlocal Priors in Generalized Linear Models","authors":"Xuan Cao, Kyoungjae Lee","doi":"10.1214/22-ba1350","DOIUrl":"https://doi.org/10.1214/22-ba1350","url":null,"abstract":"Variable selection methods with nonlocal priors have been widely studied in linear regression models, and their theoretical and empirical performances have been reported. However, the crucial model selection properties for hierarchical nonlocal priors in high-dimensional generalized linear regression have rarely been investigated. In this paper, we consider a hierarchical nonlocal prior for high-dimensional logistic regression models and investigate theoretical properties of the posterior distribution. Specifically, a product moment (pMOM) nonlocal prior is imposed over the regression coefficients with an Inverse-Gamma prior on the tuning parameter. Under standard regularity assumptions, we establish strong model selection consistency in a high-dimensional setting, where the number of covariates is allowed to increase at a subexponential rate with the sample size. We implement the Laplace approximation for computing the posterior probabilities, and a modified shotgun stochastic search procedure is suggested for efficiently exploring the model space. We demonstrate the validity of the proposed method through simulation studies and an RNA-sequencing dataset for stratifying disease risk.","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":" ","pages":""},"PeriodicalIF":4.4,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49089074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Bayesian Learning of Graph Substructures 图子结构的贝叶斯学习
IF 4.4 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-01-01 DOI: 10.1214/22-BA1338
W. V. Boom, M. Iorio, A. Beskos
Graphical models provide a powerful methodology for learning the conditional independence structure in multivariate data. Inference is often focused on estimating individual edges in the latent graph. Nonetheless, there is increasing interest in inferring more complex structures, such as communities, for multiple reasons, including more effective information retrieval and better interpretability. Stochastic blockmodels offer a powerful tool to detect such structure in a network. We thus propose to exploit advances in random graph theory and embed them within the graphical models framework. A consequence of this approach is the propagation of the uncertainty in graph estimation to large-scale structure learning. We consider Bayesian nonparametric stochastic blockmodels as priors on the graph. We extend such models to consider clique-based blocks and to multiple graph settings introducing a novel prior process based on a Dependent Dirichlet process. Moreover, we devise a tailored computation strategy of Bayes factors for block structure based on the Savage-Dickey ratio to test for presence of larger structure in a graph. We demonstrate our approach in simulations as well as on real data applications in finance and transcriptomics.
图形模型为学习多元数据中的条件独立性结构提供了一种强大的方法。推理通常集中在估计潜在图中的各个边上。尽管如此,由于多种原因,人们对推断更复杂的结构(如社区)越来越感兴趣,包括更有效的信息检索和更好的可解释性。随机块模型为检测网络中的此类结构提供了强大的工具。因此,我们建议利用随机图理论的进步,并将其嵌入到图形模型框架中。这种方法的结果是图估计中的不确定性传播到大规模结构学习中。我们将贝叶斯非参数随机块模型视为图上的先验。我们将这种模型扩展到考虑基于团的块,并扩展到引入基于依赖狄利克雷过程的新先验过程的多个图设置。此外,我们设计了一种基于Savage Dickey比率的块结构贝叶斯因子的定制计算策略,以测试图中是否存在较大结构。我们在模拟以及金融和转录组学中的真实数据应用中展示了我们的方法。
{"title":"Bayesian Learning of Graph Substructures","authors":"W. V. Boom, M. Iorio, A. Beskos","doi":"10.1214/22-BA1338","DOIUrl":"https://doi.org/10.1214/22-BA1338","url":null,"abstract":"Graphical models provide a powerful methodology for learning the conditional independence structure in multivariate data. Inference is often focused on estimating individual edges in the latent graph. Nonetheless, there is increasing interest in inferring more complex structures, such as communities, for multiple reasons, including more effective information retrieval and better interpretability. Stochastic blockmodels offer a powerful tool to detect such structure in a network. We thus propose to exploit advances in random graph theory and embed them within the graphical models framework. A consequence of this approach is the propagation of the uncertainty in graph estimation to large-scale structure learning. We consider Bayesian nonparametric stochastic blockmodels as priors on the graph. We extend such models to consider clique-based blocks and to multiple graph settings introducing a novel prior process based on a Dependent Dirichlet process. Moreover, we devise a tailored computation strategy of Bayes factors for block structure based on the Savage-Dickey ratio to test for presence of larger structure in a graph. We demonstrate our approach in simulations as well as on real data applications in finance and transcriptomics.","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":" ","pages":""},"PeriodicalIF":4.4,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44413832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Bayesian Analysis
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1