首页 > 最新文献

Bayesian Analysis最新文献

英文 中文
Bayesian Analysis of Exponential Random Graph Models Using Stochastic Gradient Markov Chain Monte Carlo. 基于随机梯度马尔可夫链蒙特卡罗的指数随机图模型的贝叶斯分析
IF 4.9 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-06-01 Epub Date: 2024-04-09 DOI: 10.1214/23-BA1364
Qian Zhang, Faming Liang

The exponential random graph model (ERGM) is a popular model for social networks, which is known to have an intractable likelihood function. Sampling from the posterior for such a model is a long-standing problem in statistical research. We analyze the performance of the stochastic gradient Langevin dynamics (SGLD) algorithm (also known as noisy Longevin Monte Carlo) in tackling this problem, where the stochastic gradient is calculated via running a short Markov chain (the so-called inner Markov chain in this paper) at each iteration. We show that if the model size grows with the network size slowly enough, then SGLD converges to the true posterior in 2-Wasserstein distance as the network size and iteration number become large regardless of the length of the inner Markov chain performed at each iteration. Our study provides a scalable algorithm for analyzing large-scale social networks with possibly high-dimensional ERGMs.

{"title":"Bayesian Analysis of Exponential Random Graph Models Using Stochastic Gradient Markov Chain Monte Carlo.","authors":"Qian Zhang, Faming Liang","doi":"10.1214/23-BA1364","DOIUrl":"10.1214/23-BA1364","url":null,"abstract":"<p><p>The exponential random graph model (ERGM) is a popular model for social networks, which is known to have an intractable likelihood function. Sampling from the posterior for such a model is a long-standing problem in statistical research. We analyze the performance of the stochastic gradient Langevin dynamics (SGLD) algorithm (also known as noisy Longevin Monte Carlo) in tackling this problem, where the stochastic gradient is calculated via running a short Markov chain (the so-called inner Markov chain in this paper) at each iteration. We show that if the model size grows with the network size slowly enough, then SGLD converges to the true posterior in 2-Wasserstein distance as the network size and iteration number become large regardless of the length of the inner Markov chain performed at each iteration. Our study provides a scalable algorithm for analyzing large-scale social networks with possibly high-dimensional ERGMs.</p>","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":" ","pages":"595-621"},"PeriodicalIF":4.9,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11756892/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46968695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reproducible Model Selection Using Bagged Posteriors. 利用袋装后验进行可重复性模型选择。
IF 4.4 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-03-01 DOI: 10.1214/21-ba1301
Jonathan H Huggins, Jeffrey W Miller

Bayesian model selection is premised on the assumption that the data are generated from one of the postulated models. However, in many applications, all of these models are incorrect (that is, there is misspecification). When the models are misspecified, two or more models can provide a nearly equally good fit to the data, in which case Bayesian model selection can be highly unstable, potentially leading to self-contradictory findings. To remedy this instability, we propose to use bagging on the posterior distribution ("BayesBag") - that is, to average the posterior model probabilities over many bootstrapped datasets. We provide theoretical results characterizing the asymptotic behavior of the posterior and the bagged posterior in the (misspecified) model selection setting. We empirically assess the BayesBag approach on synthetic and real-world data in (i) feature selection for linear regression and (ii) phylogenetic tree reconstruction. Our theory and experiments show that, when all models are misspecified, BayesBag (a) provides greater reproducibility and (b) places posterior mass on optimal models more reliably, compared to the usual Bayesian posterior; on the other hand, under correct specification, BayesBag is slightly more conservative than the usual posterior, in the sense that BayesBag posterior probabilities tend to be slightly farther from the extremes of zero and one. Overall, our results demonstrate that BayesBag provides an easy-to-use and widely applicable approach that improves upon Bayesian model selection by making it more stable and reproducible.

贝叶斯模型选择的前提是假设数据是从假设模型之一产生的。然而,在许多应用程序中,所有这些模型都是不正确的(也就是说,存在错误的规范)。当模型被错误指定时,两个或两个以上的模型可以提供几乎相同的数据拟合,在这种情况下,贝叶斯模型选择可能非常不稳定,可能导致自相矛盾的结果。为了弥补这种不稳定性,我们建议对后验分布使用bagging(“BayesBag”)——也就是说,对许多自举数据集的后验模型概率进行平均。我们提供了理论结果表征后验和袋装后验的渐近行为在(错误指定的)模型选择设置。我们在以下方面对BayesBag方法进行了实证评估:(i)线性回归的特征选择和(ii)系统发育树重建。我们的理论和实验表明,与通常的贝叶斯后验相比,当所有模型都被错误指定时,BayesBag (a)提供了更大的再现性,(b)更可靠地将后验质量放在最优模型上;另一方面,在正确的规范下,BayesBag比通常的后验概率略保守,也就是说BayesBag后验概率往往离零和一的极值略远。总的来说,我们的结果表明BayesBag提供了一种易于使用且广泛适用的方法,通过使贝叶斯模型选择更加稳定和可重复性来改进贝叶斯模型选择。
{"title":"Reproducible Model Selection Using Bagged Posteriors.","authors":"Jonathan H Huggins,&nbsp;Jeffrey W Miller","doi":"10.1214/21-ba1301","DOIUrl":"https://doi.org/10.1214/21-ba1301","url":null,"abstract":"<p><p>Bayesian model selection is premised on the assumption that the data are generated from one of the postulated models. However, in many applications, all of these models are incorrect (that is, there is misspecification). When the models are misspecified, two or more models can provide a nearly equally good fit to the data, in which case Bayesian model selection can be highly unstable, potentially leading to self-contradictory findings. To remedy this instability, we propose to use bagging on the posterior distribution (\"BayesBag\") - that is, to average the posterior model probabilities over many bootstrapped datasets. We provide theoretical results characterizing the asymptotic behavior of the posterior and the bagged posterior in the (misspecified) model selection setting. We empirically assess the BayesBag approach on synthetic and real-world data in (i) feature selection for linear regression and (ii) phylogenetic tree reconstruction. Our theory and experiments show that, when all models are misspecified, BayesBag (a) provides greater reproducibility and (b) places posterior mass on optimal models more reliably, compared to the usual Bayesian posterior; on the other hand, under correct specification, BayesBag is slightly more conservative than the usual posterior, in the sense that BayesBag posterior probabilities tend to be slightly farther from the extremes of zero and one. Overall, our results demonstrate that BayesBag provides an easy-to-use and widely applicable approach that improves upon Bayesian model selection by making it more stable and reproducible.</p>","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":"18 1","pages":"79-104"},"PeriodicalIF":4.4,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9838736/pdf/nihms-1796997.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9229540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A Bayesian Nonparametric Latent Space Approach to Modeling Evolving Communities in Dynamic Networks 动态网络中演化群体建模的贝叶斯非参数隐空间方法
2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-03-01 DOI: 10.1214/21-ba1300
Joshua Daniel Loyal, Yuguo Chen
The evolution of communities in dynamic (time-varying) network data is a prominent topic of interest. A popular approach to understanding these dynamic networks is to embed the dyadic relations into a latent metric space. While methods for clustering with this approach exist for dynamic networks, they all assume a static community structure. This paper presents a Bayesian nonparametric model for dynamic networks that can model networks with evolving community structures. Our model extends existing latent space approaches by explicitly modeling the additions, deletions, splits, and mergers of groups with a hierarchical Dirichlet process hidden Markov model. Our proposed approach, the hierarchical Dirichlet process latent position cluster model (HDP-LPCM), incorporates transitivity, models both individual and group level aspects of the data, and avoids the computationally expensive selection of the number of groups required by most popular methods. We provide a Markov chain Monte Carlo estimation algorithm and demonstrate its ability to detect evolving community structure in a network of military alliances during the Cold War and a narrative network constructed from the Game of Thrones television series.
动态(时变)网络数据中社区的演化是一个重要的研究课题。理解这些动态网络的一种流行方法是将并矢关系嵌入到潜在度量空间中。虽然使用这种方法进行集群的方法适用于动态网络,但它们都假定是静态的社区结构。本文提出了一个动态网络的贝叶斯非参数模型,该模型可以对社区结构不断变化的网络进行建模。我们的模型扩展了现有的潜在空间方法,通过层次Dirichlet过程隐马尔可夫模型显式地建模组的添加、删除、分裂和合并。我们提出的分层Dirichlet过程潜在位置聚类模型(HDP-LPCM)结合了传递性,对数据的个体和群体层面进行建模,并避免了大多数流行方法所需的群体数量的计算成本高昂的选择。我们提供了一种马尔可夫链蒙特卡罗估计算法,并展示了它在冷战时期军事联盟网络和《权力的游戏》电视剧构建的叙事网络中检测不断变化的社区结构的能力。
{"title":"A Bayesian Nonparametric Latent Space Approach to Modeling Evolving Communities in Dynamic Networks","authors":"Joshua Daniel Loyal, Yuguo Chen","doi":"10.1214/21-ba1300","DOIUrl":"https://doi.org/10.1214/21-ba1300","url":null,"abstract":"The evolution of communities in dynamic (time-varying) network data is a prominent topic of interest. A popular approach to understanding these dynamic networks is to embed the dyadic relations into a latent metric space. While methods for clustering with this approach exist for dynamic networks, they all assume a static community structure. This paper presents a Bayesian nonparametric model for dynamic networks that can model networks with evolving community structures. Our model extends existing latent space approaches by explicitly modeling the additions, deletions, splits, and mergers of groups with a hierarchical Dirichlet process hidden Markov model. Our proposed approach, the hierarchical Dirichlet process latent position cluster model (HDP-LPCM), incorporates transitivity, models both individual and group level aspects of the data, and avoids the computationally expensive selection of the number of groups required by most popular methods. We provide a Markov chain Monte Carlo estimation algorithm and demonstrate its ability to detect evolving community structure in a network of military alliances during the Cold War and a narrative network constructed from the Game of Thrones television series.","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":"285 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135643392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Latent Shrinkage Position Model for Binary and Count Network Data 二进制和计数网络数据的潜在收缩位置模型
2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-01 DOI: 10.1214/23-ba1403
Xian Yao Gwee, Isobel Claire Gormley, Michael Fop
Interactions between actors are frequently represented using a network. The latent position model is widely used for analysing network data, whereby each actor is positioned in a latent space. Inferring the dimension of this space is challenging. Often, for simplicity, two dimensions are used or model selection criteria are employed to select the dimension, but this requires choosing a criterion and the computational expense of fitting multiple models. Here the latent shrinkage position model (LSPM) is proposed which intrinsically infers the effective dimension of the latent space. The LSPM employs a Bayesian nonparametric multiplicative truncated gamma process prior that ensures shrinkage of the variance of the latent positions across higher dimensions. Dimensions with non-negligible variance are deemed most useful to describe the observed network, inducing automatic inference on the latent space dimension. While the LSPM is applicable to many network types, logistic and Poisson LSPMs are developed here for binary and count networks respectively. Inference proceeds via a Markov chain Monte Carlo algorithm, where novel surrogate proposal distributions reduce the computational burden. The LSPM’s properties are assessed through simulation studies, and its utility is illustrated through application to real network datasets. Open source software assists wider implementation of the LSPM.
参与者之间的交互通常使用网络来表示。潜在位置模型被广泛用于分析网络数据,它将每个参与者定位在一个潜在空间中。推断这个空间的尺寸是有挑战性的。通常,为了简单起见,使用两个维度或使用模型选择标准来选择维度,但这需要选择一个标准和拟合多个模型的计算费用。提出了潜在收缩位置模型(LSPM),该模型从本质上推导了潜在空间的有效维数。LSPM采用贝叶斯非参数乘截断伽马过程,确保在更高维度上潜在位置的方差缩小。具有不可忽略方差的维度被认为对描述观察到的网络最有用,对潜在空间维度产生自动推理。虽然LSPM适用于许多网络类型,但本文分别针对二进制网络和计数网络开发了logistic和泊松LSPM。推理通过马尔可夫链蒙特卡罗算法进行,其中新颖的代理提议分布减少了计算负担。通过仿真研究评估了LSPM的性能,并通过实际网络数据集的应用说明了它的实用性。开源软件有助于更广泛地实现LSPM。
{"title":"A Latent Shrinkage Position Model for Binary and Count Network Data","authors":"Xian Yao Gwee, Isobel Claire Gormley, Michael Fop","doi":"10.1214/23-ba1403","DOIUrl":"https://doi.org/10.1214/23-ba1403","url":null,"abstract":"Interactions between actors are frequently represented using a network. The latent position model is widely used for analysing network data, whereby each actor is positioned in a latent space. Inferring the dimension of this space is challenging. Often, for simplicity, two dimensions are used or model selection criteria are employed to select the dimension, but this requires choosing a criterion and the computational expense of fitting multiple models. Here the latent shrinkage position model (LSPM) is proposed which intrinsically infers the effective dimension of the latent space. The LSPM employs a Bayesian nonparametric multiplicative truncated gamma process prior that ensures shrinkage of the variance of the latent positions across higher dimensions. Dimensions with non-negligible variance are deemed most useful to describe the observed network, inducing automatic inference on the latent space dimension. While the LSPM is applicable to many network types, logistic and Poisson LSPMs are developed here for binary and count networks respectively. Inference proceeds via a Markov chain Monte Carlo algorithm, where novel surrogate proposal distributions reduce the computational burden. The LSPM’s properties are assessed through simulation studies, and its utility is illustrated through application to real network datasets. Open source software assists wider implementation of the LSPM.","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":"124 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135051558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Fast Bayesian Functional Regression for Non-Gaussian Spatial Data 非高斯空间数据的快速贝叶斯函数回归
IF 4.4 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-01 DOI: 10.1214/22-ba1354
Hyun Bin Kang, Yeo Jin Jung, Jaewoo Park
{"title":"Fast Bayesian Functional Regression for Non-Gaussian Spatial Data","authors":"Hyun Bin Kang, Yeo Jin Jung, Jaewoo Park","doi":"10.1214/22-ba1354","DOIUrl":"https://doi.org/10.1214/22-ba1354","url":null,"abstract":"","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":" ","pages":""},"PeriodicalIF":4.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48879362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
On the Use of a Local Rˆ to Improve MCMC Convergence Diagnostic 利用局部R -改进MCMC收敛诊断
IF 4.4 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-01 DOI: 10.1214/23-ba1399
Théo Moins, Julyan Arbel, A. Dutfoy, S. Girard
Diagnosing convergence of Markov chain Monte Carlo is crucial and remains an essentially unsolved problem. Among the most popular methods, the potential scale reduction factor, commonly named ˆ R , is an indicator that monitors the convergence of output chains to a target distribution, based on a comparison of the between- and within-variances. Several improvements have been suggested since its introduction in the 90s. Here, we aim at better understanding the ˆ R behavior by proposing a localized version that focuses on quantiles of the target distribution. This new version relies on key theoretical properties of the associated population value. It naturally leads to proposing a new indicator ˆ R ∞ , which is shown to allow both for localizing the Markov chain Monte Carlo convergence in different quantiles of the target distribution, and at the same time for handling some convergence issues not detected by other ˆ R versions.
马尔可夫链蒙特卡罗收敛性诊断是一个关键问题,也是一个尚未解决的问题。在最流行的方法中,潜在尺度缩减因子,通常称为R,是一种监测输出链向目标分布收敛的指标,基于对差异之间和差异内的比较。自90年代推出以来,已经提出了一些改进建议。在这里,我们的目标是通过提出一个专注于目标分布的分位数的本地化版本来更好地理解R行为。这个新版本依赖于相关人口值的关键理论属性。这自然导致提出一个新的指标- R∞,它被证明可以在目标分布的不同分位数中定位马尔可夫链蒙特卡罗收敛,同时可以处理一些其他- R版本无法检测到的收敛问题。
{"title":"On the Use of a Local Rˆ to Improve MCMC Convergence Diagnostic","authors":"Théo Moins, Julyan Arbel, A. Dutfoy, S. Girard","doi":"10.1214/23-ba1399","DOIUrl":"https://doi.org/10.1214/23-ba1399","url":null,"abstract":"Diagnosing convergence of Markov chain Monte Carlo is crucial and remains an essentially unsolved problem. Among the most popular methods, the potential scale reduction factor, commonly named ˆ R , is an indicator that monitors the convergence of output chains to a target distribution, based on a comparison of the between- and within-variances. Several improvements have been suggested since its introduction in the 90s. Here, we aim at better understanding the ˆ R behavior by proposing a localized version that focuses on quantiles of the target distribution. This new version relies on key theoretical properties of the associated population value. It naturally leads to proposing a new indicator ˆ R ∞ , which is shown to allow both for localizing the Markov chain Monte Carlo convergence in different quantiles of the target distribution, and at the same time for handling some convergence issues not detected by other ˆ R versions.","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":" ","pages":""},"PeriodicalIF":4.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44085501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Objective Bayesian Model Selection for Spatial Hierarchical Models with Intrinsic Conditional Autoregressive Priors 目的基于条件自回归先验的空间层次模型的贝叶斯模型选择
IF 4.4 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-01 DOI: 10.1214/23-ba1375
Erica M. Porter, C. Franck, Marco A. R. Ferreira
{"title":"Objective Bayesian Model Selection for Spatial Hierarchical Models with Intrinsic Conditional Autoregressive Priors","authors":"Erica M. Porter, C. Franck, Marco A. R. Ferreira","doi":"10.1214/23-ba1375","DOIUrl":"https://doi.org/10.1214/23-ba1375","url":null,"abstract":"","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":" ","pages":""},"PeriodicalIF":4.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42465458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Coarsened Mixtures of Hierarchical Skew Normal Kernels for Flow and Mass Cytometry Analyses 用于流式细胞术和流式细胞术分析的分层倾斜正态核的粗化混合物
IF 4.4 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-01 DOI: 10.1214/22-ba1356
S. Gorsky, Cliburn Chan, Li Ma
{"title":"Coarsened Mixtures of Hierarchical Skew Normal Kernels for Flow and Mass Cytometry Analyses","authors":"S. Gorsky, Cliburn Chan, Li Ma","doi":"10.1214/22-ba1356","DOIUrl":"https://doi.org/10.1214/22-ba1356","url":null,"abstract":"","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":" ","pages":""},"PeriodicalIF":4.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47319315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Warped Gradient-Enhanced Gaussian Process Surrogate Models for Exponential Family Likelihoods with Intractable Normalizing Constants 具有难以处理的归一化常数的指数族似然的扭曲梯度增强高斯过程代理模型
2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-01 DOI: 10.1214/23-ba1400
Quan Vu, Matthew T. Moores, Andrew Zammit-Mangion
Markov chain Monte Carlo methods for exponential family models with intractable normalizing constant, such as the exchange algorithm, require simulations of the sufficient statistics at every iteration of the Markov chain, which often result in expensive computations. Surrogate models for the likelihood function have been developed to accelerate inference algorithms in this context. However, these surrogate models tend to be relatively inflexible, and often provide a poor approximation to the true likelihood function. In this article, we propose the use of a warped, gradient-enhanced, Gaussian process surrogate model for the likelihood function, which jointly models the sample means and variances of the sufficient statistics, and uses warping functions to capture covariance nonstationarity in the input parameter space. We show that both the consideration of nonstationarity and the inclusion of gradient information can be leveraged to obtain a surrogate model that outperforms the conventional stationary Gaussian process surrogate model when making inference, particularly in regions where the likelihood function exhibits a phase transition. We also show that the proposed surrogate model can be used to improve the effective sample size per unit time when embedded in exact inferential algorithms. The utility of our approach in speeding up inferential algorithms is demonstrated on simulated and real-world data.
对于具有难以处理的归一化常数的指数族模型的马尔可夫链蒙特卡罗方法,如交换算法,需要在马尔可夫链的每次迭代中模拟足够的统计量,这往往导致昂贵的计算。已经开发了似然函数的代理模型来加速这种情况下的推理算法。然而,这些替代模型往往相对不灵活,并且通常提供对真实似然函数的较差近似值。在本文中,我们提出使用一个扭曲的、梯度增强的、高斯过程的似然函数代理模型,该模型联合建模充分统计量的样本均值和方差,并使用扭曲函数来捕获输入参数空间中的协方差非平稳性。我们表明,在进行推理时,可以利用非平稳性和梯度信息的考虑来获得优于传统平稳高斯过程代理模型的代理模型,特别是在似然函数显示相变的区域。我们还表明,当嵌入精确的推理算法时,所提出的代理模型可以用于提高单位时间内的有效样本量。我们的方法在加速推理算法方面的效用在模拟和现实世界的数据上得到了证明。
{"title":"Warped Gradient-Enhanced Gaussian Process Surrogate Models for Exponential Family Likelihoods with Intractable Normalizing Constants","authors":"Quan Vu, Matthew T. Moores, Andrew Zammit-Mangion","doi":"10.1214/23-ba1400","DOIUrl":"https://doi.org/10.1214/23-ba1400","url":null,"abstract":"Markov chain Monte Carlo methods for exponential family models with intractable normalizing constant, such as the exchange algorithm, require simulations of the sufficient statistics at every iteration of the Markov chain, which often result in expensive computations. Surrogate models for the likelihood function have been developed to accelerate inference algorithms in this context. However, these surrogate models tend to be relatively inflexible, and often provide a poor approximation to the true likelihood function. In this article, we propose the use of a warped, gradient-enhanced, Gaussian process surrogate model for the likelihood function, which jointly models the sample means and variances of the sufficient statistics, and uses warping functions to capture covariance nonstationarity in the input parameter space. We show that both the consideration of nonstationarity and the inclusion of gradient information can be leveraged to obtain a surrogate model that outperforms the conventional stationary Gaussian process surrogate model when making inference, particularly in regions where the likelihood function exhibits a phase transition. We also show that the proposed surrogate model can be used to improve the effective sample size per unit time when embedded in exact inferential algorithms. The utility of our approach in speeding up inferential algorithms is demonstrated on simulated and real-world data.","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135103369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Objective Bayesian Meta-Analysis Based on Generalized Marginal Multivariate Random Effects Model 基于广义边际多变量随机效应模型的目标贝叶斯元分析
IF 4.4 2区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-01 DOI: 10.1214/23-ba1363
Olha Bodnar, Taras Bodnar
{"title":"Objective Bayesian Meta-Analysis Based on Generalized Marginal Multivariate Random Effects Model","authors":"Olha Bodnar, Taras Bodnar","doi":"10.1214/23-ba1363","DOIUrl":"https://doi.org/10.1214/23-ba1363","url":null,"abstract":"","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":" ","pages":""},"PeriodicalIF":4.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45398510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Bayesian Analysis
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1