首页 > 最新文献

arXiv: Computation最新文献

英文 中文
TMB: Automatic Differentiation and Laplace Approximation TMB:自动微分和拉普拉斯近似
Pub Date : 2015-09-02 DOI: 10.18637/jss.v070.i05
K. Kristensen, Anders Nielsen, Casper W. Berg, H. Skaug, B. Bell
TMB is an open source R package that enables quick implementation of complex nonlinear random effect (latent variable) models in a manner similar to the established AD Model Builder package (ADMB, admb-project.org). In addition, it offers easy access to parallel computations. The user defines the joint likelihood for the data and the random effects as a C++ template function, while all the other operations are done in R; e.g., reading in the data. The package evaluates and maximizes the Laplace approximation of the marginal likelihood where the random effects are automatically integrated out. This approximation, and its derivatives, are obtained using automatic differentiation (up to order three) of the joint likelihood. The computations are designed to be fast for problems with many random effects (~10^6) and parameters (~10^3). Computation times using ADMB and TMB are compared on a suite of examples ranging from simple models to large spatial models where the random effects are a Gaussian random field. Speedups ranging from 1.5 to about 100 are obtained with increasing gains for large problems. The package and examples are available at this http URL
TMB是一个开源的R包,可以快速实现复杂的非线性随机效应(潜在变量)模型,其方式类似于已建立的AD模型生成器包(ADMB, admbproject.org)。此外,它还提供了方便的并行计算访问。用户将数据和随机效应的联合似然定义为c++模板函数,而所有其他操作都在R中完成;例如,读入数据。包评估和最大化边际似然的拉普拉斯近似,其中随机效应被自动集成。这种近似及其导数是使用联合似然的自动微分(最高三阶)获得的。对于有许多随机效应(~10^6)和参数(~10^3)的问题,计算速度很快。从简单模型到随机效应为高斯随机场的大空间模型,比较了ADMB和TMB的计算时间。对于大型问题,加速范围从1.5到大约100不等,并且增益越来越大。该包和示例可在此http URL中获得
{"title":"TMB: Automatic Differentiation and Laplace Approximation","authors":"K. Kristensen, Anders Nielsen, Casper W. Berg, H. Skaug, B. Bell","doi":"10.18637/jss.v070.i05","DOIUrl":"https://doi.org/10.18637/jss.v070.i05","url":null,"abstract":"TMB is an open source R package that enables quick implementation of complex nonlinear random effect (latent variable) models in a manner similar to the established AD Model Builder package (ADMB, admb-project.org). In addition, it offers easy access to parallel computations. The user defines the joint likelihood for the data and the random effects as a C++ template function, while all the other operations are done in R; e.g., reading in the data. The package evaluates and maximizes the Laplace approximation of the marginal likelihood where the random effects are automatically integrated out. This approximation, and its derivatives, are obtained using automatic differentiation (up to order three) of the joint likelihood. The computations are designed to be fast for problems with many random effects (~10^6) and parameters (~10^3). Computation times using ADMB and TMB are compared on a suite of examples ranging from simple models to large spatial models where the random effects are a Gaussian random field. Speedups ranging from 1.5 to about 100 are obtained with increasing gains for large problems. The package and examples are available at this http URL","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"14 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81128888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 641
Transitional annealed adaptive slice sampling for Gaussian process hyper-parameter estimation 过渡退火自适应切片采样用于高斯过程超参数估计
Pub Date : 2015-09-01 DOI: 10.1615/Int.J.UncertaintyQuantification.2016018590
Alfredo Garbuno-Iñigo, F. DiazDelaO, K. Zuev
Surrogate models have become ubiquitous in science and engineering for their capability of emulating expensive computer codes, necessary to model and investigate complex phenomena. Bayesian emulators based on Gaussian processes adequately quantify the uncertainty that results from the cost of the original simulator, and thus the inability to evaluate it on the whole input space. However, it is common in the literature that only a partial Bayesian analysis is carried out, whereby the underlying hyper-parameters are estimated via gradient-free optimisation or genetic algorithms, to name a few methods. On the other hand, maximum a posteriori (MAP) estimation could discard important regions of the hyper-parameter space. In this paper, we carry out a more complete Bayesian inference, that combines Slice Sampling with some recently developed Sequential Monte Carlo samplers. The resulting algorithm improves the mixing in the sampling through delayed-rejection, the inclusion of an annealing scheme akin to Asymptotically Independent Markov Sampling and parallelisation via Transitional Markov Chain Monte Carlo. Examples related to the estimation of Gaussian process hyper-parameters are presented. For the purpose of reproducibility, further development, and use in other applications, the code to generate the examples in this paper is freely available for download at this http URL
代理模型在科学和工程中已经变得无处不在,因为它们能够模拟昂贵的计算机代码,这是建模和研究复杂现象所必需的。基于高斯过程的贝叶斯仿真器充分量化了原始仿真器成本导致的不确定性,从而无法在整个输入空间上对其进行评估。然而,在文献中,仅进行部分贝叶斯分析是很常见的,其中通过无梯度优化或遗传算法估计潜在的超参数,仅举几个方法。另一方面,最大后验估计(MAP)可以丢弃超参数空间的重要区域。在本文中,我们进行了一个更完整的贝叶斯推理,结合切片采样和一些最近发展的顺序蒙特卡罗采样器。所得到的算法通过延迟抑制、包含类似于渐近独立马尔可夫抽样的退火方案和通过过渡马尔可夫链蒙特卡罗并行化来改善采样中的混合。给出了高斯过程超参数估计的相关实例。出于可再现性、进一步开发和在其他应用程序中使用的目的,本文中生成示例的代码可以从这个http URL免费下载
{"title":"Transitional annealed adaptive slice sampling for Gaussian process hyper-parameter estimation","authors":"Alfredo Garbuno-Iñigo, F. DiazDelaO, K. Zuev","doi":"10.1615/Int.J.UncertaintyQuantification.2016018590","DOIUrl":"https://doi.org/10.1615/Int.J.UncertaintyQuantification.2016018590","url":null,"abstract":"Surrogate models have become ubiquitous in science and engineering for their capability of emulating expensive computer codes, necessary to model and investigate complex phenomena. Bayesian emulators based on Gaussian processes adequately quantify the uncertainty that results from the cost of the original simulator, and thus the inability to evaluate it on the whole input space. However, it is common in the literature that only a partial Bayesian analysis is carried out, whereby the underlying hyper-parameters are estimated via gradient-free optimisation or genetic algorithms, to name a few methods. On the other hand, maximum a posteriori (MAP) estimation could discard important regions of the hyper-parameter space. In this paper, we carry out a more complete Bayesian inference, that combines Slice Sampling with some recently developed Sequential Monte Carlo samplers. The resulting algorithm improves the mixing in the sampling through delayed-rejection, the inclusion of an annealing scheme akin to Asymptotically Independent Markov Sampling and parallelisation via Transitional Markov Chain Monte Carlo. Examples related to the estimation of Gaussian process hyper-parameters are presented. For the purpose of reproducibility, further development, and use in other applications, the code to generate the examples in this paper is freely available for download at this http URL","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89004560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Rare Event Simulation 罕见事件模拟
Pub Date : 2015-08-20 DOI: 10.1007/978-3-319-11259-6_24-1
J. Beck, K. Zuev
{"title":"Rare Event Simulation","authors":"J. Beck, K. Zuev","doi":"10.1007/978-3-319-11259-6_24-1","DOIUrl":"https://doi.org/10.1007/978-3-319-11259-6_24-1","url":null,"abstract":"","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85913723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Local likelihood estimation for covariance functions with spatially-varying parameters: the convoSPAT package for R 具有空间变化参数的协方差函数的局部似然估计:R的convoSPAT包
Pub Date : 2015-07-30 DOI: 10.18637/JSS.V081.I14
M. Risser, Catherine A. Calder
In spite of the interest in and appeal of convolution-based approaches for nonstationary spatial modeling, off-the-shelf software for model fitting does not as of yet exist. Convolution-based models are highly flexible yet notoriously difficult to fit, even with relatively small data sets. The general lack of pre-packaged options for model fitting makes it difficult to compare new methodology in nonstationary modeling with other existing methods, and as a result most new models are simply compared to stationary models. Using a convolution-based approach, we present a new nonstationary covariance function for spatial Gaussian process models that allows for efficient computing in two ways: first, by representing the spatially-varying parameters via a discrete mixture or "mixture component" model, and second, by estimating the mixture component parameters through a local likelihood approach. In order to make computation for a convolution-based nonstationary spatial model readily available, this paper also presents and describes the convoSPAT package for R. The nonstationary model is fit to both a synthetic data set and a real data application involving annual precipitation to demonstrate the capabilities of the package.
尽管基于卷积的非平稳空间建模方法引起了人们的兴趣和吸引力,但用于模型拟合的现成软件尚未存在。基于卷积的模型非常灵活,但即使是相对较小的数据集,也很难拟合。由于模型拟合通常缺乏预先打包的选项,因此很难将非平稳建模中的新方法与其他现有方法进行比较,因此大多数新模型只是与平稳模型进行比较。使用基于卷积的方法,我们为空间高斯过程模型提出了一个新的非平稳协方差函数,该函数允许以两种方式进行高效计算:首先,通过离散混合或“混合成分”模型表示空间变化的参数,其次,通过局部似然方法估计混合成分参数。为了方便地计算基于卷积的非平稳空间模型,本文还提出并描述了r的convoSPAT包。非平稳模型适合于合成数据集和涉及年降水的实际数据应用,以证明该包的能力。
{"title":"Local likelihood estimation for covariance functions with spatially-varying parameters: the convoSPAT package for R","authors":"M. Risser, Catherine A. Calder","doi":"10.18637/JSS.V081.I14","DOIUrl":"https://doi.org/10.18637/JSS.V081.I14","url":null,"abstract":"In spite of the interest in and appeal of convolution-based approaches for nonstationary spatial modeling, off-the-shelf software for model fitting does not as of yet exist. Convolution-based models are highly flexible yet notoriously difficult to fit, even with relatively small data sets. The general lack of pre-packaged options for model fitting makes it difficult to compare new methodology in nonstationary modeling with other existing methods, and as a result most new models are simply compared to stationary models. Using a convolution-based approach, we present a new nonstationary covariance function for spatial Gaussian process models that allows for efficient computing in two ways: first, by representing the spatially-varying parameters via a discrete mixture or \"mixture component\" model, and second, by estimating the mixture component parameters through a local likelihood approach. In order to make computation for a convolution-based nonstationary spatial model readily available, this paper also presents and describes the convoSPAT package for R. The nonstationary model is fit to both a synthetic data set and a real data application involving annual precipitation to demonstrate the capabilities of the package.","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"32 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85400302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
Bayesian inference for diffusion driven mixed-effects models 扩散驱动混合效应模型的贝叶斯推理
Pub Date : 2015-07-24 DOI: 10.1214/16-BA1009
G. Whitaker, A. Golightly, R. Boys, C. Sherlock
Stochastic differential equations (SDEs) provide a natural framework for modelling intrinsic stochasticity inherent in many continuous-time physical processes. When such processes are observed in multiple individuals or experimental units, SDE driven mixed-effects models allow the quantification of between (as well as within) individual variation. Performing Bayesian inference for such models, using discrete time data that may be incomplete and subject to measurement error is a challenging problem and is the focus of this paper. We extend a recently proposed MCMC scheme to include the SDE driven mixed-effects framework. Fundamental to our approach is the development of a novel construct that allows for efficient sampling of conditioned SDEs that may exhibit nonlinear dynamics between observation times. We apply the resulting scheme to synthetic data generated from a simple SDE model of orange tree growth, and real data consisting of observations on aphid numbers recorded under a variety of different treatment regimes. In addition, we provide a systematic comparison of our approach with an inference scheme based on a tractable approximation of the SDE, that is, the linear noise approximation.
随机微分方程(SDEs)为许多连续时间物理过程的固有随机性建模提供了一个自然的框架。当在多个个体或实验单元中观察到这些过程时,SDE驱动的混合效应模型允许量化个体之间(以及内部)的变化。使用可能不完整且存在测量误差的离散时间数据对此类模型进行贝叶斯推理是一个具有挑战性的问题,也是本文的重点。我们扩展了最近提出的MCMC方案,以包括SDE驱动的混合效果框架。我们方法的基础是开发一种新的结构,允许对可能在观测时间之间表现出非线性动态的条件SDEs进行有效采样。我们将结果方案应用于由简单SDE模型生成的橙树生长合成数据,以及在各种不同处理制度下记录的蚜虫数量的实际数据。此外,我们还将我们的方法与基于SDE的可处理近似的推理方案(即线性噪声近似)进行了系统的比较。
{"title":"Bayesian inference for diffusion driven mixed-effects models","authors":"G. Whitaker, A. Golightly, R. Boys, C. Sherlock","doi":"10.1214/16-BA1009","DOIUrl":"https://doi.org/10.1214/16-BA1009","url":null,"abstract":"Stochastic differential equations (SDEs) provide a natural framework for modelling intrinsic stochasticity inherent in many continuous-time physical processes. When such processes are observed in multiple individuals or experimental units, SDE driven mixed-effects models allow the quantification of between (as well as within) individual variation. Performing Bayesian inference for such models, using discrete time data that may be incomplete and subject to measurement error is a challenging problem and is the focus of this paper. We extend a recently proposed MCMC scheme to include the SDE driven mixed-effects framework. Fundamental to our approach is the development of a novel construct that allows for efficient sampling of conditioned SDEs that may exhibit nonlinear dynamics between observation times. We apply the resulting scheme to synthetic data generated from a simple SDE model of orange tree growth, and real data consisting of observations on aphid numbers recorded under a variety of different treatment regimes. In addition, we provide a systematic comparison of our approach with an inference scheme based on a tractable approximation of the SDE, that is, the linear noise approximation.","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"285 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80252543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Fundamentals of cone regression 锥体回归原理
Pub Date : 2015-07-23 DOI: 10.1214/16-SS114
Mariella Dimiccoli
Cone regression is a particular case of quadratic programming that minimizes a weighted sum of squared residuals under a set of linear inequality constraints. Several important statistical problems such as isotonic, concave regression or ANOVA under partial orderings, just to name a few, can be considered as particular instances of the cone regression problem. Given its relevance in Statistics, this paper aims to address the fundamentals of cone regression from a theoretical and practical point of view. Several formulations of the cone regression problem are considered and, focusing on the particular case of concave regression as example, several algorithms are analyzed and compared both qualitatively and quantitatively through numerical simulations. Several improvements to enhance numerical stability and bound the computational cost are proposed. For each analyzed algorithm, the pseudo-code and its corresponding code in Scilab are provided. The results from this study demonstrate that the choice of the optimization approach strongly impacts the numerical performances. It is also shown that methods are not currently available to solve efficiently cone regression problems with large dimension (more than many thousands of points). We suggest further research to fill this gap by exploiting and adapting classical multi-scale strategy to compute an approximate solution.
圆锥回归是二次规划的一种特殊情况,它在一组线性不等式约束下最小化加权残差平方和。一些重要的统计问题,如等压、凹回归或偏序下的方差分析,仅举几例,可以被认为是锥回归问题的特殊实例。鉴于其在统计学中的相关性,本文旨在从理论和实践的角度解决锥体回归的基本原理。考虑了锥回归问题的几种表述,并以凹回归为例,通过数值模拟对几种算法进行了定性和定量的分析和比较。提出了提高数值稳定性和限制计算成本的若干改进措施。对于所分析的每个算法,都提供了伪代码及其在Scilab中的对应代码。研究结果表明,优化方法的选择对数值性能有很大影响。研究还表明,目前还没有有效解决大维度(超过数千个点)的锥回归问题的方法。我们建议通过进一步的研究来填补这一空白,利用和适应经典的多尺度策略来计算近似解。
{"title":"Fundamentals of cone regression","authors":"Mariella Dimiccoli","doi":"10.1214/16-SS114","DOIUrl":"https://doi.org/10.1214/16-SS114","url":null,"abstract":"Cone regression is a particular case of quadratic programming that minimizes a weighted sum of squared residuals under a set of linear inequality constraints. Several important statistical problems such as isotonic, concave regression or ANOVA under partial orderings, just to name a few, can be considered as particular instances of the cone regression problem. Given its relevance in Statistics, this paper aims to address the fundamentals of cone regression from a theoretical and practical point of view. Several formulations of the cone regression problem are considered and, focusing on the particular case of concave regression as example, several algorithms are analyzed and compared both qualitatively and quantitatively through numerical simulations. Several improvements to enhance numerical stability and bound the computational cost are proposed. For each analyzed algorithm, the pseudo-code and its corresponding code in Scilab are provided. The results from this study demonstrate that the choice of the optimization approach strongly impacts the numerical performances. It is also shown that methods are not currently available to solve efficiently cone regression problems with large dimension (more than many thousands of points). We suggest further research to fill this gap by exploiting and adapting classical multi-scale strategy to compute an approximate solution.","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83588590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Rare Event Simulation and Splitting for Discontinuous Random Variables 不连续随机变量的罕见事件模拟与分裂
Pub Date : 2015-07-03 DOI: 10.1051/PS/2015017
Clément Walter
Multilevel Splitting methods, also called Sequential Monte-Carlo or emph{Subset Simulation}, are widely used methods for estimating extreme probabilities of the form $P[S(mathbf{U}) > q]$ where $S$ is a deterministic real-valued function and $mathbf{U}$ can be a random finite- or infinite-dimensional vector. Very often, $X := S(mathbf{U})$ is supposed to be a continuous random variable and a lot of theoretical results on the statistical behaviour of the estimator are now derived with this hypothesis. However, as soon as some threshold effect appears in $S$ and/or $mathbf{U}$ is discrete or mixed discrete/continuous this assumption does not hold any more and the estimator is not consistent. In this paper, we study the impact of discontinuities in the emph{cdf} of $X$ and present three unbiased emph{corrected} estimators to handle them. These estimators do not require to know in advance if $X$ is actually discontinuous or not and become all equal if $X$ is continuous. Especially, one of them has the same statistical properties in any case. Efficiency is shown on a 2-D diffusive process as well as on the emph{Boolean SATisfiability problem} (SAT).
多层分裂方法,也称为顺序蒙特卡罗或emph{子集模拟},是广泛用于估计形式$P[S(mathbf{U}) > q]$的极端概率的方法,其中$S$是一个确定性的实值函数,$mathbf{U}$可以是一个随机的有限维或无限维向量。通常,$X := S(mathbf{U})$被认为是一个连续的随机变量,许多关于估计量的统计行为的理论结果现在都是用这个假设推导出来的。然而,一旦一些阈值效应出现在$S$和/或$mathbf{U}$是离散的或混合离散/连续的,这个假设不再成立,估计量不一致。本文研究了$X$的emph{cdf}中不连续的影响,并给出了三个无偏emph{校正}估计来处理它们。这些估计器不需要事先知道$X$是否实际上是不连续的,如果$X$是连续的,则它们都相等。特别是,它们中的一个在任何情况下都具有相同的统计属性。在二维扩散过程和emph{布尔可满足性问题}(SAT)上证明了该方法的有效性。
{"title":"Rare Event Simulation and Splitting for Discontinuous Random Variables","authors":"Clément Walter","doi":"10.1051/PS/2015017","DOIUrl":"https://doi.org/10.1051/PS/2015017","url":null,"abstract":"Multilevel Splitting methods, also called Sequential Monte-Carlo or emph{Subset Simulation}, are widely used methods for estimating extreme probabilities of the form $P[S(mathbf{U}) > q]$ where $S$ is a deterministic real-valued function and $mathbf{U}$ can be a random finite- or infinite-dimensional vector. Very often, $X := S(mathbf{U})$ is supposed to be a continuous random variable and a lot of theoretical results on the statistical behaviour of the estimator are now derived with this hypothesis. However, as soon as some threshold effect appears in $S$ and/or $mathbf{U}$ is discrete or mixed discrete/continuous this assumption does not hold any more and the estimator is not consistent. \u0000In this paper, we study the impact of discontinuities in the emph{cdf} of $X$ and present three unbiased emph{corrected} estimators to handle them. These estimators do not require to know in advance if $X$ is actually discontinuous or not and become all equal if $X$ is continuous. Especially, one of them has the same statistical properties in any case. Efficiency is shown on a 2-D diffusive process as well as on the emph{Boolean SATisfiability problem} (SAT).","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"21 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74675926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Adapting the ABC distance function 采用ABC距离函数
Pub Date : 2015-07-03 DOI: 10.1214/16-BA1002
D. Prangle
Approximate Bayesian computation performs approximate inference for models where likelihood computations are expensive or impossible. Instead simulations from the model are performed for various parameter values and accepted if they are close enough to the observations. There has been much progress on deciding which summary statistics of the data should be used to judge closeness, but less work on how to weight them. Typically weights are chosen at the start of the algorithm which normalise the summary statistics to vary on similar scales. However these may not be appropriate in iterative ABC algorithms, where the distribution from which the parameters are proposed is updated. This can substantially alter the resulting distribution of summary statistics, so that different weights are needed for normalisation. This paper presents two iterative ABC algorithms which adaptively update their weights and demonstrates improved results on test applications.
近似贝叶斯计算对似然计算昂贵或不可能的模型进行近似推理。相反,对各种参数值进行模型模拟,如果它们与观测值足够接近,则接受模型模拟。在决定应该使用哪些数据的汇总统计数据来判断接近程度方面,已经取得了很大进展,但在如何衡量这些数据的权重方面,工作却很少。通常,权重是在算法开始时选择的,它使汇总统计数据在相似的尺度上规范化。然而,这些可能不适用于迭代ABC算法,其中提出参数的分布是更新的。这可能会极大地改变汇总统计的结果分布,因此需要不同的权重来进行规范化。本文提出了两种自适应更新权值的迭代ABC算法,并在测试应用中证明了改进的结果。
{"title":"Adapting the ABC distance function","authors":"D. Prangle","doi":"10.1214/16-BA1002","DOIUrl":"https://doi.org/10.1214/16-BA1002","url":null,"abstract":"Approximate Bayesian computation performs approximate inference for models where likelihood computations are expensive or impossible. Instead simulations from the model are performed for various parameter values and accepted if they are close enough to the observations. There has been much progress on deciding which summary statistics of the data should be used to judge closeness, but less work on how to weight them. Typically weights are chosen at the start of the algorithm which normalise the summary statistics to vary on similar scales. However these may not be appropriate in iterative ABC algorithms, where the distribution from which the parameters are proposed is updated. This can substantially alter the resulting distribution of summary statistics, so that different weights are needed for normalisation. This paper presents two iterative ABC algorithms which adaptively update their weights and demonstrates improved results on test applications.","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"26 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75352534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 86
The Parallel C++ Statistical Library for Bayesian Inference: QUESO 并行c++贝叶斯推理统计库:QUESO
Pub Date : 2015-07-02 DOI: 10.1007/978-3-319-12385-1_57
Damon McDougall, Nicholas Malaya, R. Moser
{"title":"The Parallel C++ Statistical Library for Bayesian Inference: QUESO","authors":"Damon McDougall, Nicholas Malaya, R. Moser","doi":"10.1007/978-3-319-12385-1_57","DOIUrl":"https://doi.org/10.1007/978-3-319-12385-1_57","url":null,"abstract":"","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"15 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89786513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Leave Pima Indians alone: binary regression as a benchmark for Bayesian computation 离皮马印第安人远点:二元回归作为贝叶斯计算的基准
Pub Date : 2015-06-29 DOI: 10.1214/16-STS581
N. Chopin, James Ridgway
Abstract. Whenever a new approach to perform Bayesian computation is introduced, a common practice is to showcase this approach on a binary regression model and datasets of moderate size. This paper discusses to which extent this practice is sound. It also reviews the current state of the art of Bayesian computation, using binary regression as a running example. Both sampling-based algorithms (importance sampling, MCMC and SMC) and fast approximations (Laplace and EP) are covered. Extensive numerical results are provided, some of which might go against conventional wisdom regarding the effectiveness of certain algorithms. Implications for other problems (variable selection) and other models are also discussed.
摘要每当引入一种执行贝叶斯计算的新方法时,通常的做法是在中等大小的二元回归模型和数据集上展示这种方法。本文讨论了这种做法在多大程度上是合理的。它还回顾了贝叶斯计算技术的当前状态,使用二元回归作为一个运行的例子。包括基于采样的算法(重要性采样,MCMC和SMC)和快速逼近(拉普拉斯和EP)。提供了广泛的数值结果,其中一些可能违背关于某些算法有效性的传统智慧。对其他问题(变量选择)和其他模型的含义也进行了讨论。
{"title":"Leave Pima Indians alone: binary regression as a benchmark for Bayesian computation","authors":"N. Chopin, James Ridgway","doi":"10.1214/16-STS581","DOIUrl":"https://doi.org/10.1214/16-STS581","url":null,"abstract":"Abstract. Whenever a new approach to perform Bayesian computation is introduced, a common practice is to showcase this approach on a binary regression model and datasets of moderate size. This paper discusses to which extent this practice is sound. It also reviews the current state of the art of Bayesian computation, using binary regression as a running example. Both sampling-based algorithms (importance sampling, MCMC and SMC) and fast approximations (Laplace and EP) are covered. Extensive numerical results are provided, some of which might go against conventional wisdom regarding the effectiveness of certain algorithms. Implications for other problems (variable selection) and other models are also discussed.","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"32 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80903054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 63
期刊
arXiv: Computation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1