arXiv: Computation最新文献

英文中文

Stochastic Newton Sampler: R Package sns 随机牛顿采样器:R包sns

arXiv: Computation

Pub Date : 2015-02-06 DOI: 10.18637/JSS.V074.C02

A. S. Mahani, Asad Hasan, Marshall Jiang, M. Sharabiani

The R package sns implements Stochastic Newton Sampler (SNS), a Metropolis-Hastings Monte Carlo Markov Chain algorithm where the proposal density function is a multivariate Gaussian based on a local, second-order Taylor series expansion of log-density. The mean of the proposal function is the full Newton step in Newton-Raphson optimization algorithm. Taking advantage of the local, multivariate geometry captured in log-density Hessian allows SNS to be more efficient than univariate samplers, approaching independent sampling as the density function increasingly resembles a multivariate Gaussian. SNS requires the log-density Hessian to be negative-definite everywhere in order to construct a valid proposal function. This property holds, or can be easily checked, for many GLM-like models. When initial point is far from density peak, running SNS in non-stochastic mode by taking the Newton step, augmented with with line search, allows the MCMC chain to converge to high-density areas faster. For high-dimensional problems, partitioning of state space into lower-dimensional subsets, and applying SNS to the subsets within a Gibbs sampling framework can significantly improve the mixing of SNS chains. In addition to the above strategies for improving convergence and mixing, sns offers diagnostics and visualization capabilities, as well as a function for sample-based calculation of Bayesian predictive posterior distributions.

R包sns实现了随机牛顿采样器(sns)，这是一种Metropolis-Hastings蒙特卡罗马尔可夫链算法，其中建议的密度函数是基于对数密度的局部二阶泰勒级数展开的多元高斯函数。在Newton- raphson优化算法中，建议函数的均值为全牛顿步。利用对数密度Hessian捕获的局部多变量几何，SNS比单变量采样器更有效，随着密度函数越来越像多变量高斯函数，SNS更接近独立采样。为了构造有效的提议函数，SNS要求对数密度Hessian处处为负定。对于许多类似glm的模型来说，这个属性是成立的，或者可以很容易地检验。当初始点远离密度峰值时，采用Newton步进的非随机模式运行SNS，并辅以直线搜索，可以使MCMC链更快地收敛到高密度区域。对于高维问题，将状态空间划分为低维子集，并在Gibbs采样框架内对这些子集应用SNS，可以显著改善SNS链的混合。除了上述改进收敛和混合的策略外，sns还提供诊断和可视化功能，以及基于样本的贝叶斯预测后验分布计算功能。

{"title":"Stochastic Newton Sampler: R Package sns","authors":"A. S. Mahani, Asad Hasan, Marshall Jiang, M. Sharabiani","doi":"10.18637/JSS.V074.C02","DOIUrl":"https://doi.org/10.18637/JSS.V074.C02","url":null,"abstract":"The R package sns implements Stochastic Newton Sampler (SNS), a Metropolis-Hastings Monte Carlo Markov Chain algorithm where the proposal density function is a multivariate Gaussian based on a local, second-order Taylor series expansion of log-density. The mean of the proposal function is the full Newton step in Newton-Raphson optimization algorithm. Taking advantage of the local, multivariate geometry captured in log-density Hessian allows SNS to be more efficient than univariate samplers, approaching independent sampling as the density function increasingly resembles a multivariate Gaussian. SNS requires the log-density Hessian to be negative-definite everywhere in order to construct a valid proposal function. This property holds, or can be easily checked, for many GLM-like models. When initial point is far from density peak, running SNS in non-stochastic mode by taking the Newton step, augmented with with line search, allows the MCMC chain to converge to high-density areas faster. For high-dimensional problems, partitioning of state space into lower-dimensional subsets, and applying SNS to the subsets within a Gibbs sampling framework can significantly improve the mixing of SNS chains. In addition to the above strategies for improving convergence and mixing, sns offers diagnostics and visualization capabilities, as well as a function for sample-based calculation of Bayesian predictive posterior distributions.","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"37 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90604500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Polynomial-Chaos-based Kriging Polynomial-Chaos-based克里格

arXiv: Computation

Pub Date : 2015-02-01 DOI: 10.1615/INT.J.UNCERTAINTYQUANTIFICATION.2015012467

R. Schöbi, B. Sudret, J. Wiart

Computer simulation has become the standard tool in many engineering fields for designing and optimizing systems, as well as for assessing their reliability. To cope with demanding analysis such as optimization and reliability, surrogate models (a.k.a meta-models) have been increasingly investigated in the last decade. Polynomial Chaos Expansions (PCE) and Kriging are two popular non-intrusive meta-modelling techniques. PCE surrogates the computational model with a series of orthonormal polynomials in the input variables where polynomials are chosen in coherency with the probability distributions of those input variables. On the other hand, Kriging assumes that the computer model behaves as a realization of a Gaussian random process whose parameters are estimated from the available computer runs, i.e. input vectors and response values. These two techniques have been developed more or less in parallel so far with little interaction between the researchers in the two fields. In this paper, PC-Kriging is derived as a new non-intrusive meta-modeling approach combining PCE and Kriging. A sparse set of orthonormal polynomials (PCE) approximates the global behavior of the computational model whereas Kriging manages the local variability of the model output. An adaptive algorithm similar to the least angle regression algorithm determines the optimal sparse set of polynomials. PC-Kriging is validated on various benchmark analytical functions which are easy to sample for reference results. From the numerical investigations it is concluded that PC-Kriging performs better than or at least as good as the two distinct meta-modeling techniques. A larger gain in accuracy is obtained when the experimental design has a limited size, which is an asset when dealing with demanding computational models.

计算机仿真已成为许多工程领域设计和优化系统以及评估其可靠性的标准工具。为了应对诸如优化和可靠性等苛刻的分析，代理模型(又称元模型)在过去十年中得到了越来越多的研究。多项式混沌展开(PCE)和克里格(Kriging)是两种流行的非侵入式元建模技术。PCE用输入变量中的一系列标准正交多项式代替计算模型，其中多项式的选择与这些输入变量的概率分布一致。另一方面，Kriging假设计算机模型表现为高斯随机过程的实现，其参数是从可用的计算机运行中估计出来的，即输入向量和响应值。到目前为止，这两种技术或多或少是并行发展的，两个领域的研究人员之间很少有互动。PC-Kriging是一种结合PCE和Kriging的非侵入式元建模方法。标准正交多项式(PCE)的稀疏集近似计算模型的全局行为，而Kriging管理模型输出的局部可变性。一种类似最小角度回归算法的自适应算法确定多项式的最优稀疏集。PC-Kriging在各种基准分析函数上进行了验证，这些函数易于采样以获得参考结果。从数值研究中可以得出结论，PC-Kriging的表现优于或至少与这两种不同的元建模技术一样好。当实验设计具有有限的尺寸时，可以获得更大的精度增益，这在处理要求苛刻的计算模型时是一种资产。

{"title":"Polynomial-Chaos-based Kriging","authors":"R. Schöbi, B. Sudret, J. Wiart","doi":"10.1615/INT.J.UNCERTAINTYQUANTIFICATION.2015012467","DOIUrl":"https://doi.org/10.1615/INT.J.UNCERTAINTYQUANTIFICATION.2015012467","url":null,"abstract":"Computer simulation has become the standard tool in many engineering fields for designing and optimizing systems, as well as for assessing their reliability. To cope with demanding analysis such as optimization and reliability, surrogate models (a.k.a meta-models) have been increasingly investigated in the last decade. Polynomial Chaos Expansions (PCE) and Kriging are two popular non-intrusive meta-modelling techniques. PCE surrogates the computational model with a series of orthonormal polynomials in the input variables where polynomials are chosen in coherency with the probability distributions of those input variables. On the other hand, Kriging assumes that the computer model behaves as a realization of a Gaussian random process whose parameters are estimated from the available computer runs, i.e. input vectors and response values. These two techniques have been developed more or less in parallel so far with little interaction between the researchers in the two fields. In this paper, PC-Kriging is derived as a new non-intrusive meta-modeling approach combining PCE and Kriging. A sparse set of orthonormal polynomials (PCE) approximates the global behavior of the computational model whereas Kriging manages the local variability of the model output. An adaptive algorithm similar to the least angle regression algorithm determines the optimal sparse set of polynomials. PC-Kriging is validated on various benchmark analytical functions which are easy to sample for reference results. From the numerical investigations it is concluded that PC-Kriging performs better than or at least as good as the two distinct meta-modeling techniques. A larger gain in accuracy is obtained when the experimental design has a limited size, which is an asset when dealing with demanding computational models.","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"37 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82706325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 230

Enhancing reproducibility and collaboration via management of R package cohorts 通过管理R包队列来增强可重复性和协作性

arXiv: Computation

Pub Date : 2015-01-09 DOI: 10.18637/JSS.V082.I01

Gabriel Becker, C. Barr, R. Gentleman, Michael Lawrence

Science depends on collaboration, result reproduction, and the development of supporting software tools. Each of these requires careful management of software versions. We present a unified model for installing, managing, and publishing software contexts in R. It introduces the package manifest as a central data structure for representing version specific, decentralized package cohorts. The manifest points to package sources on arbitrary hosts and in various forms, including tarballs and directories under version control. We provide a high-level interface for creating and switching between side-by-side package libraries derived from manifests. Finally, we extend package installation to support the retrieval of exact package versions as indicated by manifests, and to maintain provenance for installed packages. The provenance information enables the user to publish libraries or sessions as manifests, hence completing the loop between publication and deployment. We have implemented this model across two software packages, switchr and GRANbase, and have released the source code under the Artistic 2.0 license.

科学依赖于协作、结果再现和支持性软件工具的开发。这些都需要对软件版本进行仔细的管理。我们提出了一个统一的模型，用于在r中安装、管理和发布软件上下文。它引入了包清单作为一个中心数据结构，用于表示特定于版本的、分散的包队列。清单指向任意主机上的各种形式的包源，包括版本控制下的tarball和目录。我们提供了一个高级接口，用于在源自清单的并行包库之间创建和切换。最后，我们扩展了包安装，以支持清单所指示的精确包版本的检索，并维护已安装包的来源。来源信息使用户能够将库或会话作为清单发布，从而完成发布和部署之间的循环。我们已经在两个软件包switchr和GRANbase上实现了这个模型，并在art 2.0许可下发布了源代码。

引用次数: 6

On Particle Methods for Parameter Estimation in State-Space Models 状态空间模型参数估计的粒子方法

arXiv: Computation

Pub Date : 2014-12-30 DOI: 10.1214/14-STS511

N. Kantas, A. Doucet, Sumeetpal S. Singh, J. Maciejowski, N. Chopin

Nonlinear non-Gaussian state-space models are ubiquitous in statistics, econometrics, information engineering and signal processing. Particle methods, also known as Sequential Monte Carlo (SMC) methods, provide reliable numerical approximations to the associated state inference problems. However, in most applications, the state-space model of interest also depends on unknown static parameters that need to be estimated from the data. In this context, standard particle methods fail and it is necessary to rely on more sophisticated algorithms. The aim of this paper is to present a comprehensive review of particle methods that have been proposed to perform static parameter estimation in state-space models. We discuss the advantages and limitations of these methods and illustrate their performance on simple models.

非线性非高斯状态空间模型在统计学、计量经济学、信息工程和信号处理中无处不在。粒子方法，也称为顺序蒙特卡罗(SMC)方法，为相关的状态推理问题提供了可靠的数值近似。然而，在大多数应用程序中，感兴趣的状态空间模型还依赖于需要从数据中估计的未知静态参数。在这种情况下，标准粒子方法失效，有必要依赖更复杂的算法。本文的目的是对粒子方法进行全面的回顾，这些方法已被提议在状态空间模型中执行静态参数估计。我们讨论了这些方法的优点和局限性，并举例说明了它们在简单模型上的性能。

引用次数: 387

Multivariate-from-Univariate MCMC Sampler: R Package MfUSampler 由单变量多变量MCMC采样器:R封装MfUSampler

arXiv: Computation

Pub Date : 2014-12-25 DOI: 10.18637/JSS.V078.C01

A. S. Mahani, M. Sharabiani

The R package MfUSampler provides Monte Carlo Markov Chain machinery for generating samples from multivariate probability distributions using univariate sampling algorithms such as Slice Sampler and Adaptive Rejection Sampler. The sampler function performs a full cycle of univariate sampling steps, one coordinate at a time. In each step, the latest sample values obtained for other coordinates are used to form the conditional distributions. The concept is an extension of Gibbs sampling where each step involves, not an independent sample from the conditional distribution, but a Markov transition for which the conditional distribution is invariant. The software relies on proportionality of conditional distributions to the joint distribution to implement a thin wrapper for producing conditionals. Examples illustrate basic usage as well as methods for improving performance. By encapsulating the multivariate-from-univariate logic, MfUSampler provides a reliable library for rapid prototyping of custom Bayesian models while allowing for incremental performance optimizations such as utilization of conjugacy, conditional independence, and porting function evaluations to compiled languages.

R包MfUSampler提供蒙特卡罗马尔可夫链机制，用于使用单变量采样算法(如Slice Sampler和Adaptive Rejection Sampler)从多变量概率分布生成样本。采样器函数执行单变量采样步骤的完整周期，每次一个坐标。在每一步中，使用对其他坐标获得的最新样本值来形成条件分布。这个概念是吉布斯抽样的扩展，其中每一步涉及的不是来自条件分布的独立样本，而是一个条件分布不变的马尔可夫转移。该软件依赖于条件分布与联合分布的比例性来实现用于生成条件的薄包装器。示例说明了基本用法以及提高性能的方法。通过封装从单变量到多变量的逻辑，MfUSampler为自定义贝叶斯模型的快速原型设计提供了一个可靠的库，同时允许增量性能优化，例如利用共轭、条件独立性和将函数求值移植到编译语言。

引用次数: 4

Vectorized and Parallel Particle Filter SMC Parameter Estimation for Stiff ODEs 刚性ode的矢量化并行粒子滤波SMC参数估计

arXiv: Computation

Pub Date : 2014-11-06 DOI: 10.3934/proc.2015.0075

Andrea Arnold, D. Calvetti, E. Somersalo

Particle filter (PF) sequential Monte Carlo (SMC) methods are very attractive for the estimation of parameters of time dependent systems where the data is either not all available at once, or the range of time constants is wide enough to create problems in the numerical time propagation of the states. The need to evolve a large number of particles makes PF-based methods computationally challenging, the main bottlenecks being the time propagation of each particle and the large number of particles. While parallelization is typically advocated to speed up the computing time, vectorization of the algorithm on a single processor may result in even larger speedups for certain problems. In this paper we present a formulation of the PF-SMC class of algorithms proposed in Arnold et al. (2013), which is particularly amenable to a parallel or vectorized computing environment, and we illustrate the performance with a few computed examples in MATLAB.

粒子滤波(PF)序贯蒙特卡罗(SMC)方法对于时变系统的参数估计非常有吸引力，在这些系统中，数据不是一次全部可用，或者时间常数的范围足够大，以至于在状态的数值时间传播中产生问题。进化大量粒子的需要使得基于pf的方法在计算上具有挑战性，主要瓶颈是每个粒子的时间传播和大量粒子。虽然通常提倡并行化以加快计算时间，但在单个处理器上对算法进行矢量化可能会导致某些问题的更快速度。在本文中，我们提出了Arnold等人(2013)提出的PF-SMC类算法的公式，该算法特别适用于并行或矢量化计算环境，并通过MATLAB中的几个计算示例说明了性能。

引用次数: 4

A Block Circulant Embedding Method for Simulation of Stationary Gaussian Random Fields on Block-regular Grids 块规则网格上平稳高斯随机场模拟的块循环嵌入方法

arXiv: Computation

Pub Date : 2014-11-06 DOI: 10.1615/Int.J.UncertaintyQuantification.2015013781

M. Park, M. Tretyakov

We propose a new method for sampling from stationary Gaussian random field on a grid which is not regular but has a regular block structure which is often the case in applications. The introduced block circulant embedding method (BCEM) can outperform the classical circulant embedding method (CEM) which requires a regularization of the irregular grid before its application. Comparison of BCEM vs CEM is performed on some typical model problems.

本文提出了一种在非规则网格上从平稳高斯随机场进行采样的新方法，这种网格具有规则的块结构，在实际应用中很常见。本文提出的块循环嵌入方法(BCEM)优于经典循环嵌入方法(CEM)，后者在应用前需要对不规则网格进行正则化处理。对一些典型的模型问题进行了BCEM和CEM的比较。

引用次数: 14

Software Alchemy: Turning Complex Statistical Computations into Embarrassingly-Parallel Ones 软件炼金术:将复杂的统计计算变成令人尴尬的并行计算

arXiv: Computation

Pub Date : 2014-09-19 DOI: 10.18637/JSS.V071.I04

N. Matloff

The growth in the use of computationally intensive statistical procedures, especially with Big Data, has necessitated the usage of parallel computation on diverse platforms such as multicore, GPU, clusters and clouds. However, slowdown due to interprocess communication costs typically limits such methods to "embarrassingly parallel" (EP) algorithms, especially on non-shared memory platforms. This paper develops a broadly-applicable method for converting many non-EP algorithms into statistically equivalent EP ones. The method is shown to yield excellent levels of speedup for a variety of statistical computations. It also overcomes certain problems of memory limitations.

随着计算密集型统计程序的使用，特别是大数据的使用，需要在多核、GPU、集群和云等不同平台上使用并行计算。然而，由于进程间通信成本导致的速度减慢通常将这些方法限制为“令人尴尬的并行”(EP)算法，特别是在非共享内存平台上。本文提出了一种广泛适用的方法，将许多非EP算法转换为统计等效的EP算法。该方法被证明对各种统计计算产生极好的加速水平。它还克服了内存限制的某些问题。

引用次数: 19

Decreasing flow uncertainty in Bayesian inverse problems through Lagrangian drifter control 利用拉格朗日漂移控制降低贝叶斯反问题中的流动不确定性

arXiv: Computation

Pub Date : 2014-08-27 DOI: 10.1007/978-3-319-39092-5_10

Damon McDougall, C. Jones

引用次数: 2

gamboostLSS: An R Package for Model Building and Variable Selection in the GAMLSS Framework gamboostLSS:一个在GAMLSS框架中用于模型构建和变量选择的R包

arXiv: Computation

Pub Date : 2014-07-07 DOI: 10.18637/JSS.V074.I01

B. Hofner, A. Mayr, M. Schmid

Generalized additive models for location, scale and shape (GAMLSS) are a flexible class of regression models that allow to model multiple parameters of a distribution function, such as the mean and the standard deviation, simultaneously. With the R package gamboostLSS, we provide a boosting method to fit these models. Variable selection and model choice are naturally available within this regularized regression framework. To introduce and illustrate the R package gamboostLSS and its infrastructure, we use a data set on stunted growth in India. In addition to the specification and application of the model itself, we present a variety of convenience functions, including methods for tuning parameter selection, prediction and visualization of results. The package gamboostLSS is available from CRAN (this http URL).

广义加性位置、尺度和形状模型(GAMLSS)是一类灵活的回归模型，它允许同时对分布函数的多个参数(如平均值和标准差)进行建模。利用R包gamboostLSS，我们提供了一种增强方法来拟合这些模型。在这个正则化回归框架中，变量选择和模型选择自然是可用的。为了介绍和说明R包gamboostLSS及其基础设施，我们使用了印度发展迟缓的数据集。除了模型本身的规范和应用外，我们还提供了各种方便的功能，包括调整参数选择，预测和结果可视化的方法。包gamboostLSS可从CRAN(此http URL)获得。

引用次数: 70

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

arXiv: Computation

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀