arXiv - STAT - Computation最新文献_第10页

Sampling From Multiscale Densities With Delayed Rejection Generalized Hamiltonian Monte Carlo 用延迟剔除广义哈密尔顿蒙特卡洛从多尺度密度中取样

arXiv - STAT - Computation

Pub Date : 2024-06-04 DOI: arxiv-2406.02741

Gilad Turok, Chirag Modi, Bob Carpenter

With the increasing prevalence of probabilistic programming languages,Hamiltonian Monte Carlo (HMC) has become the mainstay of applied Bayesianinference. However HMC still struggles to sample from densities with multiscalegeometry: a large step size is needed to efficiently explore low curvatureregions while a small step size is needed to accurately explore high curvatureregions. We introduce the delayed rejection generalized HMC (DR-G-HMC) samplerthat overcomes this challenge by employing dynamic step size selection,inspired by differential equation solvers. In a single sampling iteration,DR-G-HMC sequentially makes proposals with geometrically decreasing step sizesif necessary. This simulates Hamiltonian dynamics with increasing fidelitythat, in high curvature regions, generates proposals with a higher chance ofacceptance. DR-G-HMC also makes generalized HMC competitive by decreasing thenumber of rejections which otherwise cause inefficient backtracking andprevents directed movement. We present experiments to demonstrate that DR-G-HMC(1) correctly samples from multiscale densities, (2) makes generalized HMCmethods competitive with the state of the art No-U-Turn sampler, and (3) isrobust to tuning parameters.

随着概率编程语言的日益普及，汉密尔顿蒙特卡洛（HMC）已成为应用贝叶斯推理的主流。然而，HMC 仍然难以从具有多尺度几何特征的密度中采样：需要较大的步长来有效探索低曲率区域，而需要较小的步长来精确探索高曲率区域。受微分方程求解器的启发，我们引入了延迟剔除广义 HMC（DR-G-HMC）采样器，通过采用动态步长选择克服了这一难题。在一次采样迭代中，DR-G-HMC 会在必要时以几何级数递减的步长顺序提出建议。这就以越来越高的保真度模拟了哈密顿动力学，在高曲率区域，生成的建议被接受的几率更高。DR-G-HMC 还能减少拒绝的次数，从而使广义 HMC 更具竞争力，否则会导致低效回溯和阻碍定向移动。我们通过实验证明了 DR-G-HMC：（1）能正确地从多尺度密度中采样；（2）使广义 HMC 方法与最先进的 No-U-Turn 采样器相比更具竞争力；（3）对调整参数具有鲁棒性。

{"title":"Sampling From Multiscale Densities With Delayed Rejection Generalized Hamiltonian Monte Carlo","authors":"Gilad Turok, Chirag Modi, Bob Carpenter","doi":"arxiv-2406.02741","DOIUrl":"https://doi.org/arxiv-2406.02741","url":null,"abstract":"With the increasing prevalence of probabilistic programming languages,\u0000Hamiltonian Monte Carlo (HMC) has become the mainstay of applied Bayesian\u0000inference. However HMC still struggles to sample from densities with multiscale\u0000geometry: a large step size is needed to efficiently explore low curvature\u0000regions while a small step size is needed to accurately explore high curvature\u0000regions. We introduce the delayed rejection generalized HMC (DR-G-HMC) sampler\u0000that overcomes this challenge by employing dynamic step size selection,\u0000inspired by differential equation solvers. In a single sampling iteration,\u0000DR-G-HMC sequentially makes proposals with geometrically decreasing step sizes\u0000if necessary. This simulates Hamiltonian dynamics with increasing fidelity\u0000that, in high curvature regions, generates proposals with a higher chance of\u0000acceptance. DR-G-HMC also makes generalized HMC competitive by decreasing the\u0000number of rejections which otherwise cause inefficient backtracking and\u0000prevents directed movement. We present experiments to demonstrate that DR-G-HMC\u0000(1) correctly samples from multiscale densities, (2) makes generalized HMC\u0000methods competitive with the state of the art No-U-Turn sampler, and (3) is\u0000robust to tuning parameters.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141531893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Partition-insensitive Parallel Framework for Distributed Model Fitting 分布式模型拟合的分区敏感并行框架

arXiv - STAT - Computation

Pub Date : 2024-06-02 DOI: arxiv-2406.00703

Xiaofei Wu, Rongmei Liang, Fabio Roli, Marcello Pelillo, Jing Yuan

Distributed model fitting refers to the process of fitting a mathematical orstatistical model to the data using distributed computing resources, such thatcomputing tasks are divided among multiple interconnected computers or nodes,often organized in a cluster or network. Most of the existing methods fordistributed model fitting are to formulate it in a consensus optimizationproblem, and then build up algorithms based on the alternating direction methodof multipliers (ADMM). This paper introduces a novel parallel framework forachieving a distributed model fitting. In contrast to previous consensusframeworks, the introduced parallel framework offers two notable advantages.Firstly, it exhibits insensitivity to sample partitioning, meaning that thesolution of the algorithm remains unaffected by variations in the number ofslave nodes or/and the amount of data each node carries. Secondly, fewervariables are required to be updated at each iteration, so that the proposedparallel framework performs in a more succinct and efficient way, and adapts tohigh-dimensional data. In addition, we prove that the algorithms under the newparallel framework have a worst-case linear convergence rate in theory.Numerical experiments confirm the generality, robustness, and accuracy of ourproposed parallel framework.

分布式模型拟合是指利用分布式计算资源对数学或统计模型进行数据拟合的过程，即计算任务在多台相互连接的计算机或节点之间进行分配，通常以集群或网络的形式组织。现有的分布式模型拟合方法大多是将其表述为一个共识优化问题，然后建立基于交替方向乘法（ADMM）的算法。本文介绍了一种实现分布式模型拟合的新型并行框架。与以往的共识框架相比，本文介绍的并行框架有两个显著优势：首先，它对样本分割不敏感，这意味着算法的求解不受从节点数或/和每个节点携带的数据量变化的影响。其次，每次迭代需要更新的变量很少，因此所提出的并行框架能以更简洁、更高效的方式运行，并适应高维数据。此外，我们还证明了新并行框架下的算法在理论上具有最坏情况下的线性收敛率。数值实验证实了我们提出的并行框架的通用性、鲁棒性和准确性。

{"title":"A Partition-insensitive Parallel Framework for Distributed Model Fitting","authors":"Xiaofei Wu, Rongmei Liang, Fabio Roli, Marcello Pelillo, Jing Yuan","doi":"arxiv-2406.00703","DOIUrl":"https://doi.org/arxiv-2406.00703","url":null,"abstract":"Distributed model fitting refers to the process of fitting a mathematical or\u0000statistical model to the data using distributed computing resources, such that\u0000computing tasks are divided among multiple interconnected computers or nodes,\u0000often organized in a cluster or network. Most of the existing methods for\u0000distributed model fitting are to formulate it in a consensus optimization\u0000problem, and then build up algorithms based on the alternating direction method\u0000of multipliers (ADMM). This paper introduces a novel parallel framework for\u0000achieving a distributed model fitting. In contrast to previous consensus\u0000frameworks, the introduced parallel framework offers two notable advantages.\u0000Firstly, it exhibits insensitivity to sample partitioning, meaning that the\u0000solution of the algorithm remains unaffected by variations in the number of\u0000slave nodes or/and the amount of data each node carries. Secondly, fewer\u0000variables are required to be updated at each iteration, so that the proposed\u0000parallel framework performs in a more succinct and efficient way, and adapts to\u0000high-dimensional data. In addition, we prove that the algorithms under the new\u0000parallel framework have a worst-case linear convergence rate in theory.\u0000Numerical experiments confirm the generality, robustness, and accuracy of our\u0000proposed parallel framework.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"75 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141255854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Prediction of energy consumption in hotels using ANN 利用 ANN 预测酒店能耗

arXiv - STAT - Computation

Pub Date : 2024-05-28 DOI: arxiv-2405.18076

Oscar Trull, Angel Peiro-Signes, J. Carlos Garcia-Diaz, Marival Segarra-Ona

The increase in travelers and stays in tourist destinations is leading hotelsto be aware of their ecological management and the need for efficient energyconsumption. To achieve this, hotels are increasingly using digitalized systemsand more frequent measurements are made of the variables that affect theirmanagement. Electricity can play a significant role, predicting electricityusage in hotels, which in turn can enhance their circularity - an approachaimed at sustainable and efficient resource use. In this study, neural networksare trained to predict electricity usage patterns in two hotels based onhistorical data. The results indicate that the predictions have a good accuracylevel of around 2.5% in MAPE, showing the potential of using these techniquesfor electricity forecasting in hotels. Additionally, neural network models canuse climatological data to improve predictions. By accurately forecastingenergy demand, hotels can optimize their energy procurement and usage, movingenergy-intensive activities to off-peak hours to reduce costs and strain on thegrid, assisting in the better integration of renewable energy sources, oridentifying patterns and anomalies in energy consumption, suggesting areas forefficiency improvements, among other. Hence, by optimizing the allocation ofresources, reducing waste and improving efficiency these models can improvehotel's circularity.

随着游客和旅游目的地住宿人数的增加，酒店开始意识到其生态管理和高效能源消耗的必要性。为此，酒店越来越多地使用数字化系统，并对影响酒店管理的变量进行更频繁的测量。电能可以在预测酒店用电量方面发挥重要作用，这反过来又可以增强酒店的循环性--一种旨在实现可持续和高效资源利用的方法。在这项研究中，我们根据历史数据训练神经网络来预测两家酒店的用电模式。结果表明，预测结果的 MAPE 准确度约为 2.5%，显示了使用这些技术进行酒店用电预测的潜力。此外，神经网络模型还可以利用气候数据来改进预测。通过准确预测能源需求，酒店可以优化其能源采购和使用，将能源密集型活动转移到非高峰时段以降低成本和电网压力，协助更好地整合可再生能源，或识别能源消耗的模式和异常情况，为提高效率提出建议等。因此，通过优化资源配置、减少浪费和提高效率，这些模型可以改善酒店的循环性。

{"title":"Prediction of energy consumption in hotels using ANN","authors":"Oscar Trull, Angel Peiro-Signes, J. Carlos Garcia-Diaz, Marival Segarra-Ona","doi":"arxiv-2405.18076","DOIUrl":"https://doi.org/arxiv-2405.18076","url":null,"abstract":"The increase in travelers and stays in tourist destinations is leading hotels\u0000to be aware of their ecological management and the need for efficient energy\u0000consumption. To achieve this, hotels are increasingly using digitalized systems\u0000and more frequent measurements are made of the variables that affect their\u0000management. Electricity can play a significant role, predicting electricity\u0000usage in hotels, which in turn can enhance their circularity - an approach\u0000aimed at sustainable and efficient resource use. In this study, neural networks\u0000are trained to predict electricity usage patterns in two hotels based on\u0000historical data. The results indicate that the predictions have a good accuracy\u0000level of around 2.5% in MAPE, showing the potential of using these techniques\u0000for electricity forecasting in hotels. Additionally, neural network models can\u0000use climatological data to improve predictions. By accurately forecasting\u0000energy demand, hotels can optimize their energy procurement and usage, moving\u0000energy-intensive activities to off-peak hours to reduce costs and strain on the\u0000grid, assisting in the better integration of renewable energy sources, or\u0000identifying patterns and anomalies in energy consumption, suggesting areas for\u0000efficiency improvements, among other. Hence, by optimizing the allocation of\u0000resources, reducing waste and improving efficiency these models can improve\u0000hotel's circularity.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"96 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141171639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Convergence rates of particle approximation of forward-backward splitting algorithm for granular medium equations 颗粒介质方程的前向-后向分裂算法的颗粒近似收敛率

arXiv - STAT - Computation

Pub Date : 2024-05-28 DOI: arxiv-2405.18034

Matej Benko, Iwona Chlebicka, Jørgen Endal, Błażej Miasojedow

We study the spatially homogeneous granular medium equation[partial_tmu=rm{div}(munabla V)+rm{div}(mu(nabla W astmu))+Deltamu,,] within a large and natural class of the confinementpotentials $V$ and interaction potentials $W$. The considered problem do notneed to assume that $nabla V$ or $nabla W$ are globally Lipschitz. With theaim of providing particle approximation of solutions, we design efficientforward-backward splitting algorithms. Sharp convergence rates in terms of theWasserstein distance are provided.

我们研究了空间均匀颗粒介质方程（[partial_tmu=rm{div}(munabla V)+rm{div}(mu(nabla W astmu))+Deltamu,,] within a large and natural class of the confinementpotentials $V$ and interaction potentials $W$）。所考虑的问题不需要假设 $nabla V$ 或 $nabla W$ 是全局的 Lipschitz。为了提供粒子近似解，我们设计了高效的前向-后向分裂算法。我们提供了以瓦瑟斯坦距离（Wasserstein distance）表示的尖锐收敛率。

引用次数: 0

Inference for the stochastic FitzHugh-Nagumo model from real action potential data via approximate Bayesian computation 通过近似贝叶斯计算从真实动作电位数据推断随机菲茨休-纳古莫模型

arXiv - STAT - Computation

Pub Date : 2024-05-28 DOI: arxiv-2405.17972

Adeline Samson, Massimiliano Tamborrino, Irene Tubikanec

The stochastic FitzHugh-Nagumo (FHN) model considered here is atwo-dimensional nonlinear stochastic differential equation with additivedegenerate noise, whose first component, the only one observed, describes themembrane voltage evolution of a single neuron. Due to its low dimensionality,its analytical and numerical tractability, and its neuronal interpretation, ithas been used as a case study to test the performance of different statisticalmethods in estimating the underlying model parameters. Existing methods,however, often require complete observations, non-degeneracy of the noise or acomplex architecture (e.g., to estimate the transition density of the process,"recovering" the unobserved second component), and they may not(satisfactorily) estimate all model parameters simultaneously. Moreover, thesestudies lack real data applications for the stochastic FHN model. Here, wetackle all challenges (non-globally Lipschitz drift, non-explicit solution,lack of available transition density, degeneracy of the noise, and partialobservations) via an intuitive and easy-to-implement sequential Monte Carloapproximate Bayesian computation algorithm. The proposed method relies on arecent computationally efficient and structure-preserving numerical splittingscheme for synthetic data generation, and on summary statistics exploiting thestructural properties of the process. We succeed in estimating all modelparameters from simulated data and, more remarkably, real action potential dataof rats. The presented novel real-data fit may broaden the scope andcredibility of this classic and widely used neuronal model.

本文所考虑的随机菲茨休-纳古莫（FHN）模型是一个二维非线性随机微分方程，带有附加生成噪声，其第一个分量（唯一观测到的分量）描述了单个神经元的膜电压演变。由于其维度低、分析和数值上的可操作性以及对神经元的解释，该方程被用作案例研究，以测试不同统计方法在估计基础模型参数方面的性能。然而，现有方法往往需要完整的观测数据、噪声的非退化性或复杂的结构（例如，估计过程的过渡密度，"恢复 "未观测到的第二分量），而且它们可能无法（令人满意地）同时估计所有模型参数。此外，这些研究缺乏随机 FHN 模型的实际数据应用。在此，我们通过一种直观且易于实现的顺序蒙特卡洛近似贝叶斯计算算法，解决了所有难题（非全局 Lipschitz 漂移、非显式解、缺乏可用的过渡密度、噪声退化和部分观测）。所提出的方法依赖于新近提出的计算高效、结构保留的数值分裂方案来生成合成数据，并依赖于利用过程结构特性的汇总统计。我们成功地从模拟数据和大鼠的真实动作电位数据中估算出了所有模型参数。所提出的新颖的真实数据拟合方法可能会扩大这一经典和广泛应用的神经元模型的范围和可信度。

{"title":"Inference for the stochastic FitzHugh-Nagumo model from real action potential data via approximate Bayesian computation","authors":"Adeline Samson, Massimiliano Tamborrino, Irene Tubikanec","doi":"arxiv-2405.17972","DOIUrl":"https://doi.org/arxiv-2405.17972","url":null,"abstract":"The stochastic FitzHugh-Nagumo (FHN) model considered here is a\u0000two-dimensional nonlinear stochastic differential equation with additive\u0000degenerate noise, whose first component, the only one observed, describes the\u0000membrane voltage evolution of a single neuron. Due to its low dimensionality,\u0000its analytical and numerical tractability, and its neuronal interpretation, it\u0000has been used as a case study to test the performance of different statistical\u0000methods in estimating the underlying model parameters. Existing methods,\u0000however, often require complete observations, non-degeneracy of the noise or a\u0000complex architecture (e.g., to estimate the transition density of the process,\u0000\"recovering\" the unobserved second component), and they may not\u0000(satisfactorily) estimate all model parameters simultaneously. Moreover, these\u0000studies lack real data applications for the stochastic FHN model. Here, we\u0000tackle all challenges (non-globally Lipschitz drift, non-explicit solution,\u0000lack of available transition density, degeneracy of the noise, and partial\u0000observations) via an intuitive and easy-to-implement sequential Monte Carlo\u0000approximate Bayesian computation algorithm. The proposed method relies on a\u0000recent computationally efficient and structure-preserving numerical splitting\u0000scheme for synthetic data generation, and on summary statistics exploiting the\u0000structural properties of the process. We succeed in estimating all model\u0000parameters from simulated data and, more remarkably, real action potential data\u0000of rats. The presented novel real-data fit may broaden the scope and\u0000credibility of this classic and widely used neuronal model.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141172923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A First Course in Monte Carlo Methods 蒙特卡罗方法第一课

arXiv - STAT - Computation

Pub Date : 2024-05-25 DOI: arxiv-2405.16359

Daniel Sanz-Alonso, Omar Al-Ghattas

This is a concise mathematical introduction to Monte Carlo methods, a richfamily of algorithms with far-reaching applications in science and engineering.Monte Carlo methods are an exciting subject for mathematical statisticians andcomputational and applied mathematicians: the design and analysis of modernalgorithms are rooted in a broad mathematical toolbox that includes ergodictheory of Markov chains, Hamiltonian dynamical systems, transport maps,stochastic differential equations, information theory, optimization, Riemanniangeometry, and gradient flows, among many others. These lecture notes celebratethe breadth of mathematical ideas that have led to tangible advancements inMonte Carlo methods and their applications. To accommodate a diverse audience,the level of mathematical rigor varies from chapter to chapter, giving only anintuitive treatment to the most technically demanding subjects. The aim is notto be comprehensive or encyclopedic, but rather to illustrate some keyprinciples in the design and analysis of Monte Carlo methods through acarefully-crafted choice of topics that emphasizes timeless over timely ideas.Algorithms are presented in a way that is conducive to conceptual understandingand mathematical analysis -- clarity and intuition are favored overstate-of-the-art implementations that are harder to comprehend or rely onad-hoc heuristics. To help readers navigate the expansive landscape of MonteCarlo methods, each algorithm is accompanied by a summary of its pros and cons,and by a discussion of the type of problems for which they are most useful. Thepresentation is self-contained, and therefore adequate for self-guided learningor as a teaching resource. Each chapter contains a section with bibliographicremarks that will be useful for those interested in conducting research onMonte Carlo methods and their applications.

蒙特卡洛方法对于数学统计学家、计算和应用数学家来说是一个令人兴奋的课题：现代算法的设计和分析植根于广泛的数学工具箱，包括马尔可夫链的遍历理论、汉密尔顿动力系统、传输图、随机微分方程、信息论、最优化、黎曼几何和梯度流等。这些演讲稿颂扬了数学思想的广度，这些思想导致蒙特卡洛方法及其应用取得了切实的进步。为了适应不同读者的需要，各章的数学严谨程度各不相同，只对技术要求最高的课题进行直观处理。本书的目的不是要做到面面俱到或百科全书式，而是通过精心设计的选题来说明蒙特卡洛方法设计和分析中的一些关键原则，这些选题强调的是永恒性而非及时性的观点。算法的呈现方式有利于概念理解和数学分析--清晰和直观的呈现方式优于难以理解或依赖于临时启发式的最新实现方式。为了帮助读者浏览蒙特卡洛方法的广阔前景，每种算法都附有优缺点摘要，并讨论了这些算法最有用的问题类型。本书内容自成体系，因此既适合自学，也可作为教学资源。每章都包含一节书目注释，这对那些有兴趣研究蒙特卡洛方法及其应用的人很有帮助。

{"title":"A First Course in Monte Carlo Methods","authors":"Daniel Sanz-Alonso, Omar Al-Ghattas","doi":"arxiv-2405.16359","DOIUrl":"https://doi.org/arxiv-2405.16359","url":null,"abstract":"This is a concise mathematical introduction to Monte Carlo methods, a rich\u0000family of algorithms with far-reaching applications in science and engineering.\u0000Monte Carlo methods are an exciting subject for mathematical statisticians and\u0000computational and applied mathematicians: the design and analysis of modern\u0000algorithms are rooted in a broad mathematical toolbox that includes ergodic\u0000theory of Markov chains, Hamiltonian dynamical systems, transport maps,\u0000stochastic differential equations, information theory, optimization, Riemannian\u0000geometry, and gradient flows, among many others. These lecture notes celebrate\u0000the breadth of mathematical ideas that have led to tangible advancements in\u0000Monte Carlo methods and their applications. To accommodate a diverse audience,\u0000the level of mathematical rigor varies from chapter to chapter, giving only an\u0000intuitive treatment to the most technically demanding subjects. The aim is not\u0000to be comprehensive or encyclopedic, but rather to illustrate some key\u0000principles in the design and analysis of Monte Carlo methods through a\u0000carefully-crafted choice of topics that emphasizes timeless over timely ideas.\u0000Algorithms are presented in a way that is conducive to conceptual understanding\u0000and mathematical analysis -- clarity and intuition are favored over\u0000state-of-the-art implementations that are harder to comprehend or rely on\u0000ad-hoc heuristics. To help readers navigate the expansive landscape of Monte\u0000Carlo methods, each algorithm is accompanied by a summary of its pros and cons,\u0000and by a discussion of the type of problems for which they are most useful. The\u0000presentation is self-contained, and therefore adequate for self-guided learning\u0000or as a teaching resource. Each chapter contains a section with bibliographic\u0000remarks that will be useful for those interested in conducting research on\u0000Monte Carlo methods and their applications.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141171638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Skew-symmetric schemes for stochastic differential equations with non-Lipschitz drift: an unadjusted Barker algorithm 具有非 Lipschitz 漂移的随机微分方程的偏斜对称方案：一种未经调整的巴克算法

arXiv - STAT - Computation

Pub Date : 2024-05-23 DOI: arxiv-2405.14373

Samuel Livingstone, Nikolas Nüsken, Giorgos Vasdekis, Rui-Yang Zhang

We propose a new simple and explicit numerical scheme for time-homogeneousstochastic differential equations. The scheme is based on sampling incrementsat each time step from a skew-symmetric probability distribution, with thelevel of skewness determined by the drift and volatility of the underlyingprocess. We show that as the step-size decreases the scheme converges weakly tothe diffusion of interest. We then consider the problem of simulating from thelimiting distribution of an ergodic diffusion process using the numericalscheme with a fixed step-size. We establish conditions under which thenumerical scheme converges to equilibrium at a geometric rate, and quantify thebias between the equilibrium distributions of the scheme and of the truediffusion process. Notably, our results do not require a global Lipschitzassumption on the drift, in contrast to those required for the Euler--Maruyamascheme for long-time simulation at fixed step-sizes. Our weak convergenceresult relies on an extension of the theory of Milstein & Tretyakov tostochastic differential equations with non-Lipschitz drift, which could also beof independent interest. We support our theoretical results with numericalsimulations.

我们为时间同构随机微分方程提出了一种新的简单而明确的数值方案。该方案基于在每个时间步从偏斜对称概率分布中采样增量，偏斜程度由基本过程的漂移和波动决定。我们证明，随着步长的减小，该方案会弱收敛于感兴趣的扩散。然后，我们考虑使用固定步长的数值方案从遍历扩散过程的极限分布进行模拟的问题。我们确定了数值方案以几何速度收敛到均衡的条件，并量化了方案均衡分布与真实扩散过程均衡分布之间的偏差。值得注意的是，我们的结果不需要漂移的全局 Lipschitzassumption，这与在固定步长下进行长时间模拟的 Euler-Maruyamascheme 所需的条件截然不同。我们的弱收敛性结果依赖于 Milstein & Tretyakov 理论对具有非 Lipschitz 漂移的随机微分方程的扩展，这也可能是我们感兴趣的问题。我们用数值模拟来支持我们的理论结果。

{"title":"Skew-symmetric schemes for stochastic differential equations with non-Lipschitz drift: an unadjusted Barker algorithm","authors":"Samuel Livingstone, Nikolas Nüsken, Giorgos Vasdekis, Rui-Yang Zhang","doi":"arxiv-2405.14373","DOIUrl":"https://doi.org/arxiv-2405.14373","url":null,"abstract":"We propose a new simple and explicit numerical scheme for time-homogeneous\u0000stochastic differential equations. The scheme is based on sampling increments\u0000at each time step from a skew-symmetric probability distribution, with the\u0000level of skewness determined by the drift and volatility of the underlying\u0000process. We show that as the step-size decreases the scheme converges weakly to\u0000the diffusion of interest. We then consider the problem of simulating from the\u0000limiting distribution of an ergodic diffusion process using the numerical\u0000scheme with a fixed step-size. We establish conditions under which the\u0000numerical scheme converges to equilibrium at a geometric rate, and quantify the\u0000bias between the equilibrium distributions of the scheme and of the true\u0000diffusion process. Notably, our results do not require a global Lipschitz\u0000assumption on the drift, in contrast to those required for the Euler--Maruyama\u0000scheme for long-time simulation at fixed step-sizes. Our weak convergence\u0000result relies on an extension of the theory of Milstein & Tretyakov to\u0000stochastic differential equations with non-Lipschitz drift, which could also be\u0000of independent interest. We support our theoretical results with numerical\u0000simulations.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141146312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Adaptive tempering schedules with approximative intermediate measures for filtering problems 过滤问题的近似中间量自适应调节计划

arXiv - STAT - Computation

Pub Date : 2024-05-23 DOI: arxiv-2405.14408

Iris Rammelmüller, Gottfried Hastermann, Jana de Wiljes

Data assimilation algorithms integrate prior information from numerical modelsimulations with observed data. Ensemble-based filters, regarded asstate-of-the-art, are widely employed for large-scale estimation tasks indisciplines such as geoscience and meteorology. Despite their inability toproduce the true posterior distribution for nonlinear systems, their robustnessand capacity for state tracking are noteworthy. In contrast, Particle filtersyield the correct distribution in the ensemble limit but require substantiallylarger ensemble sizes than ensemble-based filters to maintain stability inhigher-dimensional spaces. It is essential to transcend traditional Gaussianassumptions to achieve realistic quantification of uncertainties. One approachinvolves the hybridisation of filters, facilitated by tempering, to harness thecomplementary strengths of different filters. A new adaptive tempering methodis proposed to tune the underlying schedule, aiming to systematically surpassthe performance previously achieved. Although promising numerical results forcertain filter combinations in toy examples exist in the literature, the tuningof hyperparameters presents a considerable challenge. A deeper understanding ofthese interactions is crucial for practical applications.

数据同化算法将数值模型模拟的先验信息与观测数据整合在一起。基于集合的滤波器被认为是最先进的，被广泛用于地球科学和气象学等学科的大规模估算任务。尽管它们无法产生非线性系统的真实后验分布，但其鲁棒性和状态跟踪能力是值得注意的。相比之下，粒子滤波器能在集合极限中产生正确的分布，但与基于集合的滤波器相比，粒子滤波器需要更大的集合规模才能在更高维度的空间中保持稳定。要实现不确定性的现实量化，必须超越传统的高斯假设。其中一种方法涉及滤波器的混合，通过调节来利用不同滤波器的互补优势。我们提出了一种新的自适应调节方法来调整基础时间表，目的是系统地超越以前取得的性能。尽管文献中已经有了在玩具实例中某些滤波器组合的可喜数值结果，但超参数的调整仍是一个相当大的挑战。深入了解这些相互作用对实际应用至关重要。

{"title":"Adaptive tempering schedules with approximative intermediate measures for filtering problems","authors":"Iris Rammelmüller, Gottfried Hastermann, Jana de Wiljes","doi":"arxiv-2405.14408","DOIUrl":"https://doi.org/arxiv-2405.14408","url":null,"abstract":"Data assimilation algorithms integrate prior information from numerical model\u0000simulations with observed data. Ensemble-based filters, regarded as\u0000state-of-the-art, are widely employed for large-scale estimation tasks in\u0000disciplines such as geoscience and meteorology. Despite their inability to\u0000produce the true posterior distribution for nonlinear systems, their robustness\u0000and capacity for state tracking are noteworthy. In contrast, Particle filters\u0000yield the correct distribution in the ensemble limit but require substantially\u0000larger ensemble sizes than ensemble-based filters to maintain stability in\u0000higher-dimensional spaces. It is essential to transcend traditional Gaussian\u0000assumptions to achieve realistic quantification of uncertainties. One approach\u0000involves the hybridisation of filters, facilitated by tempering, to harness the\u0000complementary strengths of different filters. A new adaptive tempering method\u0000is proposed to tune the underlying schedule, aiming to systematically surpass\u0000the performance previously achieved. Although promising numerical results for\u0000certain filter combinations in toy examples exist in the literature, the tuning\u0000of hyperparameters presents a considerable challenge. A deeper understanding of\u0000these interactions is crucial for practical applications.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"37 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141146400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Reinforcement Learning for Adaptive MCMC 自适应 MCMC 的强化学习

arXiv - STAT - Computation

Pub Date : 2024-05-22 DOI: arxiv-2405.13574

Congye Wang, Wilson Chen, Heishiro Kanagawa, Chris. J. Oates

An informal observation, made by several authors, is that the adaptive designof a Markov transition kernel has the flavour of a reinforcement learning task.Yet, to-date it has remained unclear how to actually exploit modernreinforcement learning technologies for adaptive MCMC. The aim of this paper isto set out a general framework, called Reinforcement LearningMetropolis--Hastings, that is theoretically supported and empiricallyvalidated. Our principal focus is on learning fast-mixing Metropolis--Hastingstransition kernels, which we cast as deterministic policies and optimise via apolicy gradient. Control of the learning rate provably ensures conditions forergodicity are satisfied. The methodology is used to construct a gradient-freesampler that out-performs a popular gradient-free adaptive Metropolis--Hastingsalgorithm on $approx 90 %$ of tasks in the PosteriorDB benchmark.

然而，迄今为止，如何将现代强化学习技术用于自适应 MCMC 仍然是个未知数。本文的目的是建立一个名为 "强化学习大都会--哈斯廷斯"（Reinforcement LearningMetropolis--Hastings）的总体框架，该框架具有理论支持和经验验证。我们的主要重点是学习快速混合的大都会--哈斯廷斯过渡核，并将其作为确定性策略，通过策略梯度进行优化。对学习率的控制可确保满足正交性条件。该方法被用于构建一个梯度自由采样器，在PosteriorDB基准测试中，该采样器在大约90%的任务上优于流行的无梯度自适应Metropolis--Hastings算法。

引用次数: 0

Normalizing Basis Functions: Approximate Stationary Models for Large Spatial Data 归一化基函数：大型空间数据的近似静态模型

arXiv - STAT - Computation

Pub Date : 2024-05-22 DOI: arxiv-2405.13821

Antony Sikorski, Daniel McKenzie, Douglas Nychka

In geostatistics, traditional spatial models often rely on the GaussianProcess (GP) to fit stationary covariances to data. It is well known that thisapproach becomes computationally infeasible when dealing with large datavolumes, necessitating the use of approximate methods. A powerful class ofmethods approximate the GP as a sum of basis functions with randomcoefficients. Although this technique offers computational efficiency, it doesnot inherently guarantee a stationary covariance. To mitigate this issue, thebasis functions can be "normalized" to maintain a constant marginal variance,avoiding unwanted artifacts and edge effects. This allows for the fitting ofnearly stationary models to large, potentially non-stationary datasets,providing a rigorous base to extend to more complex problems. Unfortunately,the process of normalizing these basis functions is computationally demanding.To address this, we introduce two fast and accurate algorithms to thenormalization step, allowing for efficient prediction on fine grids. Thepractical value of these algorithms is showcased in the context of a spatialanalysis on a large dataset, where significant computational speedups areachieved. While implementation and testing are done specifically within theLatticeKrig framework, these algorithms can be adapted to other basis functionmethods operating on regular grids.

在地理统计中，传统的空间模型通常依靠高斯过程（GP）来拟合数据的静态协方差。众所周知，当处理大量数据时，这种方法在计算上变得不可行，因此必须使用近似方法。有一类功能强大的方法将 GP 近似为具有随机系数的基函数之和。虽然这种技术具有计算效率高的特点，但本质上并不能保证协方差的稳定。为了缓解这一问题，可以对基值函数进行 "归一化 "处理，以保持恒定的边际方差，避免不必要的假象和边缘效应。这样就可以将接近静态的模型拟合到大型的、可能是非静态的数据集上，为扩展到更复杂的问题提供了一个严谨的基础。为了解决这个问题，我们为归一化步骤引入了两种快速而精确的算法，从而可以在精细网格上进行高效预测。在对大型数据集进行空间分析时，我们展示了这些算法的实用价值，计算速度明显加快。虽然这些算法是专门在 LatticeKrig 框架内实施和测试的，但它们也可适用于在常规网格上运行的其他基函数方法。

{"title":"Normalizing Basis Functions: Approximate Stationary Models for Large Spatial Data","authors":"Antony Sikorski, Daniel McKenzie, Douglas Nychka","doi":"arxiv-2405.13821","DOIUrl":"https://doi.org/arxiv-2405.13821","url":null,"abstract":"In geostatistics, traditional spatial models often rely on the Gaussian\u0000Process (GP) to fit stationary covariances to data. It is well known that this\u0000approach becomes computationally infeasible when dealing with large data\u0000volumes, necessitating the use of approximate methods. A powerful class of\u0000methods approximate the GP as a sum of basis functions with random\u0000coefficients. Although this technique offers computational efficiency, it does\u0000not inherently guarantee a stationary covariance. To mitigate this issue, the\u0000basis functions can be \"normalized\" to maintain a constant marginal variance,\u0000avoiding unwanted artifacts and edge effects. This allows for the fitting of\u0000nearly stationary models to large, potentially non-stationary datasets,\u0000providing a rigorous base to extend to more complex problems. Unfortunately,\u0000the process of normalizing these basis functions is computationally demanding.\u0000To address this, we introduce two fast and accurate algorithms to the\u0000normalization step, allowing for efficient prediction on fine grids. The\u0000practical value of these algorithms is showcased in the context of a spatial\u0000analysis on a large dataset, where significant computational speedups are\u0000achieved. While implementation and testing are done specifically within the\u0000LatticeKrig framework, these algorithms can be adapted to other basis function\u0000methods operating on regular grids.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"37 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141146317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0