首页 > 最新文献

arXiv - STAT - Computation最新文献

英文 中文
Optimised Annealed Sequential Monte Carlo Samplers 优化的校正序列蒙特卡罗采样器
Pub Date : 2024-08-22 DOI: arxiv-2408.12057
Saifuddin Syed, Alexandre Bouchard-Côté, Kevin Chern, Arnaud Doucet
Annealed Sequential Monte Carlo (SMC) samplers are special cases of SMCsamplers where the sequence of distributions can be embedded in a smooth pathof distributions. Using this underlying path of distributions and a performancemodel based on the variance of the normalisation constant estimator, wesystematically study dense schedule and large particle limits. From our theoryand adaptive methods emerges a notion of global barrier capturing the inherentcomplexity of normalisation constant approximation under our performance model.We then turn the resulting approximations into surrogate objective functions ofalgorithm performance, and use them for methodology development. We obtainnovel adaptive methodologies, Sequential SMC (SSMC) and Sequential AIS (SAIS)samplers, which address practical difficulties inherent in previous adaptiveSMC methods. First, our SSMC algorithms are predictable: they produce asequence of increasingly precise estimates at deterministic and known times.Second, SAIS, a special case of SSMC, enables schedule adaptation at a memorycost constant in the number of particles and require much less communication.Finally, these characteristics make SAIS highly efficient on GPUs. We developan open-source, high-performance GPU implementation based on our methodologyand demonstrate up to a hundred-fold speed improvement compared tostate-of-the-art adaptive AIS methods.
退火连续蒙特卡罗(SMC)采样器是SMC采样器的特例,其中的分布序列可以嵌入平滑的分布路径中。利用这种基本的分布路径和基于归一化常数估计值方差的性能模型,我们对密集计划和大粒子极限进行了系统研究。从我们的理论和自适应方法中产生了一个全局障碍的概念,它捕捉了在我们的性能模型下归一化常数近似的内在复杂性。我们获得了新颖的自适应方法--序列 SMC(SSMC)和序列 AIS(SAIS)采样器,解决了以往自适应 SMC 方法中固有的实际困难。首先,我们的 SSMC 算法是可预测的:它们能在确定和已知的时间内产生一系列越来越精确的估计值。其次,SAIS 是 SSMC 的一种特例,它能在粒子数不变的内存成本下实现计划自适应,而且所需的通信量要少得多。最后,这些特性使得 SAIS 在 GPU 上非常高效。我们基于我们的方法开发了一个开源、高性能的 GPU 实现,并证明与目前最先进的自适应 AIS 方法相比,其速度最多可提高 100 倍。
{"title":"Optimised Annealed Sequential Monte Carlo Samplers","authors":"Saifuddin Syed, Alexandre Bouchard-Côté, Kevin Chern, Arnaud Doucet","doi":"arxiv-2408.12057","DOIUrl":"https://doi.org/arxiv-2408.12057","url":null,"abstract":"Annealed Sequential Monte Carlo (SMC) samplers are special cases of SMC\u0000samplers where the sequence of distributions can be embedded in a smooth path\u0000of distributions. Using this underlying path of distributions and a performance\u0000model based on the variance of the normalisation constant estimator, we\u0000systematically study dense schedule and large particle limits. From our theory\u0000and adaptive methods emerges a notion of global barrier capturing the inherent\u0000complexity of normalisation constant approximation under our performance model.\u0000We then turn the resulting approximations into surrogate objective functions of\u0000algorithm performance, and use them for methodology development. We obtain\u0000novel adaptive methodologies, Sequential SMC (SSMC) and Sequential AIS (SAIS)\u0000samplers, which address practical difficulties inherent in previous adaptive\u0000SMC methods. First, our SSMC algorithms are predictable: they produce a\u0000sequence of increasingly precise estimates at deterministic and known times.\u0000Second, SAIS, a special case of SSMC, enables schedule adaptation at a memory\u0000cost constant in the number of particles and require much less communication.\u0000Finally, these characteristics make SAIS highly efficient on GPUs. We develop\u0000an open-source, high-performance GPU implementation based on our methodology\u0000and demonstrate up to a hundred-fold speed improvement compared to\u0000state-of-the-art adaptive AIS methods.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"230 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Stereographic MCMC 自适应立体 MCMC
Pub Date : 2024-08-21 DOI: arxiv-2408.11780
Cameron Bell, Krzystof Łatuszyński, Gareth O. Roberts
In order to tackle the problem of sampling from heavy tailed, highdimensional distributions via Markov Chain Monte Carlo (MCMC) methods, Yang,Latuszy'nski, and Roberts (2022) (arXiv:2205.12112) introduces thestereographic projection as a tool to compactify $mathbb{R}^d$ and transformthe problem into sampling from a density on the unit sphere $mathbb{S}^d$.However, the improvement in algorithmic efficiency, as well as thecomputational cost of the implementation, are still significantly impacted bythe parameters used in this transformation. To address this, we introduce adaptive versions of the Stereographic RandomWalk (SRW), the Stereographic Slice Sampler (SSS), and the Stereographic BouncyParticle Sampler (SBPS), which automatically update the parameters of thealgorithms as the run progresses. The adaptive setup allows us to betterexploit the power of the stereographic projection, even when the targetdistribution is neither centered nor homogeneous. We present a simulation studyshowing each algorithm's robustness to starting far from the mean in heavytailed, high dimensional settings, as opposed to Hamiltonian Monte Carlo (HMC).We establish a novel framework for proving convergence of adaptive MCMCalgorithms over collections of simultaneously uniformly ergodic Markovoperators, including continuous time processes. This framework allows us toprove LLNs and a CLT for our adaptive Stereographic algorithms.
为了通过马尔可夫链蒙特卡罗(MCMC)方法解决从重尾高维分布中采样的问题,Yang, Latuszy'nski, and Roberts (2022) (arXiv:2205. 12112) 引入了立体投影作为工具,将 $mathbb{R}^d$ 压缩,并将问题转化为从单位球体上的密度 $mathbb{S}^d$ 中采样。12112)引入了立体投影作为工具,以压缩 $mathbb{R}^d$ 并将问题转化为从单位球体 $mathbb{S}^d$ 上的密度中采样。然而,算法效率的提高以及实现的计算成本仍然受到这种转化中使用的参数的显著影响。为了解决这个问题,我们引入了自适应版本的立体随机漫步(SRW)、立体切片采样器(SSS)和立体弹跳粒子采样器(SBPS),它们会在运行过程中自动更新算法参数。自适应设置使我们能够更好地发挥立体投影的威力,即使目标分布既不居中也不均匀。我们提出了一项仿真研究,展示了每种算法在重尾、高维设置下远离均值起始时的鲁棒性,这与汉密尔顿蒙特卡罗(HMC)不同。我们建立了一个新颖的框架,用于证明自适应 MCMC 算法在同时均匀遍历马尔可夫操作者集合(包括连续时间过程)上的收敛性。这个框架使我们能够为我们的自适应立体算法证明 LLN 和 CLT。
{"title":"Adaptive Stereographic MCMC","authors":"Cameron Bell, Krzystof Łatuszyński, Gareth O. Roberts","doi":"arxiv-2408.11780","DOIUrl":"https://doi.org/arxiv-2408.11780","url":null,"abstract":"In order to tackle the problem of sampling from heavy tailed, high\u0000dimensional distributions via Markov Chain Monte Carlo (MCMC) methods, Yang,\u0000Latuszy'nski, and Roberts (2022) (arXiv:2205.12112) introduces the\u0000stereographic projection as a tool to compactify $mathbb{R}^d$ and transform\u0000the problem into sampling from a density on the unit sphere $mathbb{S}^d$.\u0000However, the improvement in algorithmic efficiency, as well as the\u0000computational cost of the implementation, are still significantly impacted by\u0000the parameters used in this transformation. To address this, we introduce adaptive versions of the Stereographic Random\u0000Walk (SRW), the Stereographic Slice Sampler (SSS), and the Stereographic Bouncy\u0000Particle Sampler (SBPS), which automatically update the parameters of the\u0000algorithms as the run progresses. The adaptive setup allows us to better\u0000exploit the power of the stereographic projection, even when the target\u0000distribution is neither centered nor homogeneous. We present a simulation study\u0000showing each algorithm's robustness to starting far from the mean in heavy\u0000tailed, high dimensional settings, as opposed to Hamiltonian Monte Carlo (HMC).\u0000We establish a novel framework for proving convergence of adaptive MCMC\u0000algorithms over collections of simultaneously uniformly ergodic Markov\u0000operators, including continuous time processes. This framework allows us to\u0000prove LLNs and a CLT for our adaptive Stereographic algorithms.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"14 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multiple Random Scan Strategy for Latent Space Models 潜空间模型的多重随机扫描策略
Pub Date : 2024-08-21 DOI: arxiv-2408.11725
Antonio Peruzzi, Roberto Casarin
Latent Space (LS) network models project the nodes of a network on a$d$-dimensional latent space to achieve dimensionality reduction of the networkwhile preserving its relevant features. Inference is often carried out within aMarkov Chain Monte Carlo (MCMC) framework. Nonetheless, it is well-known thatthe computational time for this set of models increases quadratically with thenumber of nodes. In this work, we build on the Random-Scan (RS) approach topropose an MCMC strategy that alleviates the computational burden for LS modelswhile maintaining the benefits of a general-purpose technique. We call thisnovel strategy Multiple RS (MRS). This strategy is effective in reducing thecomputational cost by a factor without severe consequences on the MCMC draws.Moreover, we introduce a novel adaptation strategy that consists of aprobabilistic update of the set of latent coordinates of each node. OurAdaptive MRS adapts the acceptance rate of the Metropolis step to adjust theprobability of updating the latent coordinates. We show via simulation that theAdaptive MRS approach performs better than MRS in terms of mixing. Finally, weapply our algorithm to a multi-layer temporal LS model and show how ouradaptive strategy may be beneficial to empirical applications.
潜空间(LS)网络模型将网络节点投影到一个 $d$ 维的潜空间上,以实现网络的降维,同时保留其相关特征。推理通常在马尔可夫链蒙特卡罗(MCMC)框架内进行。然而,众所周知,这组模型的计算时间会随着节点数量的增加而呈二次曲线增长。在这项工作中,我们以随机扫描(RS)方法为基础,提出了一种 MCMC 策略,既减轻了 LS 模型的计算负担,又保持了通用技术的优点。我们称这种新颖的策略为多重 RS(MRS)。此外,我们还引入了一种新颖的适应策略,它包括对每个节点的潜在坐标集进行概率更新。我们的自适应 MRS 调整 Metropolis 步骤的接受率,以调整更新潜在坐标的概率。我们通过仿真证明,自适应 MRS 方法在混合方面的表现优于 MRS。最后,我们将算法应用于多层时态 LS 模型,并展示了我们的自适应策略如何有益于经验应用。
{"title":"A Multiple Random Scan Strategy for Latent Space Models","authors":"Antonio Peruzzi, Roberto Casarin","doi":"arxiv-2408.11725","DOIUrl":"https://doi.org/arxiv-2408.11725","url":null,"abstract":"Latent Space (LS) network models project the nodes of a network on a\u0000$d$-dimensional latent space to achieve dimensionality reduction of the network\u0000while preserving its relevant features. Inference is often carried out within a\u0000Markov Chain Monte Carlo (MCMC) framework. Nonetheless, it is well-known that\u0000the computational time for this set of models increases quadratically with the\u0000number of nodes. In this work, we build on the Random-Scan (RS) approach to\u0000propose an MCMC strategy that alleviates the computational burden for LS models\u0000while maintaining the benefits of a general-purpose technique. We call this\u0000novel strategy Multiple RS (MRS). This strategy is effective in reducing the\u0000computational cost by a factor without severe consequences on the MCMC draws.\u0000Moreover, we introduce a novel adaptation strategy that consists of a\u0000probabilistic update of the set of latent coordinates of each node. Our\u0000Adaptive MRS adapts the acceptance rate of the Metropolis step to adjust the\u0000probability of updating the latent coordinates. We show via simulation that the\u0000Adaptive MRS approach performs better than MRS in terms of mixing. Finally, we\u0000apply our algorithm to a multi-layer temporal LS model and show how our\u0000adaptive strategy may be beneficial to empirical applications.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"23 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optical ISAC: Fundamental Performance Limits and Transceiver Design 光学 ISAC:基本性能限制和收发器设计
Pub Date : 2024-08-21 DOI: arxiv-2408.11792
Alireza Ghazavi Khorasgani, Mahtab Mirmohseni, Ahmed Elzanaty
This paper characterizes the optimal capacity-distortion (C-D) tradeoff in anoptical point-to-point (P2P) system with single-input single-output forcommunication and single-input multiple-output for sensing (SISO-SIMO-C/S)within an integrated sensing and communication (ISAC) framework. We introducepractical, asymptotically optimal maximum a posteriori (MAP) and maximumlikelihood estimators (MLE) for target distance, addressing nonlinearmeasurement-to-state relationships and non-conjugate priors. Our results showthese estimators converge to the Bayesian Cramer-Rao bound (BCRB) as sensingantennas increase. We also demonstrate that the achievable rate-CRB (AR-CRB)serves as an outer bound (OB) for the optimal C-D region. To optimize inputdistribution across the Pareto boundary of the C-D region, we propose twoalgorithms: an iterative Blahut-Arimoto algorithm (BAA)-type method and amemory-efficient closed-form (CF) approach, including a CF optimal distributionfor high optical signal-to-noise ratio (O-SNR) conditions. Additionally, weextend and modify the Deterministic-Random Tradeoff (DRT) to this optical ISACcontext.
本文描述了在集成传感和通信(ISAC)框架内,单输入单输出通信和单输入多输出传感(SISO-SIMO-C/S)的光学点对点(P2P)系统中的最佳容量-失真(C-D)权衡。我们为目标距离引入了实用、渐进最优的最大后验(MAP)和最大似然估计(MLE),解决了测量与状态之间的非线性关系和非共轭先验问题。我们的研究结果表明,随着传感天线的增加,这些估计器会向贝叶斯克拉默-拉奥边界(BCRB)收敛。我们还证明,可实现速率-CRB(AR-CRB)可作为最优 C-D 区域的外部界限(OB)。为了优化 C-D 区域帕累托边界上的输入分配,我们提出了两种算法:一种是迭代布拉赫特-阿里莫托算法(BAA)类型的方法,另一种是内存效率闭式(CF)方法,包括针对高光信噪比(O-SNR)条件的 CF 最佳分配。此外,我们还对确定性-随机权衡(DRT)进行了扩展和修改,以适应这种光学 ISACcontext。
{"title":"Optical ISAC: Fundamental Performance Limits and Transceiver Design","authors":"Alireza Ghazavi Khorasgani, Mahtab Mirmohseni, Ahmed Elzanaty","doi":"arxiv-2408.11792","DOIUrl":"https://doi.org/arxiv-2408.11792","url":null,"abstract":"This paper characterizes the optimal capacity-distortion (C-D) tradeoff in an\u0000optical point-to-point (P2P) system with single-input single-output for\u0000communication and single-input multiple-output for sensing (SISO-SIMO-C/S)\u0000within an integrated sensing and communication (ISAC) framework. We introduce\u0000practical, asymptotically optimal maximum a posteriori (MAP) and maximum\u0000likelihood estimators (MLE) for target distance, addressing nonlinear\u0000measurement-to-state relationships and non-conjugate priors. Our results show\u0000these estimators converge to the Bayesian Cramer-Rao bound (BCRB) as sensing\u0000antennas increase. We also demonstrate that the achievable rate-CRB (AR-CRB)\u0000serves as an outer bound (OB) for the optimal C-D region. To optimize input\u0000distribution across the Pareto boundary of the C-D region, we propose two\u0000algorithms: an iterative Blahut-Arimoto algorithm (BAA)-type method and a\u0000memory-efficient closed-form (CF) approach, including a CF optimal distribution\u0000for high optical signal-to-noise ratio (O-SNR) conditions. Additionally, we\u0000extend and modify the Deterministic-Random Tradeoff (DRT) to this optical ISAC\u0000context.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"34 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
armadillo: An R Package to Use the Armadillo C++ Library 犰狳:使用犰狳 C++ 库的 R 软件包
Pub Date : 2024-08-19 DOI: arxiv-2408.11074
Mauricio Vargas Sepúlveda, Jonathan Schneider Malamud
This article introduces 'armadillo', a new R package that integrates thepowerful Armadillo C++ library for linear algebra into the R programmingenvironment. Targeted primarily at social scientists and other non-programmers,this article explains the computational benefits of moving code to C++ in termsof speed and syntax. We provide a comprehensive overview of Armadillo'scapabilities, highlighting its user-friendly syntax akin to MATLAB and itsefficiency for computationally intensive tasks. The 'armadillo' packagesimplifies a part of the process of using C++ within R by offering additionalease of integration for those who require high-performance linear algebraoperations in their R workflows. This work aims to bridge the gap betweencomputational efficiency and accessibility, making advanced linear algebraoperations more approachable for R users without extensive programmingbackgrounds.
本文介绍了一个新的 R 软件包 "犰狳",它将强大的线性代数 C++ 库犰狳集成到了 R 编程环境中。本文主要针对社会科学家和其他非程序员,从速度和语法方面解释了将代码转为 C++ 的计算优势。我们对 Armadillo 的功能进行了全面概述,重点介绍了其类似于 MATLAB 的用户友好语法,以及其在计算密集型任务中的效率。犰狳 "软件包简化了在 R 中使用 C++ 的部分过程,为那些需要在 R 工作流中进行高性能线性代数运算的人提供了额外的集成便利。这项工作旨在弥合计算效率和易用性之间的差距,让没有广泛编程背景的 R 用户更容易进行高级线性代数运算。
{"title":"armadillo: An R Package to Use the Armadillo C++ Library","authors":"Mauricio Vargas Sepúlveda, Jonathan Schneider Malamud","doi":"arxiv-2408.11074","DOIUrl":"https://doi.org/arxiv-2408.11074","url":null,"abstract":"This article introduces 'armadillo', a new R package that integrates the\u0000powerful Armadillo C++ library for linear algebra into the R programming\u0000environment. Targeted primarily at social scientists and other non-programmers,\u0000this article explains the computational benefits of moving code to C++ in terms\u0000of speed and syntax. We provide a comprehensive overview of Armadillo's\u0000capabilities, highlighting its user-friendly syntax akin to MATLAB and its\u0000efficiency for computationally intensive tasks. The 'armadillo' package\u0000simplifies a part of the process of using C++ within R by offering additional\u0000ease of integration for those who require high-performance linear algebra\u0000operations in their R workflows. This work aims to bridge the gap between\u0000computational efficiency and accessibility, making advanced linear algebra\u0000operations more approachable for R users without extensive programming\u0000backgrounds.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Issues of parameterization and computation for posterior inference in partially identified models 部分确定模型的参数化和后验推断计算问题
Pub Date : 2024-08-19 DOI: arxiv-2408.10416
Seren Lee, Paul Gustafson
A partially identified model, where the parameters can not be uniquelyidentified, often arises during statistical analysis. While researchersfrequently use Bayesian inference to analyze the models, when Bayesianinference with an off-the-shelf MCMC sampling algorithm is applied to apartially identified model, the computational performance can be poor. It isfound that using importance sampling with transparent reparameterization (TP)is one remedy. This method is preferable since the model is known to berendered as identified with respect to the new parameterization, and at thesame time, it may allow faster, i.i.d. Monte Carlo sampling by using conjugateconvenience priors. In this paper, we explain the importance sampling methodwith the TP and a pseudo-TP. We introduce the pseudo-TP, an alternative to TP,since finding a TP is sometimes difficult. Then, we test the methods'performance in some scenarios and compare it to the performance of theoff-the-shelf MCMC method - Gibbs sampling - applied in the originalparameterization. While the importance sampling with TP (ISTP) shows generallybetter results than off-the-shelf MCMC methods, as seen in the compute time andtrace plots, it is also seen that finding a TP which is necessary for themethod may not be easy. On the other hand, the pseudo-TP method shows a mixedresult and room for improvement since it relies on an approximation, which maynot be adequate for a given model and dataset.
在统计分析过程中,经常会出现无法唯一识别参数的部分识别模型。虽然研究人员经常使用贝叶斯推理来分析模型,但当使用现成的 MCMC 采样算法对部分识别的模型进行贝叶斯推理时,计算性能可能会很差。研究发现,使用透明重参数化(TP)的重要性采样是一种补救方法。这种方法是可取的,因为已知模型在新参数化后是确定的,同时,通过使用共轭先验,它可以更快地进行 i.i.d. 蒙特卡罗采样。本文将解释使用 TP 和伪 TP 的重要性抽样方法。我们介绍了伪 TP,它是 TP 的一种替代方法,因为有时很难找到 TP。然后,我们测试了这些方法在某些情况下的性能,并将其与应用于原始参数化的现成 MCMC 方法--吉布斯采样--的性能进行了比较。从计算时间和轨迹图中可以看出,带 TP 的重要度采样(ISTP)的结果总体上优于现成的 MCMC 方法,但也可以看出,找到一种该方法所需的 TP 可能并不容易。另一方面,伪 TP 方法的结果好坏参半,还有改进的余地,因为它依赖于近似值,而近似值对于给定的模型和数据集来说可能并不充分。
{"title":"Issues of parameterization and computation for posterior inference in partially identified models","authors":"Seren Lee, Paul Gustafson","doi":"arxiv-2408.10416","DOIUrl":"https://doi.org/arxiv-2408.10416","url":null,"abstract":"A partially identified model, where the parameters can not be uniquely\u0000identified, often arises during statistical analysis. While researchers\u0000frequently use Bayesian inference to analyze the models, when Bayesian\u0000inference with an off-the-shelf MCMC sampling algorithm is applied to a\u0000partially identified model, the computational performance can be poor. It is\u0000found that using importance sampling with transparent reparameterization (TP)\u0000is one remedy. This method is preferable since the model is known to be\u0000rendered as identified with respect to the new parameterization, and at the\u0000same time, it may allow faster, i.i.d. Monte Carlo sampling by using conjugate\u0000convenience priors. In this paper, we explain the importance sampling method\u0000with the TP and a pseudo-TP. We introduce the pseudo-TP, an alternative to TP,\u0000since finding a TP is sometimes difficult. Then, we test the methods'\u0000performance in some scenarios and compare it to the performance of the\u0000off-the-shelf MCMC method - Gibbs sampling - applied in the original\u0000parameterization. While the importance sampling with TP (ISTP) shows generally\u0000better results than off-the-shelf MCMC methods, as seen in the compute time and\u0000trace plots, it is also seen that finding a TP which is necessary for the\u0000method may not be easy. On the other hand, the pseudo-TP method shows a mixed\u0000result and room for improvement since it relies on an approximation, which may\u0000not be adequate for a given model and dataset.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"66 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
$statcheck$ is flawed by design and no valid spell checker for statistical results $statcheck$ 的设计存在缺陷,没有有效的统计结果拼写检查器
Pub Date : 2024-08-15 DOI: arxiv-2408.07948
Ingmar Böschen
The R package $statcheck$ is designed to extract statistical test resultsfrom text and check the consistency of the reported test statistics andcorresponding p-values. Recently, it has also been featured as a spell checkerfor statistical results, aimed at improving reporting accuracy in scientificpublications. In this study, I perform a check on $statcheck$ using anon-exhaustive list of 187 simple text strings with arbitrary statistical testresults. These strings represent a wide range of textual representations ofresults including correctly manageable results, non-targeted test statistics,variable reporting styles, and common typos. Since $statcheck$'s detectionheuristic is tied to a specific set of statistical test results that strictlyadhere to the American Psychological Association (APA) reporting guidelines, itis unable to detect and check any reported result that even slightly deviatesfrom this narrow style. In practice, $statcheck$ is unlikely to detect manystatistical test results reported in the literature. I conclude that thecapabilities and usefulness of the $statcheck$ software are very limited andthat it should not be used to detect irregularities in results nor as a spellchecker for statistical results. Future developments should aim to incorporatemore flexible algorithms capable of handling a broader variety of reportingstyles, such as those provided by $JATSdecoder$ and Large Language Models,which show promise in overcoming these limitations but they cannot replace thecritical eye of a knowledgeable reader.
R 软件包 $statcheck$ 设计用于从文本中提取统计检验结果,并检查报告的检验统计量和相应 p 值的一致性。最近,它还被用作统计结果的拼写检查工具,旨在提高科学出版物的报告准确性。在本研究中,我使用一个尚未穷尽的 187 个带有任意统计检验结果的简单文本字符串列表对 $statcheck$ 进行了检查。这些字符串代表了各种结果的文字表述,包括可正确管理的结果、非目标测试统计、多变的报告风格和常见错别字。由于$statcheck$的检测启发式与一组严格遵守美国心理学会(APA)报告指南的特定统计检验结果相联系,因此它无法检测和检查任何报告结果,哪怕是与这种狭隘的风格稍有偏差。实际上,$statcheck$ 不可能检测出文献中报告的许多统计检验结果。我的结论是,$statcheck$ 软件的能力和作用非常有限,它既不能用来检测结果中的不规范之处,也不能作为统计结果的拼写检查器。未来的发展应着眼于纳入更灵活的算法,能够处理更广泛的报告风格,例如 $JATSdecoder$ 和 Large Language Models 提供的算法,它们在克服这些局限性方面显示出前景,但它们无法取代知识渊博的读者的批判性眼光。
{"title":"$statcheck$ is flawed by design and no valid spell checker for statistical results","authors":"Ingmar Böschen","doi":"arxiv-2408.07948","DOIUrl":"https://doi.org/arxiv-2408.07948","url":null,"abstract":"The R package $statcheck$ is designed to extract statistical test results\u0000from text and check the consistency of the reported test statistics and\u0000corresponding p-values. Recently, it has also been featured as a spell checker\u0000for statistical results, aimed at improving reporting accuracy in scientific\u0000publications. In this study, I perform a check on $statcheck$ using a\u0000non-exhaustive list of 187 simple text strings with arbitrary statistical test\u0000results. These strings represent a wide range of textual representations of\u0000results including correctly manageable results, non-targeted test statistics,\u0000variable reporting styles, and common typos. Since $statcheck$'s detection\u0000heuristic is tied to a specific set of statistical test results that strictly\u0000adhere to the American Psychological Association (APA) reporting guidelines, it\u0000is unable to detect and check any reported result that even slightly deviates\u0000from this narrow style. In practice, $statcheck$ is unlikely to detect many\u0000statistical test results reported in the literature. I conclude that the\u0000capabilities and usefulness of the $statcheck$ software are very limited and\u0000that it should not be used to detect irregularities in results nor as a spell\u0000checker for statistical results. Future developments should aim to incorporate\u0000more flexible algorithms capable of handling a broader variety of reporting\u0000styles, such as those provided by $JATSdecoder$ and Large Language Models,\u0000which show promise in overcoming these limitations but they cannot replace the\u0000critical eye of a knowledgeable reader.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modeling of Measurement Error in Financial Returns Data 金融收益数据测量误差建模
Pub Date : 2024-08-14 DOI: arxiv-2408.07405
Ajay Jasra, Mohamed Maama, Aleksandar Mijatović
In this paper we consider the modeling of measurement error for fund returnsdata. In particular, given access to a time-series of discretely observedlog-returns and the associated maximum over the observation period, we developa stochastic model which models the true log-returns and maximum via a L'evyprocess and the data as a measurement error there-of. The main technicaldifficulty of trying to infer this model, for instance Bayesian parameterestimation, is that the joint transition density of the return and maximum isseldom known, nor can it be simulated exactly. Based upon the novel stickbreaking representation of [12] we provide an approximation of the model. Wedevelop a Markov chain Monte Carlo (MCMC) algorithm to sample from the Bayesianposterior of the approximated posterior and then extend this to a multilevelMCMC method which can reduce the computational cost to approximate posteriorexpectations, relative to ordinary MCMC. We implement our methodology onseveral applications including for real data.
在本文中,我们考虑了基金收益数据测量误差的建模问题。具体而言,在获得离散观测的对数收益率时间序列以及观测期内的相关最大值的情况下,我们建立了一个随机模型,该模型通过一个 L'evy 过程对真实的对数收益率和最大值进行建模,并将数据作为其测量误差。试图推断这一模型(例如贝叶斯参数估计)的主要技术难点在于,收益率和最大值的联合过渡密度很少为人所知,也无法精确模拟。基于 [12] 的新颖破粘表示法,我们提供了模型的近似值。我们开发了一种马尔科夫链蒙特卡罗(MCMC)算法,从近似后验的贝叶斯后验中采样,然后将其扩展为一种多级 MCMC 方法,相对于普通 MCMC,这种方法可以降低近似后验的计算成本。我们在包括真实数据在内的多个应用中实施了我们的方法。
{"title":"Modeling of Measurement Error in Financial Returns Data","authors":"Ajay Jasra, Mohamed Maama, Aleksandar Mijatović","doi":"arxiv-2408.07405","DOIUrl":"https://doi.org/arxiv-2408.07405","url":null,"abstract":"In this paper we consider the modeling of measurement error for fund returns\u0000data. In particular, given access to a time-series of discretely observed\u0000log-returns and the associated maximum over the observation period, we develop\u0000a stochastic model which models the true log-returns and maximum via a L'evy\u0000process and the data as a measurement error there-of. The main technical\u0000difficulty of trying to infer this model, for instance Bayesian parameter\u0000estimation, is that the joint transition density of the return and maximum is\u0000seldom known, nor can it be simulated exactly. Based upon the novel stick\u0000breaking representation of [12] we provide an approximation of the model. We\u0000develop a Markov chain Monte Carlo (MCMC) algorithm to sample from the Bayesian\u0000posterior of the approximated posterior and then extend this to a multilevel\u0000MCMC method which can reduce the computational cost to approximate posterior\u0000expectations, relative to ordinary MCMC. We implement our methodology on\u0000several applications including for real data.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142224599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gaussian mixture Taylor approximations of risk measures constrained by PDEs with Gaussian random field inputs 受高斯随机场输入 PDE 约束的风险度量的高斯混合泰勒近似值
Pub Date : 2024-08-13 DOI: arxiv-2408.06615
Dingcheng Luo, Joshua Chen, Peng Chen, Omar Ghattas
This work considers the computation of risk measures for quantities ofinterest governed by PDEs with Gaussian random field parameters using Taylorapproximations. While efficient, Taylor approximations are local to the pointof expansion, and hence may degrade in accuracy when the variances of the inputparameters are large. To address this challenge, we approximate the underlyingGaussian measure by a mixture of Gaussians with reduced variance in a dominantdirection of parameter space. Taylor approximations are constructed at themeans of each Gaussian mixture component, which are then combined toapproximate the risk measures. The formulation is presented in the setting ofinfinite-dimensional Gaussian random parameters for risk measures including themean, variance, and conditional value-at-risk. We also provide detailedanalysis of the approximations errors arising from two sources: the Gaussianmixture approximation and the Taylor approximations. Numerical experiments areconducted for a semilinear advection-diffusion-reaction equation with a randomdiffusion coefficient field and for the Helmholtz equation with a random wavespeed field. For these examples, the proposed approximation strategy canachieve less than $1%$ relative error in estimating CVaR with only$mathcal{O}(10)$ state PDE solves, which is comparable to a standard MonteCarlo estimate with $mathcal{O}(10^4)$ samples, thus achieving significantreduction in computational cost. The proposed method can therefore serve as away to rapidly and accurately estimate risk measures under limitedcomputational budgets.
本研究考虑使用泰勒近似法计算由具有高斯随机场参数的 PDE 所控制的相关量的风险度量。泰勒近似虽然效率高,但由于是局部扩展点,因此当输入参数的方差较大时,精度可能会下降。为了应对这一挑战,我们用在参数空间的主导方向上方差减小的高斯混合物来近似底层高斯度量。在每个高斯混合物分量的主题点上构建泰勒近似值,然后将其组合起来以近似风险度量。该公式是在无限维高斯随机参数的背景下提出的,风险度量包括主题矢量、方差和条件风险值。我们还详细分析了近似误差的两个来源:高斯混合近似和泰勒近似。我们对具有随机扩散系数场的半线性平流-扩散-反应方程和具有随机波速场的亥姆霍兹方程进行了数值实验。对于这些示例,所提出的近似策略只需要$mathcal{O}(10)$状态的PDE求解,就能在估计CVaR时实现小于$1%$的相对误差,这与使用$mathcal{O}(10^4)$样本的标准蒙特卡洛估计不相上下,从而显著降低了计算成本。因此,所提出的方法可以在有限的计算预算下快速、准确地估计风险度量。
{"title":"Gaussian mixture Taylor approximations of risk measures constrained by PDEs with Gaussian random field inputs","authors":"Dingcheng Luo, Joshua Chen, Peng Chen, Omar Ghattas","doi":"arxiv-2408.06615","DOIUrl":"https://doi.org/arxiv-2408.06615","url":null,"abstract":"This work considers the computation of risk measures for quantities of\u0000interest governed by PDEs with Gaussian random field parameters using Taylor\u0000approximations. While efficient, Taylor approximations are local to the point\u0000of expansion, and hence may degrade in accuracy when the variances of the input\u0000parameters are large. To address this challenge, we approximate the underlying\u0000Gaussian measure by a mixture of Gaussians with reduced variance in a dominant\u0000direction of parameter space. Taylor approximations are constructed at the\u0000means of each Gaussian mixture component, which are then combined to\u0000approximate the risk measures. The formulation is presented in the setting of\u0000infinite-dimensional Gaussian random parameters for risk measures including the\u0000mean, variance, and conditional value-at-risk. We also provide detailed\u0000analysis of the approximations errors arising from two sources: the Gaussian\u0000mixture approximation and the Taylor approximations. Numerical experiments are\u0000conducted for a semilinear advection-diffusion-reaction equation with a random\u0000diffusion coefficient field and for the Helmholtz equation with a random wave\u0000speed field. For these examples, the proposed approximation strategy can\u0000achieve less than $1%$ relative error in estimating CVaR with only\u0000$mathcal{O}(10)$ state PDE solves, which is comparable to a standard Monte\u0000Carlo estimate with $mathcal{O}(10^4)$ samples, thus achieving significant\u0000reduction in computational cost. The proposed method can therefore serve as a\u0000way to rapidly and accurately estimate risk measures under limited\u0000computational budgets.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"35 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the generalizability of the optimal 0.234 acceptance rate in random-walk Metropolis and parallel tempering algorithms 探索随机漫步 Metropolis 算法和并行调节算法中 0.234 最佳接受率的通用性
Pub Date : 2024-08-13 DOI: arxiv-2408.06894
Aidan Li, Liyan Wang, Tianye Dou, Jeffrey S. Rosenthal
For random-walk Metropolis (RWM) and parallel tempering (PT) algorithms, anasymptotic acceptance rate of around 0.234 is known to be optimal in thehigh-dimensional limit. Yet, the practical relevance of this value is uncertaindue to the restrictive conditions underlying its derivation. We synthesiseprevious theoretical advances in extending the 0.234 acceptance rate to moregeneral settings, and demonstrate the applicability and generalizability of the0.234 theory for practitioners with a comprehensive empirical simulation studyon a variety of examples examining how acceptance rates affect Expected SquaredJumping Distance (ESJD). Our experiments show the optimality of the 0.234acceptance rate for RWM is surprisingly robust even in lower dimensions acrossvarious proposal and multimodal target distributions which may or may not havean i.i.d. product density. Experiments on parallel tempering also show that theidealized 0.234 spacing of inverse temperatures may be approximately optimalfor low dimensions and non i.i.d. product target densities, and thatconstructing an inverse temperature ladder with spacings given by a swapacceptance of 0.234 is a viable strategy. However, we observe the applicabilityof the 0.234 acceptance rate heuristic diminishes for both RWM and PTalgorithms below a certain dimension which differs based on the target density,and that inhomogeneously scaled components in the target density furtherreduces its applicability in lower dimensions.
对于随机漫步 Metropolis(RWM)和并行调质(PT)算法来说,0.234 左右的渐近接受率是高维极限下的最佳值。然而,由于其推导所依据的限制性条件,该值的实际意义并不确定。我们综合了之前的理论进展,将 0.234 接受率扩展到了更一般的环境中,并通过对各种实例进行全面的实证模拟研究,考察了接受率如何影响期望平方跳转距离(ESJD),从而证明了 0.234 理论对实践者的适用性和普适性。我们的实验表明,0.234 接受率对于 RWM 的最优性出奇地稳健,即使是在较低维度上,也能跨越各种提议和多模式目标分布(可能有也可能没有 i.i.d. 乘积密度)。平行回火的实验还表明,理想化的 0.234 逆温间距对于低尺寸和非 i.i.d. 产品目标密度来说可能是近似最佳的,而且用 0.234 的交换接受度给出的间距来构建逆温阶梯是一种可行的策略。然而,我们观察到,0.234 接受率启发式对 RWM 和 PT 算法的适用性在一定维度以下会减弱,该维度根据目标密度的不同而不同,而且目标密度中不均匀缩放的成分会进一步降低其在较低维度的适用性。
{"title":"Exploring the generalizability of the optimal 0.234 acceptance rate in random-walk Metropolis and parallel tempering algorithms","authors":"Aidan Li, Liyan Wang, Tianye Dou, Jeffrey S. Rosenthal","doi":"arxiv-2408.06894","DOIUrl":"https://doi.org/arxiv-2408.06894","url":null,"abstract":"For random-walk Metropolis (RWM) and parallel tempering (PT) algorithms, an\u0000asymptotic acceptance rate of around 0.234 is known to be optimal in the\u0000high-dimensional limit. Yet, the practical relevance of this value is uncertain\u0000due to the restrictive conditions underlying its derivation. We synthesise\u0000previous theoretical advances in extending the 0.234 acceptance rate to more\u0000general settings, and demonstrate the applicability and generalizability of the\u00000.234 theory for practitioners with a comprehensive empirical simulation study\u0000on a variety of examples examining how acceptance rates affect Expected Squared\u0000Jumping Distance (ESJD). Our experiments show the optimality of the 0.234\u0000acceptance rate for RWM is surprisingly robust even in lower dimensions across\u0000various proposal and multimodal target distributions which may or may not have\u0000an i.i.d. product density. Experiments on parallel tempering also show that the\u0000idealized 0.234 spacing of inverse temperatures may be approximately optimal\u0000for low dimensions and non i.i.d. product target densities, and that\u0000constructing an inverse temperature ladder with spacings given by a swap\u0000acceptance of 0.234 is a viable strategy. However, we observe the applicability\u0000of the 0.234 acceptance rate heuristic diminishes for both RWM and PT\u0000algorithms below a certain dimension which differs based on the target density,\u0000and that inhomogeneously scaled components in the target density further\u0000reduces its applicability in lower dimensions.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"23 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189568","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - STAT - Computation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1