{"title":"Stratification and Optimal Resampling for Sequential Monte Carlo","authors":"Yichao Li, Wenshuo Wang, Ke Deng, Jun S. Liu","doi":"10.1093/BIOMET/ASAB004","DOIUrl":null,"url":null,"abstract":"Sequential Monte Carlo (SMC), also known as particle filters, has been widely accepted as a powerful computational tool for making inference with dynamical systems. A key step in SMC is resampling, which plays the role of steering the algorithm towards the future dynamics. Several strategies have been proposed and used in practice, including multinomial resampling, residual resampling (Liu and Chen 1998), optimal resampling (Fearnhead and Clifford 2003), stratified resampling (Kitagawa 1996), and optimal transport resampling (Reich 2013). We show that, in the one dimensional case, optimal transport resampling is equivalent to stratified resampling on the sorted particles, and they both minimize the resampling variance as well as the expected squared energy distance between the original and resampled empirical distributions; in the multidimensional case, the variance of stratified resampling after sorting particles using Hilbert curve (Gerber et al. 2019) in $\\mathbb{R}^d$ is $O(m^{-(1+2/d)})$, an improved rate compared to the original $O(m^{-(1+1/d)})$, where $m$ is the number of particles. This improved rate is the lowest for ordered stratified resampling schemes, as conjectured in Gerber et al. (2019). We also present an almost sure bound on the Wasserstein distance between the original and Hilbert-curve-resampled empirical distributions. In light of these theoretical results, we propose the stratified multiple-descendant growth (SMG) algorithm, which allows us to explore the sample space more efficiently compared to the standard i.i.d. multiple-descendant sampling-resampling approach as measured by the Wasserstein metric. Numerical evidence is provided to demonstrate the effectiveness of our proposed method.","PeriodicalId":186390,"journal":{"name":"arXiv: Methodology","volume":"57 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv: Methodology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/BIOMET/ASAB004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Sequential Monte Carlo (SMC), also known as particle filters, has been widely accepted as a powerful computational tool for making inference with dynamical systems. A key step in SMC is resampling, which plays the role of steering the algorithm towards the future dynamics. Several strategies have been proposed and used in practice, including multinomial resampling, residual resampling (Liu and Chen 1998), optimal resampling (Fearnhead and Clifford 2003), stratified resampling (Kitagawa 1996), and optimal transport resampling (Reich 2013). We show that, in the one dimensional case, optimal transport resampling is equivalent to stratified resampling on the sorted particles, and they both minimize the resampling variance as well as the expected squared energy distance between the original and resampled empirical distributions; in the multidimensional case, the variance of stratified resampling after sorting particles using Hilbert curve (Gerber et al. 2019) in $\mathbb{R}^d$ is $O(m^{-(1+2/d)})$, an improved rate compared to the original $O(m^{-(1+1/d)})$, where $m$ is the number of particles. This improved rate is the lowest for ordered stratified resampling schemes, as conjectured in Gerber et al. (2019). We also present an almost sure bound on the Wasserstein distance between the original and Hilbert-curve-resampled empirical distributions. In light of these theoretical results, we propose the stratified multiple-descendant growth (SMG) algorithm, which allows us to explore the sample space more efficiently compared to the standard i.i.d. multiple-descendant sampling-resampling approach as measured by the Wasserstein metric. Numerical evidence is provided to demonstrate the effectiveness of our proposed method.
序列蒙特卡罗(SMC),也被称为粒子滤波,作为一种对动态系统进行推理的强大计算工具已被广泛接受。SMC的关键步骤是重采样,它起着引导算法走向未来动态的作用。已经提出并在实践中使用了几种策略,包括多项重采样、残差重采样(Liu and Chen 1998)、最优重采样(Fearnhead and Clifford 2003)、分层重采样(Kitagawa 1996)和最优输运重采样(Reich 2013)。我们发现,在一维情况下,最优输运重采样等同于对已排序粒子进行分层重采样,它们都最小化了重采样方差以及原始和重采样经验分布之间的期望能量距离的平方;在多维情况下,使用希尔伯特曲线(Gerber et al. 2019)在$\mathbb{R}^d$中对粒子进行排序后的分层重采样方差为$O(m^{-(1+2/d)})$,与原始的$O(m^{-(1+1/d)})$相比,提高了速率,其中$m$为粒子数。根据Gerber等人(2019)的推测,对于有序分层重采样方案,这一改进率是最低的。我们还提出了原始和hilbert曲线重新抽样的经验分布之间的Wasserstein距离的几乎确定的界限。根据这些理论结果,我们提出了分层多后代生长(SMG)算法,与使用Wasserstein度量测量的标准i - id多后代采样-重采样方法相比,该算法使我们能够更有效地探索样本空间。数值实验证明了该方法的有效性。