不连续随机变量的罕见事件模拟与分裂

arXiv: Computation Pub Date : 2015-07-03 DOI:10.1051/PS/2015017

Clément Walter

{"title":"不连续随机变量的罕见事件模拟与分裂","authors":"Clément Walter","doi":"10.1051/PS/2015017","DOIUrl":null,"url":null,"abstract":"Multilevel Splitting methods, also called Sequential Monte-Carlo or \\emph{Subset Simulation}, are widely used methods for estimating extreme probabilities of the form $P[S(\\mathbf{U}) > q]$ where $S$ is a deterministic real-valued function and $\\mathbf{U}$ can be a random finite- or infinite-dimensional vector. Very often, $X := S(\\mathbf{U})$ is supposed to be a continuous random variable and a lot of theoretical results on the statistical behaviour of the estimator are now derived with this hypothesis. However, as soon as some threshold effect appears in $S$ and/or $\\mathbf{U}$ is discrete or mixed discrete/continuous this assumption does not hold any more and the estimator is not consistent. \nIn this paper, we study the impact of discontinuities in the \\emph{cdf} of $X$ and present three unbiased \\emph{corrected} estimators to handle them. These estimators do not require to know in advance if $X$ is actually discontinuous or not and become all equal if $X$ is continuous. Especially, one of them has the same statistical properties in any case. Efficiency is shown on a 2-D diffusive process as well as on the \\emph{Boolean SATisfiability problem} (SAT).","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"21 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2015-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Rare Event Simulation and Splitting for Discontinuous Random Variables\",\"authors\":\"Clément Walter\",\"doi\":\"10.1051/PS/2015017\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multilevel Splitting methods, also called Sequential Monte-Carlo or \\\\emph{Subset Simulation}, are widely used methods for estimating extreme probabilities of the form $P[S(\\\\mathbf{U}) > q]$ where $S$ is a deterministic real-valued function and $\\\\mathbf{U}$ can be a random finite- or infinite-dimensional vector. Very often, $X := S(\\\\mathbf{U})$ is supposed to be a continuous random variable and a lot of theoretical results on the statistical behaviour of the estimator are now derived with this hypothesis. However, as soon as some threshold effect appears in $S$ and/or $\\\\mathbf{U}$ is discrete or mixed discrete/continuous this assumption does not hold any more and the estimator is not consistent. \\nIn this paper, we study the impact of discontinuities in the \\\\emph{cdf} of $X$ and present three unbiased \\\\emph{corrected} estimators to handle them. These estimators do not require to know in advance if $X$ is actually discontinuous or not and become all equal if $X$ is continuous. Especially, one of them has the same statistical properties in any case. Efficiency is shown on a 2-D diffusive process as well as on the \\\\emph{Boolean SATisfiability problem} (SAT).\",\"PeriodicalId\":8446,\"journal\":{\"name\":\"arXiv: Computation\",\"volume\":\"21 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv: Computation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1051/PS/2015017\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv: Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1051/PS/2015017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

多层分裂方法，也称为顺序蒙特卡罗或\emph{子集模拟}，是广泛用于估计形式$P[S(\mathbf{U}) > q]$的极端概率的方法，其中$S$是一个确定性的实值函数，$\mathbf{U}$可以是一个随机的有限维或无限维向量。通常，$X := S(\mathbf{U})$被认为是一个连续的随机变量，许多关于估计量的统计行为的理论结果现在都是用这个假设推导出来的。然而，一旦一些阈值效应出现在$S$和/或$\mathbf{U}$是离散的或混合离散/连续的，这个假设不再成立，估计量不一致。本文研究了$X$的\emph{cdf}中不连续的影响，并给出了三个无偏\emph{校正}估计来处理它们。这些估计器不需要事先知道$X$是否实际上是不连续的，如果$X$是连续的，则它们都相等。特别是，它们中的一个在任何情况下都具有相同的统计属性。在二维扩散过程和\emph{布尔可满足性问题}(SAT)上证明了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Rare Event Simulation and Splitting for Discontinuous Random Variables

Multilevel Splitting methods, also called Sequential Monte-Carlo or \emph{Subset Simulation}, are widely used methods for estimating extreme probabilities of the form $P[S(\mathbf{U}) > q]$ where $S$ is a deterministic real-valued function and $\mathbf{U}$ can be a random finite- or infinite-dimensional vector. Very often, $X := S(\mathbf{U})$ is supposed to be a continuous random variable and a lot of theoretical results on the statistical behaviour of the estimator are now derived with this hypothesis. However, as soon as some threshold effect appears in $S$ and/or $\mathbf{U}$ is discrete or mixed discrete/continuous this assumption does not hold any more and the estimator is not consistent. In this paper, we study the impact of discontinuities in the \emph{cdf} of $X$ and present three unbiased \emph{corrected} estimators to handle them. These estimators do not require to know in advance if $X$ is actually discontinuous or not and become all equal if $X$ is continuous. Especially, one of them has the same statistical properties in any case. Efficiency is shown on a 2-D diffusive process as well as on the \emph{Boolean SATisfiability problem} (SAT).

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv: Computation

自引率

0.00%

发文量