{"title":"不连续随机变量的罕见事件模拟与分裂","authors":"Clément Walter","doi":"10.1051/PS/2015017","DOIUrl":null,"url":null,"abstract":"Multilevel Splitting methods, also called Sequential Monte-Carlo or \\emph{Subset Simulation}, are widely used methods for estimating extreme probabilities of the form $P[S(\\mathbf{U}) > q]$ where $S$ is a deterministic real-valued function and $\\mathbf{U}$ can be a random finite- or infinite-dimensional vector. Very often, $X := S(\\mathbf{U})$ is supposed to be a continuous random variable and a lot of theoretical results on the statistical behaviour of the estimator are now derived with this hypothesis. However, as soon as some threshold effect appears in $S$ and/or $\\mathbf{U}$ is discrete or mixed discrete/continuous this assumption does not hold any more and the estimator is not consistent. \nIn this paper, we study the impact of discontinuities in the \\emph{cdf} of $X$ and present three unbiased \\emph{corrected} estimators to handle them. These estimators do not require to know in advance if $X$ is actually discontinuous or not and become all equal if $X$ is continuous. Especially, one of them has the same statistical properties in any case. Efficiency is shown on a 2-D diffusive process as well as on the \\emph{Boolean SATisfiability problem} (SAT).","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"21 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2015-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Rare Event Simulation and Splitting for Discontinuous Random Variables\",\"authors\":\"Clément Walter\",\"doi\":\"10.1051/PS/2015017\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multilevel Splitting methods, also called Sequential Monte-Carlo or \\\\emph{Subset Simulation}, are widely used methods for estimating extreme probabilities of the form $P[S(\\\\mathbf{U}) > q]$ where $S$ is a deterministic real-valued function and $\\\\mathbf{U}$ can be a random finite- or infinite-dimensional vector. Very often, $X := S(\\\\mathbf{U})$ is supposed to be a continuous random variable and a lot of theoretical results on the statistical behaviour of the estimator are now derived with this hypothesis. However, as soon as some threshold effect appears in $S$ and/or $\\\\mathbf{U}$ is discrete or mixed discrete/continuous this assumption does not hold any more and the estimator is not consistent. \\nIn this paper, we study the impact of discontinuities in the \\\\emph{cdf} of $X$ and present three unbiased \\\\emph{corrected} estimators to handle them. These estimators do not require to know in advance if $X$ is actually discontinuous or not and become all equal if $X$ is continuous. Especially, one of them has the same statistical properties in any case. Efficiency is shown on a 2-D diffusive process as well as on the \\\\emph{Boolean SATisfiability problem} (SAT).\",\"PeriodicalId\":8446,\"journal\":{\"name\":\"arXiv: Computation\",\"volume\":\"21 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv: Computation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1051/PS/2015017\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv: Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1051/PS/2015017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Rare Event Simulation and Splitting for Discontinuous Random Variables
Multilevel Splitting methods, also called Sequential Monte-Carlo or \emph{Subset Simulation}, are widely used methods for estimating extreme probabilities of the form $P[S(\mathbf{U}) > q]$ where $S$ is a deterministic real-valued function and $\mathbf{U}$ can be a random finite- or infinite-dimensional vector. Very often, $X := S(\mathbf{U})$ is supposed to be a continuous random variable and a lot of theoretical results on the statistical behaviour of the estimator are now derived with this hypothesis. However, as soon as some threshold effect appears in $S$ and/or $\mathbf{U}$ is discrete or mixed discrete/continuous this assumption does not hold any more and the estimator is not consistent.
In this paper, we study the impact of discontinuities in the \emph{cdf} of $X$ and present three unbiased \emph{corrected} estimators to handle them. These estimators do not require to know in advance if $X$ is actually discontinuous or not and become all equal if $X$ is continuous. Especially, one of them has the same statistical properties in any case. Efficiency is shown on a 2-D diffusive process as well as on the \emph{Boolean SATisfiability problem} (SAT).