{"title":"Symbolic control for stochastic systems via finite parity games","authors":"Rupak Majumdar , Kaushik Mallik , Anne-Kathrin Schmuck , Sadegh Soudjani","doi":"10.1016/j.nahs.2023.101430","DOIUrl":null,"url":null,"abstract":"<div><p>We consider the problem of computing the maximal probability of satisfying an <span><math><mi>ω</mi></math></span>-regular specification for stochastic, continuous-state, nonlinear systems evolving in discrete time. The problem reduces, after automata-theoretic constructions, to finding the maximal probability of satisfying a parity condition on a (possibly hybrid) state space. While characterizing the exact satisfaction probability is open, we show that a lower bound on this probability can be obtained by <strong>(I)</strong> computing an under-approximation of the <em>qualitative winning region</em>, i.e., states from which the parity condition can be enforced almost surely, and <strong>(II)</strong> computing the maximal probability of reaching this qualitative winning region.</p><p>The heart of our approach is a technique to <em>symbolically</em> compute the under-approximation of the qualitative winning region in step <strong>(I)</strong> via a <em>finite-state abstraction</em> of the original system as a <span><math><mrow><mn>2</mn><mfrac><mrow><mn>1</mn></mrow><mrow><mn>2</mn></mrow></mfrac></mrow></math></span>-<em>player parity game</em>. Our abstraction procedure uses only the support of the probabilistic evolution; it does not use precise numerical transition probabilities. We prove that the winning set in the abstract <span><math><mrow><mn>2</mn><mfrac><mrow><mn>1</mn></mrow><mrow><mn>2</mn></mrow></mfrac></mrow></math></span>-player game induces an under-approximation of the qualitative winning region in the original synthesis problem, along with a policy to solve it. By combining these contributions with (a) a symbolic fixpoint algorithm to solve <span><math><mrow><mn>2</mn><mfrac><mrow><mn>1</mn></mrow><mrow><mn>2</mn></mrow></mfrac></mrow></math></span>-player games and (b) existing techniques for reachability policy synthesis in stochastic nonlinear systems, we get an <em>abstraction-based algorithm</em> for finding a lower bound on the maximal satisfaction probability.</p><p>We have implemented the abstraction-based algorithm in <span>Mascot-SDS</span>, where we combined the outlined abstraction step with our tool <span>Genie</span> (Majumdar et al., 2023) that solves <span><math><mrow><mn>2</mn><mfrac><mrow><mn>1</mn></mrow><mrow><mn>2</mn></mrow></mfrac></mrow></math></span>-player parity games (through a reduction to Rabin games) more efficiently than existing algorithms. We evaluated our implementation on the nonlinear model of a perturbed bistable switch from the literature. We show empirically that the lower bound on the winning region computed by our approach is precise, by comparing against an over-approximation of the qualitative winning region. Moreover, our implementation outperforms a recently proposed tool for solving this problem by a large margin.</p></div>","PeriodicalId":49011,"journal":{"name":"Nonlinear Analysis-Hybrid Systems","volume":"51 ","pages":"Article 101430"},"PeriodicalIF":3.7000,"publicationDate":"2023-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nonlinear Analysis-Hybrid Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1751570X23001012","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 1
Abstract
We consider the problem of computing the maximal probability of satisfying an -regular specification for stochastic, continuous-state, nonlinear systems evolving in discrete time. The problem reduces, after automata-theoretic constructions, to finding the maximal probability of satisfying a parity condition on a (possibly hybrid) state space. While characterizing the exact satisfaction probability is open, we show that a lower bound on this probability can be obtained by (I) computing an under-approximation of the qualitative winning region, i.e., states from which the parity condition can be enforced almost surely, and (II) computing the maximal probability of reaching this qualitative winning region.
The heart of our approach is a technique to symbolically compute the under-approximation of the qualitative winning region in step (I) via a finite-state abstraction of the original system as a -player parity game. Our abstraction procedure uses only the support of the probabilistic evolution; it does not use precise numerical transition probabilities. We prove that the winning set in the abstract -player game induces an under-approximation of the qualitative winning region in the original synthesis problem, along with a policy to solve it. By combining these contributions with (a) a symbolic fixpoint algorithm to solve -player games and (b) existing techniques for reachability policy synthesis in stochastic nonlinear systems, we get an abstraction-based algorithm for finding a lower bound on the maximal satisfaction probability.
We have implemented the abstraction-based algorithm in Mascot-SDS, where we combined the outlined abstraction step with our tool Genie (Majumdar et al., 2023) that solves -player parity games (through a reduction to Rabin games) more efficiently than existing algorithms. We evaluated our implementation on the nonlinear model of a perturbed bistable switch from the literature. We show empirically that the lower bound on the winning region computed by our approach is precise, by comparing against an over-approximation of the qualitative winning region. Moreover, our implementation outperforms a recently proposed tool for solving this problem by a large margin.
我们考虑在离散时间内演化的随机、连续状态、非线性系统满足ω-正则规范的最大概率的计算问题。在自动机理论构造之后,该问题简化为在(可能是混合的)状态空间上找到满足奇偶条件的最大概率。虽然表征精确满足概率是开放的,但我们表明,该概率的下界可以通过以下方式获得:(I)计算定性获胜区域的欠近似,即,几乎可以肯定地执行奇偶条件的状态,以及(II)计算到达该定性获胜区域的最大概率。我们方法的核心是一种技术,通过将原始系统的有限状态抽象为212人的奇偶游戏,在步骤(I)中象征性地计算定性获胜区域的欠近似。我们的抽象过程只使用概率进化的支持;它不使用精确的数值转移概率。我们证明了抽象212人博弈中的获胜集导致了原始综合问题中定性获胜区域的欠近似,以及解决该问题的策略。通过将这些贡献与(a)解决212人博弈的符号不动点算法和(b)随机非线性系统中可达性策略综合的现有技术相结合,我们得到了一个基于抽象的算法来寻找最大满足概率的下界。我们在Mascot SDS中实现了基于抽象的算法,其中我们将概述的抽象步骤与我们的工具Genie(Majumdar et al.,2023)相结合,该工具比现有算法更有效地解决了212个玩家奇偶性游戏(通过减少到Rabin游戏)。我们评估了我们在文献中扰动双稳态开关的非线性模型上的实现。通过与定性获胜区域的过近似进行比较,我们从经验上表明,通过我们的方法计算的获胜区域的下界是精确的。此外,我们的实现在很大程度上优于最近提出的解决此问题的工具。
期刊介绍:
Nonlinear Analysis: Hybrid Systems welcomes all important research and expository papers in any discipline. Papers that are principally concerned with the theory of hybrid systems should contain significant results indicating relevant applications. Papers that emphasize applications should consist of important real world models and illuminating techniques. Papers that interrelate various aspects of hybrid systems will be most welcome.