Distributionally Robust Policy and Lyapunov-Certificate Learning

IEEE open journal of control systems Pub Date : 2024-08-07 DOI:10.1109/OJCSYS.2024.3440051

Kehan Long;Jorge Cortés;Nikolay Atanasov

{"title":"Distributionally Robust Policy and Lyapunov-Certificate Learning","authors":"Kehan Long;Jorge Cortés;Nikolay Atanasov","doi":"10.1109/OJCSYS.2024.3440051","DOIUrl":null,"url":null,"abstract":"This article presents novel methods for synthesizing distributionally robust stabilizing neural controllers and certificates for control systems under model uncertainty. A key challenge in designing controllers with stability guarantees for uncertain systems is the accurate determination of and adaptation to shifts in model parametric uncertainty during online deployment. We tackle this with a novel distributionally robust formulation of the Lyapunov derivative chance constraint ensuring a monotonic decrease of the Lyapunov certificate. To avoid the computational complexity involved in dealing with the space of probability measures, we identify a sufficient condition in the form of deterministic convex constraints that ensures the Lyapunov derivative constraint is satisfied. We integrate this condition into a loss function for training a neural network-based controller and show that, for the resulting closed-loop system, the global asymptotic stability of its equilibrium can be certified with high confidence, even with Out-of-Distribution (OoD) model uncertainties. To demonstrate the efficacy and efficiency of the proposed methodology, we compare it with an uncertainty-agnostic baseline approach and several reinforcement learning approaches in two control problems in simulation. Open-source implementations of the examples are available at \n<uri>https://github.com/KehanLong/DR_Stabilizing_Policy</uri>\n.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 ","pages":"375-388"},"PeriodicalIF":0.0000,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10629071","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE open journal of control systems","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10629071/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

This article presents novel methods for synthesizing distributionally robust stabilizing neural controllers and certificates for control systems under model uncertainty. A key challenge in designing controllers with stability guarantees for uncertain systems is the accurate determination of and adaptation to shifts in model parametric uncertainty during online deployment. We tackle this with a novel distributionally robust formulation of the Lyapunov derivative chance constraint ensuring a monotonic decrease of the Lyapunov certificate. To avoid the computational complexity involved in dealing with the space of probability measures, we identify a sufficient condition in the form of deterministic convex constraints that ensures the Lyapunov derivative constraint is satisfied. We integrate this condition into a loss function for training a neural network-based controller and show that, for the resulting closed-loop system, the global asymptotic stability of its equilibrium can be certified with high confidence, even with Out-of-Distribution (OoD) model uncertainties. To demonstrate the efficacy and efficiency of the proposed methodology, we compare it with an uncertainty-agnostic baseline approach and several reinforcement learning approaches in two control problems in simulation. Open-source implementations of the examples are available at https://github.com/KehanLong/DR_Stabilizing_Policy .

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

分布稳健政策与 Lyapunov 证书学习

本文介绍了为模型不确定性下的控制系统合成分布式鲁棒稳定神经控制器和凭手机验证码领取彩金的新方法。在为不确定系统设计具有稳定性保证的控制器时，一个关键挑战是在线部署期间如何准确确定和适应模型参数不确定性的变化。我们采用新颖的分布稳健型 Lyapunov 导数机会约束来解决这一问题，确保 Lyapunov 证书单调递减。为了避免处理概率度量空间所涉及的计算复杂性，我们以确定性凸约束的形式确定了一个充分条件，确保满足 Lyapunov 导数约束。我们将这一条件整合到损失函数中，用于训练基于神经网络的控制器，结果表明，对于由此产生的闭环系统，即使在分布外（OoD）模型不确定的情况下，其平衡的全局渐近稳定性也能以很高的置信度得到验证。为了证明所提方法的功效和效率，我们在两个模拟控制问题中将其与不确定基线方法和几种强化学习方法进行了比较。示例的开源实现可在 https://github.com/KehanLong/DR_Stabilizing_Policy 上获取。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE open journal of control systems

自引率

0.00%

发文量