Online Optimization of Dynamical Systems With Deep Learning Perception

IEEE open journal of control systems Pub Date : 2022-09-14 DOI:10.1109/OJCSYS.2022.3205871

Liliaokeawawa Cothren;Gianluca Bianchin;Emiliano Dall'Anese

{"title":"Online Optimization of Dynamical Systems With Deep Learning Perception","authors":"Liliaokeawawa Cothren;Gianluca Bianchin;Emiliano Dall'Anese","doi":"10.1109/OJCSYS.2022.3205871","DOIUrl":null,"url":null,"abstract":"This paper considers the problem of controlling a dynamical system when the state cannot be directly measured and the control performance metrics are unknown or only partially known. In particular, we focus on the design of data-driven controllers to regulate a dynamical system to the solution of a constrained convex optimization problem where: i) the state must be estimated from nonlinear and possibly high-dimensional data; and ii) the cost of the optimization problem – which models control objectives associated with inputs and states of the system – is not available and must be learned from data. We propose a data-driven feedback controller that is based on adaptations of a projected gradient-flow method; the controller includes neural networks as integral components for the estimation of the unknown functions. Leveraging stability theory for perturbed systems, we derive sufficient conditions to guarantee exponential input-to-state stability (ISS) of the control loop. In particular, we show that the interconnected system is ISS with respect to the approximation errors of the neural network and unknown disturbances affecting the system. The transient bounds combine the universal approximation property of deep neural networks with the ISS characterization. Illustrative numerical results are presented in the context of robotics and control of epidemics.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"306-321"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09891838.pdf","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE open journal of control systems","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/9891838/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

This paper considers the problem of controlling a dynamical system when the state cannot be directly measured and the control performance metrics are unknown or only partially known. In particular, we focus on the design of data-driven controllers to regulate a dynamical system to the solution of a constrained convex optimization problem where: i) the state must be estimated from nonlinear and possibly high-dimensional data; and ii) the cost of the optimization problem – which models control objectives associated with inputs and states of the system – is not available and must be learned from data. We propose a data-driven feedback controller that is based on adaptations of a projected gradient-flow method; the controller includes neural networks as integral components for the estimation of the unknown functions. Leveraging stability theory for perturbed systems, we derive sufficient conditions to guarantee exponential input-to-state stability (ISS) of the control loop. In particular, we show that the interconnected system is ISS with respect to the approximation errors of the neural network and unknown disturbances affecting the system. The transient bounds combine the universal approximation property of deep neural networks with the ISS characterization. Illustrative numerical results are presented in the context of robotics and control of epidemics.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

具有深度学习感知的动态系统在线优化

本文考虑了当状态不能直接测量并且控制性能指标未知或仅部分已知时控制动态系统的问题。特别地，我们专注于数据驱动控制器的设计，以将动力系统调节到约束凸优化问题的解，其中：i）状态必须从非线性和可能的高维数据中估计；以及ii）优化问题的成本——对与系统输入和状态相关的控制目标进行建模——是不可用的，必须从数据中学习。我们提出了一种基于投影梯度流方法自适应的数据驱动反馈控制器；控制器包括神经网络作为用于估计未知函数的积分部件。利用扰动系统的稳定性理论，我们导出了保证控制回路状态稳定性（ISS）的指数输入的充分条件。特别地，我们证明了关于神经网络的近似误差和影响系统的未知扰动，互连系统是ISS。瞬态边界将深度神经网络的普遍逼近特性与ISS特征相结合。在机器人和流行病控制的背景下给出了说明性的数值结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE open journal of control systems

自引率

0.00%

发文量