A Time-Delay Modeling Approach for Data-Driven Predictive Control of Continuous-Time Systems

IF 6.4 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Automation Science and Engineering Pub Date : 2024-08-26 DOI:10.1109/TASE.2024.3445335

Juan Liu;Xindi Yang;Hao Zhang;Zhuping Wang;Huaicheng Yan

{"title":"A Time-Delay Modeling Approach for Data-Driven Predictive Control of Continuous-Time Systems","authors":"Juan Liu;Xindi Yang;Hao Zhang;Zhuping Wang;Huaicheng Yan","doi":"10.1109/TASE.2024.3445335","DOIUrl":null,"url":null,"abstract":"This paper aims to predict future optimal control inputs for unknown continuous-time linear systems. In contrast to most existing approaches that generate control input without observation loss, the proposed control scheme analyzes causality between observation and sequence feedback from the time-delay series model, allowing for the existence of observation loss. These time-delay series can be viewed as multiplayer games over a temporal scale, then a temporal game-theoretic approach ensures system stability and performance. By integrating the Bellman principle, a data-driven adaptive dynamic programming algorithm is proposed to avoid system knowledge. Furthermore, the designed parallel data-driven predictive algorithm reduces the computational complexity. Finally, the applicability and effectiveness of the methodology are demonstrated through numerical simulations and practical experiments. Note to Practitioners—This paper mainly concerns the predictive control of unknown continuous systems, which suffer from unknown dynamics and state observation loss. The proposed methods are suitable for weak information feedback and dynamically changing scenarios, such as autonomous driving in low visibility scenarios and endoscopic surgical robots. Most of the current processing methods are model-driven, which makes them unsuitable for unknown system dynamics changes. To address this issue, we propose a time-delay switched strategy for control prediction with stability and optimality guarantees. The practical application can be divided into three parts: i) Data collection: Collect historical multi-intervals accumulated inputs and outputs data, the amount of data should reach the requirement of the full rank of the data matrix; ii) Iterative learning: the optimal control strategy is learned from the historical data through an adaptive dynamic programming method; iii) Deployment: Control intervals are extended through temporal time-delay feedback on historical state trajectory, and length trigger conditions will switch these feedbacks logically for actuators. Finally, the Quanser QBot 2e robot is used as a demonstration example.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"6408-6420"},"PeriodicalIF":6.4000,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10645795/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

This paper aims to predict future optimal control inputs for unknown continuous-time linear systems. In contrast to most existing approaches that generate control input without observation loss, the proposed control scheme analyzes causality between observation and sequence feedback from the time-delay series model, allowing for the existence of observation loss. These time-delay series can be viewed as multiplayer games over a temporal scale, then a temporal game-theoretic approach ensures system stability and performance. By integrating the Bellman principle, a data-driven adaptive dynamic programming algorithm is proposed to avoid system knowledge. Furthermore, the designed parallel data-driven predictive algorithm reduces the computational complexity. Finally, the applicability and effectiveness of the methodology are demonstrated through numerical simulations and practical experiments. Note to Practitioners—This paper mainly concerns the predictive control of unknown continuous systems, which suffer from unknown dynamics and state observation loss. The proposed methods are suitable for weak information feedback and dynamically changing scenarios, such as autonomous driving in low visibility scenarios and endoscopic surgical robots. Most of the current processing methods are model-driven, which makes them unsuitable for unknown system dynamics changes. To address this issue, we propose a time-delay switched strategy for control prediction with stability and optimality guarantees. The practical application can be divided into three parts: i) Data collection: Collect historical multi-intervals accumulated inputs and outputs data, the amount of data should reach the requirement of the full rank of the data matrix; ii) Iterative learning: the optimal control strategy is learned from the historical data through an adaptive dynamic programming method; iii) Deployment: Control intervals are extended through temporal time-delay feedback on historical state trajectory, and length trigger conditions will switch these feedbacks logically for actuators. Finally, the Quanser QBot 2e robot is used as a demonstration example.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

连续时间系统数据驱动预测控制的时延建模方法

本文旨在预测未知连续时间线性系统的未来最优控制输入。与大多数不产生观测损失的控制输入方法不同，本文提出的控制方案分析了观测值与时滞序列模型序列反馈之间的因果关系，考虑了观测损失的存在。这些时滞序列可以看作是时间尺度上的多人游戏，然后时间博弈论方法确保系统的稳定性和性能。结合Bellman原理，提出了一种数据驱动的自适应动态规划算法。此外，所设计的并行数据驱动预测算法降低了计算复杂度。最后，通过数值模拟和实际实验验证了该方法的适用性和有效性。本文主要研究具有未知动态和状态观测损失的未知连续系统的预测控制问题。提出的方法适用于弱信息反馈和动态变化的场景，如低能见度场景下的自动驾驶和内镜手术机器人。目前的处理方法大多是模型驱动的，这使得它们不适用于未知的系统动力学变化。为了解决这个问题，我们提出了一种具有稳定性和最优性保证的时滞切换控制预测策略。实际应用可分为三个部分：i)数据采集：采集历史多区间累积的输入输出数据，数据量应达到数据矩阵满秩的要求；ii)迭代学习：通过自适应动态规划方法从历史数据中学习最优控制策略；iii)部署：通过历史状态轨迹的时间延迟反馈扩展控制间隔，长度触发条件将这些反馈逻辑切换到执行器。最后，以qanser QBot 2e机器人为例进行了演示。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统

CiteScore

12.50

自引率

14.30%

发文量

404

审稿时长

3.0 months

期刊介绍： The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.