USLC: Universal self-learning control via physical performance policy-optimization neural network

IF 2.2 4区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS IET Control Theory and Applications Pub Date : 2024-11-11 DOI:10.1049/cth2.12758
Yanhui Zhang, Xiaoling Liang, Weifang Chen, Kunfeng Lu, Chao Xu, Shuzhi Sam Ge
{"title":"USLC: Universal self-learning control via physical performance policy-optimization neural network","authors":"Yanhui Zhang,&nbsp;Xiaoling Liang,&nbsp;Weifang Chen,&nbsp;Kunfeng Lu,&nbsp;Chao Xu,&nbsp;Shuzhi Sam Ge","doi":"10.1049/cth2.12758","DOIUrl":null,"url":null,"abstract":"<p>This article proposes an online universal self-learning control (USLC) algorithm based on a physical performance policy-optimization neural network, which aims to solve the problem of universal self-learning optimal control laws for nonlinear systems with various uncertain dynamics. As a key system characterization, this algorithm predicts the discrepancy between the optimal and current control laws by evaluating overall performance in each iterative learning cycle, leveraging an offline-trained universal policy network. This approach is universal, as it does not rely on an exact system model and can adaptively control performance preferences across various tasks by customizing the physical performance cost weights. Using the established control law-performance surface and contraction Lyapunov function, the necessary assumptions and proofs for the stable convergence of the system within a three-dimensional manifold space are provided. To demonstrate the universality of USLC, simulation experiments are conducted on two different systems: a low-order circuit system and a high-order variable-span aircraft attitude control system. The stable control achieved under varying initial values and boundary conditions in each system illustrates the effectiveness of the proposed method. Finally, the limitations of this study are discussed.</p>","PeriodicalId":50382,"journal":{"name":"IET Control Theory and Applications","volume":"18 18","pages":"2944-2960"},"PeriodicalIF":2.2000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.12758","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Control Theory and Applications","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/cth2.12758","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

This article proposes an online universal self-learning control (USLC) algorithm based on a physical performance policy-optimization neural network, which aims to solve the problem of universal self-learning optimal control laws for nonlinear systems with various uncertain dynamics. As a key system characterization, this algorithm predicts the discrepancy between the optimal and current control laws by evaluating overall performance in each iterative learning cycle, leveraging an offline-trained universal policy network. This approach is universal, as it does not rely on an exact system model and can adaptively control performance preferences across various tasks by customizing the physical performance cost weights. Using the established control law-performance surface and contraction Lyapunov function, the necessary assumptions and proofs for the stable convergence of the system within a three-dimensional manifold space are provided. To demonstrate the universality of USLC, simulation experiments are conducted on two different systems: a low-order circuit system and a high-order variable-span aircraft attitude control system. The stable control achieved under varying initial values and boundary conditions in each system illustrates the effectiveness of the proposed method. Finally, the limitations of this study are discussed.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
USLC:通过物理性能策略优化神经网络实现通用自学习控制
本文提出了一种基于物理性能策略优化神经网络的在线通用自学习控制(USLC)算法,旨在解决具有各种不确定动态的非线性系统的通用自学习最优控制律问题。作为一个关键的系统特征,该算法通过评估每个迭代学习周期中的整体性能来预测最优控制律和当前控制律之间的差异,利用离线训练的通用策略网络。这种方法是通用的,因为它不依赖于精确的系统模型,并且可以通过自定义物理性能成本权重来自适应地控制各种任务的性能首选项。利用所建立的控制律性能曲面和收缩Lyapunov函数,给出了系统在三维流形空间稳定收敛的必要假设和证明。为了证明USLC的通用性,对低阶电路系统和高阶变跨度飞行器姿态控制系统进行了仿真实验。各系统在不同初始值和边界条件下的稳定控制表明了所提方法的有效性。最后,讨论了本研究的局限性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IET Control Theory and Applications
IET Control Theory and Applications 工程技术-工程:电子与电气
CiteScore
5.70
自引率
7.70%
发文量
167
审稿时长
5.1 months
期刊介绍: IET Control Theory & Applications is devoted to control systems in the broadest sense, covering new theoretical results and the applications of new and established control methods. Among the topics of interest are system modelling, identification and simulation, the analysis and design of control systems (including computer-aided design), and practical implementation. The scope encompasses technological, economic, physiological (biomedical) and other systems, including man-machine interfaces. Most of the papers published deal with original work from industrial and government laboratories and universities, but subject reviews and tutorial expositions of current methods are welcomed. Correspondence discussing published papers is also welcomed. Applications papers need not necessarily involve new theory. Papers which describe new realisations of established methods, or control techniques applied in a novel situation, or practical studies which compare various designs, would be of interest. Of particular value are theoretical papers which discuss the applicability of new work or applications which engender new theoretical applications.
期刊最新文献
Delayed teleoperated robotic eye surgical system; controller design and real-time experiments Design of an Improved Robust Fractional-Order PID Controller for Buck–Boost Converter using Snake Optimization Algorithm Quasi-bipartite synchronization for delayed coupled inertial neural networks having cooperative and competitive communication via pinning control strategy Robust Distributed Fault Estimation in Disturbed Uncertain Interconnected Systems With Time-Varying Delays A Finite-Time Non-Singular Fast Terminal Sliding Mode Control of Wheeled Mobile Robots With Prescribed Performance
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1