执行器失效情况下非线性典型多代理系统的容错优化共识控制的自适应强化学习

IF 4 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Systems Journal Pub Date : 2024-08-13 DOI:10.1109/JSYST.2024.3433023
Boyan Zhu;Liang Zhang;Ben Niu;Ning Zhao
{"title":"执行器失效情况下非线性典型多代理系统的容错优化共识控制的自适应强化学习","authors":"Boyan Zhu;Liang Zhang;Ben Niu;Ning Zhao","doi":"10.1109/JSYST.2024.3433023","DOIUrl":null,"url":null,"abstract":"This article addresses the adaptive optimized consensus tracking control problem of nonlinear multiagent systems (MASs) via a reinforcement learning (RL) algorithm. Specifically, the nonlinear high-order MASs are formulated in a canonical form, with considerations for both actuator effectiveness loss and time-varying bias faults. First, neural networks (NNs) are utilized to approximate unknown nonlinear dynamics, and a state identifier and a fault estimator based on NNs are established, both of which are essential for evaluating state information and bias faults, respectively. Second, to achieve a high-order canonical dynamic consensus and enhance the efficiency of the consensus control strategy, a sliding-mode mechanism is employed to regulate tracking errors. Moreover, we develop an adaptive NN-based fault-tolerant optimal control method by integrating the sliding-mode mechanism with an actor–critic structured RL algorithm. It is proved that the outputs of the MASs precisely align with the desired reference signals, while ensuring the boundedness of all closed-loop signals. Finally, the proposed control methodology's effectiveness is validated through a simulation example.","PeriodicalId":55017,"journal":{"name":"IEEE Systems Journal","volume":"18 3","pages":"1681-1692"},"PeriodicalIF":4.0000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Adaptive Reinforcement Learning for Fault-Tolerant Optimal Consensus Control of Nonlinear Canonical Multiagent Systems With Actuator Loss of Effectiveness\",\"authors\":\"Boyan Zhu;Liang Zhang;Ben Niu;Ning Zhao\",\"doi\":\"10.1109/JSYST.2024.3433023\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article addresses the adaptive optimized consensus tracking control problem of nonlinear multiagent systems (MASs) via a reinforcement learning (RL) algorithm. Specifically, the nonlinear high-order MASs are formulated in a canonical form, with considerations for both actuator effectiveness loss and time-varying bias faults. First, neural networks (NNs) are utilized to approximate unknown nonlinear dynamics, and a state identifier and a fault estimator based on NNs are established, both of which are essential for evaluating state information and bias faults, respectively. Second, to achieve a high-order canonical dynamic consensus and enhance the efficiency of the consensus control strategy, a sliding-mode mechanism is employed to regulate tracking errors. Moreover, we develop an adaptive NN-based fault-tolerant optimal control method by integrating the sliding-mode mechanism with an actor–critic structured RL algorithm. It is proved that the outputs of the MASs precisely align with the desired reference signals, while ensuring the boundedness of all closed-loop signals. Finally, the proposed control methodology's effectiveness is validated through a simulation example.\",\"PeriodicalId\":55017,\"journal\":{\"name\":\"IEEE Systems Journal\",\"volume\":\"18 3\",\"pages\":\"1681-1692\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2024-08-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Systems Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10634586/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Systems Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10634586/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

本文通过强化学习(RL)算法解决了非线性多代理系统(MAS)的自适应优化共识跟踪控制问题。具体来说,非线性高阶 MAS 采用典型形式,同时考虑了执行器效力损失和时变偏差故障。首先,利用神经网络(NN)来逼近未知的非线性动力学,并建立了基于 NN 的状态识别器和故障估计器,这两者分别对评估状态信息和偏差故障至关重要。其次,为了实现高阶典型动态共识并提高共识控制策略的效率,我们采用了滑模机制来调节跟踪误差。此外,我们还将滑模机制与行为批判结构化 RL 算法相结合,开发了一种基于 NN 的自适应容错优化控制方法。事实证明,MAS 的输出与所需的参考信号精确一致,同时确保所有闭环信号的有界性。最后,通过一个仿真实例验证了所提出的控制方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Adaptive Reinforcement Learning for Fault-Tolerant Optimal Consensus Control of Nonlinear Canonical Multiagent Systems With Actuator Loss of Effectiveness
This article addresses the adaptive optimized consensus tracking control problem of nonlinear multiagent systems (MASs) via a reinforcement learning (RL) algorithm. Specifically, the nonlinear high-order MASs are formulated in a canonical form, with considerations for both actuator effectiveness loss and time-varying bias faults. First, neural networks (NNs) are utilized to approximate unknown nonlinear dynamics, and a state identifier and a fault estimator based on NNs are established, both of which are essential for evaluating state information and bias faults, respectively. Second, to achieve a high-order canonical dynamic consensus and enhance the efficiency of the consensus control strategy, a sliding-mode mechanism is employed to regulate tracking errors. Moreover, we develop an adaptive NN-based fault-tolerant optimal control method by integrating the sliding-mode mechanism with an actor–critic structured RL algorithm. It is proved that the outputs of the MASs precisely align with the desired reference signals, while ensuring the boundedness of all closed-loop signals. Finally, the proposed control methodology's effectiveness is validated through a simulation example.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Systems Journal
IEEE Systems Journal 工程技术-电信学
CiteScore
9.80
自引率
6.80%
发文量
572
审稿时长
4.9 months
期刊介绍: This publication provides a systems-level, focused forum for application-oriented manuscripts that address complex systems and system-of-systems of national and global significance. It intends to encourage and facilitate cooperation and interaction among IEEE Societies with systems-level and systems engineering interest, and to attract non-IEEE contributors and readers from around the globe. Our IEEE Systems Council job is to address issues in new ways that are not solvable in the domains of the existing IEEE or other societies or global organizations. These problems do not fit within traditional hierarchical boundaries. For example, disaster response such as that triggered by Hurricane Katrina, tsunamis, or current volcanic eruptions is not solvable by pure engineering solutions. We need to think about changing and enlarging the paradigm to include systems issues.
期刊最新文献
Relationship between emotional state and masticatory system function in a group of healthy volunteers aged 18-21. Table of Contents Front Cover Editorial IEEE Systems Council Information
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1