Pub Date : 2024-08-13DOI: 10.1109/TSMC.2024.3426923
Dongdong Li;Jiuxiang Dong
In this article, a feasible reinforcement learning (RL) scheme is proposed for partially unknown fractional-order nonlinear systems (FONSs). First, the fractional Hamilton-Jacobi–Bellman (HJB) equation containing the dynamics of FONSs is proposed by constructing an auxiliary system and equivalent transformation. Then, the optimal solution of FONSs optimal control under a performance constraint is obtained. It is proved that the optimal cost function and optimal control policy can be approximated gradually by the policy iteration. By using the backstepping control, RL, and identifier-actor-critic neural networks (NNs), the unknown dynamics functions are approximated and the approximate optimal controllers are obtained. A Lyapunov function based on optimality error is constructed, then the fractional-order update laws of NNs weights are designed to ensure that the weights converge to the optimum. Thus, the use of the gradient descent algorithm in the context of the fractional-order calculus to train the NNs is avoided. Finally, the error signals are proved to be bounded and the effectiveness of the proposed algorithm is verified by the simulation of two practical examples.
{"title":"Approximate Optimized Backstepping Control of Uncertain Fractional-Order Nonlinear Systems Based on Reinforcement Learning","authors":"Dongdong Li;Jiuxiang Dong","doi":"10.1109/TSMC.2024.3426923","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3426923","url":null,"abstract":"In this article, a feasible reinforcement learning (RL) scheme is proposed for partially unknown fractional-order nonlinear systems (FONSs). First, the fractional Hamilton-Jacobi–Bellman (HJB) equation containing the dynamics of FONSs is proposed by constructing an auxiliary system and equivalent transformation. Then, the optimal solution of FONSs optimal control under a performance constraint is obtained. It is proved that the optimal cost function and optimal control policy can be approximated gradually by the policy iteration. By using the backstepping control, RL, and identifier-actor-critic neural networks (NNs), the unknown dynamics functions are approximated and the approximate optimal controllers are obtained. A Lyapunov function based on optimality error is constructed, then the fractional-order update laws of NNs weights are designed to ensure that the weights converge to the optimum. Thus, the use of the gradient descent algorithm in the context of the fractional-order calculus to train the NNs is avoided. Finally, the error signals are proved to be bounded and the effectiveness of the proposed algorithm is verified by the simulation of two practical examples.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":null,"pages":null},"PeriodicalIF":8.6,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142443082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-09DOI: 10.1109/TSMC.2024.3426931
Jun Zhang;Yi Zuo;Shaocheng Tong
In this article, the adaptive fuzzy dynamic event-triggered output feedback resilient consensus control issue is investigated for nonlinear multiagent systems (MASs) subject to denial-of-service (DoS) attacks. Fuzzy logic systems (FLSs) are employed to model uncertain agents, and a state observer is constructed to estimate unmeasurable states. An event-triggered distributed resilient observer is designed to save the communication resources between agents, and estimate the unknown leader and its high-order derivatives in case of the communication topology being interrupted by DoS attacks. By the designed state observer and distributed resilient observer, a dynamic event-triggered resilient consensus control method is presented. It is proved that the controlled MASs are stable, and the followers can track the leader under DoS attacks. Moreover, the Zeno behavior can be excluded. Finally, we apply the developed resilient consensus control algorithm to multiple unmanned surface vehicles (USVs), the simulation results verify its effectiveness.
本文研究了受拒绝服务(DoS)攻击的非线性多代理系统(MAS)的自适应模糊动态事件触发输出反馈弹性共识控制问题。研究采用模糊逻辑系统(FLS)来模拟不确定的代理,并构建了一个状态观测器来估计不可测量的状态。设计了一个事件触发的分布式弹性观测器,以节省代理之间的通信资源,并在通信拓扑被 DoS 攻击中断时估计未知领导者及其高阶导数。通过所设计的状态观测器和分布式弹性观测器,提出了一种动态事件触发弹性共识控制方法。实验证明,受控的 MAS 是稳定的,在 DoS 攻击下,跟随者可以跟踪领导者。此外,还可以排除 Zeno 行为。最后,我们将所开发的弹性共识控制算法应用于多个无人水面飞行器(USV),仿真结果验证了该算法的有效性。
{"title":"Dynamic Event-Triggered Fuzzy Adaptive Resilient Consensus Control for Nonlinear MASs Under DoS Attacks","authors":"Jun Zhang;Yi Zuo;Shaocheng Tong","doi":"10.1109/TSMC.2024.3426931","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3426931","url":null,"abstract":"In this article, the adaptive fuzzy dynamic event-triggered output feedback resilient consensus control issue is investigated for nonlinear multiagent systems (MASs) subject to denial-of-service (DoS) attacks. Fuzzy logic systems (FLSs) are employed to model uncertain agents, and a state observer is constructed to estimate unmeasurable states. An event-triggered distributed resilient observer is designed to save the communication resources between agents, and estimate the unknown leader and its high-order derivatives in case of the communication topology being interrupted by DoS attacks. By the designed state observer and distributed resilient observer, a dynamic event-triggered resilient consensus control method is presented. It is proved that the controlled MASs are stable, and the followers can track the leader under DoS attacks. Moreover, the Zeno behavior can be excluded. Finally, we apply the developed resilient consensus control algorithm to multiple unmanned surface vehicles (USVs), the simulation results verify its effectiveness.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":null,"pages":null},"PeriodicalIF":8.6,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142443057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-09DOI: 10.1109/TSMC.2024.3426083
Giulio Ferro;Michela Robba;Roberto Sacile
Shortly, power distribution grids will incorporate large amounts of distributed energy resources and flexible loads, allowing the operation of a portion of the network in islanded mode to increase the reliability and resilience of the whole power system. A fully distributed robust model predictive control (MPC) strategy for voltage and frequency regulation in interconnected distribution grids is stated. Each grid node represents a collection of prosumers with a large active and reactive power regulation capacity. The advantages of this approach rely on the capability to afford any type of uncertainties, without making any assumption on the probability density function, on distributed generation and load nowcasting. We propose a two-stage architecture: at the first stage, an MPC approach, based on the distributed alternating direction method of multipliers (dADMM), is performed, considering the data nowcasting; instead, the second stage (based on robust distributed team decision theory) takes as input the trajectory of the first stage to compensate the noise that affects the system. The developed architecture has been tested on a modified IEEE5 bus system, considering multiple loads and renewable generation.
{"title":"A Fully Distributed Robust MPC Approach for Frequency and Voltage Regulation in Smart Grids With Active and Reactive Power Constraints","authors":"Giulio Ferro;Michela Robba;Roberto Sacile","doi":"10.1109/TSMC.2024.3426083","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3426083","url":null,"abstract":"Shortly, power distribution grids will incorporate large amounts of distributed energy resources and flexible loads, allowing the operation of a portion of the network in islanded mode to increase the reliability and resilience of the whole power system. A fully distributed robust model predictive control (MPC) strategy for voltage and frequency regulation in interconnected distribution grids is stated. Each grid node represents a collection of prosumers with a large active and reactive power regulation capacity. The advantages of this approach rely on the capability to afford any type of uncertainties, without making any assumption on the probability density function, on distributed generation and load nowcasting. We propose a two-stage architecture: at the first stage, an MPC approach, based on the distributed alternating direction method of multipliers (dADMM), is performed, considering the data nowcasting; instead, the second stage (based on robust distributed team decision theory) takes as input the trajectory of the first stage to compensate the noise that affects the system. The developed architecture has been tested on a modified IEEE5 bus system, considering multiple loads and renewable generation.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":null,"pages":null},"PeriodicalIF":8.6,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142443017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-06DOI: 10.1109/TSMC.2024.3432615
Lien-Wu Chen;Chi-Ren Chen
This study proposes a centimeter-level indoor positioning (CLIP) framework to achieve highly accurate localization with facing direction detection for the microlocation-aware Internet of Things (IoT). The CLIP framework can provide accurate centimeter-level positioning information to people indoors by integrating installed surveillance cameras with the IoT, where the efficient operation of microlocation-aware IoT applications and services can be enabled for smart spaces. CLIP can be used to accurately determine the position and facing direction of an individual. According to our review of relevant research, CLIP is the first indoor positioning framework that includes the following features: 1) centimeter-level positioning accuracy for the microlocation-aware IoT that can detect the facing direction of individuals; 2) employment of existing surveillance cameras with low-additional installation cost; and 3) innovative infrastructure for microlocation-aware IoT applications that can enable accurate centimeter-level path planning for individuals, emergency evacuation for groups of people, and geofencing with microlocation awareness. An Android-based system was implemented to verify the feasibility and effectiveness of the CLIP framework, and experimental results indicate that CLIP outperforms existing indoor positioning methods and can achieve centimeter-level accuracy with the improvement ratio of 94.6% over Sextant.
{"title":"Centimeter-Level Indoor Positioning With Facing Direction Detection for Microlocation-Aware Services","authors":"Lien-Wu Chen;Chi-Ren Chen","doi":"10.1109/TSMC.2024.3432615","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3432615","url":null,"abstract":"This study proposes a centimeter-level indoor positioning (CLIP) framework to achieve highly accurate localization with facing direction detection for the microlocation-aware Internet of Things (IoT). The CLIP framework can provide accurate centimeter-level positioning information to people indoors by integrating installed surveillance cameras with the IoT, where the efficient operation of microlocation-aware IoT applications and services can be enabled for smart spaces. CLIP can be used to accurately determine the position and facing direction of an individual. According to our review of relevant research, CLIP is the first indoor positioning framework that includes the following features: 1) centimeter-level positioning accuracy for the microlocation-aware IoT that can detect the facing direction of individuals; 2) employment of existing surveillance cameras with low-additional installation cost; and 3) innovative infrastructure for microlocation-aware IoT applications that can enable accurate centimeter-level path planning for individuals, emergency evacuation for groups of people, and geofencing with microlocation awareness. An Android-based system was implemented to verify the feasibility and effectiveness of the CLIP framework, and experimental results indicate that CLIP outperforms existing indoor positioning methods and can achieve centimeter-level accuracy with the improvement ratio of 94.6% over Sextant.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":null,"pages":null},"PeriodicalIF":8.6,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142275005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-06DOI: 10.1109/TSMC.2024.3431222
Zixiang Nie;Kwang-Cheng Chen
Smart factories employ intelligent transportaton systems such as autonomous mobile robots (AMRs) to support real-time adjusted production flows for agile and flexible production. While decentralized transportation task execution provides a scalable multirobot system (MRS) for a smart factory, new coordination challenges arise in implementing such a system. Transportation-MRS collaborates with production-MRS to accommodate just-in-time (JIT) production, leading to nonstationary transportation tasks that transportation-MRS must learn and adapt to. Also, decentralized operation on a shared shop floor means that one robot cannot factor in peer robots’ task execution planning, leading to competitive collisions. Meanwhile, predictive coordination with communication among multiple learning and adapting intelligent robots is still an open problem. On top of identifying the aforementioned challenges, this article first proposes a multifloor transportation graph model to discretize transportation task execution and allow real-time adjustment of transportation paths toward collision-free. We introduce a unique collaborative multi-intelligent robot system approach taking each robot as a cyber–physical agent with automated artificial intelligence (AI) workflow. First, it includes a novel multiagent reinforcement learning (MARL) algorithm, where each robot predictively plans collision-avoidant paths. Second, we introduce a token-passing mechanism to resolve inevitable competitive collisions due to nonstationary tasks. The proposed approach innovatively uses the multifloor model as a domain model for planning. By allowing competitive collision to occur and resolve, a robot only needs to learn and adapt to uncertain parts of the environment—nonstationary tasks and peer robots’ paths. Computational experiments show that our approach is both sample-efficient and computationally efficient. The transportation-MRS quickly reaches near-optimal performance levels, which are empirically shown to scale with the number of robots involved.
{"title":"Predictive Path Coordination of Collaborative Transportation Multirobot System in a Smart Factory","authors":"Zixiang Nie;Kwang-Cheng Chen","doi":"10.1109/TSMC.2024.3431222","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3431222","url":null,"abstract":"Smart factories employ intelligent transportaton systems such as autonomous mobile robots (AMRs) to support real-time adjusted production flows for agile and flexible production. While decentralized transportation task execution provides a scalable multirobot system (MRS) for a smart factory, new coordination challenges arise in implementing such a system. Transportation-MRS collaborates with production-MRS to accommodate just-in-time (JIT) production, leading to nonstationary transportation tasks that transportation-MRS must learn and adapt to. Also, decentralized operation on a shared shop floor means that one robot cannot factor in peer robots’ task execution planning, leading to competitive collisions. Meanwhile, predictive coordination with communication among multiple learning and adapting intelligent robots is still an open problem. On top of identifying the aforementioned challenges, this article first proposes a multifloor transportation graph model to discretize transportation task execution and allow real-time adjustment of transportation paths toward collision-free. We introduce a unique collaborative multi-intelligent robot system approach taking each robot as a cyber–physical agent with automated artificial intelligence (AI) workflow. First, it includes a novel multiagent reinforcement learning (MARL) algorithm, where each robot predictively plans collision-avoidant paths. Second, we introduce a token-passing mechanism to resolve inevitable competitive collisions due to nonstationary tasks. The proposed approach innovatively uses the multifloor model as a domain model for planning. By allowing competitive collision to occur and resolve, a robot only needs to learn and adapt to uncertain parts of the environment—nonstationary tasks and peer robots’ paths. Computational experiments show that our approach is both sample-efficient and computationally efficient. The transportation-MRS quickly reaches near-optimal performance levels, which are empirically shown to scale with the number of robots involved.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":null,"pages":null},"PeriodicalIF":8.6,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142274908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}