首页 > 最新文献

IEEE Transactions on Cybernetics最新文献

英文 中文
Reinforcement Learning for Synchronization of Heterogeneous Multiagent Systems by Improved Q-Functions 通过改进的 Q$ 函数实现异构多代理系统同步的强化学习
IF 9.4 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-24 DOI: 10.1109/TCYB.2024.3440333
Jinna Li;Lin Yuan;Weiran Cheng;Tianyou Chai;Frank L. Lewis
This article dedicates to investigating a methodology for enhancing adaptability to environmental changes of reinforcement learning (RL) techniques with data efficiency, by which a joint control protocol is learned using only data for multiagent systems (MASs). Thus, all followers are able to synchronize themselves with the leader and minimize their individual performance. To this end, an optimal synchronization problem of heterogeneous MASs is first formulated, and then an arbitration RL mechanism is developed for well addressing key challenges faced by the current RL techniques, that is, insufficient data and environmental changes. In the developed mechanism, an improved Q-function with an arbitration factor is designed for accommodating the fact that control protocols tend to be made by historic experiences and instinctive decision-making, such that the degree of control over agents’ behaviors can be adaptively allocated by on-policy and off-policy RL techniques for the optimal multiagent synchronization problem. Finally, an arbitration RL algorithm with critic-only neural networks is proposed, and theoretical analysis and proofs of synchronization and performance optimality are provided. Simulation results verify the effectiveness of the proposed method.
本文致力于研究一种利用数据效率提高强化学习(RL)技术对环境变化适应性的方法,通过这种方法,多代理系统(MAS)只需使用数据就能学习联合控制协议。这样,所有追随者都能与领导者同步,并最大限度地降低各自的绩效。为此,首先提出了异构 MAS 的最优同步问题,然后开发了一种仲裁 RL 机制,以很好地解决当前 RL 技术面临的主要挑战,即数据不足和环境变化。在所开发的机制中,设计了一种带有仲裁因子的改进 Q 函数,以适应控制协议往往由历史经验和本能决策所决定的事实,这样就可以通过最优多代理同步问题的策略上和策略下 RL 技术自适应地分配代理行为的控制程度。最后,提出了一种使用纯批判神经网络的仲裁 RL 算法,并提供了同步和性能优化的理论分析和证明。仿真结果验证了所提方法的有效性。
{"title":"Reinforcement Learning for Synchronization of Heterogeneous Multiagent Systems by Improved Q-Functions","authors":"Jinna Li;Lin Yuan;Weiran Cheng;Tianyou Chai;Frank L. Lewis","doi":"10.1109/TCYB.2024.3440333","DOIUrl":"10.1109/TCYB.2024.3440333","url":null,"abstract":"This article dedicates to investigating a methodology for enhancing adaptability to environmental changes of reinforcement learning (RL) techniques with data efficiency, by which a joint control protocol is learned using only data for multiagent systems (MASs). Thus, all followers are able to synchronize themselves with the leader and minimize their individual performance. To this end, an optimal synchronization problem of heterogeneous MASs is first formulated, and then an arbitration RL mechanism is developed for well addressing key challenges faced by the current RL techniques, that is, insufficient data and environmental changes. In the developed mechanism, an improved Q-function with an arbitration factor is designed for accommodating the fact that control protocols tend to be made by historic experiences and instinctive decision-making, such that the degree of control over agents’ behaviors can be adaptively allocated by on-policy and off-policy RL techniques for the optimal multiagent synchronization problem. Finally, an arbitration RL algorithm with critic-only neural networks is proposed, and theoretical analysis and proofs of synchronization and performance optimality are provided. Simulation results verify the effectiveness of the proposed method.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"54 11","pages":"6545-6558"},"PeriodicalIF":9.4,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142317311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust Prescribed Performance Control of Nonlinear Systems With Unknown Odd Powers 具有未知奇异幂的非线性系统的鲁棒性规定性能控制
IF 9.4 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-24 DOI: 10.1109/TCYB.2024.3453948
Jin-Xi Zhang;Tianyou Chai
This article is concerned with the problem of reference tracking for the lower-triangular nonlinear systems with a chain of odd powers. Contrary to most of the related studies, this work is focused on the case where neither the odd powers nor their bounds are known. This renders the majority of the existing methods for stability analysis and control design for the odd-power systems infeasible. To surmount this challenge, a robust prescribed performance control strategy together with a constraint analysis by contradiction is put forward. Instead of the well-established adding one power integrator technique, a group of barrier functions are employed to combat the tracking error and the intermediate errors. In lieu of the Lyapunov stability theory, a constraint analysis by contradiction is carried out, which discloses the inherent robustness of the control system against the nonparametric uncertainties, the unmatched disturbances and the unknown odd powers. It is guaranteed that the tracking error enters into a preassigned neighborhood of zero after a given time, with a predefined bound on the overshoot. In addition, the proposed control exhibits a striking simplicity. Despite the severe model uncertainties and the recursive control design, no effort needs to be paid for parameter identification, function approximation, disturbance estimation, or derivative calculation. The above theoretical findings are substantiated by the comparative simulation results.
本文关注具有奇数幂链的下三角非线性系统的参考跟踪问题。与大多数相关研究不同的是,本文的研究重点是奇数幂及其边界均未知的情况。这使得大多数现有的奇数幂系统稳定性分析和控制设计方法变得不可行。为了克服这一难题,我们提出了一种稳健的规定性能控制策略,并通过矛盾进行约束分析。该方法采用了一组障碍函数来消除跟踪误差和中间误差,而不是采用行之有效的增加一个功率积分器的技术。通过矛盾约束分析,代替了 Lyapunov 稳定性理论,揭示了控制系统对非参数不确定性、不匹配干扰和未知奇异功率的内在鲁棒性。保证了跟踪误差在给定时间后进入预设的零邻域,并预设了超调边界。此外,所提出的控制还具有惊人的简洁性。尽管存在严重的模型不确定性和递归控制设计,但在参数识别、函数近似、扰动估计或导数计算方面无需付出任何努力。比较仿真结果证实了上述理论结论。
{"title":"Robust Prescribed Performance Control of Nonlinear Systems With Unknown Odd Powers","authors":"Jin-Xi Zhang;Tianyou Chai","doi":"10.1109/TCYB.2024.3453948","DOIUrl":"10.1109/TCYB.2024.3453948","url":null,"abstract":"This article is concerned with the problem of reference tracking for the lower-triangular nonlinear systems with a chain of odd powers. Contrary to most of the related studies, this work is focused on the case where neither the odd powers nor their bounds are known. This renders the majority of the existing methods for stability analysis and control design for the odd-power systems infeasible. To surmount this challenge, a robust prescribed performance control strategy together with a constraint analysis by contradiction is put forward. Instead of the well-established adding one power integrator technique, a group of barrier functions are employed to combat the tracking error and the intermediate errors. In lieu of the Lyapunov stability theory, a constraint analysis by contradiction is carried out, which discloses the inherent robustness of the control system against the nonparametric uncertainties, the unmatched disturbances and the unknown odd powers. It is guaranteed that the tracking error enters into a preassigned neighborhood of zero after a given time, with a predefined bound on the overshoot. In addition, the proposed control exhibits a striking simplicity. Despite the severe model uncertainties and the recursive control design, no effort needs to be paid for parameter identification, function approximation, disturbance estimation, or derivative calculation. The above theoretical findings are substantiated by the comparative simulation results.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"54 11","pages":"6485-6494"},"PeriodicalIF":9.4,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142317309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fully Actuated System Approach for Control: An Overview 全驱动系统控制方法:概述
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-24 DOI: 10.1109/tcyb.2024.3457584
Guang-Ren Duan
{"title":"Fully Actuated System Approach for Control: An Overview","authors":"Guang-Ren Duan","doi":"10.1109/tcyb.2024.3457584","DOIUrl":"https://doi.org/10.1109/tcyb.2024.3457584","url":null,"abstract":"","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"9 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142317310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stealthy Measurement-Aided Pole-Dynamics Attacks With Nominal Models 利用标称模型进行隐蔽的测量辅助极动力攻击
IF 9.4 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-23 DOI: 10.1109/TCYB.2024.3456084
Dajun Du;Changda Zhang;Chen Peng;Minrui Fei;Huiyu Zhou
When traditional pole-dynamics attacks (TPDAs) are implemented with nominal models, model mismatch between exact and nominal models often affects their stealthiness, or even makes the stealthiness lost. To solve this problem, this article presents a novel stealthy measurement-aided pole-dynamics attacks (MAPDAs) method with model mismatch. First, the limitations of TPDAs using exact models are revealed. Second, to handle the limitations, the proposed MAPDAs method is designed by using an adaptive control strategy, which can keep the stealthiness. Moreover, it is easier to implement as only the measurements are needed in comparison with the existing methods requiring both measurements and control inputs. Third, the performance of the proposed MAPDAs method is explored using convergence of multivariate measurements, and MAPDAs with model mismatch have the same stealthiness and similar destructiveness as TPDAs. Finally, experimental results from a networked inverted pendulum system confirm the feasibility and effectiveness of the proposed method.
当传统的极点动力学攻击(TPDAs)使用标称模型实现时,精确模型和标称模型之间的模型不匹配往往会影响其隐蔽性,甚至使隐蔽性丧失。为了解决这个问题,本文提出了一种新型的隐身测量辅助极点动力学攻击(MAPDAs)方法。首先,揭示了使用精确模型的 TPDAs 的局限性。其次,针对这些局限性,提出的 MAPDAs 方法采用了自适应控制策略,从而保持了隐蔽性。此外,与需要测量和控制输入的现有方法相比,该方法只需要测量,因此更易于实施。第三,利用多变量测量的收敛性探讨了所提出的 MAPDAs 方法的性能,发现模型不匹配的 MAPDAs 具有与 TPDAs 相同的隐蔽性和相似的破坏性。最后,一个联网倒立摆系统的实验结果证实了所提方法的可行性和有效性。
{"title":"Stealthy Measurement-Aided Pole-Dynamics Attacks With Nominal Models","authors":"Dajun Du;Changda Zhang;Chen Peng;Minrui Fei;Huiyu Zhou","doi":"10.1109/TCYB.2024.3456084","DOIUrl":"10.1109/TCYB.2024.3456084","url":null,"abstract":"When traditional pole-dynamics attacks (TPDAs) are implemented with nominal models, model mismatch between exact and nominal models often affects their stealthiness, or even makes the stealthiness lost. To solve this problem, this article presents a novel stealthy measurement-aided pole-dynamics attacks (MAPDAs) method with model mismatch. First, the limitations of TPDAs using exact models are revealed. Second, to handle the limitations, the proposed MAPDAs method is designed by using an adaptive control strategy, which can keep the stealthiness. Moreover, it is easier to implement as only the measurements are needed in comparison with the existing methods requiring both measurements and control inputs. Third, the performance of the proposed MAPDAs method is explored using convergence of multivariate measurements, and MAPDAs with model mismatch have the same stealthiness and similar destructiveness as TPDAs. Finally, experimental results from a networked inverted pendulum system confirm the feasibility and effectiveness of the proposed method.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"54 11","pages":"6653-6666"},"PeriodicalIF":9.4,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142307655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fuzzy Adaptive Event-Triggered Consensus Control for Nonlinear Multiagent Systems With Output Constraints and DoS Attacks 具有输出约束和 DoS 攻击的非线性多代理系统的模糊自适应事件触发共识控制
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-17 DOI: 10.1109/tcyb.2024.3456821
Yongming Li, Ge Lu, Kewen Li
{"title":"Fuzzy Adaptive Event-Triggered Consensus Control for Nonlinear Multiagent Systems With Output Constraints and DoS Attacks","authors":"Yongming Li, Ge Lu, Kewen Li","doi":"10.1109/tcyb.2024.3456821","DOIUrl":"https://doi.org/10.1109/tcyb.2024.3456821","url":null,"abstract":"","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"4 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142236420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Indefinite Robust Linear Quadratic Optimal Regulator for Discrete-Time Uncertain Singular Markov Jump Systems 离散时间不确定奇异马尔可夫跃迁系统的无限鲁棒线性二次优化调节器
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-17 DOI: 10.1109/tcyb.2024.3454530
Yichun Li, Wei Xing Zheng, Zheng-Guang Wu, Yang Tang, Shuping Ma
{"title":"Indefinite Robust Linear Quadratic Optimal Regulator for Discrete-Time Uncertain Singular Markov Jump Systems","authors":"Yichun Li, Wei Xing Zheng, Zheng-Guang Wu, Yang Tang, Shuping Ma","doi":"10.1109/tcyb.2024.3454530","DOIUrl":"https://doi.org/10.1109/tcyb.2024.3454530","url":null,"abstract":"","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"23 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142236389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Surface Intelligent Robust Control of Nonlinear Systems With Fixed-Time Sliding-Mode Observer 带固定时间滑模观测器的非线性系统动态表面智能鲁棒控制
IF 9.4 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-17 DOI: 10.1109/TCYB.2024.3456089
Hong-Gui Han;Cheng-Cheng Feng;Hao-Yuan Sun;Jun-Fei Qiao
The high tracking control precision and fast finite-time convergence for nonlinear systems is a significant challenge due to complex nonlinearity and unknown disturbances. To address this challenge, a dynamic surface intelligent robust control strategy with fixed-time sliding-mode observer (DSIRC-SMO) is proposed to improve the tracking control performance in a finite time. First, adaptive fuzzy neural network based on a predictor (P-AFNN) is designed to imitate the complex nonlinearity. In particular, the weight adaptive law is formulated by utilizing the prediction error information, which improves the accuracy of approximating the nonlinear system. Second, the fixed-time sliding-mode observer (SMO) is integrated into the dynamic surface control technique to deal with unknown disturbances and modeling errors in a fixed time. This integration allows for timely updates the dynamic surface using observation information, thereby enhancing the system’s anti-interference capability. Then, the fixed-time convergence of SMO is proven. Third, the proposed DSIRC-SMO can be effectively implemented and the finite-time convergence of DSIRC-SMO is proven in detail based on the fixed-time convergence of SMO. Finally, numerical simulation and actual wastewater treatment processes simulation are conducted to validate the effectiveness of DSIRC-SMO.
由于复杂的非线性和未知的干扰,非线性系统的高跟踪控制精度和快速有限时间收敛是一个重大挑战。为应对这一挑战,本文提出了一种带有固定时间滑模观测器(DSIRC-SMO)的动态表面智能鲁棒控制策略,以提高有限时间内的跟踪控制性能。首先,设计了基于预测器的自适应模糊神经网络(P-AFNN)来模仿复杂的非线性。其中,利用预测误差信息制定了权重自适应法则,从而提高了近似非线性系统的精度。其次,将固定时间滑动模式观测器(SMO)集成到动态表面控制技术中,以便在固定时间内处理未知干扰和建模误差。这种集成可以利用观测信息及时更新动态曲面,从而增强系统的抗干扰能力。然后,证明了 SMO 的固定时间收敛性。第三,基于 SMO 的固定时间收敛性,可以有效地实现所提出的 DSIRC-SMO,并详细证明了 DSIRC-SMO 的有限时间收敛性。最后,通过数值模拟和实际污水处理过程模拟来验证 DSIRC-SMO 的有效性。
{"title":"Dynamic Surface Intelligent Robust Control of Nonlinear Systems With Fixed-Time Sliding-Mode Observer","authors":"Hong-Gui Han;Cheng-Cheng Feng;Hao-Yuan Sun;Jun-Fei Qiao","doi":"10.1109/TCYB.2024.3456089","DOIUrl":"10.1109/TCYB.2024.3456089","url":null,"abstract":"The high tracking control precision and fast finite-time convergence for nonlinear systems is a significant challenge due to complex nonlinearity and unknown disturbances. To address this challenge, a dynamic surface intelligent robust control strategy with fixed-time sliding-mode observer (DSIRC-SMO) is proposed to improve the tracking control performance in a finite time. First, adaptive fuzzy neural network based on a predictor (P-AFNN) is designed to imitate the complex nonlinearity. In particular, the weight adaptive law is formulated by utilizing the prediction error information, which improves the accuracy of approximating the nonlinear system. Second, the fixed-time sliding-mode observer (SMO) is integrated into the dynamic surface control technique to deal with unknown disturbances and modeling errors in a fixed time. This integration allows for timely updates the dynamic surface using observation information, thereby enhancing the system’s anti-interference capability. Then, the fixed-time convergence of SMO is proven. Third, the proposed DSIRC-SMO can be effectively implemented and the finite-time convergence of DSIRC-SMO is proven in detail based on the fixed-time convergence of SMO. Finally, numerical simulation and actual wastewater treatment processes simulation are conducted to validate the effectiveness of DSIRC-SMO.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"54 11","pages":"6767-6779"},"PeriodicalIF":9.4,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142236379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Minimum-Cost State-Flipped Control for Reachability of Boolean Control Networks Using Reinforcement Learning 利用强化学习实现布尔控制网络可达性的最小成本状态翻转控制
IF 9.4 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-17 DOI: 10.1109/TCYB.2024.3454253
Jingjie Ni;Yang Tang;Fangfei Li
This article proposes model-free reinforcement learning methods for minimum-cost state-flipped control in Boolean control networks (BCNs). We tackle two questions: 1) finding the flipping kernel, namely, the flip set with the smallest cardinality ensuring reachability and 2) deriving optimal policies to minimize the number of flipping actions for reachability based on the obtained flipping kernel. For Question 1), Q-learning’s capability in determining reachability is demonstrated. To expedite convergence, we incorporate two improvements: 1) demonstrating that previously reachable states remain reachable after adding elements to the flip set, followed by employing transfer learning and 2) initiating each episode with special initial states whose reachability to the target state set are currently unknown. For Question 2), it is challenging to encapsulate the objective of simultaneously reducing control costs and satisfying terminal constraints exclusively through the reward function employed in the Q-learning framework. To bridge the gap, we propose a BCN-characteristics-based reward scheme and prove its optimality. Questions 1) and 2) with large-scale BCNs are addressed by employing small memory Q-learning, which reduces memory usage by only recording visited action-values. An upper bound on memory usage is provided to assess the algorithm’s feasibility. To expedite convergence for Question 2) in large-scale BCNs, we introduce adaptive variable rewards based on the known maximum steps needed to reach the target state set without cycles. Finally, the effectiveness of the proposed methods is validated on both small- and large-scale BCNs.
本文提出了布尔控制网络(BCN)中最小成本状态翻转控制的无模型强化学习方法。我们解决了两个问题:1) 找到翻转内核,即确保可达性的最小卡方的翻转集;2) 根据获得的翻转内核推导出最优策略,以最小化可达性的翻转操作次数。对于问题 1),Q-learning 在确定可达性方面的能力得到了证明。为了加快收敛速度,我们做了两点改进:1) 在向翻转集添加元素后,证明之前可到达的状态仍然是可到达的,然后采用迁移学习;以及 2) 以特殊的初始状态启动每一集,这些初始状态与目标状态集的可到达性目前是未知的。对于问题 2),完全通过 Q-learning 框架中使用的奖励函数来囊括同时降低控制成本和满足终端约束的目标具有挑战性。为了弥补这一差距,我们提出了基于 BCN 特征的奖励方案,并证明了其最优性。通过采用小内存 Q-learning,我们解决了大规模 BCN 的问题 1) 和 2),Q-learning 只记录访问过的动作值,从而减少了内存使用量。我们提供了内存使用量的上限,以评估算法的可行性。为了加快问题 2) 在大规模 BCN 中的收敛速度,我们根据达到目标状态集所需的已知最大步骤,引入了自适应变量奖励。最后,我们在小型和大型 BCN 上验证了所提方法的有效性。
{"title":"Minimum-Cost State-Flipped Control for Reachability of Boolean Control Networks Using Reinforcement Learning","authors":"Jingjie Ni;Yang Tang;Fangfei Li","doi":"10.1109/TCYB.2024.3454253","DOIUrl":"10.1109/TCYB.2024.3454253","url":null,"abstract":"This article proposes model-free reinforcement learning methods for minimum-cost state-flipped control in Boolean control networks (BCNs). We tackle two questions: 1) finding the flipping kernel, namely, the flip set with the smallest cardinality ensuring reachability and 2) deriving optimal policies to minimize the number of flipping actions for reachability based on the obtained flipping kernel. For Question 1), Q-learning’s capability in determining reachability is demonstrated. To expedite convergence, we incorporate two improvements: 1) demonstrating that previously reachable states remain reachable after adding elements to the flip set, followed by employing transfer learning and 2) initiating each episode with special initial states whose reachability to the target state set are currently unknown. For Question 2), it is challenging to encapsulate the objective of simultaneously reducing control costs and satisfying terminal constraints exclusively through the reward function employed in the Q-learning framework. To bridge the gap, we propose a BCN-characteristics-based reward scheme and prove its optimality. Questions 1) and 2) with large-scale BCNs are addressed by employing small memory Q-learning, which reduces memory usage by only recording visited action-values. An upper bound on memory usage is provided to assess the algorithm’s feasibility. To expedite convergence for Question 2) in large-scale BCNs, we introduce adaptive variable rewards based on the known maximum steps needed to reach the target state set without cycles. Finally, the effectiveness of the proposed methods is validated on both small- and large-scale BCNs.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"54 11","pages":"7103-7115"},"PeriodicalIF":9.4,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142236380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fully Event-Triggered Practical Leader–Following Consensus of Multiple Euler–Lagrange Systems Over Switching Networks 交换网络上多个欧拉-拉格朗日系统的完全事件触发实用领导者-追随共识
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-17 DOI: 10.1109/tcyb.2024.3454337
Rui Zhang, Jie Huang
{"title":"Fully Event-Triggered Practical Leader–Following Consensus of Multiple Euler–Lagrange Systems Over Switching Networks","authors":"Rui Zhang, Jie Huang","doi":"10.1109/tcyb.2024.3454337","DOIUrl":"https://doi.org/10.1109/tcyb.2024.3454337","url":null,"abstract":"","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"32 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142236386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Secure Decentralized Event-Triggered Load Frequency Control Design for Multiarea Power Systems Under Multiple DoS Attacks 多重 DoS 攻击下多区域电力系统的安全分散式事件触发负载频率控制设计
IF 9.4 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-17 DOI: 10.1109/TCYB.2024.3454053
Kun Xu;Yugang Niu;James Lam
This article deals with the load frequency control problem of multiarea power systems subject to multiple intermittent denial-of-service (DoS) attacks that can interrupt the data transmission of each area independently. A decentralized event-triggering (ET) scheme under periodic sampling is proposed to reduce the transmission burden. Then, a decentralized ET-based controller is designed by tuning several parameters. Such control scheme can reduce the computational complexity while achieving privacy preserving. Moreover, sufficient conditions are derived for ensuring the input-to-state stability of multiarea power systems and an optimization solution method via particle swarm algorithm is provided. Finally, a three-area power system is employed to verify the effectiveness of the proposed scheme.
本文讨论了多区域电力系统的负载频率控制问题,该系统受到多种间歇性拒绝服务(DoS)攻击,这些攻击会独立中断每个区域的数据传输。本文提出了一种周期性采样下的分散式事件触发(ET)方案,以减轻传输负担。然后,通过调整几个参数,设计了一种基于 ET 的分散控制器。这种控制方案既能降低计算复杂度,又能实现隐私保护。此外,还推导出了确保多区域电力系统输入到状态稳定性的充分条件,并提供了一种通过粒子群算法的优化求解方法。最后,采用了一个三区电力系统来验证所提方案的有效性。
{"title":"Secure Decentralized Event-Triggered Load Frequency Control Design for Multiarea Power Systems Under Multiple DoS Attacks","authors":"Kun Xu;Yugang Niu;James Lam","doi":"10.1109/TCYB.2024.3454053","DOIUrl":"10.1109/TCYB.2024.3454053","url":null,"abstract":"This article deals with the load frequency control problem of multiarea power systems subject to multiple intermittent denial-of-service (DoS) attacks that can interrupt the data transmission of each area independently. A decentralized event-triggering (ET) scheme under periodic sampling is proposed to reduce the transmission burden. Then, a decentralized ET-based controller is designed by tuning several parameters. Such control scheme can reduce the computational complexity while achieving privacy preserving. Moreover, sufficient conditions are derived for ensuring the input-to-state stability of multiarea power systems and an optimization solution method via particle swarm algorithm is provided. Finally, a three-area power system is employed to verify the effectiveness of the proposed scheme.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"54 11","pages":"6423-6435"},"PeriodicalIF":9.4,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142236382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Cybernetics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1