首页 > 最新文献

IEEE Transactions on Cybernetics最新文献

英文 中文
Inverse Dynamic Games With Process Noise and Unknown Target States: A Linear Estimation Approach 具有过程噪声和未知目标状态的逆动态对策:一种线性估计方法
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-03 DOI: 10.1109/tcyb.2026.3656420
Yao Li, Chengpu Yu, Renshuo Cheng, Fang Deng, Jie Chen
{"title":"Inverse Dynamic Games With Process Noise and Unknown Target States: A Linear Estimation Approach","authors":"Yao Li, Chengpu Yu, Renshuo Cheng, Fang Deng, Jie Chen","doi":"10.1109/tcyb.2026.3656420","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3656420","url":null,"abstract":"","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"100 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146110399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Cybernetics Publication Information IEEE控制论学报出版信息
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-03 DOI: 10.1109/tcyb.2026.3652051
{"title":"IEEE Transactions on Cybernetics Publication Information","authors":"","doi":"10.1109/tcyb.2026.3652051","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3652051","url":null,"abstract":"","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"10 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146110400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tensor-Based Ant Colony Optimization for Set Meal Design in Online-to-Offline Restaurants 基于张量的蚁群优化线上到线下餐厅套餐设计
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-02 DOI: 10.1109/tcyb.2026.3656946
Bing Sun, Wei-Jie Yu, Xiao-Fang Liu, Jinghui Zhong, Jian-Yu Li, Zhi-Hui Zhan, Sam Kwong, Jun Zhang
{"title":"Tensor-Based Ant Colony Optimization for Set Meal Design in Online-to-Offline Restaurants","authors":"Bing Sun, Wei-Jie Yu, Xiao-Fang Liu, Jinghui Zhong, Jian-Yu Li, Zhi-Hui Zhan, Sam Kwong, Jun Zhang","doi":"10.1109/tcyb.2026.3656946","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3656946","url":null,"abstract":"","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"1 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146101330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ADR-DMOEA: A Dynamic Multiobjective Optimization Evolutionary Algorithm Based on Adaptive Dynamic Response Strategy. 基于自适应动态响应策略的动态多目标优化进化算法。
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-28 DOI: 10.1109/tcyb.2026.3652642
Yuying Wang,Ping Zhou,Shengxiang Yang,Tianyou Chai
Optimization problems in real-world applications often involve dynamic environmental changes, requiring algorithms to adapt quickly, track optimal solutions, and maintain efficiency. Existing dynamic multiobjective optimization evolutionary algorithms (DMOEAs) typically rely on fixed or limited dynamic response mechanisms, which are often insufficient to handle complex and varied dynamic environments. To overcome these limitations, this article proposes an adaptive dynamic response-based DMOEA (ADR-DMOEA), which employs a subpopulation-level adaptive mechanism to coordinate diversity-driven, prediction-driven, and memory-driven strategies. The strategy weights are dynamically adjusted according to the static optimization distance of each subpopulation, ensuring that appropriate strategies are adaptively deployed in different environments. This design overcomes the inefficiency of fixed assignments and the instability of individual-level perturbations, enabling coordinated and stable evolution. Extensive experiments on DF benchmark functions and a blast furnace (BF) ironmaking case study demonstrate that ADR-DMOEA achieves superior convergence, diversity, and robustness compared to state-of-the-art algorithms, effectively supporting real-world decision-making under dynamic conditions.
现实应用中的优化问题通常涉及动态环境变化,要求算法快速适应、跟踪最优解决方案并保持效率。现有的动态多目标优化进化算法(dmoea)通常依赖于固定或有限的动态响应机制,往往不足以处理复杂多变的动态环境。为了克服这些限制,本文提出了一种基于自适应动态响应的DMOEA (ADR-DMOEA),它采用亚种群水平的自适应机制来协调多样性驱动、预测驱动和记忆驱动的策略。根据每个子种群的静态优化距离动态调整策略权重,确保在不同环境下自适应部署合适的策略。这种设计克服了固定分配的低效率和个体扰动的不稳定性,实现了协调和稳定的进化。在DF基准函数和高炉炼铁案例研究上的大量实验表明,与最先进的算法相比,ADR-DMOEA具有更好的收敛性、多样性和鲁棒性,有效地支持了动态条件下的现实决策。
{"title":"ADR-DMOEA: A Dynamic Multiobjective Optimization Evolutionary Algorithm Based on Adaptive Dynamic Response Strategy.","authors":"Yuying Wang,Ping Zhou,Shengxiang Yang,Tianyou Chai","doi":"10.1109/tcyb.2026.3652642","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3652642","url":null,"abstract":"Optimization problems in real-world applications often involve dynamic environmental changes, requiring algorithms to adapt quickly, track optimal solutions, and maintain efficiency. Existing dynamic multiobjective optimization evolutionary algorithms (DMOEAs) typically rely on fixed or limited dynamic response mechanisms, which are often insufficient to handle complex and varied dynamic environments. To overcome these limitations, this article proposes an adaptive dynamic response-based DMOEA (ADR-DMOEA), which employs a subpopulation-level adaptive mechanism to coordinate diversity-driven, prediction-driven, and memory-driven strategies. The strategy weights are dynamically adjusted according to the static optimization distance of each subpopulation, ensuring that appropriate strategies are adaptively deployed in different environments. This design overcomes the inefficiency of fixed assignments and the instability of individual-level perturbations, enabling coordinated and stable evolution. Extensive experiments on DF benchmark functions and a blast furnace (BF) ironmaking case study demonstrate that ADR-DMOEA achieves superior convergence, diversity, and robustness compared to state-of-the-art algorithms, effectively supporting real-world decision-making under dynamic conditions.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146069935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-Evolution of Hybrid Data-Physics Equipment Digital Twin Using Meta Learning and Continual Learning. 基于元学习和持续学习的混合数据-物理设备数字孪生的自进化
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-28 DOI: 10.1109/tcyb.2026.3651627
Kunyu Wang,Lin Zhang,Zhen Chen,Hongbo Cheng,Han Lu,Wentong Cai,Qingsha S Cheng,M Jamal Deen
This article introduces a novel hybrid method to enable the self-evolution of equipment digital twins (DTs), allowing them to continuously and accurately mirror their physical counterparts. Self-evolution is the process by which a DT autonomously updates its models using real-time sensor data, adapting to dynamic real-world behavior. To enhance this process, we propose a data-physics driven approach that synergistically integrates meta-learning and continual learning. Our method begins by designing an extended residual model using a Koopman autoencoder (KAE) neural network. This component bridges the gap between an imperfect analytical physics model and actual equipment behavior. Next, we employ the Reptile meta-learning algorithm to train offline a versatile foundation model on historical data, endowing it with strong adaptability for rapid learning from new information. A key innovation is a periodic event-triggered mechanism, which monitors the DT's simulation accuracy against a fixed time window. When a performance discrepancy is detected, it automatically triggers a self-evolution cycle. The foundation model is then updated through a fine-tuning strategy based on continual learning with random reinitialization. This fusion of offline meta-learning and online continual learning allows the DT to quickly adapt to new, unseen scenarios, ensuring it reflects the physical equipment's state in real-time. We validate the effectiveness and improved performance of our proposed framework through a comprehensive robot simulation case study.
本文介绍了一种新的混合方法来实现设备数字孪生(dt)的自我进化,使它们能够连续准确地反映其物理对偶。自进化是DT使用实时传感器数据自主更新模型的过程,以适应动态的现实世界行为。为了加强这一过程,我们提出了一种数据物理驱动的方法,该方法协同集成了元学习和持续学习。我们的方法首先使用Koopman自编码器(KAE)神经网络设计一个扩展残差模型。该组件弥合了不完美的分析物理模型和实际设备行为之间的差距。接下来,我们采用Reptile元学习算法在历史数据上离线训练一个通用的基础模型,使其具有快速学习新信息的强适应性。一个关键的创新是周期性事件触发机制,该机制可以根据固定的时间窗口监控DT的模拟精度。当检测到性能差异时,它会自动触发一个自我进化周期。然后,通过基于随机重新初始化的持续学习的微调策略更新基础模型。这种离线元学习和在线持续学习的融合使DT能够快速适应新的、看不见的场景,确保它实时反映物理设备的状态。我们通过一个全面的机器人仿真案例研究验证了我们提出的框架的有效性和改进的性能。
{"title":"Self-Evolution of Hybrid Data-Physics Equipment Digital Twin Using Meta Learning and Continual Learning.","authors":"Kunyu Wang,Lin Zhang,Zhen Chen,Hongbo Cheng,Han Lu,Wentong Cai,Qingsha S Cheng,M Jamal Deen","doi":"10.1109/tcyb.2026.3651627","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3651627","url":null,"abstract":"This article introduces a novel hybrid method to enable the self-evolution of equipment digital twins (DTs), allowing them to continuously and accurately mirror their physical counterparts. Self-evolution is the process by which a DT autonomously updates its models using real-time sensor data, adapting to dynamic real-world behavior. To enhance this process, we propose a data-physics driven approach that synergistically integrates meta-learning and continual learning. Our method begins by designing an extended residual model using a Koopman autoencoder (KAE) neural network. This component bridges the gap between an imperfect analytical physics model and actual equipment behavior. Next, we employ the Reptile meta-learning algorithm to train offline a versatile foundation model on historical data, endowing it with strong adaptability for rapid learning from new information. A key innovation is a periodic event-triggered mechanism, which monitors the DT's simulation accuracy against a fixed time window. When a performance discrepancy is detected, it automatically triggers a self-evolution cycle. The foundation model is then updated through a fine-tuning strategy based on continual learning with random reinitialization. This fusion of offline meta-learning and online continual learning allows the DT to quickly adapt to new, unseen scenarios, ensuring it reflects the physical equipment's state in real-time. We validate the effectiveness and improved performance of our proposed framework through a comprehensive robot simulation case study.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"272 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146069937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mode Cluster-Based Event-Triggered Control for Stochastic Markovian Jump Systems Under Denial-of-Service Attack. 拒绝服务攻击下随机马尔可夫跳跃系统的模式聚类事件触发控制。
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-28 DOI: 10.1109/tcyb.2026.3652011
Siyong Song,Yingchun Wang,Jiayue Sun,Yunfei Mu
This article investigates the mode cluster-based event-triggered control (MCETC) of stochastic Markovian jump systems (SMJSs) under denial-of-service (DoS) attack. First, a novel MCETC framework is designed by considering the interplay among subsystems, DoS attacks, and the event-triggered mechanism (ETM). In this framework, the controller mode is reconstructed, and the number of controller modes is reduced by reclustering the system modes. It significantly reduces the conservatism of the system compared to existing mode-dependent/-independent controllers. Second, a switching ETM is designed for scenarios with and without DoS attack activation, which can effectively save network bandwidth resources and reduce computational load. Third, a multi-Lyapunov function based on DoS attacks is proposed to ensure the stability of the closed-loop SMJSs. Then, the controller gains and event-triggered parameters are jointly solved via the linear matrix inequality (LMI) technique. Moreover, the maximum allowable sampling interval (MASI) is given such that the controller can restore the control signals as soon as a DoS attack ends, which enables faster stabilization of the closed-loop system. Finally, a numerical example is used to verify the effectiveness and superiority of the proposed method.
本文研究了随机马尔可夫跳变系统在拒绝服务攻击下基于模式集群的事件触发控制。首先,从子系统之间的相互作用、DoS攻击和事件触发机制(ETM)三个方面,设计了一种新的MCETC框架。在该框架中,对控制器模式进行重构,并通过对系统模式进行重新聚类来减少控制器模式的数量。与现有的模式依赖/独立控制器相比,它显著降低了系统的保守性。其次,针对DoS攻击激活和非DoS攻击激活两种情况设计了切换ETM,有效地节省了网络带宽资源,降低了计算负荷。第三,提出了一种基于DoS攻击的多重lyapunov函数,以保证闭环smjs的稳定性。然后,通过线性矩阵不等式(LMI)技术联合求解控制器增益和事件触发参数。此外,给出了最大允许采样间隔(MASI),使得控制器可以在DoS攻击结束后立即恢复控制信号,从而使闭环系统能够更快地稳定。最后通过一个算例验证了所提方法的有效性和优越性。
{"title":"Mode Cluster-Based Event-Triggered Control for Stochastic Markovian Jump Systems Under Denial-of-Service Attack.","authors":"Siyong Song,Yingchun Wang,Jiayue Sun,Yunfei Mu","doi":"10.1109/tcyb.2026.3652011","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3652011","url":null,"abstract":"This article investigates the mode cluster-based event-triggered control (MCETC) of stochastic Markovian jump systems (SMJSs) under denial-of-service (DoS) attack. First, a novel MCETC framework is designed by considering the interplay among subsystems, DoS attacks, and the event-triggered mechanism (ETM). In this framework, the controller mode is reconstructed, and the number of controller modes is reduced by reclustering the system modes. It significantly reduces the conservatism of the system compared to existing mode-dependent/-independent controllers. Second, a switching ETM is designed for scenarios with and without DoS attack activation, which can effectively save network bandwidth resources and reduce computational load. Third, a multi-Lyapunov function based on DoS attacks is proposed to ensure the stability of the closed-loop SMJSs. Then, the controller gains and event-triggered parameters are jointly solved via the linear matrix inequality (LMI) technique. Moreover, the maximum allowable sampling interval (MASI) is given such that the controller can restore the control signals as soon as a DoS attack ends, which enables faster stabilization of the closed-loop system. Finally, a numerical example is used to verify the effectiveness and superiority of the proposed method.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"42 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146069949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparse Identification of Nonlinear Dynamics With Library Optimization Mechanism: Recursive Long-Term Prediction Perspective 基于库优化机制的非线性动力学稀疏识别:递归长期预测视角
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-27 DOI: 10.1109/tcyb.2026.3652850
Ansei Yonezawa, Heisei Yonezawa, Shuichi Yahagi, Itsuro Kajiwara, Shinya Kijimoto, Hikaru Taniuchi, Kentaro Murakami
{"title":"Sparse Identification of Nonlinear Dynamics With Library Optimization Mechanism: Recursive Long-Term Prediction Perspective","authors":"Ansei Yonezawa, Heisei Yonezawa, Shuichi Yahagi, Itsuro Kajiwara, Shinya Kijimoto, Hikaru Taniuchi, Kentaro Murakami","doi":"10.1109/tcyb.2026.3652850","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3652850","url":null,"abstract":"","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"8 1","pages":"1-14"},"PeriodicalIF":11.8,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146056303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Blockchain-Assisted Intelligent Resilient Tracking Control of Networked Systems. 区块链辅助的网络系统智能弹性跟踪控制。
IF 10.5 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-26 DOI: 10.1109/TCYB.2025.3646492
Yi Yu, Guo-Ping Liu, Zhong-Hua Pang, Jian Sun, Rongni Yang

With the increasingly integrated nature of networked control systems (NCSs), security has become a challenging issue for their widespread deployment. Although resilient control methods against various attacks have been reported, the analysis and design of defense mechanisms for NCSs still require fresh efforts. To this end, this article is concerned with the security control of a class of NCSs vulnerable to smart false data injection (FDI) attacks. Specifically, the scenario of output tracking of NCSs is considered, where the communication between sensors and controllers, as well as between controllers and actuators, is compromised by sophisticated malicious adversaries. To enhance security, peer-to-peer (P2P) networks with blockchain technologies are utilized instead of traditional communication patterns to transmit measurement and control signals. Unlike previous work, this work carefully designs an optimal blockchain consensus policy by perceiving the performance of NCSs and develops a resilient dynamic output tracking controller based on this policy. The formulation of the consensus policy is derived from a game-theoretic framework that models the interaction between the blockchain and the malicious adversary, enabling deep integration of blockchain technology with NCSs. With the proposed approach, the adverse effects of malicious FDI attacks can be greatly mitigated by balancing energy consumption and tracking performance. Finally, the applicability of the proposed security control strategy is verified in a real-world power system.

随着网络控制系统(NCSs)的集成化程度越来越高,安全性已成为其广泛部署的一个具有挑战性的问题。虽然针对各种攻击的弹性控制方法已经被报道,但对ncs防御机制的分析和设计仍然需要新的努力。为此,本文关注一类易受智能虚假数据注入(FDI)攻击的ncs的安全控制。具体来说,考虑了ncs的输出跟踪场景,其中传感器和控制器之间以及控制器和执行器之间的通信被复杂的恶意对手破坏。为了提高安全性,采用区块链技术的点对点(P2P)网络代替传统的通信方式来传输测控信号。与之前的工作不同,这项工作通过感知ncs的性能,仔细设计了一个最优区块链共识策略,并基于该策略开发了一个弹性动态输出跟踪控制器。共识策略的制定源自博弈论框架,该框架模拟了区块链与恶意对手之间的交互,从而实现了区块链技术与NCSs的深度集成。利用所提出的方法,通过平衡能耗和跟踪性能,可以大大减轻恶意FDI攻击的不利影响。最后,在实际电力系统中验证了所提安全控制策略的适用性。
{"title":"Blockchain-Assisted Intelligent Resilient Tracking Control of Networked Systems.","authors":"Yi Yu, Guo-Ping Liu, Zhong-Hua Pang, Jian Sun, Rongni Yang","doi":"10.1109/TCYB.2025.3646492","DOIUrl":"https://doi.org/10.1109/TCYB.2025.3646492","url":null,"abstract":"<p><p>With the increasingly integrated nature of networked control systems (NCSs), security has become a challenging issue for their widespread deployment. Although resilient control methods against various attacks have been reported, the analysis and design of defense mechanisms for NCSs still require fresh efforts. To this end, this article is concerned with the security control of a class of NCSs vulnerable to smart false data injection (FDI) attacks. Specifically, the scenario of output tracking of NCSs is considered, where the communication between sensors and controllers, as well as between controllers and actuators, is compromised by sophisticated malicious adversaries. To enhance security, peer-to-peer (P2P) networks with blockchain technologies are utilized instead of traditional communication patterns to transmit measurement and control signals. Unlike previous work, this work carefully designs an optimal blockchain consensus policy by perceiving the performance of NCSs and develops a resilient dynamic output tracking controller based on this policy. The formulation of the consensus policy is derived from a game-theoretic framework that models the interaction between the blockchain and the malicious adversary, enabling deep integration of blockchain technology with NCSs. With the proposed approach, the adverse effects of malicious FDI attacks can be greatly mitigated by balancing energy consumption and tracking performance. Finally, the applicability of the proposed security control strategy is verified in a real-world power system.</p>","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"PP ","pages":""},"PeriodicalIF":10.5,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146051779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Output-Feedback Control of Linear Continuous-Time Systems Using Discounted Inverse Reinforcement Learning. 基于折现逆强化学习的线性连续系统输出反馈控制。
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-23 DOI: 10.1109/tcyb.2026.3651519
Han Wu,Qinglei Hu,Jianying Zheng,Xiaodong Shao,Yueyang Liu,Dongyu Li
This article proposes a novel discounted inverse reinforcement learning (DIRL) algorithm for linear quadratic (LQ) control of unknown continuous-time (CT) systems with partially observable states and an unknown discounted value function. Existing DIRL methods predominantly rely on full-state feedback, limiting their applicability to practical scenarios where only input-output data are available. To this end, a state reconstruction method is designed for the system controlled by an expert using the measured desired output. Based on this, a model-free output-feedback (OPFB) DIRL algorithm is presented to iteratively solve the unknown value function and the corresponding optimal OPFB control policy equivalent to the expert control policy. The convergence of the proposed algorithm and the nonuniqueness of solutions are rigorously analyzed. Finally, comprehensive simulations reveal the effectiveness of the proposed algorithm in recovering the expert control policy and its superior computational efficiency compared to state-of-the-art (SOTA) methods.
针对具有部分可观察状态和未知折现值函数的未知连续时间系统的线性二次控制,提出了一种新的折现逆强化学习(DIRL)算法。现有的DIRL方法主要依赖于全状态反馈,限制了它们在只有输入-输出数据可用的实际场景中的适用性。为此,设计了一种由专家利用测量到的期望输出对系统进行状态重构的方法。在此基础上,提出了一种无模型输出反馈(OPFB) DIRL算法,迭代求解未知值函数和相应的最优OPFB控制策略(相当于专家控制策略)。严格分析了算法的收敛性和解的非唯一性。最后,综合仿真表明,该算法在恢复专家控制策略方面的有效性,以及与最先进的(SOTA)方法相比,其优越的计算效率。
{"title":"Output-Feedback Control of Linear Continuous-Time Systems Using Discounted Inverse Reinforcement Learning.","authors":"Han Wu,Qinglei Hu,Jianying Zheng,Xiaodong Shao,Yueyang Liu,Dongyu Li","doi":"10.1109/tcyb.2026.3651519","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3651519","url":null,"abstract":"This article proposes a novel discounted inverse reinforcement learning (DIRL) algorithm for linear quadratic (LQ) control of unknown continuous-time (CT) systems with partially observable states and an unknown discounted value function. Existing DIRL methods predominantly rely on full-state feedback, limiting their applicability to practical scenarios where only input-output data are available. To this end, a state reconstruction method is designed for the system controlled by an expert using the measured desired output. Based on this, a model-free output-feedback (OPFB) DIRL algorithm is presented to iteratively solve the unknown value function and the corresponding optimal OPFB control policy equivalent to the expert control policy. The convergence of the proposed algorithm and the nonuniqueness of solutions are rigorously analyzed. Finally, comprehensive simulations reveal the effectiveness of the proposed algorithm in recovering the expert control policy and its superior computational efficiency compared to state-of-the-art (SOTA) methods.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"42 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146034075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Task Optimization for Fixed-Time Control of Intermittent Human-Robot Interaction With Time-Varying Exponents and Coefficients. 具有时变指数和系数的间歇性人机交互固定时间控制的任务优化。
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-23 DOI: 10.1109/tcyb.2026.3650907
Zhi-Hui Fu,Ming-Feng Ge,Teng-Fei Ding,Zhi-Wei Liu
In this article, we investigate the task optimization for fixed-time control of intermittent human-robot interaction, where a human operator assists the robot intermittently in selecting the most appropriate Pareto solution. First, as for the Lyapunov fixed-time stability criterion inequality with and without the constant term, we all derive the Lyapunov stability conditions with time-varying exponents and coefficients, providing us with more flexibility and freedom to shape the contour of the convergence near the Lyapunov stable equilibrium. We then use them to propose a hierarchical fixed-time event-triggered optimization (HFTEO) algorithm based on human-oriented scheme, where the so-called human-oriented scheme means that the components constituting task information are known only to the human operator, but not to the robot, which is beneficial to ensure the confidentiality and security of the task. Simulation results are given to show the effectiveness of the proposed Lyapunov stability conditions and algorithm.
在本文中,我们研究了间歇性人机交互的固定时间控制的任务优化,其中人类操作员间歇性地帮助机器人选择最合适的帕累托解。首先,对于有常数项和没有常数项的Lyapunov定时稳定性判据不等式,我们都推导出了指数和系数随时间变化的Lyapunov稳定性条件,为我们在Lyapunov稳定平衡点附近的收敛轮廓的形成提供了更大的灵活性和自由度。然后,我们利用它们提出了一种基于以人为本方案的分层固定时间事件触发优化(HFTEO)算法,其中所谓的以人为本方案是指构成任务信息的组件仅由人类操作员知道,而机器人不知道,这有利于确保任务的机密性和安全性。仿真结果表明了所提出的李雅普诺夫稳定性条件和算法的有效性。
{"title":"Task Optimization for Fixed-Time Control of Intermittent Human-Robot Interaction With Time-Varying Exponents and Coefficients.","authors":"Zhi-Hui Fu,Ming-Feng Ge,Teng-Fei Ding,Zhi-Wei Liu","doi":"10.1109/tcyb.2026.3650907","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3650907","url":null,"abstract":"In this article, we investigate the task optimization for fixed-time control of intermittent human-robot interaction, where a human operator assists the robot intermittently in selecting the most appropriate Pareto solution. First, as for the Lyapunov fixed-time stability criterion inequality with and without the constant term, we all derive the Lyapunov stability conditions with time-varying exponents and coefficients, providing us with more flexibility and freedom to shape the contour of the convergence near the Lyapunov stable equilibrium. We then use them to propose a hierarchical fixed-time event-triggered optimization (HFTEO) algorithm based on human-oriented scheme, where the so-called human-oriented scheme means that the components constituting task information are known only to the human operator, but not to the robot, which is beneficial to ensure the confidentiality and security of the task. Simulation results are given to show the effectiveness of the proposed Lyapunov stability conditions and algorithm.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"7 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146034077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Cybernetics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1