Ieee-Caa Journal of Automatica Sinica最新文献_第2页

Letter Leader-Follower Formation Control of Quadrotor UAVs with Stochastic Impulsive Deception Attacks 随机脉冲欺骗攻击下四旋翼无人机的信导从者编队控制

IF 19.2 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2026-03-10 DOI: 10.1109/JAS.2025.125615

Wenhao Song;Chang Liu;Xiuping Han;Xiaodi Li

Dear Editor, This letter presents some control strategies for quadrotor unmanned aerial vehicle (UAV) leader-follower formation model, where the stochastic impulsive deception attacks are fully considered. Based on Lyapunov method, the outer loop and the inner loop controllers of quadrotor UAV are designed, respectively. Moreover, a relationship between continuous control laws, stochastic impulsive sequences, and impulsive intensity is established in this letter. Finally, the simulation results are presented to validate the proposed results.

本文提出了充分考虑随机脉冲欺骗攻击的四旋翼无人机（UAV） leader-follower编队模型的控制策略。基于李雅普诺夫方法，分别设计了四旋翼无人机的外环和内环控制器。建立了连续控制律、随机脉冲序列和脉冲强度之间的关系。最后给出了仿真结果，验证了所提结果的正确性。

引用次数: 0

KIG: A Knowledge Graph-Guided Iterative-Updating Graph Neural Network for Multisensor Time Series Time-Delay Estimation 基于知识图的多传感器时间序列时延估计迭代更新图神经网络

IF 19.2 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2026-03-10 DOI: 10.1109/JAS.2025.125897

Siyuan Xu;Dong Pan;Zhaohui Jiang;Zhiwen Chen;Haoyang Yu;Weihua Gui

Temporal alignment of multisensor time series (MTS) is a critical prerequisite for accurate modeling and optimal control in subsequent data-driven applications. Nevertheless, many approaches frequently neglect to consider the complex interdependencies between different sensors in MTS, and temporal alignment in many methods is typically treated as an isolated task disconnected from the downstream objectives, leading to unsatisfactory performances in follow-up applications. To address these challenges, this paper proposes a novel knowledge graph (KG)-guided iterative-updating graph neural network (GNN) for time-delay estimation (TDE) in MTS. Initially, a domain-specific KG is constructed from domain mechanism knowledge, providing a foundation for GNN's initialization. Next, capitalizing on the inherent structure of the graph topology, a GNN-based TDE method is developed. Then, a customized loss function is constructed, which synthesizes both the performances of downstream tasks and graph-based constraints. Moreover, an innovative algorithm for GNN structure learning and iterative-updating is proposed to renovate the graph structure further. Finally, experimental results across various regression and classification tasks on numerical simulation, public datasets, and the real blast furnace ironmaking dataset demonstrate that the proposed method can achieve accurate temporal alignment of MTS.

多传感器时间序列（MTS）的时序对齐是后续数据驱动应用中精确建模和最优控制的关键前提。然而，许多方法经常忽略考虑MTS中不同传感器之间复杂的相互依赖关系，并且许多方法中的时间对准通常被视为与下游目标断开的孤立任务，导致后续应用中的性能不理想。针对这些问题，本文提出了一种基于知识图（KG）引导的时延估计迭代更新图神经网络（GNN），该网络首先利用领域机制知识构建了特定领域的迭代更新图神经网络，为初始化GNN提供了基础。其次，利用图拓扑的固有结构，开发了一种基于gnn的TDE方法。然后，构建了一个定制的损失函数，该函数综合了下游任务的性能和基于图的约束。此外，提出了一种创新的GNN结构学习和迭代更新算法，进一步更新图的结构。最后，在数值模拟、公开数据集和实际高炉炼铁数据集上进行的各种回归和分类任务的实验结果表明，该方法可以实现MTS的精确时序对准。

{"title":"KIG: A Knowledge Graph-Guided Iterative-Updating Graph Neural Network for Multisensor Time Series Time-Delay Estimation","authors":"Siyuan Xu;Dong Pan;Zhaohui Jiang;Zhiwen Chen;Haoyang Yu;Weihua Gui","doi":"10.1109/JAS.2025.125897","DOIUrl":"https://doi.org/10.1109/JAS.2025.125897","url":null,"abstract":"Temporal alignment of multisensor time series (MTS) is a critical prerequisite for accurate modeling and optimal control in subsequent data-driven applications. Nevertheless, many approaches frequently neglect to consider the complex interdependencies between different sensors in MTS, and temporal alignment in many methods is typically treated as an isolated task disconnected from the downstream objectives, leading to unsatisfactory performances in follow-up applications. To address these challenges, this paper proposes a novel knowledge graph (KG)-guided iterative-updating graph neural network (GNN) for time-delay estimation (TDE) in MTS. Initially, a domain-specific KG is constructed from domain mechanism knowledge, providing a foundation for GNN's initialization. Next, capitalizing on the inherent structure of the graph topology, a GNN-based TDE method is developed. Then, a customized loss function is constructed, which synthesizes both the performances of downstream tasks and graph-based constraints. Moreover, an innovative algorithm for GNN structure learning and iterative-updating is proposed to renovate the graph structure further. Finally, experimental results across various regression and classification tasks on numerical simulation, public datasets, and the real blast furnace ironmaking dataset demonstrate that the proposed method can achieve accurate temporal alignment of MTS.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"13 2","pages":"327-345"},"PeriodicalIF":19.2,"publicationDate":"2026-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147383095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Control-Communication Co-Optimization for Wireless Cloud Robotic System via Multi-Agent Transfer Reinforcement Learning 基于多智能体转移强化学习的无线云机器人控制-通信协同优化

IF 19.2 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2026-03-10 DOI: 10.1109/JAS.2025.125894

Chi Xu;Junyuan Zhang;Haibin Yu

The wireless cloud robotic system (WCRS), which fully integrates sensing, communication, computing, and control capabilities as an intelligent agent, is a promising way to achieve intelligent manufacturing due to easy deployment and flexible expansion. However, the high-precision control of WCRS requires deterministic wireless communication, which is always challenging in the complex and dynamic radio space. This paper employs the reconfigurable intelligent surface (RIS) to establish a novel RIS-assisted WCRS architecture, where the radio channel is controlled to achieve ultra-reliable, low-delay, and low-jitter communication for high-precision closed-loop motion control. However, control and communication are strongly coupled and should be co-optimized. Fully considering the constraints of control input threshold, control delay deadline, beam phase, antenna power, and information distortion, we establish a stability maximization problem to jointly optimize control input compensation, RIS phase shift, and beamforming. Herein, a new jitter-oriented system stability objective with respect to control error and communication jitter is defined and the closed-form expression of control delay deadline is derived based on the Jensen Inequality and Lyapunov-Krasovskii functional. Due to the time-varying and partial observability of the channel and robot states, we model the problem as a partially observable Markov decision process (pOMDP). To solve this complex problem, we propose a multi-agent transfer reinforcement learning algorithm named LSTM-PPO-MATRL, where the LSTM-enhanced proximal policy optimization (PPO) is designed to approximate an optimal solution and the option-guided policy transfer learning is proposed to facilitate the learning process. By centralized training and decentralized execution, LSTM-PPO-MATRL is validated by extensive experiments on MuJoCo tasks for both low-mobility and high-mobility robotic control scenarios. The results demonstrate that LSTM-PPO-MATRL not only realizes high learning efficiency, but also supports low-delay, low-jitter communication for low error control, where 71.9% control accuracy improvement and 68.7% delay jitter reduction are achieved compared to the PPO-MADRL baseline.

无线云机器人系统（WCRS）作为一个智能体，将传感、通信、计算和控制能力充分集成，具有易于部署和灵活扩展的特点，是实现智能制造的一种很有前途的方式。然而，WCRS的高精度控制需要确定性的无线通信，这在复杂动态的无线电空间中一直是一个挑战。本文利用可重构智能表面（RIS）建立了一种新的RIS辅助WCRS架构，通过控制无线信道实现超可靠、低延迟、低抖动通信，实现高精度闭环运动控制。然而，控制和通信是强耦合的，应该共同优化。充分考虑控制输入阈值、控制延迟期限、波束相位、天线功率、信息失真等约束，建立稳定性最大化问题，共同优化控制输入补偿、RIS相移和波束形成。在此基础上，基于Jensen不等式和Lyapunov-Krasovskii泛函，定义了一个考虑控制误差和通信抖动的新的面向抖动的系统稳定性目标，导出了控制延迟期限的封闭表达式。由于通道和机器人状态的时变和部分可观察性，我们将问题建模为部分可观察的马尔可夫决策过程（pOMDP）。为了解决这一复杂问题，我们提出了一种名为lstm -PPO- matl的多智能体迁移强化学习算法，其中lstm增强的近端策略优化（PPO）被设计为近似最优解，而选项引导的策略迁移学习被提出以促进学习过程。通过集中训练和分散执行，lstm - ppo - matl在低机动性和高机动性机器人控制场景下的MuJoCo任务中得到了广泛的实验验证。结果表明，LSTM-PPO-MATRL不仅实现了高学习效率，而且支持低延迟、低抖动通信，实现了低误差控制，与PPO-MADRL基线相比，控制精度提高了71.9%，延迟抖动降低了68.7%。

{"title":"Control-Communication Co-Optimization for Wireless Cloud Robotic System via Multi-Agent Transfer Reinforcement Learning","authors":"Chi Xu;Junyuan Zhang;Haibin Yu","doi":"10.1109/JAS.2025.125894","DOIUrl":"https://doi.org/10.1109/JAS.2025.125894","url":null,"abstract":"The wireless cloud robotic system (WCRS), which fully integrates sensing, communication, computing, and control capabilities as an intelligent agent, is a promising way to achieve intelligent manufacturing due to easy deployment and flexible expansion. However, the high-precision control of WCRS requires deterministic wireless communication, which is always challenging in the complex and dynamic radio space. This paper employs the reconfigurable intelligent surface (RIS) to establish a novel RIS-assisted WCRS architecture, where the radio channel is controlled to achieve ultra-reliable, low-delay, and low-jitter communication for high-precision closed-loop motion control. However, control and communication are strongly coupled and should be co-optimized. Fully considering the constraints of control input threshold, control delay deadline, beam phase, antenna power, and information distortion, we establish a stability maximization problem to jointly optimize control input compensation, RIS phase shift, and beamforming. Herein, a new jitter-oriented system stability objective with respect to control error and communication jitter is defined and the closed-form expression of control delay deadline is derived based on the Jensen Inequality and Lyapunov-Krasovskii functional. Due to the time-varying and partial observability of the channel and robot states, we model the problem as a partially observable Markov decision process (pOMDP). To solve this complex problem, we propose a multi-agent transfer reinforcement learning algorithm named LSTM-PPO-MATRL, where the LSTM-enhanced proximal policy optimization (PPO) is designed to approximate an optimal solution and the option-guided policy transfer learning is proposed to facilitate the learning process. By centralized training and decentralized execution, LSTM-PPO-MATRL is validated by extensive experiments on MuJoCo tasks for both low-mobility and high-mobility robotic control scenarios. The results demonstrate that LSTM-PPO-MATRL not only realizes high learning efficiency, but also supports low-delay, low-jitter communication for low error control, where 71.9% control accuracy improvement and 68.7% delay jitter reduction are achieved compared to the PPO-MADRL baseline.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"13 2","pages":"311-326"},"PeriodicalIF":19.2,"publicationDate":"2026-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147383096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Collaboration Better Than Integration: A Novel Time-Frequency-Assisted Deep Feature Enhancement Mechanism for Few-Shot Transfer Learning in Anomaly Detection 协同优于集成：一种新的时频辅助深度特征增强机制，用于异常检测中的少量迁移学习

IF 19.2 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2026-03-10 DOI: 10.1109/JAS.2025.125702

Wentao Mao;Jianing Wu;Shubin Du;Ke Feng;Zidong Wang

Deep transfer learning has achieved significant success in anomaly detection over the past decade, but data acquisition challenges in practical engineering hinder high-quality feature representation for few-shot learning tasks. To address this issue, a novel time-frequency-assisted deep feature enhancement (TFE) mechanism is proposed. Unlike traditional methods that integrate time-frequency analysis with deep neural networks, TFE employs a wavelet scattering transform to establish a parallel time-frequency feature space, where a dual interaction strategy facilitates collaboration between deep feature and time-frequency spaces through two operations: 1) Enhancement, where a frequency-importance-driven contrastive learning (FICL) network transfers physically-aware information from wavelet scattering features to deep features, and 2) Feedback, which uses a detection rule adaptation module to minimize bias in wavelet scattering features based on deep feature performance. TFE is applied to a domain-adversarial anomaly detection framework and, through alternating training, significantly enhances both deep feature discriminative power and few-shot anomaly detection. Theoretical analysis confirms that the proposed dual interaction strategy reduces the upper bound of classification error. Experiments on benchmark datasets and a real-world industrial dataset from a large steel factory demonstrate TFE's superior performance and highlight the importance of frequency saliency in transfer learning. Thus, collaboration is shown to outperform integration for few-shot transfer learning in anomaly detection.

在过去的十年中，深度迁移学习在异常检测方面取得了显著的成功，但实际工程中的数据采集挑战阻碍了对少量学习任务的高质量特征表示。为了解决这个问题，提出了一种新的时频辅助深度特征增强（TFE）机制。与传统将时频分析与深度神经网络相结合的方法不同，TFE采用小波散射变换建立并行时频特征空间，其中双重交互策略通过两种操作促进了深度特征与时频空间的协同：1)增强，其中频率重要度驱动的对比学习（FICL）网络将物理感知信息从小波散射特征转移到深度特征；2)反馈，使用检测规则自适应模块根据深度特征性能最小化小波散射特征中的偏差。将TFE应用于域对抗异常检测框架中，通过交替训练，显著提高了深度特征判别能力和少量异常检测能力。理论分析证实，所提出的双交互策略减小了分类误差的上界。在基准数据集和来自大型钢铁厂的真实工业数据集上的实验证明了TFE的卓越性能，并强调了频率显著性在迁移学习中的重要性。因此，在异常检测中，协作学习优于集成学习。

{"title":"Collaboration Better Than Integration: A Novel Time-Frequency-Assisted Deep Feature Enhancement Mechanism for Few-Shot Transfer Learning in Anomaly Detection","authors":"Wentao Mao;Jianing Wu;Shubin Du;Ke Feng;Zidong Wang","doi":"10.1109/JAS.2025.125702","DOIUrl":"https://doi.org/10.1109/JAS.2025.125702","url":null,"abstract":"Deep transfer learning has achieved significant success in anomaly detection over the past decade, but data acquisition challenges in practical engineering hinder high-quality feature representation for few-shot learning tasks. To address this issue, a novel time-frequency-assisted deep feature enhancement (TFE) mechanism is proposed. Unlike traditional methods that integrate time-frequency analysis with deep neural networks, TFE employs a wavelet scattering transform to establish a parallel time-frequency feature space, where a dual interaction strategy facilitates collaboration between deep feature and time-frequency spaces through two operations: 1) Enhancement, where a frequency-importance-driven contrastive learning (FICL) network transfers physically-aware information from wavelet scattering features to deep features, and 2) Feedback, which uses a detection rule adaptation module to minimize bias in wavelet scattering features based on deep feature performance. TFE is applied to a domain-adversarial anomaly detection framework and, through alternating training, significantly enhances both deep feature discriminative power and few-shot anomaly detection. Theoretical analysis confirms that the proposed dual interaction strategy reduces the upper bound of classification error. Experiments on benchmark datasets and a real-world industrial dataset from a large steel factory demonstrate TFE's superior performance and highlight the importance of frequency saliency in transfer learning. Thus, collaboration is shown to outperform integration for few-shot transfer learning in anomaly detection.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"13 2","pages":"366-382"},"PeriodicalIF":19.2,"publicationDate":"2026-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147383124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Representation Then Augmentation: Wide Graph Clustering Network with Multi-Order Filter Fusion and Double-Level Contrastive Learning 先表示后增强：多阶滤波器融合和双层对比学习的宽图聚类网络

IF 19.2 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2026-03-10 DOI: 10.1109/JAS.2025.125564

Youqing Wang;Tianxiang Zhao;Mingliang Cui;Junbin Gao;Li Liang;Jipeng Guo

Deep graph contrastive clustering has attracted widespread attentions due to its self-supervised representation learning paradigm and superior clustering performance. Although, two challenges emerge and result in high computational costs. Most existing contrastive methods adopt the data augmentation and then representation learning strategy, where representation learning with trainable graph convolution is coupled with complex and fixed data augmentation, inevitably limiting the efficiency and flexibility. The similarity metric between positive-negative sample pairs is complex and contrastive objective is partial, limiting the discriminability of representation learning. To solve these challenges, a novel wide graph clustering network (WGCN) adhering to representation and then augmentation framework is proposed, which mainly consists of multi-order filter fusion (MFF) and double-level contrastive learning (DCL) modules. Specifically, the MFF module integrates multi-order low-pass filters to extract smooth and multi-scale topological features, utilizing self-attention fusion to reduce redundancy and obtain comprehensive embedding representation. Further, the DCL module constructs two augmented views by the parallel parameter-unshared Siamese encoders rather than complex augmentations on graph. To achieve simple yet effective self-supervised learning, representation self-supervision and structural consistency oriented double-level contrastive loss is designed, where representation self-supervision maximizes the agreement between pairwise augmented embedding representations and structural consistency promotes the mutual information correlation between appending neighborhoods with similar semantics. Extensive experiments on six benchmark datasets demonstrate the superiority of the proposed WGCN, especially highlighting its time-saving characteristic. The code could be available in the https://github.com/TianxiangZhao0474/WGCN.

深度图对比聚类由于其自监督表示学习范式和优异的聚类性能而受到广泛关注。但是，出现了两个挑战并导致高计算成本。现有的对比方法大多采用数据增强后再表示学习的策略，其中具有可训练图卷积的表示学习与复杂且固定的数据增强相结合，不可避免地限制了效率和灵活性。正负样本对之间的相似性度量比较复杂，对比目标比较片面，限制了表征学习的可判别性。为了解决这些问题，提出了一种基于表示-增强框架的新型宽图聚类网络（WGCN），该网络主要由多阶滤波器融合（MFF）和双级对比学习（DCL）模块组成。其中，MFF模块集成多阶低通滤波器提取平滑多尺度拓扑特征，利用自关注融合减少冗余，获得全面的嵌入表示。此外，DCL模块通过并行参数非共享Siamese编码器构造两个增强视图，而不是在图上进行复杂的增强。为了实现简单而有效的自监督学习，设计了面向表征自监督和结构一致性的双层对比损失，其中表征自监督最大化了成对增强嵌入表征之间的一致性，结构一致性促进了具有相似语义的附加邻域之间的相互信息关联。在6个基准数据集上的大量实验证明了WGCN的优越性，尤其突出了其节省时间的特点。代码可以在https://github.com/TianxiangZhao0474/WGCN中获得。

{"title":"Representation Then Augmentation: Wide Graph Clustering Network with Multi-Order Filter Fusion and Double-Level Contrastive Learning","authors":"Youqing Wang;Tianxiang Zhao;Mingliang Cui;Junbin Gao;Li Liang;Jipeng Guo","doi":"10.1109/JAS.2025.125564","DOIUrl":"https://doi.org/10.1109/JAS.2025.125564","url":null,"abstract":"Deep graph contrastive clustering has attracted widespread attentions due to its self-supervised representation learning paradigm and superior clustering performance. Although, two challenges emerge and result in high computational costs. Most existing contrastive methods adopt the data augmentation and then representation learning strategy, where representation learning with trainable graph convolution is coupled with complex and fixed data augmentation, inevitably limiting the efficiency and flexibility. The similarity metric between positive-negative sample pairs is complex and contrastive objective is partial, limiting the discriminability of representation learning. To solve these challenges, a novel wide graph clustering network (WGCN) adhering to representation and then augmentation framework is proposed, which mainly consists of multi-order filter fusion (MFF) and double-level contrastive learning (DCL) modules. Specifically, the MFF module integrates multi-order low-pass filters to extract smooth and multi-scale topological features, utilizing self-attention fusion to reduce redundancy and obtain comprehensive embedding representation. Further, the DCL module constructs two augmented views by the parallel parameter-unshared Siamese encoders rather than complex augmentations on graph. To achieve simple yet effective self-supervised learning, representation self-supervision and structural consistency oriented double-level contrastive loss is designed, where representation self-supervision maximizes the agreement between pairwise augmented embedding representations and structural consistency promotes the mutual information correlation between appending neighborhoods with similar semantics. Extensive experiments on six benchmark datasets demonstrate the superiority of the proposed WGCN, especially highlighting its time-saving characteristic. The code could be available in the https://github.com/TianxiangZhao0474/WGCN.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"13 2","pages":"421-435"},"PeriodicalIF":19.2,"publicationDate":"2026-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147383110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Data-Driven Predictive Control for Continuous-Time Nonlinear Systems: A Nonzero-Sum Game Approach 连续时间非线性系统的数据驱动预测控制：一个非零和博弈方法

IF 19.2 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2026-03-10 DOI: 10.1109/JAS.2025.125660

Juan Liu;Hao Zhang;Yifan Xie;Frank Allgöwer

Dear Editor, This letter proposes a reinforcement learning-based predictive learning algorithm for unknown continuous-time nonlinear systems with observation loss. Firstly, we construct a temporal nonzero-sum game over predictive control input sequences, deriving multiple optimal predictive control input sequences from its solution. To obtain the Nash equilibrium solution of the temporal nonzero-sum game, we solve the problem through policy iteration of reinforcement learning. Then, we train the actor neural network and critic neural network to estimate the control policy and action value function, respectively, using the collected offline and online input-state data. Compared to traditional predictive control methods, the proposed framework does not rely on an explicit model and obtains a data-driven controller design. Finally, the effectiveness of the proposed algorithm is validated through a numerical example.

这封信提出了一种基于强化学习的预测学习算法，用于未知的具有观测损失的连续时间非线性系统。首先，在预测控制输入序列上构造一个时间非零和博弈，从其解中导出多个最优预测控制输入序列。为了得到时间非零和博弈的纳什均衡解，我们通过强化学习的策略迭代来求解问题。然后，我们利用收集到的离线和在线输入状态数据，分别训练行动者神经网络和评论家神经网络来估计控制策略和动作值函数。与传统的预测控制方法相比，该框架不依赖于显式模型，实现了数据驱动的控制器设计。最后，通过一个算例验证了所提算法的有效性。

引用次数: 0

Relative Motion Based Predictive Adaptive Control: A Case Study of AUV 3D Trajectory Tracking 基于相对运动的预测自适应控制：AUV三维轨迹跟踪实例研究

IF 19.2 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2026-03-10 DOI: 10.1109/JAS.2025.125624

Daxiong Ji;Xinwei Wang;Yuanchang Liu

Dear Editor, This letter deals with the autonomous underwater vehicle (AUV) three dimensional (3D) trajectory tracking control chronically suffering from poor accuracy and efficiency in complex hydrodynamics. A state-of-the-art predictive adaptive controller (PAC) is proposed with a distinct dual closed-loop structure. Based on the essence of AUV's relative hydrodynamics, the ocean currents are no longer compensated in the outer loop, where a novel predictive controller is investigated to obtain absolute target velocities, optimized by an advanced dung beetle optimizer (ADBO). In the inner loop, an adaptive controller is elaborately derived from the AUV dynamic model of relative motion, while an ocean current observer is adopted to acquire the relative target velocities. A comprehensive series of experiments proves the advantage of PAC compared to the existing representative control approaches.

这封信是关于自主水下航行器（AUV）三维（3D）轨迹跟踪控制在复杂的流体动力学中长期存在精度和效率不高的问题。提出了一种具有独特双闭环结构的预测自适应控制器。根据水下航行器相对流体力学的特点，在水下航行器外环不再对海流进行补偿，研究了一种新的预测控制器，通过先进的屎壳虫优化器（ADBO）对其进行优化，获得绝对目标速度。在内环中，根据水下机器人的相对运动动力学模型，精心推导出自适应控制器，并采用海流观测器获取相对目标速度。一系列的实验证明了PAC与现有的代表性控制方法相比的优势。

引用次数: 0

Dynamic Event-Triggered Mechanisms with Positive Minimum Inter-Event Times for Linear Multiagent Consensus on Directed Graphs 有向图上线性多智能体一致性的正最小事件间时间动态事件触发机制

IF 19.2 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2026-03-10 DOI: 10.1109/JAS.2025.125822

Sikang Zhan;Xianwei Li;Yuanyuan Zou;Shaoyuan Li

This article studies the consensus problem with directed graphs for general linear multi-agent systems. New distributed state-feedback protocols with dynamic event-triggered (DET) mechanisms are proposed for directed graphs that are strongly connected and weight-balanced, general strongly connected, and have spanning trees, respectively. It is proven that strictly positive minimum inter-event times (MIETs) are ensured using the designed DET mechanisms. Several numerical examples are presented to illustrate the effectiveness of the theoretical results. Compared with existing results, our results have the following merits: 1) DET mechanisms are designed to determine the sampling instants, which can reduce the communication frequency between agents compared with static mechanisms; 2) We focus on the consensus problem on directed graphs, which is more general than existing related results on undirected graphs; 3) The existence of positive MIETs is shown to be guaranteed by the designed DET sampling strategies while existing related results can only exclude Zeno behavior.

研究一般线性多智能体系统的有向图一致性问题。针对具有强连接和权平衡、一般强连接和生成树的有向图，提出了具有动态事件触发（DET）机制的分布式状态反馈协议。证明了使用所设计的DET机制可以确保严格正的最小事件间时间（MIETs）。通过数值算例说明了理论结果的有效性。与已有结果相比，我们的结果具有以下优点：1)设计了DET机制来确定采样时刻，与静态机制相比，可以减少agent之间的通信频率；2)重点研究了有向图上的一致性问题，这比已有的无向图上的相关结果更具有一般性；3)所设计的DET采样策略保证了正MIETs的存在性，而现有的相关结果只能排除Zeno行为。

{"title":"Dynamic Event-Triggered Mechanisms with Positive Minimum Inter-Event Times for Linear Multiagent Consensus on Directed Graphs","authors":"Sikang Zhan;Xianwei Li;Yuanyuan Zou;Shaoyuan Li","doi":"10.1109/JAS.2025.125822","DOIUrl":"https://doi.org/10.1109/JAS.2025.125822","url":null,"abstract":"This article studies the consensus problem with directed graphs for general linear multi-agent systems. New distributed state-feedback protocols with dynamic event-triggered (DET) mechanisms are proposed for directed graphs that are strongly connected and weight-balanced, general strongly connected, and have spanning trees, respectively. It is proven that strictly positive minimum inter-event times (MIETs) are ensured using the designed DET mechanisms. Several numerical examples are presented to illustrate the effectiveness of the theoretical results. Compared with existing results, our results have the following merits: 1) DET mechanisms are designed to determine the sampling instants, which can reduce the communication frequency between agents compared with static mechanisms; 2) We focus on the consensus problem on directed graphs, which is more general than existing related results on undirected graphs; 3) The existence of positive MIETs is shown to be guaranteed by the designed DET sampling strategies while existing related results can only exclude Zeno behavior.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"13 2","pages":"436-450"},"PeriodicalIF":19.2,"publicationDate":"2026-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147383102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Human-Robot Interaction-Based Model Predictive Control for Exoskeleton Robots Driven by Series Elastic Actuators 基于人机交互的串联弹性作动器外骨骼机器人模型预测控制

IF 19.2 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2026-03-10 DOI: 10.1109/JAS.2025.125477

Changxian Xu;Keping Liu;Zhongbo Sun

Dear Editor, This letter presents a model predictive control (MPC) scheme for human-robot interaction (HRI) in a multi-joint exoskeleton robot (ER) driven by series elastic actuator (SEA). The proposed scheme in robot-in-charge (RIC) mode facilitates the ER driven by SEA to provide the required assistance and support for the subject. Meanwhile, it enables the ER driven by SEA to closely follow the movement of the subject and provides the least resistance in the human-in-charge (HIC) mode. In addition, a finite-time convergent zeroing neural network (FTCZNN) model is designed to solve the optimal solution of the MPC scheme. The convex activation function is designed to impose the error function to converge to zero in finite time, which guarantees the recursive feasibility of the MPC optimization. The convergence of the proposed FTCZNN algorithm is demonstrated through theoretical analyses. Finally, numerical simulations verify that the MPC scheme solved by the FTCZNN model enables ER driven by SEA to perform HRI tasks with high efficiency, rapid response, and strong robustness.

本文提出了一种基于串联弹性致动器（SEA）驱动的多关节外骨骼机器人（ER）人机交互（HRI）的模型预测控制（MPC）方案。在机器人主管（RIC）模式下提出的方案使SEA驱动的ER为受试者提供所需的帮助和支持。同时，它使SEA驱动的ER能够密切跟随受试者的运动，并且在human-in-charge （HIC）模式下提供最小的阻力。此外，设计了一个有限时间收敛归零神经网络（FTCZNN）模型来求解MPC方案的最优解。设计了凸激活函数，使误差函数在有限时间内收敛于零，保证了MPC优化的递归可行性。通过理论分析证明了该算法的收敛性。最后，通过数值仿真验证了由FTCZNN模型求解的MPC方案能够使SEA驱动的ER以高效率、快速响应和强鲁棒性执行HRI任务。

{"title":"Human-Robot Interaction-Based Model Predictive Control for Exoskeleton Robots Driven by Series Elastic Actuators","authors":"Changxian Xu;Keping Liu;Zhongbo Sun","doi":"10.1109/JAS.2025.125477","DOIUrl":"https://doi.org/10.1109/JAS.2025.125477","url":null,"abstract":"Dear Editor, This letter presents a model predictive control (MPC) scheme for human-robot interaction (HRI) in a multi-joint exoskeleton robot (ER) driven by series elastic actuator (SEA). The proposed scheme in robot-in-charge (RIC) mode facilitates the ER driven by SEA to provide the required assistance and support for the subject. Meanwhile, it enables the ER driven by SEA to closely follow the movement of the subject and provides the least resistance in the human-in-charge (HIC) mode. In addition, a finite-time convergent zeroing neural network (FTCZNN) model is designed to solve the optimal solution of the MPC scheme. The convex activation function is designed to impose the error function to converge to zero in finite time, which guarantees the recursive feasibility of the MPC optimization. The convergence of the proposed FTCZNN algorithm is demonstrated through theoretical analyses. Finally, numerical simulations verify that the MPC scheme solved by the FTCZNN model enables ER driven by SEA to perform HRI tasks with high efficiency, rapid response, and strong robustness.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"13 2","pages":"486-488"},"PeriodicalIF":19.2,"publicationDate":"2026-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11429630","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147383111","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Indefinite Linear-Quadratic Mean-Field Game of Regime-Switching System 状态切换系统的不定线性二次平均场对策

IF 19.2 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2026-01-30 DOI: 10.1109/JAS.2025.125456

Tian Chen;Kai Du;Zhen Wu

This paper studies an indefinite mean-field game with Markov jump parameters, where all agents' diffusion terms depend on control variables and both state and control average terms $(x^{(N)}, u^{(N)})$ are considered. One notable aspect is the relaxation of the assumption regarding the positivity or non-negativity of weight matrices within costs, allowing for zero or even negative values. By virtue of mean-field methods and decomposition techniques, we have derived decentralized strategies presented by Hamiltonian systems and a new type of consistency condition system. These systems consist of fully coupled regime-switching forward-backward stochastic differential equations that do not conform to the Monotonicity condition. The well-posedness of these strategies is established by employing a relaxed compensator method with an easily verifiable Condition (RC) and the decomposition technique. Furthermore, we demonstrate that the resulting decentralized strategies achieve an $epsilon$-Nash equilibrium in the indefinite case without any assumptions on admissible control sets using novel estimates of the disturbed state and cost function. Finally, our theoretical results are applied to resolve a class of mean-variance portfolio selection problems. We provide corresponding numerical simulation results and economic explanations.

本文研究了一个具有马尔可夫跳跃参数的不定平均场对策，其中所有智能体的扩散项依赖于控制变量，并且同时考虑状态和控制平均项$(x^{(N)}, u^{(N)})$。一个值得注意的方面是在成本范围内放宽了关于权矩阵的正或非负性的假设，允许零甚至负值。利用平均场方法和分解技术，导出了哈密顿系统的分散策略和一类新的一致性条件系统。这些系统由不符合单调性条件的完全耦合状态切换的前向后随机微分方程组成。采用具有易验证条件的松弛补偿器方法和分解技术建立了这些策略的适定性。此外，我们证明了所得到的分散策略在不确定情况下实现了$epsilon$-Nash均衡，而不需要对允许的控制集进行任何假设，使用新的扰动状态和成本函数估计。最后，将我们的理论结果应用于解决一类均值-方差投资组合问题。给出了相应的数值模拟结果和经济解释。

{"title":"Indefinite Linear-Quadratic Mean-Field Game of Regime-Switching System","authors":"Tian Chen;Kai Du;Zhen Wu","doi":"10.1109/JAS.2025.125456","DOIUrl":"https://doi.org/10.1109/JAS.2025.125456","url":null,"abstract":"This paper studies an indefinite mean-field game with Markov jump parameters, where all agents' diffusion terms depend on control variables and both state and control average terms <tex>$(x^{(N)}, u^{(N)})$</tex> are considered. One notable aspect is the relaxation of the assumption regarding the positivity or non-negativity of weight matrices within costs, allowing for zero or even negative values. By virtue of mean-field methods and decomposition techniques, we have derived decentralized strategies presented by Hamiltonian systems and a new type of consistency condition system. These systems consist of fully coupled regime-switching forward-backward stochastic differential equations that do not conform to the Monotonicity condition. The well-posedness of these strategies is established by employing a relaxed compensator method with an easily verifiable Condition (RC) and the decomposition technique. Furthermore, we demonstrate that the resulting decentralized strategies achieve an <tex>$epsilon$</tex>-Nash equilibrium in the indefinite case without any assumptions on admissible control sets using novel estimates of the disturbed state and cost function. Finally, our theoretical results are applied to resolve a class of mean-variance portfolio selection problems. We provide corresponding numerical simulation results and economic explanations.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"13 1","pages":"83-97"},"PeriodicalIF":19.2,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0