首页 > 最新文献

Automatica最新文献

英文 中文
CBF-based safety design for adaptive control of uncertain nonlinear strict-feedback systems 基于cbf的不确定非线性严格反馈系统自适应控制安全设计
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-03-01 Epub Date: 2025-12-19 DOI: 10.1016/j.automatica.2025.112782
Liwei An , Can Zhao , Guang-Hong Yang
This paper studies the problem of adaptive tracking control for nonlinear strict-feedback systems with parametric uncertainties and safety constraints. A co-design strategy of control barrier functions (CBFs) and barrier Lyapunov functions (BLFs) is proposed, which inherits the robustness and stability of traditional adaptive backstepping controllers. First, a safe virtual control signal is generated by the CBF-induced quadratic programming (QP), which is the suboptimal and smooth solution of the QP with a shifting function. Then, a BLF-based backstepping controller is designed by following the safe virtual control signal. It is shown that the co-design can minimize the damage to the original tracking performance on the premise of safety guarantees. The distinguishing point of the safety design over the existing results is to avoid constructing the high-order CBFs that lead to conservative feasible sets of inputs for ensuring high-relative-degree safety constraints. The simulation results show that the proposed scheme achieves better tracking performance compared with the existing high-order CBF-based method.
研究了具有参数不确定性和安全约束的非线性严格反馈系统的自适应跟踪控制问题。提出了一种控制障碍函数(cbf)和障碍李雅普诺夫函数(blf)的协同设计策略,继承了传统自适应反演控制器的鲁棒性和稳定性。首先,利用cbf诱导的二次规划(QP)生成一个安全的虚拟控制信号,该信号是带移位函数的二次规划的次优光滑解。然后,根据安全虚拟控制信号设计了基于blf的反步控制器。研究表明,在保证安全的前提下,协同设计能最大限度地降低对原有跟踪性能的破坏。安全设计与现有结果的区别在于避免构建高阶cbf,从而导致保守的可行输入集,以确保高相对程度的安全约束。仿真结果表明,与现有的基于高阶cbf的方法相比,该方法具有更好的跟踪性能。
{"title":"CBF-based safety design for adaptive control of uncertain nonlinear strict-feedback systems","authors":"Liwei An ,&nbsp;Can Zhao ,&nbsp;Guang-Hong Yang","doi":"10.1016/j.automatica.2025.112782","DOIUrl":"10.1016/j.automatica.2025.112782","url":null,"abstract":"<div><div>This paper studies the problem of adaptive tracking control for nonlinear strict-feedback systems with parametric uncertainties and safety constraints. A co-design strategy of control barrier functions (CBFs) and barrier Lyapunov functions (BLFs) is proposed, which inherits the robustness and stability of traditional adaptive backstepping controllers. First, a safe virtual control signal is generated by the CBF-induced quadratic programming (QP), which is the suboptimal and smooth solution of the QP with a shifting function. Then, a BLF-based backstepping controller is designed by following the safe virtual control signal. It is shown that the co-design can minimize the damage to the original tracking performance on the premise of safety guarantees. The distinguishing point of the safety design over the existing results is to avoid constructing the high-order CBFs that lead to conservative feasible sets of inputs for ensuring high-relative-degree safety constraints. The simulation results show that the proposed scheme achieves better tracking performance compared with the existing high-order CBF-based method.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112782"},"PeriodicalIF":5.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145799183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonholonomic mechanics and virtual constraints on Riemannian homogeneous spaces 黎曼齐次空间的非完整力学和虚约束
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-03-01 Epub Date: 2025-12-16 DOI: 10.1016/j.automatica.2025.112754
Efstratios Stratoglou , Alexandre Anahory Simoes , Anthony Bloch , Leonardo J. Colombo
Nonholonomic systems are, so to speak, mechanical systems with a prescribed restriction on the velocities. A virtual nonholonomic constraint is a controlled invariant distribution associated with an affine connection mechanical control system. A Riemannian homogeneous space is, a Riemannian manifold that looks the same everywhere, as you move through it by the action of a Lie group. These Riemannian manifolds are not necessarily Lie groups themselves, but nonetheless possess certain symmetries and invariances that allow for similar results to be obtained. In this work, we introduce the notion of virtual constraint on Riemannian homogeneous spaces in a geometric framework which is a generalization of the classical controlled invariant distribution setting and we show the existence and uniqueness of a control law preserving the invariant distribution. Moreover we characterize the closed-loop dynamics obtained using the unique control law in terms of an affine connection. We illustrate the theory with new examples of nonholonomic control systems inspired by robotics applications.
可以说,非完整系统是对速度有规定限制的机械系统。虚非完整约束是与仿射连接机械控制系统相关的受控不变分布。一个黎曼齐次空间是,一个黎曼流形在任何地方看起来都是一样的,当你通过李群的作用穿过它的时候。这些黎曼流形本身不一定是李群,但仍然具有一定的对称性和不变性,从而可以得到类似的结果。本文在几何框架中引入了riemann齐次空间上的虚约束的概念,这是经典控制不变分布设置的推广,并证明了保持不变分布的控制律的存在唯一性。此外,我们还描述了利用唯一控制律在仿射连接下得到的闭环动力学。我们用机器人应用启发的非完整控制系统的新例子来说明这一理论。
{"title":"Nonholonomic mechanics and virtual constraints on Riemannian homogeneous spaces","authors":"Efstratios Stratoglou ,&nbsp;Alexandre Anahory Simoes ,&nbsp;Anthony Bloch ,&nbsp;Leonardo J. Colombo","doi":"10.1016/j.automatica.2025.112754","DOIUrl":"10.1016/j.automatica.2025.112754","url":null,"abstract":"<div><div>Nonholonomic systems are, so to speak, mechanical systems with a prescribed restriction on the velocities. A virtual nonholonomic constraint is a controlled invariant distribution associated with an affine connection mechanical control system. A Riemannian homogeneous space is, a Riemannian manifold that looks the same everywhere, as you move through it by the action of a Lie group. These Riemannian manifolds are not necessarily Lie groups themselves, but nonetheless possess certain symmetries and invariances that allow for similar results to be obtained. In this work, we introduce the notion of virtual constraint on Riemannian homogeneous spaces in a geometric framework which is a generalization of the classical controlled invariant distribution setting and we show the existence and uniqueness of a control law preserving the invariant distribution. Moreover we characterize the closed-loop dynamics obtained using the unique control law in terms of an affine connection. We illustrate the theory with new examples of nonholonomic control systems inspired by robotics applications.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112754"},"PeriodicalIF":5.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145799601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stationary Stackelberg equilibrium in two-person zero-sum stochastic Stackelberg games 二人零和随机Stackelberg对策中的平稳Stackelberg均衡
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-03-01 Epub Date: 2025-12-19 DOI: 10.1016/j.automatica.2025.112758
Kaiyun Xie , Junlin Xiong
This paper investigates the existence and computability of the stationary Stackelberg equilibrium (SSE) in two-person zero-sum stochastic Stackelberg games (SSGs). First, an operator-based approach is developed to illustrate that such games admit a fixed-point equilibrium (FPE). It is further proven that the FPE strategy pair constitutes an SSE. Building on this foundation, a value iteration (VI) algorithm is proposed to compute the SSE strategies. However, due to the curse of dimensionality, the exact computation of SSE strategies involves high computational complexity. To address this issue, an ϵ-sacrifice strategy is introduced to approximate the leader’s SSE strategy by performing finite iterations, with the degree of approximation quantified by ϵ. The relationship between ϵ and the number of iterations n is established, ensuring a trade-off between computational efficiency and strategic performance. An information flow control example demonstrates the efficiency of the designed strategies.
研究了二人零和随机Stackelberg对策中平稳Stackelberg均衡(SSE)的存在性和可计算性。首先,提出了一种基于算子的方法来说明这种博弈承认不动点均衡(FPE)。进一步证明了FPE策略对构成SSE。在此基础上,提出了一种计算SSE策略的值迭代(VI)算法。然而,由于维度的诅咒,SSE策略的精确计算涉及到很高的计算复杂度。为了解决这个问题,引入ϵ-sacrifice策略,通过执行有限迭代来近似领导者的SSE策略,近似程度由λ量化。建立了λ和迭代次数n之间的关系,确保了计算效率和策略性能之间的权衡。一个信息流控制实例验证了所设计策略的有效性。
{"title":"Stationary Stackelberg equilibrium in two-person zero-sum stochastic Stackelberg games","authors":"Kaiyun Xie ,&nbsp;Junlin Xiong","doi":"10.1016/j.automatica.2025.112758","DOIUrl":"10.1016/j.automatica.2025.112758","url":null,"abstract":"<div><div>This paper investigates the existence and computability of the stationary Stackelberg equilibrium (SSE) in two-person zero-sum stochastic Stackelberg games (SSGs). First, an operator-based approach is developed to illustrate that such games admit a fixed-point equilibrium (FPE). It is further proven that the FPE strategy pair constitutes an SSE. Building on this foundation, a value iteration (VI) algorithm is proposed to compute the SSE strategies. However, due to the curse of dimensionality, the exact computation of SSE strategies involves high computational complexity. To address this issue, an <span><math><mi>ϵ</mi></math></span>-sacrifice strategy is introduced to approximate the leader’s SSE strategy by performing finite iterations, with the degree of approximation quantified by <span><math><mi>ϵ</mi></math></span>. The relationship between <span><math><mi>ϵ</mi></math></span> and the number of iterations <span><math><mi>n</mi></math></span> is established, ensuring a trade-off between computational efficiency and strategic performance. An information flow control example demonstrates the efficiency of the designed strategies.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112758"},"PeriodicalIF":5.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145799602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed control to steer dynamical systems to the generalized Nash equilibria for monotone aggregative games with operational constraints 带操作约束的单调聚集对策的分布式控制
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-03-01 Epub Date: 2025-12-19 DOI: 10.1016/j.automatica.2025.112794
Ming Li , Zhaojian Wang , Mengshuo Jia , Feng Liu , Bo Yang , Xinping Guan
In this paper, we propose a distributed feedback controller to steer a multi-agent dynamical system to the generalized Nash equilibrium (GNE) for an aggregative game with only a monotone pseudo-gradient. First, a monotone aggregative game among the agents is formulated, which considers operational constraints in both the transient process and the steady state. Then, a distributed controller based on primal–dual gradient dynamics is designed for each agent to steer the system to track the GNE of the predefined game autonomously. The controller relaxes the requirement of the strictly/strongly monotone pseudo-gradient. At the same time, the hard limits of the control input are satisfied both at equilibrium and during the transient process. We further prove that the closed-loop system is asymptotically stable and its equilibrium attains the GNE. Finally, a practical example of frequency control on IEEE 68-bus system verifies the effectiveness of the proposed method.
在本文中,我们提出了一种分布式反馈控制器来引导多智能体动态系统在只有单调伪梯度的聚集对策下达到广义纳什均衡(GNE)。首先,建立了agent间的单调聚合对策,同时考虑了瞬态过程和稳态过程的操作约束。然后,为每个智能体设计基于原对偶梯度动力学的分布式控制器,引导系统自主跟踪预定义博弈的GNE;该控制器放宽了对严格/强单调伪梯度的要求。同时,在平衡态和暂态过程中都满足控制输入的硬极限。进一步证明了闭环系统是渐近稳定的,其平衡点达到了GNE。最后,通过IEEE 68总线系统的频率控制实例验证了所提方法的有效性。
{"title":"Distributed control to steer dynamical systems to the generalized Nash equilibria for monotone aggregative games with operational constraints","authors":"Ming Li ,&nbsp;Zhaojian Wang ,&nbsp;Mengshuo Jia ,&nbsp;Feng Liu ,&nbsp;Bo Yang ,&nbsp;Xinping Guan","doi":"10.1016/j.automatica.2025.112794","DOIUrl":"10.1016/j.automatica.2025.112794","url":null,"abstract":"<div><div>In this paper, we propose a distributed feedback controller to steer a multi-agent dynamical system to the generalized Nash equilibrium (GNE) for an aggregative game with only a monotone pseudo-gradient. First, a monotone aggregative game among the agents is formulated, which considers operational constraints in both the transient process and the steady state. Then, a distributed controller based on primal–dual gradient dynamics is designed for each agent to steer the system to track the GNE of the predefined game autonomously. The controller relaxes the requirement of the strictly/strongly monotone pseudo-gradient. At the same time, the hard limits of the control input are satisfied both at equilibrium and during the transient process. We further prove that the closed-loop system is asymptotically stable and its equilibrium attains the GNE. Finally, a practical example of frequency control on IEEE 68-bus system verifies the effectiveness of the proposed method.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112794"},"PeriodicalIF":5.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145799647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust reinforcement learning-based controller for a class of nonlinear multiplayer Stackelberg game with input constraints 一类具有输入约束的非线性多人Stackelberg博弈的鲁棒强化学习控制器
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-03-01 Epub Date: 2025-12-23 DOI: 10.1016/j.automatica.2025.112760
Reza Samsami, Mohammad Mehdi Arefi
This paper introduces a novel policy iteration (PI) reinforcement learning (RL) technique for a hierarchical Stackelberg dynamic game for a class of continuous-time nonlinear dynamical systems including one-leader-multi-followers that concurrently handles both zero-sum (ZS) and nonzero-sum (NZS) games in the game model. At first, the leader makes decisions to learn optimal strategies of all followers. Meanwhile, each follower independently reacts optimally to the leader’s strategy by playing a Nash game with followers and a ZS game with the disturbances simultaneously. To overcome the difficulty of solving the Hamilton–Jacobi (HJ) equation resulting from this hierarchical game, and consequently finding the multiplayer Stackelberg–Nash-saddle (MSNS) equilibrium solution of the game, an adaptive dynamic programming (ADP) algorithm that employs neural network (NN) is developed. An actor-critic NNs framework is presented to solve HJ equations by estimating value functions, control policies, and disturbances. To obtain the leader’s optimal strategy, an optimal adaptive control method using NN is proposed. The convergence of NN weights and the states of the closed-loop optimal system are shown to be uniformly ultimately bounded (UUB). Finally, an example is provided to show the effectiveness of the proposed algorithm in solving the MSNS equilibrium solution with input constraints.
本文介绍了一种新的策略迭代(PI)强化学习(RL)技术,用于解决一类包含一个领导者-多追随者的连续时间非线性动态系统的分层Stackelberg动态博弈,该系统同时处理博弈模型中的零和(ZS)和非零和(NZS)博弈。首先,领导者做出决策,学习所有追随者的最优策略。同时,每个追随者通过与追随者进行纳什博弈和与干扰同时进行ZS博弈,独立地对领导者的策略做出最优反应。为了克服该分层博弈导致的Hamilton-Jacobi (HJ)方程求解困难,从而找到该博弈的多人Stackelberg-Nash-saddle (MSNS)均衡解,提出了一种基于神经网络的自适应动态规划(ADP)算法。提出了一个行动者批判神经网络框架,通过估计值函数、控制策略和干扰来求解HJ方程。为了获得领导者的最优策略,提出了一种基于神经网络的最优自适应控制方法。证明了神经网络权值的收敛性和闭环最优系统的状态是一致最终有界的。最后,给出了一个算例,证明了该算法在求解带有输入约束的MSNS平衡解时的有效性。
{"title":"Robust reinforcement learning-based controller for a class of nonlinear multiplayer Stackelberg game with input constraints","authors":"Reza Samsami,&nbsp;Mohammad Mehdi Arefi","doi":"10.1016/j.automatica.2025.112760","DOIUrl":"10.1016/j.automatica.2025.112760","url":null,"abstract":"<div><div>This paper introduces a novel policy iteration (PI) reinforcement learning (RL) technique for a hierarchical Stackelberg dynamic game for a class of continuous-time nonlinear dynamical systems including one-leader-multi-followers that concurrently handles both zero-sum (ZS) and nonzero-sum (NZS) games in the game model. At first, the leader makes decisions to learn optimal strategies of all followers. Meanwhile, each follower independently reacts optimally to the leader’s strategy by playing a Nash game with followers and a ZS game with the disturbances simultaneously. To overcome the difficulty of solving the Hamilton–Jacobi (HJ) equation resulting from this hierarchical game, and consequently finding the multiplayer Stackelberg–Nash-saddle (MSNS) equilibrium solution of the game, an adaptive dynamic programming (ADP) algorithm that employs neural network (NN) is developed. An actor-critic NNs framework is presented to solve HJ equations by estimating value functions, control policies, and disturbances. To obtain the leader’s optimal strategy, an optimal adaptive control method using NN is proposed. The convergence of NN weights and the states of the closed-loop optimal system are shown to be uniformly ultimately bounded (UUB). Finally, an example is provided to show the effectiveness of the proposed algorithm in solving the MSNS equilibrium solution with input constraints.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112760"},"PeriodicalIF":5.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145822797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Event-based probability-guaranteed tracking control with amplify-and-forward relay schemes 基于事件概率保证的放大前向中继跟踪控制
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-03-01 Epub Date: 2025-12-24 DOI: 10.1016/j.automatica.2025.112761
Licheng Wang , Zidong Wang
This paper investigates the event-based probability guaranteed H tracking control problem for a class of discrete-time systems with a relay-aided communication scheme. The uncertainties of the system matrices are taken into account, which is described by mutually independent uniformly distributed random variables. An adaptive event-triggering scheme is introduced during the signal exchange from the sensor to the relay where the triggering threshold is dynamically regulated. Moreover, an amplify-to-forward relay is employed to improve the communication quality between the sensor and the controller. With the help of the Lyapunov stability theory, sufficient conditions are established to guarantee that the tracking error dynamics achieves the asymptotical stability and the H performance with probability constraints. The desired controller parameters are derived by means of the solution to certain optimization problem subject to matrix inequality constraints. Finally, a numerical example is provided to show the effectiveness of the developed tracking control scheme.
研究了一类具有中继辅助通信方案的离散系统的基于事件概率保证的H∞跟踪控制问题。考虑系统矩阵的不确定性,用相互独立的均匀分布随机变量来描述。在从传感器到继电器的信号交换过程中引入了一种自适应事件触发方案,触发阈值是动态调节的。此外,为了提高传感器和控制器之间的通信质量,还采用了放大前向继电器。利用Lyapunov稳定性理论,建立了跟踪误差动力学在概率约束下达到渐近稳定性和H∞性能的充分条件。通过求解一类矩阵不等式约束下的优化问题,得到了期望的控制器参数。最后,通过数值算例验证了所提出的跟踪控制方案的有效性。
{"title":"Event-based probability-guaranteed tracking control with amplify-and-forward relay schemes","authors":"Licheng Wang ,&nbsp;Zidong Wang","doi":"10.1016/j.automatica.2025.112761","DOIUrl":"10.1016/j.automatica.2025.112761","url":null,"abstract":"<div><div>This paper investigates the event-based probability guaranteed <span><math><msub><mrow><mi>H</mi></mrow><mrow><mi>∞</mi></mrow></msub></math></span> tracking control problem for a class of discrete-time systems with a relay-aided communication scheme. The uncertainties of the system matrices are taken into account, which is described by mutually independent uniformly distributed random variables. An adaptive event-triggering scheme is introduced during the signal exchange from the sensor to the relay where the triggering threshold is dynamically regulated. Moreover, an amplify-to-forward relay is employed to improve the communication quality between the sensor and the controller. With the help of the Lyapunov stability theory, sufficient conditions are established to guarantee that the tracking error dynamics achieves the asymptotical stability and the <span><math><msub><mrow><mi>H</mi></mrow><mrow><mi>∞</mi></mrow></msub></math></span> performance with probability constraints. The desired controller parameters are derived by means of the solution to certain optimization problem subject to matrix inequality constraints. Finally, a numerical example is provided to show the effectiveness of the developed tracking control scheme.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112761"},"PeriodicalIF":5.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145822795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The role of network connectivity in distributed k-agreement protocols 网络连接在分布式k协议中的作用
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-03-01 Epub Date: 2025-12-29 DOI: 10.1016/j.automatica.2025.112753
Gianluca Bianchin , Miguel Vaquero , Jorge Cortés , Emiliano Dall’Anese
Given a network of agents, we say that the agents achieve a k-agreement when their state variables converge to a point that corresponds to the projection of the agents’ states onto a k-dimensional linear subspace. The k-agreement problem generalizes the classical consensus problem; unlike in consensus, where the agents’ states must asymptotically coincide, in k-agreement the agents reach an agreement in a generalized sense (within a linear subspace, where the states do not necessarily coincide). In this paper, we investigate which interaction topologies enable a network of agents to reach an agreement on a prescribed k-dimensional subspace through local coordination algorithms. We show that achieving k-agreement requires communication over highly connected graphs; specifically, the number of edges in the interaction graph must grow linearly with the dimension k of the agreement subspace. Our characterization reveals that the presence of cycles in the communication graph (particularly, independent families of cycles) constitutes the fundamental structural feature enabling the agents to achieve k-agreement. We also investigate the use of common graph topologies, such as path and circulant graphs, for k-agreement, deriving insights into the relationship between the subspace dimension k and the required network connectivity. The effectiveness of the proposed framework is demonstrated through simulations in robotic formation control problems.
给定一个智能体网络,当它们的状态变量收敛到与智能体状态在k维线性子空间上的投影相对应的点时,我们说智能体达到了k协议。k-一致问题推广了经典共识问题;不像在consensus中,agent的状态必须渐近地重合,在k-agreement中,agent在广义意义上达成一致(在线性子空间中,状态不一定重合)。在本文中,我们研究了哪些交互拓扑使智能体网络通过局部协调算法在规定的k维子空间上达成一致。我们证明了实现k协议需要在高度连通的图上进行通信;具体来说,交互图中的边数必须随协议子空间的维数k线性增长。我们的表征揭示了通信图中循环的存在(特别是,独立的循环族)构成了使代理实现k-协议的基本结构特征。我们还研究了常见图拓扑(如路径图和循环图)对k-一致性的使用,从而深入了解了子空间维度k与所需网络连通性之间的关系。通过机器人编队控制问题的仿真验证了该框架的有效性。
{"title":"The role of network connectivity in distributed k-agreement protocols","authors":"Gianluca Bianchin ,&nbsp;Miguel Vaquero ,&nbsp;Jorge Cortés ,&nbsp;Emiliano Dall’Anese","doi":"10.1016/j.automatica.2025.112753","DOIUrl":"10.1016/j.automatica.2025.112753","url":null,"abstract":"<div><div>Given a network of agents, we say that the agents achieve a <span><math><mi>k</mi></math></span>-<em>agreement</em> when their state variables converge to a point that corresponds to the projection of the agents’ states onto a <span><math><mi>k</mi></math></span>-dimensional linear subspace. The <span><math><mi>k</mi></math></span>-agreement problem generalizes the classical consensus problem; unlike in consensus, where the agents’ states must asymptotically coincide, in <span><math><mi>k</mi></math></span>-agreement the agents reach an agreement in a generalized sense (within a linear subspace, where the states do not necessarily coincide). In this paper, we investigate which interaction topologies enable a network of agents to reach an agreement on a prescribed <span><math><mi>k</mi></math></span>-dimensional subspace through local coordination algorithms. We show that achieving <span><math><mi>k</mi></math></span>-agreement requires communication over highly connected graphs; specifically, the number of edges in the interaction graph must grow linearly with the dimension <span><math><mi>k</mi></math></span> of the agreement subspace. Our characterization reveals that the presence of cycles in the communication graph (particularly, independent families of cycles) constitutes the fundamental structural feature enabling the agents to achieve <span><math><mi>k</mi></math></span>-agreement. We also investigate the use of common graph topologies, such as path and circulant graphs, for <span><math><mi>k</mi></math></span>-agreement, deriving insights into the relationship between the subspace dimension <span><math><mi>k</mi></math></span> and the required network connectivity. The effectiveness of the proposed framework is demonstrated through simulations in robotic formation control problems.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112753"},"PeriodicalIF":5.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145884874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-driven inverse optimal control for linear quadratic tracking with unknown target states 未知目标状态线性二次跟踪的数据驱动逆最优控制
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-03-01 Epub Date: 2026-01-08 DOI: 10.1016/j.automatica.2026.112822
Renshuo Cheng, Chengpu Yu, Yao Li
This paper studies the inverse optimal control for discrete-time finite-horizon linear quadratic tracking with unknown target states. Due to the time-varying feedback policies caused by the finite-horizon setting and the unknown system dynamics, the concerned inverse optimal control becomes challenging. To deal with it, a novel data driven inverse identification approach is developed, for which the corresponding identifiability conditions are provided and the statistical consistency is analyzed in the presence of observation noise. Compared to the existing solutions, the proposed approach requires only optimal trajectories, possibly corrupted by additive observation noise with zero mean and bounded covariance, and achieves consistent results without knowledge of the noise covariance. Finally, simulation examples are presented to show the effectiveness of the proposed approach.
研究了目标状态未知的离散有限视界线性二次跟踪的逆最优控制问题。由于有限视界设置和未知系统动力学所引起的时变反馈策略,使得逆最优控制变得具有挑战性。为了解决这一问题,提出了一种新的数据驱动的逆识别方法,给出了相应的可识别条件,并分析了在观测噪声存在下的统计一致性。与现有的解决方案相比,该方法只需要最优轨迹,可能被具有零平均值和有界协方差的附加观测噪声破坏,并且在不知道噪声协方差的情况下获得一致的结果。最后,通过仿真实例验证了该方法的有效性。
{"title":"Data-driven inverse optimal control for linear quadratic tracking with unknown target states","authors":"Renshuo Cheng,&nbsp;Chengpu Yu,&nbsp;Yao Li","doi":"10.1016/j.automatica.2026.112822","DOIUrl":"10.1016/j.automatica.2026.112822","url":null,"abstract":"<div><div>This paper studies the inverse optimal control for discrete-time finite-horizon linear quadratic tracking with unknown target states. Due to the time-varying feedback policies caused by the finite-horizon setting and the unknown system dynamics, the concerned inverse optimal control becomes challenging. To deal with it, a novel data driven inverse identification approach is developed, for which the corresponding identifiability conditions are provided and the statistical consistency is analyzed in the presence of observation noise. Compared to the existing solutions, the proposed approach requires only optimal trajectories, possibly corrupted by additive observation noise with zero mean and bounded covariance, and achieves consistent results without knowledge of the noise covariance. Finally, simulation examples are presented to show the effectiveness of the proposed approach.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112822"},"PeriodicalIF":5.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145939140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive dynamic programming for unknown continuous-time nonlinear systems via bias-policy iteration 基于偏策略迭代的未知连续非线性系统自适应动态规划
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-03-01 Epub Date: 2026-01-09 DOI: 10.1016/j.automatica.2026.112821
Ruiqing Zhang, Huaiyuan Jiang, Bin Zhou
In this paper, a bias-policy iteration (Bias-PI) method is proposed to relax the requirement of the policy iteration method on the initial admissible control and achieve optimal control for unknown continuous-time nonlinear systems. First, a model-based Bias-PI method is introduced that uses a bias value function to ease the constraints of the initial admissible control. The boundedness of the bias value function and the convergence of the algorithm are demonstrated through rigorous mathematical proofs. Further, the data-driven implementation of the Bias-PI method is detailed, highlighting its ability to learn an optimal controller without prior system information, and simultaneously retaining the fast convergence properties of the traditional policy iteration algorithm. The effectiveness of the data-driven Bias-PI method is illustrated through two simulation examples.
针对未知连续时间非线性系统,提出了一种偏置策略迭代(Bias-PI)方法,放宽了策略迭代法对初始允许控制的要求,实现了系统的最优控制。首先,引入了一种基于模型的偏置- pi方法,该方法使用偏置值函数来缓解初始允许控制的约束。通过严格的数学证明,证明了偏置值函数的有界性和算法的收敛性。此外,详细介绍了Bias-PI方法的数据驱动实现,强调了它在没有先验系统信息的情况下学习最优控制器的能力,同时保留了传统策略迭代算法的快速收敛特性。通过两个仿真实例说明了数据驱动的Bias-PI方法的有效性。
{"title":"Adaptive dynamic programming for unknown continuous-time nonlinear systems via bias-policy iteration","authors":"Ruiqing Zhang,&nbsp;Huaiyuan Jiang,&nbsp;Bin Zhou","doi":"10.1016/j.automatica.2026.112821","DOIUrl":"10.1016/j.automatica.2026.112821","url":null,"abstract":"<div><div>In this paper, a bias-policy iteration (Bias-PI) method is proposed to relax the requirement of the policy iteration method on the initial admissible control and achieve optimal control for unknown continuous-time nonlinear systems. First, a model-based Bias-PI method is introduced that uses a bias value function to ease the constraints of the initial admissible control. The boundedness of the bias value function and the convergence of the algorithm are demonstrated through rigorous mathematical proofs. Further, the data-driven implementation of the Bias-PI method is detailed, highlighting its ability to learn an optimal controller without prior system information, and simultaneously retaining the fast convergence properties of the traditional policy iteration algorithm. The effectiveness of the data-driven Bias-PI method is illustrated through two simulation examples.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112821"},"PeriodicalIF":5.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145939238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fusion estimation for multi-rate systems with probabilistic quantization and transmission power constraints: A federated-filtering-based method 具有概率量化和传输功率约束的多速率系统的融合估计:一种基于联邦滤波的方法
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-03-01 Epub Date: 2026-01-09 DOI: 10.1016/j.automatica.2026.112824
Na Li , Lei Zou , Jiayue Sun , Derui Ding
The federated-filtering-based (FFB) fusion estimation problem is investigated in this paper for networked multi-rate systems, where the measurement signals are transmitted over a wireless network with limited transmission power. A probabilistic quantization mechanism is introduced to handle the raw measurement signals for the purpose of facilitating digital communication over network. Certain transmission models are proposed to describe the behaviors under the effects of multi-rate dynamics, probabilistic quantization and limited transmission power. A delicately designed FFB fusion scheme is proposed to acquire the desired state estimates, where the local filters will receive feedback from the fusion center to reset their estimates. The parameters for the local filters are calculated by recursively minimizing their upper-bounds for the estimation error covariances. Furthermore, new conditions have been derived to analyze the ultimately boundedness of the estimation error covariance for the fusion center. Subsequently, a power allocation strategy is designed by minimizing such ultimate bound subject to the given transmission power constraint. Finally, the effectiveness of the proposed fusion estimation strategy and its optimal power allocation scheme is verified through a simulation example.
研究了基于联邦滤波的多速率系统的融合估计问题,其中测量信号在有限传输功率的无线网络中传输。为了方便网络上的数字通信,引入了一种概率量化机制对原始测量信号进行处理。在多速率动力学、概率量化和有限传输功率的影响下,提出了一定的传输模型来描述传输行为。提出了一种精心设计的FFB融合方案,以获取期望的状态估计,其中局部滤波器将接收来自融合中心的反馈以重置其估计。通过递归最小化估计误差协方差的上界来计算局部滤波器的参数。在此基础上,导出了分析融合中心估计误差协方差最终有界性的新条件。然后,在给定的传输功率约束下,通过最小化传输功率极限来设计功率分配策略。最后,通过仿真实例验证了所提出的融合估计策略及其最优功率分配方案的有效性。
{"title":"Fusion estimation for multi-rate systems with probabilistic quantization and transmission power constraints: A federated-filtering-based method","authors":"Na Li ,&nbsp;Lei Zou ,&nbsp;Jiayue Sun ,&nbsp;Derui Ding","doi":"10.1016/j.automatica.2026.112824","DOIUrl":"10.1016/j.automatica.2026.112824","url":null,"abstract":"<div><div>The federated-filtering-based (FFB) fusion estimation problem is investigated in this paper for networked multi-rate systems, where the measurement signals are transmitted over a wireless network with limited transmission power. A probabilistic quantization mechanism is introduced to handle the raw measurement signals for the purpose of facilitating digital communication over network. Certain transmission models are proposed to describe the behaviors under the effects of multi-rate dynamics, probabilistic quantization and limited transmission power. A delicately designed FFB fusion scheme is proposed to acquire the desired state estimates, where the local filters will receive feedback from the fusion center to reset their estimates. The parameters for the local filters are calculated by recursively minimizing their upper-bounds for the estimation error covariances. Furthermore, new conditions have been derived to analyze the ultimately boundedness of the estimation error covariance for the fusion center. Subsequently, a power allocation strategy is designed by minimizing such ultimate bound subject to the given transmission power constraint. Finally, the effectiveness of the proposed fusion estimation strategy and its optimal power allocation scheme is verified through a simulation example.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112824"},"PeriodicalIF":5.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145939237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Automatica
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1