首页 > 最新文献

Automatica最新文献

英文 中文
Distributed output synchronization of discrete-time multi-agent systems: A data-driven approach 离散时间多智能体系统的分布式输出同步:一种数据驱动方法
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-12-22 DOI: 10.1016/j.automatica.2025.112789
Giulio Fattore, Maria Elena Valcher
Given an autonomous exosystem generating a reference output, we consider the problem of designing a distributed data-driven control law for a network of discrete-time heterogeneous LTI agents, to synchronize the agents’ outputs to the reference one. The agents split into two groups: leaders, with direct access to the exosystem output, and followers, which only receive information from their neighbors. Each agent implements a state feedback strategy relying on its own state and on an estimate of the exogenous system state, provided by an internal state observer. Necessary and sufficient conditions for the existence of a solution are first derived in the model-based setup and then in a data-driven context. An algorithm is provided to design from data the matrices of the distributed control law. A numerical example illustrates the effectiveness of the proposed approach.
给定一个产生参考输出的自治系统,我们考虑为离散时间异构LTI智能体网络设计一个分布式数据驱动控制律的问题,以使智能体的输出与参考输出同步。代理人分成两组:领导者,直接访问外部系统的输出,追随者,只能从他们的邻居那里接收信息。每个智能体根据自己的状态和由内部状态观测器提供的外生系统状态的估计实现状态反馈策略。首先在基于模型的设置中推导出解决方案存在的充分必要条件,然后在数据驱动的上下文中推导。给出了一种从数据中设计分布式控制律矩阵的算法。数值算例说明了该方法的有效性。
{"title":"Distributed output synchronization of discrete-time multi-agent systems: A data-driven approach","authors":"Giulio Fattore,&nbsp;Maria Elena Valcher","doi":"10.1016/j.automatica.2025.112789","DOIUrl":"10.1016/j.automatica.2025.112789","url":null,"abstract":"<div><div>Given an autonomous exosystem generating a reference output, we consider the problem of designing a distributed data-driven control law for a network of discrete-time heterogeneous LTI agents, to synchronize the agents’ outputs to the reference one. The agents split into two groups: leaders, with direct access to the exosystem output, and followers, which only receive information from their neighbors. Each agent implements a state feedback strategy relying on its own state and on an estimate of the exogenous system state, provided by an internal state observer. Necessary and sufficient conditions for the existence of a solution are first derived in the model-based setup and then in a data-driven context. An algorithm is provided to design from data the matrices of the distributed control law. A numerical example illustrates the effectiveness of the proposed approach.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112789"},"PeriodicalIF":5.9,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A multi-objective hierarchical control framework for battery units in DC microgrids using improved adaptive droop control 基于改进自适应下垂控制的直流微电网电池单元多目标分层控制框架
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-12-22 DOI: 10.1016/j.automatica.2025.112762
Yangyang Qian , Zongli Lin , Yacov A. Shamash
We investigate the primary/secondary control problem for battery units with adaptive droop control in DC microgrids. A majority of existing control methods for state-of-charge (SoC) balancing among battery units require that the terminal voltage of each battery unit remains constant. To remove such a requirement, at the primary control level, we propose an improved adaptive droop control method by introducing an auxiliary battery state in terms of the battery parameters, the terminal voltage, and a power function of the SoC. The designed droop coefficient is inversely proportional to the auxiliary battery state. It is shown that the proposed adaptive droop control achieves proportional power sharing and SoC balancing, albeit at a cost of compromised voltage regulation. A salient feature of the proposed adaptive droop control method is that the battery’s terminal voltage is allowed to be time-varying. Moreover, to compensate for the voltage deviation caused by the improved adaptive droop control, we propose a consensus-based distributed control scheme at the secondary control level. It is shown that the resulting hierarchical control framework, consisting of the primary and secondary control levels, guarantees that the control objectives of voltage regulation, proportional power sharing, and SoC balancing are all achieved. Simulation studies based on an accurate electrical battery model validate the effectiveness of our multi-objective hierarchical control framework.
研究了直流微电网中具有自适应下垂控制的电池单元的一次/二次控制问题。现有的电池荷电状态(SoC)平衡控制方法大多要求每个电池单元的终端电压保持恒定。为了消除这样的要求,在主控制层面,我们提出了一种改进的自适应下垂控制方法,通过引入电池参数、终端电压和SoC的功率函数等辅助电池状态。设计的下垂系数与辅助电池状态成反比。结果表明,尽管以降低电压调节为代价,但所提出的自适应下垂控制实现了比例功率共享和SoC平衡。提出的自适应下垂控制方法的一个显著特点是允许电池端电压随时间变化。此外,为了补偿由改进的自适应下垂控制引起的电压偏差,我们提出了一种基于共识的二次控制分布式控制方案。结果表明,由一级控制和二级控制组成的分层控制框架保证了电压调节、比例功率共享和SoC平衡的控制目标都能实现。基于精确电池模型的仿真研究验证了多目标分层控制框架的有效性。
{"title":"A multi-objective hierarchical control framework for battery units in DC microgrids using improved adaptive droop control","authors":"Yangyang Qian ,&nbsp;Zongli Lin ,&nbsp;Yacov A. Shamash","doi":"10.1016/j.automatica.2025.112762","DOIUrl":"10.1016/j.automatica.2025.112762","url":null,"abstract":"<div><div>We investigate the primary/secondary control problem for battery units with adaptive droop control in DC microgrids. A majority of existing control methods for state-of-charge (SoC) balancing among battery units require that the terminal voltage of each battery unit remains constant. To remove such a requirement, at the primary control level, we propose an improved adaptive droop control method by introducing an auxiliary battery state in terms of the battery parameters, the terminal voltage, and a power function of the SoC. The designed droop coefficient is inversely proportional to the auxiliary battery state. It is shown that the proposed adaptive droop control achieves proportional power sharing and SoC balancing, albeit at a cost of compromised voltage regulation. A salient feature of the proposed adaptive droop control method is that the battery’s terminal voltage is allowed to be time-varying. Moreover, to compensate for the voltage deviation caused by the improved adaptive droop control, we propose a consensus-based distributed control scheme at the secondary control level. It is shown that the resulting hierarchical control framework, consisting of the primary and secondary control levels, guarantees that the control objectives of voltage regulation, proportional power sharing, and SoC balancing are all achieved. Simulation studies based on an accurate electrical battery model validate the effectiveness of our multi-objective hierarchical control framework.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112762"},"PeriodicalIF":5.9,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modeling nonlinear control systems via Koopman Control Family: Universal forms and subspace invariance proximity 用Koopman控制族建模非线性控制系统:通用形式和子空间不变性接近
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-12-22 DOI: 10.1016/j.automatica.2025.112722
Masih Haseli , Jorge Cortés
This paper introduces the Koopman Control Family (KCF), a mathematical framework for modeling general (not necessarily control-affine) discrete-time nonlinear control systems with the aim of providing a solid theoretical foundation for the use of Koopman-based methods in systems with inputs. We demonstrate that the concept of KCF captures the behavior of nonlinear control systems on a (potentially infinite-dimensional) function space. By employing a generalized notion of subspace invariance under the KCF, we establish a universal form for finite-dimensional models, which encompasses the commonly used lifted linear, bilinear, and linear switched models as specific instances. In cases where the subspace is not invariant under the KCF, we propose a method for approximating models in general form and characterize the model’s accuracy using the concept of invariance proximity. We end by discussing how the proposed framework naturally lends itself to data-driven modeling of control systems.
本文介绍了Koopman控制族(KCF),这是一个用于一般(不一定是仿射控制)离散时间非线性控制系统建模的数学框架,旨在为在具有输入的系统中使用基于Koopman的方法提供坚实的理论基础。我们证明了KCF的概念捕获了非线性控制系统在(潜在的无限维)函数空间上的行为。利用KCF下子空间不变性的广义概念,建立了有限维模型的通用形式,其中包括常用的提升线性、双线性和线性切换模型作为具体实例。对于子空间在KCF下不是不变的情况,我们提出了一种以一般形式逼近模型的方法,并使用不变接近的概念来表征模型的精度。最后,我们将讨论所提出的框架如何自然地适用于控制系统的数据驱动建模。
{"title":"Modeling nonlinear control systems via Koopman Control Family: Universal forms and subspace invariance proximity","authors":"Masih Haseli ,&nbsp;Jorge Cortés","doi":"10.1016/j.automatica.2025.112722","DOIUrl":"10.1016/j.automatica.2025.112722","url":null,"abstract":"<div><div>This paper introduces the Koopman Control Family (KCF), a mathematical framework for modeling general (not necessarily control-affine) discrete-time nonlinear control systems with the aim of providing a solid theoretical foundation for the use of Koopman-based methods in systems with inputs. We demonstrate that the concept of KCF captures the behavior of nonlinear control systems on a (potentially infinite-dimensional) function space. By employing a generalized notion of subspace invariance under the KCF, we establish a universal form for finite-dimensional models, which encompasses the commonly used lifted linear, bilinear, and linear switched models as specific instances. In cases where the subspace is not invariant under the KCF, we propose a method for approximating models in general form and characterize the model’s accuracy using the concept of invariance proximity. We end by discussing how the proposed framework naturally lends itself to data-driven modeling of control systems.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112722"},"PeriodicalIF":5.9,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Backstepping-based fault-tolerant control for strict-feedback nonlinear multi-agent systems: An encoding–decoding scheme 基于退步的严格反馈非线性多智能体系统容错控制:一种编解码方案
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-12-22 DOI: 10.1016/j.automatica.2025.112800
Chao Zhou, Zehui Mao, Bin Jiang
In this paper, the fault-tolerant consensus tracking control problem is investigated for a class of discrete-time strict-feedback nonlinear multi-agent systems with sensor faults. The sensor faults are described by the first-order difference equations. By leveraging the augmented state observer structure, an auxiliary nonlinear system is first constructed in a non-strict feedback form, based on which a backstepping-based control framework is established. The consensus tracking errors are directly incorporated into the virtual control to preserve the distributed structure of the controller. Furthermore, in order to alleviate the communication burden and enhance the transmission security, an encoding–decoding scheme is adopted during the information exchange among agents. Accordingly, an encoding–decoding-based distributed controller is proposed, guaranteeing that the output tracking error is uniformly ultimately bounded and the transmitted data is conditionally bounded. Finally, a numerical example is provided to validate the effectiveness of the proposed backstepping-based control framework.
研究了一类具有传感器故障的离散时间严格反馈非线性多智能体系统的容错一致跟踪控制问题。传感器故障用一阶差分方程描述。利用增广状态观测器结构,首先以非严格反馈形式构造了辅助非线性系统,在此基础上建立了基于后阶的控制框架。将一致性跟踪误差直接纳入虚拟控制中,保持了控制器的分布式结构。此外,为了减轻通信负担和提高传输安全性,在agent之间的信息交换中采用了编解码的方式。据此,提出了一种基于编解码的分布式控制器,保证了输出跟踪误差最终统一有界,传输数据有条件有界。最后,给出了一个数值算例,验证了该控制框架的有效性。
{"title":"Backstepping-based fault-tolerant control for strict-feedback nonlinear multi-agent systems: An encoding–decoding scheme","authors":"Chao Zhou,&nbsp;Zehui Mao,&nbsp;Bin Jiang","doi":"10.1016/j.automatica.2025.112800","DOIUrl":"10.1016/j.automatica.2025.112800","url":null,"abstract":"<div><div>In this paper, the fault-tolerant consensus tracking control problem is investigated for a class of discrete-time strict-feedback nonlinear multi-agent systems with sensor faults. The sensor faults are described by the first-order difference equations. By leveraging the augmented state observer structure, an auxiliary nonlinear system is first constructed in a non-strict feedback form, based on which a backstepping-based control framework is established. The consensus tracking errors are directly incorporated into the virtual control to preserve the distributed structure of the controller. Furthermore, in order to alleviate the communication burden and enhance the transmission security, an encoding–decoding scheme is adopted during the information exchange among agents. Accordingly, an encoding–decoding-based distributed controller is proposed, guaranteeing that the output tracking error is uniformly ultimately bounded and the transmitted data is conditionally bounded. Finally, a numerical example is provided to validate the effectiveness of the proposed backstepping-based control framework.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112800"},"PeriodicalIF":5.9,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic mode decomposition of higher order systems 高阶系统的动态模态分解
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-12-22 DOI: 10.1016/j.automatica.2025.112776
Joel A. Rosenfeld , Benjamin P. Russo , Rushikesh Kamalapurkar
Conventionally, data driven identification and control problems for higher order dynamical systems are solved by augmenting the system state by the derivatives of the output to formulate first order dynamical systems in higher dimensions. However, solution of the augmented problem typically requires knowledge of the full augmented state, which requires numerical differentiation of the original output, frequently resulting in noisy signals. This manuscript develops the theory necessary for a direct analysis of higher order dynamical systems using higher order Liouville operators. Fundamental to this theoretical development is the introduction of signal valued RKHSs and new operators posed over these spaces. Ultimately, it is observed that despite the added abstractions, the necessary computations are remarkably similar to that of first order DMD methods using occupation kernels.
传统上,高阶动力系统的数据驱动识别和控制问题是通过将系统状态通过输出的导数进行增广来实现高维一阶动力系统。然而,增广问题的解决通常需要了解完整的增广状态,这需要对原始输出进行数值微分,经常导致噪声信号。本文发展了使用高阶刘维尔算子直接分析高阶动力系统所必需的理论。这一理论发展的基础是引入了信号值RKHSs和在这些空间上提出的新算子。最后,可以观察到,尽管添加了抽象,但必要的计算与使用占用核的一阶DMD方法非常相似。
{"title":"Dynamic mode decomposition of higher order systems","authors":"Joel A. Rosenfeld ,&nbsp;Benjamin P. Russo ,&nbsp;Rushikesh Kamalapurkar","doi":"10.1016/j.automatica.2025.112776","DOIUrl":"10.1016/j.automatica.2025.112776","url":null,"abstract":"<div><div>Conventionally, data driven identification and control problems for higher order dynamical systems are solved by augmenting the system state by the derivatives of the output to formulate first order dynamical systems in higher dimensions. However, solution of the augmented problem typically requires knowledge of the full augmented state, which requires numerical differentiation of the original output, frequently resulting in noisy signals. This manuscript develops the theory necessary for a direct analysis of higher order dynamical systems using higher order Liouville operators. Fundamental to this theoretical development is the introduction of signal valued RKHSs and new operators posed over these spaces. Ultimately, it is observed that despite the added abstractions, the necessary computations are remarkably similar to that of first order DMD methods using occupation kernels.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112776"},"PeriodicalIF":5.9,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Safety-critical optimal control of switched systems 切换系统的安全关键优化控制
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-12-22 DOI: 10.1016/j.automatica.2025.112775
Lijun Long , Chunxiao Huang , Zhendong Sun
This paper investigates the problem of safety-critical optimal control for switched systems via weak topology. A novel safety-critical optimal control framework is established for finding an optimal switching signal and an optimal controller that minimize a switched cost function and ensure safety along the whole optimization time interval. A necessary and sufficient condition is presented to reveal the essence of the safety for switched systems. Also, a single barrier function (SBF) method is proposed to ensure safety of switched systems, where each subsystem does not necessarily possess the safety in the whole safe set. In order to analyze the safety of switched systems and asymptotic stability of the safe set, a new version of the comparison function result is presented, which handles a class of extended comparison functions. Moreover, an optimization algorithm based on weak topology is developed to solve the safety-critical optimal control problem of switched systems, where additional dynamics are introduced to overcome the limitations caused by the unavailability of the traditional weak topology. Finally, two examples including a continuously stirred tank reactor system are given to verify the effectiveness of the methods proposed.
本文研究了基于弱拓扑的切换系统安全临界最优控制问题。建立了一种新的安全关键最优控制框架,用于寻找最优切换信号和最优控制器,以最小化切换代价函数并保证整个优化时间区间的安全性。给出了揭示开关系统安全本质的充分必要条件。同时,提出了一种单屏障函数(SBF)方法来保证开关系统的安全性,其中每个子系统不一定具有整个安全集的安全性。为了分析切换系统的安全性和安全集的渐近稳定性,提出了一种新的比较函数结果,它处理了一类扩展的比较函数。此外,提出了一种基于弱拓扑的优化算法来解决切换系统的安全关键型最优控制问题,该算法引入了附加动力学来克服传统弱拓扑不可用性的局限性。最后,以一个连续搅拌罐式反应器系统为例,验证了所提方法的有效性。
{"title":"Safety-critical optimal control of switched systems","authors":"Lijun Long ,&nbsp;Chunxiao Huang ,&nbsp;Zhendong Sun","doi":"10.1016/j.automatica.2025.112775","DOIUrl":"10.1016/j.automatica.2025.112775","url":null,"abstract":"<div><div>This paper investigates the problem of safety-critical optimal control for switched systems via weak topology. A novel safety-critical optimal control framework is established for finding an optimal switching signal and an optimal controller that minimize a switched cost function and ensure safety along the whole optimization time interval. A necessary and sufficient condition is presented to reveal the essence of the safety for switched systems. Also, a single barrier function (SBF) method is proposed to ensure safety of switched systems, where each subsystem does not necessarily possess the safety in the whole safe set. In order to analyze the safety of switched systems and asymptotic stability of the safe set, a new version of the comparison function result is presented, which handles a class of extended comparison functions. Moreover, an optimization algorithm based on weak topology is developed to solve the safety-critical optimal control problem of switched systems, where additional dynamics are introduced to overcome the limitations caused by the unavailability of the traditional weak topology. Finally, two examples including a continuously stirred tank reactor system are given to verify the effectiveness of the methods proposed.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112775"},"PeriodicalIF":5.9,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145813909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exact solution to the least squares realization problem as a multiparameter eigenvalue problem 将最小二乘实现问题精确解为多参数特征值问题
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-12-20 DOI: 10.1016/j.automatica.2025.112792
Sibren Lagauw, Lukas Vanpoucke, Bart De Moor
We consider the least squares (LS) realization of an autonomous single-output linear time-invariant dynamical model for a given sequence of output data. As opposed to standard system identification practices, which often rely on (heuristic) iterative optimization techniques, we propose an exact solution to this non-convex optimization problem: the globally optimal solution(s) are identified by means of a deterministic eigenvalue procedure. In particular, we illustrate that for all (local) minimizers, the corresponding misfit can be characterized as the result of filtering an unknown signal twice through a finite-impulse response filter. Exploiting this insight, we propose a novel (rectangular) multiparameter eigenvalue problem (MEP), the eigentuples of which allow to retrieve all local and global minimizers of the identification problem. The proposed MEP is of great theoretical interest and offers new insights into the structure of the LS realization problem, which we explore in detail. We provide numerical examples to illustrate our findings.
考虑了给定输出数据序列的自主单输出线性定常动态模型的最小二乘实现。与通常依赖于(启发式)迭代优化技术的标准系统识别实践相反,我们提出了这种非凸优化问题的精确解决方案:通过确定性特征值过程识别全局最优解。特别地,我们说明了对于所有(局部)最小化,相应的失配可以表征为通过有限脉冲响应滤波器对未知信号进行两次滤波的结果。利用这一见解,我们提出了一种新的(矩形)多参数特征值问题(MEP),其特征组允许检索识别问题的所有局部和全局最小值。所提出的MEP具有很大的理论价值,并为LS实现问题的结构提供了新的见解,我们对此进行了详细的探讨。我们提供了数值例子来说明我们的发现。
{"title":"Exact solution to the least squares realization problem as a multiparameter eigenvalue problem","authors":"Sibren Lagauw,&nbsp;Lukas Vanpoucke,&nbsp;Bart De Moor","doi":"10.1016/j.automatica.2025.112792","DOIUrl":"10.1016/j.automatica.2025.112792","url":null,"abstract":"<div><div>We consider the least squares (LS) realization of an autonomous single-output linear time-invariant dynamical model for a given sequence of output data. As opposed to standard system identification practices, which often rely on (heuristic) iterative optimization techniques, we propose an exact solution to this non-convex optimization problem: the <em>globally optimal</em> solution(s) are identified by means of a <em>deterministic</em> eigenvalue procedure. In particular, we illustrate that for all (local) minimizers, the corresponding misfit can be characterized as the result of filtering an unknown signal twice through a finite-impulse response filter. Exploiting this insight, we propose a novel (rectangular) multiparameter eigenvalue problem (MEP), the eigentuples of which allow to retrieve all local and global minimizers of the identification problem. The proposed MEP is of great theoretical interest and offers new insights into the structure of the LS realization problem, which we explore in detail. We provide numerical examples to illustrate our findings.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112792"},"PeriodicalIF":5.9,"publicationDate":"2025-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145784437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-enabled iterative learning control: A zero-sum game design for time-scale-varying tasks 数据支持迭代学习控制:时间尺度变化任务的零和游戏设计
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-12-20 DOI: 10.1016/j.automatica.2025.112781
Zhihe Zhuang , Rodrigo A. González , Hongfeng Tao , Wojciech Paszke , Tom Oomen
Iterative learning control (ILC) is an intelligent control methodology for tackling iteration-invariant exogenous inputs. It is of great significance to develop its extrapolation for more general repetitive tasks with mutual similarity, e.g., tasks with different time scales. In practice, discrete-time ILC with sampling behavior for time-scale-varying tasks suffers from the failure of perfect corresponding learning and environment-dependent iteration-varying disturbances. This paper develops a novel direct data-based ILC algorithm using off-policy Q-learning for tasks with varying time scales, enabling the robust learning of an optimal ILC policy from experimental input/output (I/O) data. From a two-player zero-sum game perspective, the iteration-varying disturbance generated from the varying time scales of repetitive tasks is tackled quantitatively with a preset disturbance attenuation level. Further, to emphasize the importance of theoretical guarantees of reinforcement learning (RL)-based ILC designs, the data efficiency of the developed algorithm is enhanced based on Willems’ Fundamental Lemma, and a rigorous convergence analysis is given. The simulation model of an F-16 aircraft autopilot is employed to show the effectiveness of the developed approach.
迭代学习控制(ILC)是一种用于处理迭代不变外生输入的智能控制方法。对于更一般的具有相互相似性的重复性任务,如不同时间尺度的任务,发展其外推具有重要意义。在实际应用中,具有采样行为的离散时间ILC在处理时标变化任务时,会遇到相应的完美学习失败和环境相关的迭代变化干扰。本文开发了一种新的直接基于数据的ILC算法,该算法使用离策略q学习来处理具有不同时间尺度的任务,从而能够从实验输入/输出(I/O)数据中鲁棒学习最优ILC策略。从二人零和博弈的角度出发,利用预设的干扰衰减水平定量地处理重复性任务的不同时间尺度所产生的迭代变化的干扰。此外,为了强调基于强化学习(RL)的ILC设计的理论保证的重要性,基于Willems基本引理提高了所开发算法的数据效率,并给出了严格的收敛分析。通过F-16飞机自动驾驶仪的仿真模型验证了该方法的有效性。
{"title":"Data-enabled iterative learning control: A zero-sum game design for time-scale-varying tasks","authors":"Zhihe Zhuang ,&nbsp;Rodrigo A. González ,&nbsp;Hongfeng Tao ,&nbsp;Wojciech Paszke ,&nbsp;Tom Oomen","doi":"10.1016/j.automatica.2025.112781","DOIUrl":"10.1016/j.automatica.2025.112781","url":null,"abstract":"<div><div>Iterative learning control (ILC) is an intelligent control methodology for tackling iteration-invariant exogenous inputs. It is of great significance to develop its extrapolation for more general repetitive tasks with mutual similarity, e.g., tasks with different time scales. In practice, discrete-time ILC with sampling behavior for time-scale-varying tasks suffers from the failure of perfect corresponding learning and environment-dependent iteration-varying disturbances. This paper develops a novel direct data-based ILC algorithm using off-policy Q-learning for tasks with varying time scales, enabling the robust learning of an optimal ILC policy from experimental input/output (I/O) data. From a two-player zero-sum game perspective, the iteration-varying disturbance generated from the varying time scales of repetitive tasks is tackled quantitatively with a preset disturbance attenuation level. Further, to emphasize the importance of theoretical guarantees of reinforcement learning (RL)-based ILC designs, the data efficiency of the developed algorithm is enhanced based on Willems’ Fundamental Lemma, and a rigorous convergence analysis is given. The simulation model of an F-16 aircraft autopilot is employed to show the effectiveness of the developed approach.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112781"},"PeriodicalIF":5.9,"publicationDate":"2025-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145784438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decentralized control of finite state systems: A game theoretic approach 有限状态系统的分散控制:一种博弈论方法
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-12-20 DOI: 10.1016/j.automatica.2025.112773
Giordano Pola, Elena De Santis, Maria Domenica Di Benedetto
In this paper we consider a pair of interconnected, nondeterministic and metric finite state systems and address a control problem where controllers are designed for enforcing local specifications expressed in terms of regular languages, up to a desired accuracy. The control architecture considered is decentralized, that is each controller can only communicate with the corresponding plant. Since plant systems are interconnected, the part of the specification that can be enforced on one system depends on the part that can be applied on the other one. We show how this dependency can be formalized in terms of equilibria, by extending game theory to the present framework. We introduce notions of equilibria, Nash equilibria and dominant equilibria. When controlled plants are at an equilibrium, they satisfy a part of their specification; when they are at a Nash equilibrium, deviation of each plant from its control strategy may correspond to a loss in terms of the part of specification enforced; when they are at a dominant equilibrium, there is no other equilibrium where plants can achieve larger parts of the corresponding specifications. A characterization of these notions is derived and checkable conditions are discussed. An example in the context of multi-agent systems with shared resources is also included.
在本文中,我们考虑了一对相互连接的,不确定的和度量的有限状态系统,并解决了一个控制问题,其中控制器被设计用于执行用常规语言表示的局部规范,达到所需的精度。所考虑的控制体系结构是分散的,即每个控制器只能与相应的设备通信。由于工厂系统是相互连接的,规范中可以在一个系统上执行的部分取决于可以在另一个系统上应用的部分。通过将博弈论扩展到目前的框架,我们展示了这种依赖如何在均衡方面形式化。我们介绍均衡,纳什均衡和优势均衡的概念。当受控植物处于平衡状态时,它们满足了其规格的一部分;当它们处于纳什均衡时,每个工厂对其控制策略的偏离可能对应于所执行规范部分的损失;当它们处于优势平衡时,就没有其他平衡可以使植物达到相应规格的较大部分。导出了这些概念的一个表征,并讨论了可检验的条件。本文还提供了具有共享资源的多代理系统上下文中的一个示例。
{"title":"Decentralized control of finite state systems: A game theoretic approach","authors":"Giordano Pola,&nbsp;Elena De Santis,&nbsp;Maria Domenica Di Benedetto","doi":"10.1016/j.automatica.2025.112773","DOIUrl":"10.1016/j.automatica.2025.112773","url":null,"abstract":"<div><div>In this paper we consider a pair of interconnected, nondeterministic and metric finite state systems and address a control problem where controllers are designed for enforcing local specifications expressed in terms of regular languages, up to a desired accuracy. The control architecture considered is decentralized, that is each controller can only communicate with the corresponding plant. Since plant systems are interconnected, the part of the specification that can be enforced on one system depends on the part that can be applied on the other one. We show how this dependency can be formalized in terms of equilibria, by extending game theory to the present framework. We introduce notions of equilibria, Nash equilibria and dominant equilibria. When controlled plants are at an equilibrium, they satisfy a part of their specification; when they are at a Nash equilibrium, deviation of each plant from its control strategy may correspond to a loss in terms of the part of specification enforced; when they are at a dominant equilibrium, there is no other equilibrium where plants can achieve larger parts of the corresponding specifications. A characterization of these notions is derived and checkable conditions are discussed. An example in the context of multi-agent systems with shared resources is also included.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112773"},"PeriodicalIF":5.9,"publicationDate":"2025-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145784432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online learning of nonlinear parametric models under non-smooth regularization using EKF and ADMM 基于EKF和ADMM的非光滑正则化非线性参数模型在线学习
IF 5.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-12-19 DOI: 10.1016/j.automatica.2025.112787
Lapo Frascati , Alberto Bemporad
This paper proposes a novel combination of extended Kalman filtering (EKF) with the alternating direction method of multipliers (ADMM) for learning parametric nonlinear models online under non-smooth regularization terms, including 1 and 0 penalties and bound constraints on model parameters. For the case of linear time-varying models and non-smooth convex regularization terms, we provide a sublinear regret bound that ensures the proper behavior of the online learning strategy. The approach is computationally efficient for a wide range of regularization terms, which makes it appealing for its use in embedded control applications for online model adaptation. We show the performance of the proposed method in three simulation examples, highlighting its effectiveness compared to other batch and online algorithms.
本文提出了一种扩展卡尔曼滤波(EKF)与乘法器交替方向法(ADMM)的新组合,用于非光滑正则化项下参数非线性模型的在线学习,包括对模型参数的约束和约束。对于线性时变模型和非光滑凸正则化项的情况,我们提供了一个次线性后悔界,以确保在线学习策略的适当行为。该方法对于广泛的正则化项具有计算效率,这使得它在用于在线模型适应的嵌入式控制应用中具有吸引力。我们在三个仿真示例中展示了所提出方法的性能,突出了与其他批处理和在线算法相比的有效性。
{"title":"Online learning of nonlinear parametric models under non-smooth regularization using EKF and ADMM","authors":"Lapo Frascati ,&nbsp;Alberto Bemporad","doi":"10.1016/j.automatica.2025.112787","DOIUrl":"10.1016/j.automatica.2025.112787","url":null,"abstract":"<div><div>This paper proposes a novel combination of extended Kalman filtering (EKF) with the alternating direction method of multipliers (ADMM) for learning parametric nonlinear models online under non-smooth regularization terms, including <span><math><msub><mrow><mi>ℓ</mi></mrow><mrow><mn>1</mn></mrow></msub></math></span> and <span><math><msub><mrow><mi>ℓ</mi></mrow><mrow><mn>0</mn></mrow></msub></math></span> penalties and bound constraints on model parameters. For the case of linear time-varying models and non-smooth convex regularization terms, we provide a sublinear regret bound that ensures the proper behavior of the online learning strategy. The approach is computationally efficient for a wide range of regularization terms, which makes it appealing for its use in embedded control applications for online model adaptation. We show the performance of the proposed method in three simulation examples, highlighting its effectiveness compared to other batch and online algorithms.</div></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":"185 ","pages":"Article 112787"},"PeriodicalIF":5.9,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145784442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Automatica
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1