首页 > 最新文献

European Journal of Control最新文献

英文 中文
Optimal control of linear cost networks 线性成本网络的优化控制
IF 2.5 3区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-01 DOI: 10.1016/j.ejcon.2024.101068
David Ohlin, Emma Tegling, Anders Rantzer
We present a method for optimal control with respect to a linear cost function for positive linear systems with coupled input constraints. We show that the Bellman equation giving the optimal cost function and resulting sparse state feedback for these systems can be stated explicitly, with the solution given by a linear program. Our framework admits a range of network routing problems with underlying linear dynamics. These dynamics can be used to model traditional graph-theoretical problems like shortest path as a special case, but can also capture more complex behaviors. We provide an asynchronous and distributed value iteration algorithm for obtaining the optimal cost function and control law.
我们提出了一种针对具有耦合输入约束条件的正线性系统的线性成本函数的最优控制方法。我们证明,这些系统的贝尔曼方程给出了最优成本函数和由此产生的稀疏状态反馈,可以用线性规划给出解。我们的框架适用于一系列具有基本线性动力学的网络路由问题。这些动力学可以用来模拟传统的图论问题,如作为特例的最短路径,也可以捕捉更复杂的行为。我们提供了一种异步分布式值迭代算法,用于获得最佳成本函数和控制法则。
{"title":"Optimal control of linear cost networks","authors":"David Ohlin,&nbsp;Emma Tegling,&nbsp;Anders Rantzer","doi":"10.1016/j.ejcon.2024.101068","DOIUrl":"10.1016/j.ejcon.2024.101068","url":null,"abstract":"<div><div>We present a method for optimal control with respect to a linear cost function for positive linear systems with coupled input constraints. We show that the Bellman equation giving the optimal cost function and resulting sparse state feedback for these systems can be stated explicitly, with the solution given by a linear program. Our framework admits a range of network routing problems with underlying linear dynamics. These dynamics can be used to model traditional graph-theoretical problems like shortest path as a special case, but can also capture more complex behaviors. We provide an asynchronous and distributed value iteration algorithm for obtaining the optimal cost function and control law.</div></div>","PeriodicalId":50489,"journal":{"name":"European Journal of Control","volume":"80 ","pages":"Article 101068"},"PeriodicalIF":2.5,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141510764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Constructive synchronous observer design for inertial navigation with delayed GNSS measurements 利用延迟全球导航卫星系统测量进行惯性导航的建设性同步观测器设计
IF 2.5 3区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-01 DOI: 10.1016/j.ejcon.2024.101047
Pieter van Goor, Punjaya Wickramasinghe, Matthew Hampsey, Robert Mahony
Inertial Navigation Systems (INS) estimate a vehicle’s navigation states (attitude, velocity, and position) by combining measurements from an Inertial Measurement Unit (IMU) with other supporting sensors, typically including a Global Navigation Satellite System (GNSS) and a magnetometer. Recent nonlinear observer designs for INS provide powerful stability guarantees but do not account for some of the real-world challenges of INS. One of the key challenges is to account for the time-delay characteristic of GNSS measurements. This paper addresses this question by extending recent work on synchronous observer design for INS. The delayed GNSS measurements are related to the state at the current time using recursively-computable delay matrices, and this is used to design correction terms that leads to almost-globally asymptotic and locally exponential stability of the error. Simulation results verify the proposed observer and show that the compensation of time-delay is key to both transient and steady-state performance.
惯性导航系统(INS)通过将惯性测量单元(IMU)的测量数据与其他支持传感器(通常包括全球导航卫星系统(GNSS)和磁力计)的测量数据相结合来估计车辆的导航状态(姿态、速度和位置)。最近针对 INS 的非线性观测器设计提供了强大的稳定性保证,但没有考虑到 INS 在现实世界中面临的一些挑战。其中一个主要挑战是如何考虑 GNSS 测量的时延特性。本文通过扩展最近有关 INS 同步观测器设计的工作来解决这一问题。利用可递归计算的延迟矩阵,将延迟的 GNSS 测量与当前时间的状态联系起来,并以此设计校正项,从而实现误差的几乎全局渐近稳定性和局部指数稳定性。仿真结果验证了所提出的观测器,并表明时间延迟补偿是瞬态和稳态性能的关键。
{"title":"Constructive synchronous observer design for inertial navigation with delayed GNSS measurements","authors":"Pieter van Goor,&nbsp;Punjaya Wickramasinghe,&nbsp;Matthew Hampsey,&nbsp;Robert Mahony","doi":"10.1016/j.ejcon.2024.101047","DOIUrl":"10.1016/j.ejcon.2024.101047","url":null,"abstract":"<div><div>Inertial Navigation Systems (INS) estimate a vehicle’s navigation states (attitude, velocity, and position) by combining measurements from an Inertial Measurement Unit (IMU) with other supporting sensors, typically including a Global Navigation Satellite System (GNSS) and a magnetometer. Recent nonlinear observer designs for INS provide powerful stability guarantees but do not account for some of the real-world challenges of INS. One of the key challenges is to account for the time-delay characteristic of GNSS measurements. This paper addresses this question by extending recent work on synchronous observer design for INS. The delayed GNSS measurements are related to the state at the current time using recursively-computable delay matrices, and this is used to design correction terms that leads to almost-globally asymptotic and locally exponential stability of the error. Simulation results verify the proposed observer and show that the compensation of time-delay is key to both transient and steady-state performance.</div></div>","PeriodicalId":50489,"journal":{"name":"European Journal of Control","volume":"80 ","pages":"Article 101047"},"PeriodicalIF":2.5,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141510832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reinforcement learning based MPC with neural dynamical models 基于神经动力学模型的强化学习 MPC
IF 2.5 3区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-01 DOI: 10.1016/j.ejcon.2024.101048
Saket Adhau , Sébastien Gros , Sigurd Skogestad
This paper presents an end-to-end learning approach to developing a Nonlinear Model Predictive Control (NMPC) policy, which does not require an explicit first-principles model and assumes that the system dynamics are either unknown or partially known. The paper proposes the use of available measurements to identify a nominal Recurrent Neural Network (RNN) model to capture the nonlinear dynamics, which includes constraints on the state variables and inputs. To address the issue of suboptimal control policies resulting from simply fitting the model to the data, this paper uses Reinforcement learning (RL) to tune the NMPC scheme and generate an optimal policy for the real system. The approach’s novelty lies in the use of RL to overcome the limitations of the nominal RNN model and generate a more accurate control policy. The paper discusses the implementation aspects of initial state estimation for RNN models and integration of neural models in MPC. The presented method is demonstrated on a classic benchmark control problem: cascaded two tank system (CTS).
本文提出了一种开发非线性模型预测控制(NMPC)策略的端到端学习方法,该方法不需要明确的第一原理模型,并假定系统动态是未知或部分已知的。本文建议利用现有的测量数据来确定一个名义递归神经网络 (RNN) 模型,以捕捉非线性动态,其中包括状态变量和输入的约束条件。为了解决简单地根据数据拟合模型所产生的次优控制策略问题,本文使用强化学习(RL)来调整 NMPC 方案,并为实际系统生成最优策略。这种方法的新颖之处在于利用 RL 克服了名义 RNN 模型的局限性,并生成了更精确的控制策略。论文讨论了 RNN 模型的初始状态估计和 MPC 中神经模型集成的实施问题。本文提出的方法在一个经典的基准控制问题上进行了演示:级联双油箱系统 (CTS)。
{"title":"Reinforcement learning based MPC with neural dynamical models","authors":"Saket Adhau ,&nbsp;Sébastien Gros ,&nbsp;Sigurd Skogestad","doi":"10.1016/j.ejcon.2024.101048","DOIUrl":"10.1016/j.ejcon.2024.101048","url":null,"abstract":"<div><div>This paper presents an end-to-end learning approach to developing a Nonlinear Model Predictive Control (NMPC) policy, which does not require an explicit first-principles model and assumes that the system dynamics are either unknown or partially known. The paper proposes the use of available measurements to identify a nominal Recurrent Neural Network (RNN) model to capture the nonlinear dynamics, which includes constraints on the state variables and inputs. To address the issue of suboptimal control policies resulting from simply fitting the model to the data, this paper uses Reinforcement learning (RL) to tune the NMPC scheme and generate an optimal policy for the real system. The approach’s novelty lies in the use of RL to overcome the limitations of the nominal RNN model and generate a more accurate control policy. The paper discusses the implementation aspects of initial state estimation for RNN models and integration of neural models in MPC. The presented method is demonstrated on a classic benchmark control problem: cascaded two tank system (CTS).</div></div>","PeriodicalId":50489,"journal":{"name":"European Journal of Control","volume":"80 ","pages":"Article 101048"},"PeriodicalIF":2.5,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141510834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Closed-loop identification of stabilized models using dual input–output parameterization 利用双输入输出参数化对稳定模型进行闭环识别
IF 2.5 3区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-01 DOI: 10.1016/j.ejcon.2024.101089
Ran Chen , Amber Srivastava , Mingzhou Yin , Roy S. Smith
This paper introduces a dual input–output parameterization (dual IOP) for the identification of linear time-invariant systems from closed-loop data. It draws inspiration from the recent input–output parameterization developed to synthesize a stabilizing controller. The controller is parameterized in terms of closed-loop transfer functions, from the external disturbances to the input and output of the system, constrained to lie in a given subspace. Analogously, the dual IOP method parameterizes the unknown plant with analogous closed-loop transfer functions, also referred to as dual parameters. In this case, these closed-loop transfer functions are constrained to lie in an affine subspace guaranteeing that the identified plant is stabilized by the known controller. Compared with existing closed-loop identification techniques guaranteeing closed-loop stability, such as the dual Youla parameterization, the dual IOP requires neither a doubly-coprime factorization of the controller nor a nominal plant that is stabilized by the controller. The dual IOP does not depend on the order and the state-space realization of the controller either, as in the dual system-level parameterization. Simulation shows that the dual IOP outperforms the existing benchmark methods.
本文介绍了从闭环数据识别线性时变系统的双输入输出参数化(dual IOP)。它借鉴了最近为合成稳定控制器而开发的输入输出参数化方法。控制器是以闭环传递函数为参数的,从外部干扰到系统的输入和输出,都受限于一个给定的子空间。与此类似,对偶 IOP 方法也是通过类似的闭环传递函数(也称为对偶参数)对未知工厂进行参数化。在这种情况下,这些闭环传递函数受限于一个仿射子空间,从而保证识别出的工厂由已知控制器控制。与现有的保证闭环稳定性的闭环识别技术(如双 Youla 参数化)相比,双 IOP 既不需要控制器的双同调因子化,也不需要控制器稳定的标称工厂。双 IOP 也不像双系统级参数化那样取决于控制器的阶数和状态空间实现。仿真表明,双 IOP 优于现有的基准方法。
{"title":"Closed-loop identification of stabilized models using dual input–output parameterization","authors":"Ran Chen ,&nbsp;Amber Srivastava ,&nbsp;Mingzhou Yin ,&nbsp;Roy S. Smith","doi":"10.1016/j.ejcon.2024.101089","DOIUrl":"10.1016/j.ejcon.2024.101089","url":null,"abstract":"<div><div>This paper introduces a dual input–output parameterization (dual IOP) for the identification of linear time-invariant systems from closed-loop data. It draws inspiration from the recent input–output parameterization developed to synthesize a stabilizing controller. The controller is parameterized in terms of closed-loop transfer functions, from the external disturbances to the input and output of the system, constrained to lie in a given subspace. Analogously, the dual IOP method parameterizes the unknown plant with analogous closed-loop transfer functions, also referred to as dual parameters. In this case, these closed-loop transfer functions are constrained to lie in an affine subspace guaranteeing that the identified plant is <em>stabilized</em> by the known controller. Compared with existing closed-loop identification techniques guaranteeing closed-loop stability, such as the dual Youla parameterization, the dual IOP requires neither a doubly-coprime factorization of the controller nor a nominal plant that is stabilized by the controller. The dual IOP does not depend on the order and the state-space realization of the controller either, as in the dual system-level parameterization. Simulation shows that the dual IOP outperforms the existing benchmark methods.</div></div>","PeriodicalId":50489,"journal":{"name":"European Journal of Control","volume":"80 ","pages":"Article 101089"},"PeriodicalIF":2.5,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141781381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards fully autonomous orbit management for low-earth orbit satellites based on neuro-evolutionary algorithms and deep reinforcement learning 基于神经进化算法和深度强化学习实现低地轨道卫星的完全自主轨道管理
IF 2.5 3区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-01 DOI: 10.1016/j.ejcon.2024.101052
Alexander Kyuroson, Avijit Banerjee, Nektarios Aristeidis Tafanidis, Sumeet Satpute, George Nikolakopoulos
The recent advances in space technology are focusing on fully autonomous, real-time, long-term orbit management and mission planning for large-scale satellite constellations in Low-Earth Orbit (LEO). Thus, a pioneering approach for autonomous orbital station-keeping has been introduced using a model-free Deep Policy Gradient-based Reinforcement Learning (DPGRL) strategy explicitly tailored for LEO. Addressing the critical need for more efficient and self-regulating orbit management in LEO satellite constellations, this work explores the potential synergy between Deep Reinforcement Learning (DRL) and Neuro-Evolution of Augmenting Topology (NEAT) to optimize station-keeping strategies with the primary goal to empower satellite to autonomously maintain their orbit in the presence of external perturbations within an allowable tolerance margin, thereby significantly reducing operational costs while maintaining precise and consistent station-keeping throughout their life cycle. The study specifically tailors DPGRL algorithms for LEO satellites, considering low-thrust constraints for maneuvers and integrating dense reward schemes and domain-based reward shaping techniques. By showcasing the adaptability and scalability of the combined NEAT and DRL framework in diverse operational scenarios, this approach holds immense promise for revolutionizing autonomous orbit management, paving the way for more efficient and adaptable satellite operations while incorporating the physical constraints of satellite, such as thruster limitations.
空间技术的最新进展主要集中在为低地轨道(LEO)上的大型卫星群提供完全自主、实时、长期的轨道管理和任务规划。因此,一种开创性的自主轨道站保持方法已被引入,该方法采用了明确为低地轨道量身定制的无模型深度策略梯度强化学习(DPGRL)策略。为了满足低地轨道卫星群对更高效、更自律的轨道管理的迫切需求,这项研究探索了深度强化学习(DRL)与增强拓扑神经进化(NEAT)之间的潜在协同作用,以优化定点保持策略,其主要目标是使卫星能够在允许的容差范围内,在出现外部扰动时自主保持轨道,从而在整个生命周期内保持精确一致的定点保持,同时大幅降低运营成本。这项研究专门为低地球轨道卫星量身定制了 DPGRL 算法,考虑到了机动的低推力约束,并集成了密集奖励方案和基于域的奖励塑造技术。通过展示 NEAT 和 DRL 组合框架在不同运行场景中的适应性和可扩展性,这种方法有望彻底改变自主轨道管理,为更高效、适应性更强的卫星运行铺平道路,同时将推进器限制等卫星物理限制因素纳入其中。
{"title":"Towards fully autonomous orbit management for low-earth orbit satellites based on neuro-evolutionary algorithms and deep reinforcement learning","authors":"Alexander Kyuroson,&nbsp;Avijit Banerjee,&nbsp;Nektarios Aristeidis Tafanidis,&nbsp;Sumeet Satpute,&nbsp;George Nikolakopoulos","doi":"10.1016/j.ejcon.2024.101052","DOIUrl":"10.1016/j.ejcon.2024.101052","url":null,"abstract":"<div><div>The recent advances in space technology are focusing on fully autonomous, real-time, long-term orbit management and mission planning for large-scale satellite constellations in Low-Earth Orbit (LEO). Thus, a pioneering approach for autonomous orbital station-keeping has been introduced using a model-free Deep Policy Gradient-based Reinforcement Learning (DPGRL) strategy explicitly tailored for LEO. Addressing the critical need for more efficient and self-regulating orbit management in LEO satellite constellations, this work explores the potential synergy between Deep Reinforcement Learning (DRL) and Neuro-Evolution of Augmenting Topology (NEAT) to optimize station-keeping strategies with the primary goal to empower satellite to autonomously maintain their orbit in the presence of external perturbations within an allowable tolerance margin, thereby significantly reducing operational costs while maintaining precise and consistent station-keeping throughout their life cycle. The study specifically tailors DPGRL algorithms for LEO satellites, considering low-thrust constraints for maneuvers and integrating dense reward schemes and domain-based reward shaping techniques. By showcasing the adaptability and scalability of the combined NEAT and DRL framework in diverse operational scenarios, this approach holds immense promise for revolutionizing autonomous orbit management, paving the way for more efficient and adaptable satellite operations while incorporating the physical constraints of satellite, such as thruster limitations.</div></div>","PeriodicalId":50489,"journal":{"name":"European Journal of Control","volume":"80 ","pages":"Article 101052"},"PeriodicalIF":2.5,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141397533","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Secure state estimation of networked switched systems under denial-of-service attacks 拒绝服务攻击下网络交换系统的安全状态估计
IF 2.5 3区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-01 DOI: 10.1016/j.ejcon.2024.101037
Qingkai Meng , Andreas Kasis , Hao Yang , Marios M. Polycarpou
This paper studies the problem of secure state estimation of networked switched systems in the presence of denial-of-service (DoS) attacks, as well as disturbances and measurement noise. Firstly, a state transformation rule is designed to partition the original system into two subsystems, facilitating the design of discrete and continuous state observers. Secondly, by modifying the traditional super-twisting sliding-mode method and taking into account the frequency and duration characteristics of DoS attacks, we employ dynamic differential properties between different modes to design a switching law identification strategy. We show that this strategy can accurately estimate the switching state without imposing any requirement on the switching times and sequences. Thirdly, based on the identified activated mode, a set of mode-dependent continuous state sliding-mode observers is designed, that achieves continuous state estimation in finite time. The practicality and applicability of the developed results are validated through numerical simulations.
本文研究了存在拒绝服务(DoS)攻击以及干扰和测量噪声的网络交换系统的安全状态估计问题。首先,本文设计了一种状态变换规则,将原始系统划分为两个子系统,从而方便设计离散和连续状态观测器。其次,通过修改传统的超扭曲滑动模式方法,并考虑到 DoS 攻击的频率和持续时间特征,我们利用不同模式之间的动态差分特性设计了一种切换规律识别策略。我们的研究表明,这种策略可以准确估计切换状态,而无需对切换时间和顺序提出任何要求。第三,基于识别出的激活模式,我们设计了一组与模式相关的连续状态滑动模式观测器,可在有限时间内实现连续状态估计。通过数值模拟验证了所开发成果的实用性和适用性。
{"title":"Secure state estimation of networked switched systems under denial-of-service attacks","authors":"Qingkai Meng ,&nbsp;Andreas Kasis ,&nbsp;Hao Yang ,&nbsp;Marios M. Polycarpou","doi":"10.1016/j.ejcon.2024.101037","DOIUrl":"10.1016/j.ejcon.2024.101037","url":null,"abstract":"<div><div><span>This paper studies the problem of secure state estimation of networked </span>switched systems in the presence of denial-of-service (DoS) attacks, as well as disturbances and measurement noise. Firstly, a state transformation rule is designed to partition the original system into two subsystems, facilitating the design of discrete and continuous state observers. Secondly, by modifying the traditional super-twisting sliding-mode method and taking into account the frequency and duration characteristics of DoS attacks, we employ dynamic differential properties between different modes to design a switching law identification strategy. We show that this strategy can accurately estimate the switching state without imposing any requirement on the switching times and sequences. Thirdly, based on the identified activated mode, a set of mode-dependent continuous state sliding-mode observers is designed, that achieves continuous state estimation in finite time. The practicality and applicability of the developed results are validated through numerical simulations.</div></div>","PeriodicalId":50489,"journal":{"name":"European Journal of Control","volume":"80 ","pages":"Article 101037"},"PeriodicalIF":2.5,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141510835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Hamiltonian dynamics with reproducing kernel Hilbert spaces and random features 利用重现核希尔伯特空间和随机特征学习哈密顿动力学
IF 2.5 3区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-01 DOI: 10.1016/j.ejcon.2024.101128
Torbjørn Smith, Olav Egeland
A method for learning Hamiltonian dynamics from a limited and noisy dataset is proposed. The method learns a Hamiltonian vector field on a reproducing kernel Hilbert space (RKHS) of inherently Hamiltonian vector fields, and in particular, odd Hamiltonian vector fields. This is done with a symplectic kernel, and it is shown how the kernel can be modified to an odd symplectic kernel to impose the odd symmetry. A random feature approximation is developed for the proposed odd kernel to reduce the problem size. The performance of the method is validated in simulations for three Hamiltonian systems. It is demonstrated that the use of an odd symplectic kernel improves prediction accuracy and data efficiency, and that the learned vector fields are Hamiltonian and exhibit the imposed odd symmetry characteristics.
本文提出了一种从有限的噪声数据集学习哈密顿动力学的方法。该方法在哈密顿向量场,特别是奇数哈密顿向量场的重现核希尔伯特空间(RKHS)上学习哈密顿向量场。该方法使用交映核,并展示了如何将交映核修改为奇数交映核,以实现奇数对称性。针对所提出的奇数核开发了一种随机特征近似方法,以减小问题的规模。通过对三个哈密顿系统的模拟验证了该方法的性能。结果表明,奇对称核的使用提高了预测精度和数据效率,而且学习到的矢量场是哈密尔顿矢量场,并表现出所强加的奇对称特征。
{"title":"Learning Hamiltonian dynamics with reproducing kernel Hilbert spaces and random features","authors":"Torbjørn Smith,&nbsp;Olav Egeland","doi":"10.1016/j.ejcon.2024.101128","DOIUrl":"10.1016/j.ejcon.2024.101128","url":null,"abstract":"<div><div>A method for learning Hamiltonian dynamics from a limited and noisy dataset is proposed. The method learns a Hamiltonian vector field on a reproducing kernel Hilbert space (RKHS) of inherently Hamiltonian vector fields, and in particular, odd Hamiltonian vector fields. This is done with a symplectic kernel, and it is shown how the kernel can be modified to an odd symplectic kernel to impose the odd symmetry. A random feature approximation is developed for the proposed odd kernel to reduce the problem size. The performance of the method is validated in simulations for three Hamiltonian systems. It is demonstrated that the use of an odd symplectic kernel improves prediction accuracy and data efficiency, and that the learned vector fields are Hamiltonian and exhibit the imposed odd symmetry characteristics.</div></div>","PeriodicalId":50489,"journal":{"name":"European Journal of Control","volume":"80 ","pages":"Article 101128"},"PeriodicalIF":2.5,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142572252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
One superior pursuer and multiple-evader differential games with two lifelines 有两条生命线的一优一劣追逐者和多诱饵差分赛
IF 2.5 3区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-01 DOI: 10.1016/j.ejcon.2024.101130
Ningsheng Xu , Fang Deng , Weiwen Huang , Li Liang , Xiang Shi
This paper investigates a pursuit-evasion game with multiple evaders and a superior pursuer situated on a two-dimensional plane, divided by two lifelines into the play area and goal areas. The goal of the evaders is to reach one of the two goal areas, while the pursuer aims to capture them before they reach the lifeline. This paper constructs barriers without time delay and with time delay, respectively. For each evader, the entire barrier divides the game area into regions of dominance for the evader and pursuer, respectively. Cooperative and non-cooperative strategies between two evaders are studied when the evaders’ positions are within the pursuer’s dominance region. We consider the impact of different strategies adopted by the evaders and variations in the distance between the two lifelines on the number of captures by the pursuer. Furthermore, Apollonius circles and Cartesian ovals are used to determine optimal target points for the evaders under different circumstances. Subsequently, we extend the cooperative strategies of the evaders to multiplayer cooperative games, transform them into optimization problems, and use optimization algorithm to derive the cooperative strategies of multiple evaders. Finally, numerical simulations for various cases are presented in this paper.
本文研究的是一种追逐-逃避博弈,在一个二维平面上有多个逃避者和一个卓越的追逐者,平面被两条生命线分为游戏区和目标区。逃避者的目标是到达两个目标区域之一,而追击者的目标是在他们到达生命线之前抓住他们。本文分别构建了无时间延迟和有时间延迟的障碍。对于每个逃逸者,整个障碍将游戏区域划分为逃逸者和追捕者分别占优势的区域。当逃避者的位置位于追逐者的优势区域内时,我们将研究两个逃避者之间的合作和非合作策略。我们考虑了逃避者采取的不同策略以及两条生命线之间的距离变化对追逐者捕获数量的影响。此外,我们还利用阿波罗尼斯圆和笛卡尔椭圆来确定不同情况下逃逸者的最佳目标点。随后,我们将逃逸者的合作策略扩展到多人合作博弈中,将其转化为优化问题,并利用优化算法推导出多个逃逸者的合作策略。最后,本文介绍了各种情况下的数值模拟。
{"title":"One superior pursuer and multiple-evader differential games with two lifelines","authors":"Ningsheng Xu ,&nbsp;Fang Deng ,&nbsp;Weiwen Huang ,&nbsp;Li Liang ,&nbsp;Xiang Shi","doi":"10.1016/j.ejcon.2024.101130","DOIUrl":"10.1016/j.ejcon.2024.101130","url":null,"abstract":"<div><div>This paper investigates a pursuit-evasion game with multiple evaders and a superior pursuer situated on a two-dimensional plane, divided by two lifelines into the play area and goal areas. The goal of the evaders is to reach one of the two goal areas, while the pursuer aims to capture them before they reach the lifeline. This paper constructs barriers without time delay and with time delay, respectively. For each evader, the entire barrier divides the game area into regions of dominance for the evader and pursuer, respectively. Cooperative and non-cooperative strategies between two evaders are studied when the evaders’ positions are within the pursuer’s dominance region. We consider the impact of different strategies adopted by the evaders and variations in the distance between the two lifelines on the number of captures by the pursuer. Furthermore, Apollonius circles and Cartesian ovals are used to determine optimal target points for the evaders under different circumstances. Subsequently, we extend the cooperative strategies of the evaders to multiplayer cooperative games, transform them into optimization problems, and use optimization algorithm to derive the cooperative strategies of multiple evaders. Finally, numerical simulations for various cases are presented in this paper.</div></div>","PeriodicalId":50489,"journal":{"name":"European Journal of Control","volume":"80 ","pages":"Article 101130"},"PeriodicalIF":2.5,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142554567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A frequency-domain approach for enhanced performance and task flexibility in finite-time ILC 在有限时间 ILC 中提高性能和任务灵活性的频域方法
IF 2.5 3区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-01 DOI: 10.1016/j.ejcon.2024.101033
Max van Haren , Kentaro Tsurumoto , Masahiro Mae , Lennart Blanken , Wataru Ohnishi , Tom Oomen
Iterative learning control (ILC) techniques are capable of improving the tracking performance of control systems that repeatedly perform similar tasks by utilizing data from past iterations. The aim of this paper is to achieve both the task flexibility enabled by ILC with basis functions and the performance of frequency-domain ILC, with an intuitive design procedure. The cost function of norm-optimal ILC is determined that recovers frequency-domain ILC, and consequently, the feedforward signal is parameterized in terms of basis functions and frequency-domain ILC. The resulting method has the performance and design procedure of frequency-domain ILC and the task flexibility of basis functions ILC, and are complimentary to each other. Validation on a benchmark example confirms the capabilities of the framework.
迭代学习控制(ILC)技术能够利用过去迭代的数据,改善重复执行类似任务的控制系统的跟踪性能。本文的目的是通过直观的设计程序,实现基函数 ILC 的任务灵活性和频域 ILC 的性能。本文确定了能恢复频域 ILC 的规范最优 ILC 成本函数,并因此根据基函数和频域 ILC 对前馈信号进行了参数化。由此产生的方法既有频域 ILC 的性能和设计程序,又有基函数 ILC 的任务灵活性,两者相得益彰。在一个基准实例上的验证证实了该框架的能力。
{"title":"A frequency-domain approach for enhanced performance and task flexibility in finite-time ILC","authors":"Max van Haren ,&nbsp;Kentaro Tsurumoto ,&nbsp;Masahiro Mae ,&nbsp;Lennart Blanken ,&nbsp;Wataru Ohnishi ,&nbsp;Tom Oomen","doi":"10.1016/j.ejcon.2024.101033","DOIUrl":"10.1016/j.ejcon.2024.101033","url":null,"abstract":"<div><div>Iterative learning control (ILC) techniques are capable of improving the tracking performance of control systems that repeatedly perform similar tasks by utilizing data from past iterations. The aim of this paper is to achieve both the task flexibility enabled by ILC with basis functions and the performance of frequency-domain ILC, with an intuitive design procedure. The cost function of norm-optimal ILC is determined that recovers frequency-domain ILC, and consequently, the feedforward signal is parameterized in terms of basis functions and frequency-domain ILC. The resulting method has the performance and design procedure of frequency-domain ILC and the task flexibility of basis functions ILC, and are complimentary to each other. Validation on a benchmark example confirms the capabilities of the framework.</div></div>","PeriodicalId":50489,"journal":{"name":"European Journal of Control","volume":"80 ","pages":"Article 101033"},"PeriodicalIF":2.5,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141510831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decentralized PI-control and anti-windup in resource sharing networks 资源共享网络中的分散 PI 控制和防倒转功能
IF 2.5 3区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-01 DOI: 10.1016/j.ejcon.2024.101049
Felix Agner , Jonas Hansson , Pauline Kergus , Anders Rantzer , Sophie Tarbouriech , Luca Zaccarian
We consider control of multiple stable first-order agents which have a control coupling described by an M-matrix. These agents are subject to incremental sector-bounded input nonlinearities. We show that such plants can be globally asymptotically stabilized to a unique equilibrium using fully decentralized proportional–integral controllers equipped with anti-windup and subject to local tuning rules. In addition, we show that when the nonlinearities correspond to the saturation function, the closed loop asymptotically minimizes a weighted 1-norm of the agents state mismatch. The control strategy is finally compared to other state-of-the-art controllers on a numerical district heating example.
我们考虑对多个稳定的一阶代理进行控制,这些代理具有由 M 矩阵描述的控制耦合。这些代理受增量部门有界输入非线性的影响。我们的研究表明,使用配备防倒退功能的全分散比例积分控制器,并根据局部调整规则,可以将这些植物全局渐近地稳定在一个唯一的平衡点上。此外,我们还证明,当非线性与饱和函数相对应时,闭环可近似最小化代理状态失配的加权 1 正态。最后,我们在一个区域供热数值示例中将该控制策略与其他最先进的控制器进行了比较。
{"title":"Decentralized PI-control and anti-windup in resource sharing networks","authors":"Felix Agner ,&nbsp;Jonas Hansson ,&nbsp;Pauline Kergus ,&nbsp;Anders Rantzer ,&nbsp;Sophie Tarbouriech ,&nbsp;Luca Zaccarian","doi":"10.1016/j.ejcon.2024.101049","DOIUrl":"10.1016/j.ejcon.2024.101049","url":null,"abstract":"<div><div>We consider control of multiple stable first-order agents which have a control coupling described by an M-matrix. These agents are subject to incremental sector-bounded input nonlinearities. We show that such plants can be globally asymptotically stabilized to a unique equilibrium using fully decentralized proportional–integral controllers equipped with anti-windup and subject to local tuning rules. In addition, we show that when the nonlinearities correspond to the saturation function, the closed loop asymptotically minimizes a weighted 1-norm of the agents state mismatch. The control strategy is finally compared to other state-of-the-art controllers on a numerical district heating example.</div></div>","PeriodicalId":50489,"journal":{"name":"European Journal of Control","volume":"80 ","pages":"Article 101049"},"PeriodicalIF":2.5,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141406400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
European Journal of Control
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1