首页 > 最新文献

Automatica最新文献

英文 中文
Controllability and Pareto improvability on Nash equilibriums in game-based control systems 基于博弈的控制系统中纳什均衡的可控性和帕累托改进性
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-07 DOI: 10.1016/j.automatica.2024.111893

To investigate objects driven by external input and players’ interests, the game-based control system (GBCS) was established. In this system, the high-level leader does not participate directly but regulates the low-level game to make followers’ Nash equilibrium(NE) “better”. This article focuses on a specific type of GBCSs with rational players, where the open-loop NE is unique under any given initial state and macro-regulation. We discuss two kinds of regulations on NEs: achieving reachability among NEs through macro-regulation and Pareto improvability on NEs that can benefit at least one follower without harming anyone else. These regulations help reduce the widespread inconsistency between individual and collective rationality. Moreover, conditions are provided to determine controllability and Pareto improvability on NEs. Finally, an example on opinion dynamics is presented to demonstrate the effectiveness of obtained theoretical results.

为了研究由外部输入和参与者利益驱动的对象,建立了基于游戏的控制系统(GBCS)。在这个系统中,高层领导者并不直接参与,而是调节低层博弈,使追随者的纳什均衡(NE)"更好"。本文重点研究一种特定类型的理性博弈者的 GBCS,在任何给定的初始状态和宏观调控下,其开环 NE 都是唯一的。我们讨论了对 NE 的两种规定:通过宏观调控实现 NE 之间的可达性,以及对 NE 的帕累托改进性,即在不损害其他任何人的情况下,使至少一个追随者受益。这些规定有助于减少个人理性与集体理性之间普遍存在的不一致性。此外,我们还提供了一些条件来确定网络的可控性和帕累托改进性。最后,介绍了一个关于舆论动态的例子,以证明所获得的理论结果的有效性。
{"title":"Controllability and Pareto improvability on Nash equilibriums in game-based control systems","authors":"","doi":"10.1016/j.automatica.2024.111893","DOIUrl":"10.1016/j.automatica.2024.111893","url":null,"abstract":"<div><p>To investigate objects driven by external input and players’ interests, the game-based control system (GBCS) was established. In this system, the high-level leader does not participate directly but regulates the low-level game to make followers’ Nash equilibrium(NE) “better”. This article focuses on a specific type of GBCSs with rational players, where the open-loop NE is unique under any given initial state and macro-regulation. We discuss two kinds of regulations on NEs: achieving reachability among NEs through macro-regulation and Pareto improvability on NEs that can benefit at least one follower without harming anyone else. These regulations help reduce the widespread inconsistency between individual and collective rationality. Moreover, conditions are provided to determine controllability and Pareto improvability on NEs. Finally, an example on opinion dynamics is presented to demonstrate the effectiveness of obtained theoretical results.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S000510982400387X/pdfft?md5=cef51a4d64d7ae2b1e84f254902d78bc&pid=1-s2.0-S000510982400387X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142162501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal control strategies in a generic class of bacterial growth models with multiple substrates 具有多种底物的一类细菌生长模型的最佳控制策略
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-07 DOI: 10.1016/j.automatica.2024.111881

Optimal control strategies are studied through the application of the Pontryagin’s Maximum Principle for a class of non-linear differential systems that are commonly used to describe resource allocation during bacterial growth. The approach is inspired by the optimality of numerous regulatory mechanisms in bacterial cells. In this context, we aim to predict natural feedback loops as optimal control solutions so as to gain insight on the behavior of microorganisms from a control-theoretical perspective. The problem is posed in terms of a control function u0(t) representing the fraction of the cell dedicated to protein synthesis, and n additional controls ui(t) modeling the fraction of the cell responsible for the consumption of the available nutrient sources in the medium. By studying the necessary conditions for optimality, it is possible to prove that the solutions follow a bang–singular–bang structure, and that they are characterized by a sequential uptake pattern known as diauxic growth, which prioritizes the consumption of richer substrates over poor nutrients. Numerical simulations obtained through an optimal control solver confirm the theoretical results. Finally, we provide an application to batch cultivation of E. coli growing on glucose and lactose. For that, we propose a state feedback law that is based on the optimal control, and we calibrate the obtained closed-loop model to experimental data.

通过对一类常用于描述细菌生长过程中资源分配的非线性微分系统应用庞特里亚金最大原则,研究了最优控制策略。这种方法的灵感来自细菌细胞中众多调节机制的最优性。在这种情况下,我们的目标是将自然反馈回路预测为最优控制方案,从而从控制理论的角度深入了解微生物的行为。该问题由一个控制函数 u0(t)和 n 个附加控制函数 uui(t)构成,前者代表细胞中专门用于蛋白质合成的部分,后者则代表细胞中负责消耗培养基中可用营养源的部分。通过研究最优化的必要条件,可以证明解遵循 "砰-砰-砰 "结构,其特征是一种被称为 "双氧生长 "的顺序吸收模式,即优先消耗较丰富的基质而不是较贫乏的营养物质。通过最优控制求解器获得的数值模拟证实了理论结果。最后,我们提供了在葡萄糖和乳糖上批量培养大肠杆菌的应用。为此,我们提出了基于最优控制的状态反馈法,并根据实验数据对所获得的闭环模型进行了校准。
{"title":"Optimal control strategies in a generic class of bacterial growth models with multiple substrates","authors":"","doi":"10.1016/j.automatica.2024.111881","DOIUrl":"10.1016/j.automatica.2024.111881","url":null,"abstract":"<div><p>Optimal control strategies are studied through the application of the Pontryagin’s Maximum Principle for a class of non-linear differential systems that are commonly used to describe resource allocation during bacterial growth. The approach is inspired by the optimality of numerous regulatory mechanisms in bacterial cells. In this context, we aim to predict natural feedback loops as optimal control solutions so as to gain insight on the behavior of microorganisms from a control-theoretical perspective. The problem is posed in terms of a control function <span><math><mrow><msub><mrow><mi>u</mi></mrow><mrow><mn>0</mn></mrow></msub><mrow><mo>(</mo><mi>t</mi><mo>)</mo></mrow></mrow></math></span> representing the fraction of the cell dedicated to protein synthesis, and <span><math><mi>n</mi></math></span> additional controls <span><math><mrow><msub><mrow><mi>u</mi></mrow><mrow><mi>i</mi></mrow></msub><mrow><mo>(</mo><mi>t</mi><mo>)</mo></mrow></mrow></math></span> modeling the fraction of the cell responsible for the consumption of the available nutrient sources in the medium. By studying the necessary conditions for optimality, it is possible to prove that the solutions follow a bang–singular–bang structure, and that they are characterized by a sequential uptake pattern known as diauxic growth, which prioritizes the consumption of richer substrates over poor nutrients. Numerical simulations obtained through an optimal control solver confirm the theoretical results. Finally, we provide an application to batch cultivation of E. coli growing on glucose and lactose. For that, we propose a state feedback law that is based on the optimal control, and we calibrate the obtained closed-loop model to experimental data.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0005109824003753/pdfft?md5=4b5a18dc60bfd905fc283f6a25f01d94&pid=1-s2.0-S0005109824003753-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142162577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-driven output prediction and control of stochastic systems: An innovation-based approach 数据驱动的随机系统输出预测和控制:基于创新的方法
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-07 DOI: 10.1016/j.automatica.2024.111897

Recent years have witnessed a booming interest in data-driven control of dynamical systems. However, the implicit data-driven output predictors are vulnerable to uncertainty such as process disturbance and measurement noise, causing unreliable predictions and unexpected control actions. In this brief, we put forward a new data-driven approach to output prediction of stochastic linear time-invariant (LTI) systems. By utilizing the innovation form, the uncertainty in stochastic LTI systems is recast as innovations that can be readily estimated from input–output data without knowing system matrices. In this way, by applying the fundamental lemma to the innovation form, we propose a new innovation-based data-driven output predictor (OP) of stochastic LTI systems, which bypasses the need for identifying state–space matrices explicitly and building a state estimator. The boundedness of the second moment of prediction errors in closed-loop is established under mild conditions. The proposed data-driven OP can be integrated into optimal control design for better performance. Numerical simulations demonstrate the outperformance of the proposed innovation-based methods in output prediction and control design over existing formulations.

近年来,人们对数据驱动的动态系统控制产生了浓厚的兴趣。然而,隐式数据驱动输出预测器容易受到过程干扰和测量噪声等不确定性的影响,导致预测不可靠和控制行动出乎意料。在本简介中,我们提出了一种新的数据驱动方法,用于随机线性时变(LTI)系统的输出预测。通过利用创新形式,随机 LTI 系统中的不确定性被重塑为创新,无需了解系统矩阵,即可从输入输出数据中轻松估算出创新。这样,通过将基本定理应用于创新形式,我们提出了一种新的基于创新的数据驱动的随机 LTI 系统输出预测器(OP),它绕过了明确识别状态空间矩阵和建立状态估计器的需要。在温和条件下,闭环预测误差第二矩的有界性得以确立。所提出的数据驱动 OP 可以集成到优化控制设计中,以获得更好的性能。数值模拟证明了所提出的基于创新的方法在输出预测和控制设计方面的性能优于现有公式。
{"title":"Data-driven output prediction and control of stochastic systems: An innovation-based approach","authors":"","doi":"10.1016/j.automatica.2024.111897","DOIUrl":"10.1016/j.automatica.2024.111897","url":null,"abstract":"<div><p>Recent years have witnessed a booming interest in data-driven control of dynamical systems. However, the implicit data-driven output predictors are vulnerable to uncertainty such as process disturbance and measurement noise, causing unreliable predictions and unexpected control actions. In this brief, we put forward a new data-driven approach to output prediction of stochastic linear time-invariant (LTI) systems. By utilizing the innovation form, the uncertainty in stochastic LTI systems is recast as innovations that can be readily estimated from input–output data without knowing system matrices. In this way, by applying the fundamental lemma to the innovation form, we propose a new innovation-based data-driven output predictor (OP) of stochastic LTI systems, which bypasses the need for identifying state–space matrices explicitly and building a state estimator. The boundedness of the second moment of prediction errors in closed-loop is established under mild conditions. The proposed data-driven OP can be integrated into optimal control design for better performance. Numerical simulations demonstrate the outperformance of the proposed innovation-based methods in output prediction and control design over existing formulations.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0005109824003911/pdfft?md5=d01ded28d040db1c663391881e4966c0&pid=1-s2.0-S0005109824003911-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142162503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online change points detection for linear dynamical systems with finite sample guarantees 有限样本保证线性动力系统的在线变化点检测
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-07 DOI: 10.1016/j.automatica.2024.111894

The problem of online change point detection is to detect abrupt changes in properties of time series, ideally as soon as possible after those changes occur. Existing work on online change point detection either assumes i.i.d. data, focuses on asymptotic analysis, does not present theoretical guarantees on the trade-off between detection accuracy and detection delay, or is only suitable for detecting single change points. In this work, we study the online change point detection problem for linear dynamical systems with unknown dynamics, where the data exhibits temporal correlations and the system could have multiple change points. We develop a data-dependent threshold that can be used in our test that allows one to achieve a pre-specified upper bound on the probability of making a false alarm. We further provide a finite-sample-based bound for the probability of detecting a change point. Our bound demonstrates how parameters used in our algorithm affect the detection probability and delay, and provides guidance on the minimum required time between changes to guarantee detection.

在线变化点检测的问题是检测时间序列属性的突然变化,最好是在这些变化发生后尽快检测。现有的在线变化点检测工作要么假定数据为 i.i.d.,要么侧重于渐近分析,要么没有提出检测精度和检测延迟之间权衡的理论保证,要么只适用于检测单个变化点。在这项工作中,我们研究了具有未知动态的线性动力系统的在线变化点检测问题,在这种情况下,数据具有时间相关性,系统可能有多个变化点。我们开发了一种与数据相关的阈值,可用于我们的检测,从而使误报概率达到预先设定的上限。我们进一步为检测到变化点的概率提供了一个基于有限样本的界限。我们的约束说明了我们算法中使用的参数如何影响检测概率和延迟,并为保证检测到变化所需的最小间隔时间提供了指导。
{"title":"Online change points detection for linear dynamical systems with finite sample guarantees","authors":"","doi":"10.1016/j.automatica.2024.111894","DOIUrl":"10.1016/j.automatica.2024.111894","url":null,"abstract":"<div><p>The problem of online change point detection is to detect abrupt changes in properties of time series, ideally as soon as possible after those changes occur. Existing work on online change point detection either assumes i.i.d. data, focuses on asymptotic analysis, does not present theoretical guarantees on the trade-off between detection accuracy and detection delay, or is only suitable for detecting single change points. In this work, we study the online change point detection problem for linear dynamical systems with unknown dynamics, where the data exhibits temporal correlations and the system could have multiple change points. We develop a data-dependent threshold that can be used in our test that allows one to achieve a pre-specified upper bound on the probability of making a false alarm. We further provide a finite-sample-based bound for the probability of detecting a change point. Our bound demonstrates how parameters used in our algorithm affect the detection probability and delay, and provides guidance on the minimum required time between changes to guarantee detection.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0005109824003881/pdfft?md5=253252efe9e72628a7809ea795bb855e&pid=1-s2.0-S0005109824003881-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142162579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed continuous-time time-varying optimization for networked Lagrangian systems with quadratic cost functions 具有二次成本函数的网络拉格朗日系统的分布式连续时间时变优化
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-07 DOI: 10.1016/j.automatica.2024.111882

In this paper, the distributed time-varying optimization problem is investigated for networked Lagrangian systems with parametric uncertainties. Usually, in the literature, to address some distributed control problems for nonlinear systems, a networked virtual system is constructed, and a tracking algorithm is designed such that the agents’ physical states track the virtual states. It is worth pointing out that such an idea requires the exchange of the virtual states and hence necessitates communication among the group. In addition, due to the complexities of the Lagrangian dynamics and the distributed time-varying optimization problem, there exist significant challenges. This paper proposes distributed time-varying optimization algorithms that achieve zero optimum-tracking errors for the networked Lagrangian agents without the communication requirement. The main idea behind the proposed algorithms is to construct a reference system for each agent to generate a reference velocity using absolute and relative physical state measurements with no exchange of virtual states needed, and to design adaptive controllers for Lagrangian systems such that the physical states are able to track the reference velocities and hence the optimal trajectory. The algorithms introduce mutual feedback between the reference systems and the local controllers via physical states/measurements and are amenable to implementation via local onboard sensing in a communication unfriendly environment. Specifically, first, a base algorithm is proposed to solve the distributed time-varying optimization problem for networked Lagrangian systems under switching graphs. Then, based on the base algorithm, a continuous function is introduced to approximate the signum function, forming a continuous distributed optimization algorithm and hence removing the chattering. Such a continuous algorithm is convergent with bounded ultimate optimum-tracking errors, which are proportion to approximation errors. Finally, numerical simulations are provided to illustrate the validity of the proposed algorithms.

本文研究了具有参数不确定性的网络拉格朗日系统的分布式时变优化问题。通常,文献中为了解决一些非线性系统的分布式控制问题,会构建一个网络虚拟系统,并设计一种跟踪算法,使代理的物理状态跟踪虚拟状态。值得指出的是,这种想法需要交换虚拟状态,因此需要小组之间进行通信。此外,由于拉格朗日动力学和分布式时变优化问题的复杂性,存在着巨大的挑战。本文提出了分布式时变优化算法,可在无需通信的情况下实现网络化拉格朗日代理的零最佳跟踪误差。所提算法的主要思想是为每个代理构建一个参考系统,利用绝对和相对物理状态测量值生成参考速度,无需交换虚拟状态,并为拉格朗日系统设计自适应控制器,使物理状态能够跟踪参考速度,从而实现最优轨迹。这些算法通过物理状态/测量值在参考系统和本地控制器之间引入了相互反馈,并可在通信不友好的环境中通过本地机载传感实现。具体来说,首先提出了一种基础算法,用于解决切换图下网络化拉格朗日系统的分布式时变优化问题。然后,在基础算法的基础上,引入一个连续函数来近似符号函数,形成一个连续的分布式优化算法,从而消除颤振。这种连续算法是收敛的,其终极最优跟踪误差是有界的,与近似误差成正比。最后,通过数值模拟说明了所提算法的有效性。
{"title":"Distributed continuous-time time-varying optimization for networked Lagrangian systems with quadratic cost functions","authors":"","doi":"10.1016/j.automatica.2024.111882","DOIUrl":"10.1016/j.automatica.2024.111882","url":null,"abstract":"<div><p>In this paper, the distributed time-varying optimization problem is investigated for networked Lagrangian systems with parametric uncertainties. Usually, in the literature, to address some distributed control problems for nonlinear systems, a networked virtual system is constructed, and a tracking algorithm is designed such that the agents’ physical states track the virtual states. It is worth pointing out that such an idea requires the exchange of the virtual states and hence necessitates communication among the group. In addition, due to the complexities of the Lagrangian dynamics and the distributed time-varying optimization problem, there exist significant challenges. This paper proposes distributed time-varying optimization algorithms that achieve zero optimum-tracking errors for the networked Lagrangian agents without the communication requirement. The main idea behind the proposed algorithms is to construct a reference system for each agent to generate a reference velocity using absolute and relative physical state measurements with no exchange of virtual states needed, and to design adaptive controllers for Lagrangian systems such that the physical states are able to track the reference velocities and hence the optimal trajectory. The algorithms introduce mutual feedback between the reference systems and the local controllers via physical states/measurements and are amenable to implementation via local onboard sensing in a communication unfriendly environment. Specifically, first, a base algorithm is proposed to solve the distributed time-varying optimization problem for networked Lagrangian systems under switching graphs. Then, based on the base algorithm, a continuous function is introduced to approximate the signum function, forming a continuous distributed optimization algorithm and hence removing the chattering. Such a continuous algorithm is convergent with bounded ultimate optimum-tracking errors, which are proportion to approximation errors. Finally, numerical simulations are provided to illustrate the validity of the proposed algorithms.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0005109824003765/pdfft?md5=268cd3ac8c9394f35de6171bde25b9be&pid=1-s2.0-S0005109824003765-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142162502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online estimated-based-state feedback control of n-qubit stochastic open quantum systems n 量子位随机开放量子系统的在线基于估计的状态反馈控制
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-06 DOI: 10.1016/j.automatica.2024.111883

For n-qubit stochastic open quantum systems, an online quantum state estimation (OQSE) algorithm and the associated online estimated-based-state feedback control (OQSE-FC) are proposed in this paper. The proposed OQSE algorithm integrates the online alternating direction multiplier method (OADM) to the continuous weak measurement (CWM). The quantum state feedback control laws are designed based on the Lyapunov stability theorem, and the states for feedback control are online estimated by OQSE algorithm. The convergence of OQSE algorithm and the asymptotic stability of the state feedback control laws are proved.

针对 n 量子位随机开放量子系统,本文提出了一种在线量子态估计(OQSE)算法和相关的基于估计的在线状态反馈控制(OQSE-FC)。所提出的 OQSE 算法将在线交替方向乘法器法(OADM)与连续弱测量法(CWM)相结合。基于 Lyapunov 稳定性定理设计量子态反馈控制律,并通过 OQSE 算法在线估计反馈控制的状态。证明了 OQSE 算法的收敛性和状态反馈控制律的渐近稳定性。
{"title":"Online estimated-based-state feedback control of n-qubit stochastic open quantum systems","authors":"","doi":"10.1016/j.automatica.2024.111883","DOIUrl":"10.1016/j.automatica.2024.111883","url":null,"abstract":"<div><p>For <span><math><mi>n</mi></math></span>-qubit stochastic open quantum systems, an online quantum state estimation (OQSE) algorithm and the associated online estimated-based-state feedback control (OQSE-FC) are proposed in this paper. The proposed OQSE algorithm integrates the online alternating direction multiplier method (OADM) to the continuous weak measurement (CWM). The quantum state feedback control laws are designed based on the Lyapunov stability theorem, and the states for feedback control are online estimated by OQSE algorithm. The convergence of OQSE algorithm and the asymptotic stability of the state feedback control laws are proved.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0005109824003777/pdfft?md5=e20a03851b75b5f56ef9ad5bc20ad096&pid=1-s2.0-S0005109824003777-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142162498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed online optimization for multi-agent optimal transport 多代理优化运输的分布式在线优化
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-05 DOI: 10.1016/j.automatica.2024.111880

We propose a scalable, distributed algorithm for the optimal transport of large-scale multi-agent systems. We formulate the problem as one of steering the collective towards a target probability measure while minimizing the total cost of transport, with the additional constraint of distributed implementation. Using optimal transport theory, we realize the solution as an iterative transport based on a stochastic proximal descent scheme. At each stage of the transport, the agents implement an online, distributed primal–dual algorithm to obtain local estimates of the Kantorovich potential for optimal transport from the current distribution of the collective to the target distribution. Using these estimates as their local objective functions, the agents then implement the transport by stochastic proximal descent. This two-step process is carried out recursively by the agents to converge asymptotically to the target distribution. We rigorously establish the underlying theoretical framework and convergence of the algorithm and test its behavior in numerical experiments.

我们提出了一种可扩展的分布式算法,用于优化大规模多代理系统的运输。我们将这一问题表述为:在使运输总成本最小化的同时,引导集体向目标概率度量前进,并附加分布式实施的约束条件。利用最优传输理论,我们以随机近似下降方案为基础,实现了迭代传输的解决方案。在运输的每个阶段,代理都会执行在线分布式初等二元算法,以获得从当前集体分布到目标分布的最优运输的康托洛维奇势的局部估计值。利用这些估计值作为局部目标函数,代理们就可以通过随机近似下降法实现传输。这两步过程由代理递归执行,以渐近收敛到目标分布。我们严格建立了算法的基础理论框架和收敛性,并在数值实验中对其行为进行了测试。
{"title":"Distributed online optimization for multi-agent optimal transport","authors":"","doi":"10.1016/j.automatica.2024.111880","DOIUrl":"10.1016/j.automatica.2024.111880","url":null,"abstract":"<div><p>We propose a scalable, distributed algorithm for the optimal transport of large-scale multi-agent systems. We formulate the problem as one of steering the collective towards a target probability measure while minimizing the total cost of transport, with the additional constraint of distributed implementation. Using optimal transport theory, we realize the solution as an iterative transport based on a stochastic proximal descent scheme. At each stage of the transport, the agents implement an online, distributed primal–dual algorithm to obtain local estimates of the Kantorovich potential for optimal transport from the current distribution of the collective to the target distribution. Using these estimates as their local objective functions, the agents then implement the transport by stochastic proximal descent. This two-step process is carried out recursively by the agents to converge asymptotically to the target distribution. We rigorously establish the underlying theoretical framework and convergence of the algorithm and test its behavior in numerical experiments.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0005109824003741/pdfft?md5=279afaa21d8e82d5ccefa238244d0e47&pid=1-s2.0-S0005109824003741-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142162505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A switching control strategy for policy selection in stochastic Dynamic Programming problems 随机动态编程问题中政策选择的切换控制策略
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-05 DOI: 10.1016/j.automatica.2024.111884

This paper presents a switching control strategy as a criterion for policy selection in stochastic Dynamic Programming problems over an infinite time horizon. In particular, the Bellman operator, applied iteratively to solve such problems, is generalized to the case of stochastic policies, and formulated as a discrete-time switched affine system. Then, a Lyapunov-based policy selection strategy is designed to ensure the practical convergence of the resulting closed-loop system trajectories towards an appropriately chosen reference value function. This way, it is possible to verify how the chosen reference value function can be approached by using a stabilizing switching signal, the latter defined on a given finite set of stationary stochastic policies. Finally, the presented method is applied to the Value Iteration algorithm, and an illustrative example of a recycling robot is provided to demonstrate its effectiveness in terms of convergence performance.

本文提出了一种开关控制策略,作为在无限时间范围内随机动态编程问题中选择策略的标准。具体而言,本文将用于迭代求解此类问题的贝尔曼算子推广到随机策略的情况中,并将其表述为离散时间切换仿射系统。然后,设计了一种基于 Lyapunov 的策略选择策略,以确保所得到的闭环系统轨迹朝着适当选择的参考值函数实际收敛。通过这种方法,可以验证如何利用稳定开关信号接近所选的参考值函数,后者定义在给定的有限静态随机策略集合上。最后,介绍的方法被应用于价值迭代算法,并提供了一个回收机器人的示例来证明其收敛性能的有效性。
{"title":"A switching control strategy for policy selection in stochastic Dynamic Programming problems","authors":"","doi":"10.1016/j.automatica.2024.111884","DOIUrl":"10.1016/j.automatica.2024.111884","url":null,"abstract":"<div><p>This paper presents a switching control strategy as a criterion for policy selection in stochastic Dynamic Programming problems over an infinite time horizon. In particular, the Bellman operator, applied iteratively to solve such problems, is generalized to the case of stochastic policies, and formulated as a discrete-time switched affine system. Then, a Lyapunov-based policy selection strategy is designed to ensure the practical convergence of the resulting closed-loop system trajectories towards an appropriately chosen reference value function. This way, it is possible to verify how the chosen reference value function can be approached by using a stabilizing switching signal, the latter defined on a given finite set of stationary stochastic policies. Finally, the presented method is applied to the Value Iteration algorithm, and an illustrative example of a recycling robot is provided to demonstrate its effectiveness in terms of convergence performance.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0005109824003789/pdfft?md5=6c3de146a3dd2eb4416dc66a045345cd&pid=1-s2.0-S0005109824003789-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142162500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantitatively nonblocking supervisory control of discrete-event systems 离散事件系统的定量无阻塞监督控制
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-31 DOI: 10.1016/j.automatica.2024.111879

In this paper, we propose a new property of quantitative nonblockingness of an automaton with respect to a given cover on its set of marker states. This property quantifies the standard nonblocking property by capturing the practical requirement that every subset (i.e. cell) of marker states can be reached within a prescribed number of steps from any reachable state and following any trajectory of the system. Accordingly, we formulate a new problem of quantitatively nonblocking supervisory control, and characterize its solvability in terms of a new concept of quantitative language completability. It is proven that there exists the unique supremal quantitatively completable sublanguage of a given language, and we develop an effective algorithm to compute the supremal sublanguage. Finally, combining with the algorithm of computing the supremal controllable sublanguage, we design an algorithm to compute the maximally permissive solution to the formulated quantitatively nonblocking supervisory control problem.

在本文中,我们提出了一种新的自动机定量无阻塞性属性,它与自动机标记状态集上的给定覆盖相关。这一属性量化了标准的非阻塞性,它捕捉到了这样一个实际要求,即每个标记状态子集(即单元)都能在规定的步数内从任意可到达状态到达,并遵循系统的任意轨迹。因此,我们提出了一个新的定量无阻塞监督控制问题,并用定量语言可完备性这一新概念来描述其可解性。我们证明了存在给定语言的唯一超定量可完备性子语言,并开发了计算超定量子语言的有效算法。最后,结合计算至上可控子语言的算法,我们设计了一种算法来计算所提出的定量无阻塞监督控制问题的最大允许解。
{"title":"Quantitatively nonblocking supervisory control of discrete-event systems","authors":"","doi":"10.1016/j.automatica.2024.111879","DOIUrl":"10.1016/j.automatica.2024.111879","url":null,"abstract":"<div><p>In this paper, we propose a new property of <em>quantitative nonblockingness</em> of an automaton with respect to a given cover on its set of marker states. This property <em>quantifies</em> the standard nonblocking property by capturing the practical requirement that every subset (i.e. cell) of marker states can be reached within a prescribed number of steps from any reachable state and following any trajectory of the system. Accordingly, we formulate a new problem of quantitatively nonblocking supervisory control, and characterize its solvability in terms of a new concept of quantitative language completability. It is proven that there exists the unique supremal quantitatively completable sublanguage of a given language, and we develop an effective algorithm to compute the supremal sublanguage. Finally, combining with the algorithm of computing the supremal controllable sublanguage, we design an algorithm to compute the maximally permissive solution to the formulated quantitatively nonblocking supervisory control problem.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142095236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An approach to mismatched disturbance rejection control for uncontrollable systems 不可控系统的不匹配干扰抑制控制方法
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-31 DOI: 10.1016/j.automatica.2024.111876

This study focuses on the problem of optimal mismatched disturbance rejection control for uncontrollable linear discrete-time systems. In contrast to previous studies, by introducing a quadratic performance index such that the regulated state can track a reference trajectory and minimize the effects of disturbances, mismatched disturbance rejection control is transformed into a linear quadratic tracking problem. The necessary and sufficient conditions for the solvability of this problem over a finite horizon and a disturbance rejection controller are derived by solving a forward–backward difference equation. In the case of an infinite horizon, a sufficient condition for the stabilization of the system is obtained under the detectable condition. Additionally, in combination with the generalized extended state observer, a controller design method is proposed, and the stability analysis of the system under this controller is presented. This paper details our novel approach to disturbance rejection. Finally, four examples are provided to demonstrate the effectiveness of the proposed method.

本研究的重点是不可控线性离散时间系统的最优错配干扰抑制控制问题。与以往研究不同的是,通过引入二次性能指标,使调节状态能跟踪参考轨迹并使干扰影响最小,错配干扰抑制控制被转化为线性二次跟踪问题。通过求解前向-后向差分方程,得出了该问题在有限视界和干扰抑制控制器上可解的必要条件和充分条件。在无限视界的情况下,在可探测条件下获得了系统稳定的充分条件。此外,结合广义扩展状态观测器,还提出了一种控制器设计方法,并对该控制器下的系统进行了稳定性分析。本文详细介绍了我们新颖的干扰抑制方法。最后,本文提供了四个实例来证明所提方法的有效性。
{"title":"An approach to mismatched disturbance rejection control for uncontrollable systems","authors":"","doi":"10.1016/j.automatica.2024.111876","DOIUrl":"10.1016/j.automatica.2024.111876","url":null,"abstract":"<div><p>This study focuses on the problem of optimal mismatched disturbance rejection control for uncontrollable linear discrete-time systems. In contrast to previous studies, by introducing a quadratic performance index such that the regulated state can track a reference trajectory and minimize the effects of disturbances, mismatched disturbance rejection control is transformed into a linear quadratic tracking problem. The necessary and sufficient conditions for the solvability of this problem over a finite horizon and a disturbance rejection controller are derived by solving a forward–backward difference equation. In the case of an infinite horizon, a sufficient condition for the stabilization of the system is obtained under the detectable condition. Additionally, in combination with the generalized extended state observer, a controller design method is proposed, and the stability analysis of the system under this controller is presented. This paper details our novel approach to disturbance rejection. Finally, four examples are provided to demonstrate the effectiveness of the proposed method.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142095237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Automatica
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1