首页 > 最新文献

Automatica最新文献

英文 中文
Solving optimal predictor-feedback control using approximate dynamic programming 利用近似动态程序设计解决最优预测反馈控制问题
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-31 DOI: 10.1016/j.automatica.2024.111848

This paper is concerned with approximately solving the optimal predictor-feedback control problem of multiplicative-noise systems with input delay in infinite horizon. The optimal predictor-feedback control, provided by the analytical method, is determined by Riccati–ZXL equations and is hard to obtain in the case of unknown system dynamics. We aim to propose a policy iteration (PI) algorithm for solving the optimal solution by approximate dynamic programming. For convergence analysis of the algorithm, we first develop a necessary and sufficient stabilizing condition, in the form of several new Lyapunov-type equations, which parameterizes all predictor-feedback controllers and can be seen as an important addition to Lyapunov stability theory. We then propose an iterative scheme for the Riccati–ZXL equations computations, along with convergence analysis, based on the condition. Inspired by this scheme, a data-driven online PI algorithm, convergence implied in that of the iterative scheme, is proposed for the optimal predictor-feedback control problem without full system dynamics. Finally, a numerical example is used to evaluate the proposed PI algorithm.

本文主要研究在无限视距内近似求解具有输入延迟的乘噪声系统的最优预测-反馈控制问题。解析法提供的最优预测反馈控制由 Riccati-ZXL 方程决定,在系统动态未知的情况下很难获得。我们旨在提出一种策略迭代(PI)算法,通过近似动态编程求解最优解。为了分析该算法的收敛性,我们首先以几个新的 Lyapunov 型方程的形式提出了一个必要且充分的稳定条件,该条件涉及所有预测反馈控制器的参数,可视为对 Lyapunov 稳定性理论的重要补充。然后,我们提出了一种基于该条件的 Riccati-ZXL 方程计算迭代方案以及收敛性分析。受这一方案的启发,我们提出了一种数据驱动的在线 PI 算法,其收敛性隐含于迭代方案的收敛性,适用于无完整系统动态的最优预测器-反馈控制问题。最后,通过一个数值示例来评估所提出的 PI 算法。
{"title":"Solving optimal predictor-feedback control using approximate dynamic programming","authors":"","doi":"10.1016/j.automatica.2024.111848","DOIUrl":"10.1016/j.automatica.2024.111848","url":null,"abstract":"<div><p>This paper is concerned with approximately solving the optimal predictor-feedback control problem of multiplicative-noise systems with input delay in infinite horizon. The optimal predictor-feedback control, provided by the analytical method, is determined by Riccati–ZXL equations and is hard to obtain in the case of unknown system dynamics. We aim to propose a policy iteration (PI) algorithm for solving the optimal solution by approximate dynamic programming. For convergence analysis of the algorithm, we first develop a necessary and sufficient stabilizing condition, in the form of several new Lyapunov-type equations, which parameterizes all predictor-feedback controllers and can be seen as an important addition to Lyapunov stability theory. We then propose an iterative scheme for the Riccati–ZXL equations computations, along with convergence analysis, based on the condition. Inspired by this scheme, a data-driven online PI algorithm, convergence implied in that of the iterative scheme, is proposed for the optimal predictor-feedback control problem without full system dynamics. Finally, a numerical example is used to evaluate the proposed PI algorithm.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142095239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On a class of linear quadratic Gaussian quantilized mean field games 关于一类线性二次高斯量化均值场博弈
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-31 DOI: 10.1016/j.automatica.2024.111878

An energy provider faced with energy generation risks and a large homogeneous pool of customers designs its energy price as a time-varying function of a risk-related quantile of the total energy demand, which generalizes pricing through the mean of the total energy demand. In the infinite population limit, we model the pricing problem with a class of linear quadratic Gaussian quantilized mean field games. For these quantilized mean field games, we show existence and uniqueness of an equilibrium which reveals the price trajectory, as well as an approximate Nash property when the quantilized mean field game’s feedback control functions are applied to the large but finite game and the rate of convergence of the Nash deviation to zero as a function of the population size and the quantile is provided. Finally, the use of this class of quantilized mean field games is illustrated in the context of equivalent thermal parameter models for households heater and an energy provider using solar generation.

能源供应商面临着能源生产风险和大量同质客户,其能源价格的设计是总能源需求中与风险相关的量化值的时变函数,它通过总能源需求的均值进行一般化定价。在无限人口极限中,我们用一类线性二次高斯量化均值场博弈来模拟定价问题。对于这些量化均值场博弈,我们证明了揭示价格轨迹的均衡的存在性和唯一性,以及将量化均值场博弈的反馈控制函数应用于大型但有限博弈时的近似纳什特性,并提供了纳什偏差趋近于零的速率与人口数量和数量级的函数关系。最后,在家庭加热器和使用太阳能发电的能源提供商的等效热参数模型中说明了这一类量化均值场博弈的应用。
{"title":"On a class of linear quadratic Gaussian quantilized mean field games","authors":"","doi":"10.1016/j.automatica.2024.111878","DOIUrl":"10.1016/j.automatica.2024.111878","url":null,"abstract":"<div><p>An energy provider faced with energy generation risks and a large homogeneous pool of customers designs its energy price as a time-varying function of a risk-related quantile of the total energy demand, which generalizes pricing through the mean of the total energy demand. In the infinite population limit, we model the pricing problem with a class of linear quadratic Gaussian quantilized mean field games. For these quantilized mean field games, we show existence and uniqueness of an equilibrium which reveals the price trajectory, as well as an approximate Nash property when the quantilized mean field game’s feedback control functions are applied to the large but finite game and the rate of convergence of the Nash deviation to zero as a function of the population size and the quantile is provided. Finally, the use of this class of quantilized mean field games is illustrated in the context of equivalent thermal parameter models for households heater and an energy provider using solar generation.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142095238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Resilient bipartite consensus of high-order heterogeneous multi-agent systems under Byzantine attacks 拜占庭攻击下高阶异构多代理系统的弹性二方共识
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-28 DOI: 10.1016/j.automatica.2024.111834

This paper addresses a resilient bipartite output consensus issue for high-order heterogeneous multi-agent systems (MASs) with Byzantine attacks. Output regulator equations as well observers are introduced for bipartite leader-following issue with heterogeneous dynamics. For the security concerns, a multidimensional-bipartite-absolute-mean-subsequence-reduced (MBA-MSR) algorithm is developed for each component of received information from neighbors. This algorithm sorts and trims a set of structures, which judge the absolute value of the difference between the output of the current follower observer and that of the signed neighbor observer. It is able to exclude bounded paralyzed signals as the graph is strongly robust. Based on the filtered information via the algorithm, a resilient adaptive observer is designed to for individual follower. A resilient control strategy is then proposed for the MAS to achieve bipartite output consensus under f-local/total attack. For reduction of number of sort, a conditional MBA-MSR (CMBA-MSR) algorithm is also developed. Finally, simulation examples are given to illustrate the effectiveness of the theoretical results.

本文探讨了具有拜占庭攻击的高阶异构多代理系统(MAS)的弹性双向输出共识问题。针对具有异构动态的双向领导-跟随问题,引入了输出调节方程以及观测器。出于安全考虑,针对从邻居那里接收到的信息的每个组成部分,开发了一种多维-双方-绝对-均值-子序列缩减(MBA-MSR)算法。该算法对一组结构进行排序和修剪,判断当前跟随者观测器输出与已签名邻居观测器输出之间差值的绝对值。由于图具有很强的鲁棒性,因此它能排除有界瘫痪信号。基于通过该算法过滤后的信息,为单个跟随者设计了一个弹性自适应观测器。然后为 MAS 提出了一种弹性控制策略,以在 f 局部/总攻击下实现两端输出共识。为了减少排序次数,还开发了一种有条件的 MBA-MSR 算法(CMBA-MSR)。最后,还给出了仿真实例来说明理论结果的有效性。
{"title":"Resilient bipartite consensus of high-order heterogeneous multi-agent systems under Byzantine attacks","authors":"","doi":"10.1016/j.automatica.2024.111834","DOIUrl":"10.1016/j.automatica.2024.111834","url":null,"abstract":"<div><p>This paper addresses a resilient bipartite output consensus issue for high-order heterogeneous multi-agent systems (MASs) with Byzantine attacks. Output regulator equations as well observers are introduced for bipartite leader-following issue with heterogeneous dynamics. For the security concerns, a multidimensional-bipartite-absolute-mean-subsequence-reduced (MBA-MSR) algorithm is developed for each component of received information from neighbors. This algorithm sorts and trims a set of structures, which judge the absolute value of the difference between the output of the current follower observer and that of the signed neighbor observer. It is able to exclude bounded paralyzed signals as the graph is strongly robust. Based on the filtered information via the algorithm, a resilient adaptive observer is designed to for individual follower. A resilient control strategy is then proposed for the MAS to achieve bipartite output consensus under <span><math><mi>f</mi></math></span>-local/total attack. For reduction of number of sort, a conditional MBA-MSR (CMBA-MSR) algorithm is also developed. Finally, simulation examples are given to illustrate the effectiveness of the theoretical results.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142089672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed event-triggered algorithm for convex optimization with coupled constraints 具有耦合约束条件的凸优化分布式事件触发算法
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-24 DOI: 10.1016/j.automatica.2024.111877

This paper develops a distributed primal–dual algorithm via an event-triggered mechanism to solve a class of convex optimization problems subject to local set constraints, coupled equality and inequality constraints. Different from some existing distributed algorithms with the diminishing step-sizes, our algorithm uses the constant step-sizes, and is shown to achieve an exact convergence to an optimal solution with an ergodic convergence rate of O(1/k) for general convex objective functions, where k>0 is the iteration number. Based on the event-triggered communication mechanism, the proposed algorithm can effectively reduce the communication cost without sacrificing the convergence rate. Finally, a numerical example is presented to verify the effectiveness of the proposed algorithm.

本文通过事件触发机制开发了一种分布式原始二元算法,用于求解一类受局部集约束、耦合相等和不等式约束的凸优化问题。与现有的一些步长递减的分布式算法不同,我们的算法采用恒定步长,对于一般的凸目标函数(k>0 为迭代次数),我们的算法能以 O(1/k)的遍历收敛率精确收敛到最优解。基于事件触发的通信机制,所提出的算法可以在不牺牲收敛率的情况下有效降低通信成本。最后,通过一个数值实例验证了所提算法的有效性。
{"title":"Distributed event-triggered algorithm for convex optimization with coupled constraints","authors":"","doi":"10.1016/j.automatica.2024.111877","DOIUrl":"10.1016/j.automatica.2024.111877","url":null,"abstract":"<div><p>This paper develops a distributed primal–dual algorithm via an event-triggered mechanism to solve a class of convex optimization problems subject to local set constraints, coupled equality and inequality constraints. Different from some existing distributed algorithms with the diminishing step-sizes, our algorithm uses the constant step-sizes, and is shown to achieve an exact convergence to an optimal solution with an ergodic convergence rate of <span><math><mrow><mi>O</mi><mrow><mo>(</mo><mn>1</mn><mo>/</mo><mi>k</mi><mo>)</mo></mrow></mrow></math></span> for general convex objective functions, where <span><math><mrow><mi>k</mi><mo>&gt;</mo><mn>0</mn></mrow></math></span> is the iteration number. Based on the event-triggered communication mechanism, the proposed algorithm can effectively reduce the communication cost without sacrificing the convergence rate. Finally, a numerical example is presented to verify the effectiveness of the proposed algorithm.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142048288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exact output tracking in prescribed finite time via funnel control 通过漏斗控制,在规定的有限时间内实现精确的输出跟踪
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-24 DOI: 10.1016/j.automatica.2024.111873

Output reference tracking of unknown nonlinear systems is considered. The control objective is exact tracking in predefined finite time, while in the transient phase the tracking error evolves within a prescribed boundary. To achieve this, a novel high-gain feedback controller is developed that is similar to, but extends, existing high-gain feedback controllers. Feasibility and functioning of the proposed controller is proven rigorously. Examples for the particular control objective under consideration are, for instance, linking up two train sections, or docking of spaceships.

本文考虑了未知非线性系统的输出参考跟踪问题。控制目标是在预定的有限时间内精确跟踪,而在瞬态阶段,跟踪误差在规定的边界内变化。为此,我们开发了一种新型高增益反馈控制器,它与现有的高增益反馈控制器相似,但有所扩展。提出的控制器的可行性和功能已得到严格验证。所考虑的特定控制目标的例子包括,例如,连接两节列车或飞船对接。
{"title":"Exact output tracking in prescribed finite time via funnel control","authors":"","doi":"10.1016/j.automatica.2024.111873","DOIUrl":"10.1016/j.automatica.2024.111873","url":null,"abstract":"<div><p>Output reference tracking of unknown nonlinear systems is considered. The control objective is exact tracking in predefined finite time, while in the transient phase the tracking error evolves within a prescribed boundary. To achieve this, a novel high-gain feedback controller is developed that is similar to, but extends, existing high-gain feedback controllers. Feasibility and functioning of the proposed controller is proven rigorously. Examples for the particular control objective under consideration are, for instance, linking up two train sections, or docking of spaceships.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0005109824003674/pdfft?md5=8b3a61d9d1362e26e9499375bd99056a&pid=1-s2.0-S0005109824003674-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142048287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Finite-time stability analysis of homogeneous systems with sector nonlinearities 具有扇形非线性的均质系统的有限时间稳定性分析
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-24 DOI: 10.1016/j.automatica.2024.111872

A class of homogeneous systems with sector nonlinearities is considered in the paper. Sufficient conditions of finite-time (input-to-state) stability are established with the use of new constructive modification of the Implicit Lyapunov Function approach. The proposed conditions are given in the form of linear matrix inequalities. The theoretical results are supported with numerical examples.

本文考虑了一类具有扇形非线性的均质系统。利用对隐式 Lyapunov 函数方法的新构造性修正,建立了有限时间(输入到状态)稳定性的充分条件。提出的条件以线性矩阵不等式的形式给出。理论结果得到了数值示例的支持。
{"title":"Finite-time stability analysis of homogeneous systems with sector nonlinearities","authors":"","doi":"10.1016/j.automatica.2024.111872","DOIUrl":"10.1016/j.automatica.2024.111872","url":null,"abstract":"<div><p>A class of homogeneous systems with sector nonlinearities is considered in the paper. Sufficient conditions of finite-time (input-to-state) stability are established with the use of new constructive modification of the Implicit Lyapunov Function approach. The proposed conditions are given in the form of linear matrix inequalities. The theoretical results are supported with numerical examples.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142048289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fixed-time Lyapunov criteria of stochastic nonlinear systems revisited and its applications 再论随机非线性系统的固定时间李雅普诺夫准则及其应用
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-24 DOI: 10.1016/j.automatica.2024.111866

In all the references on stochastic fixed-time stability, the customary treatment of the stochastic noise in the worst-case sense is that it is treated as the unfavorable factor for system stability. Consequently, stochastic fixed-time Lyapunov-type conditions are rather restrictive. Realizing this limitation, we revisit stochastic fixed-time stability and present a generalized fixed-time stability theorem for stochastic differential equations. This theorem completely removes the assumption in these references that the differential operator of the Lyapunov function must be strictly negative, and reveals a positive role of the stochastic noise in stochastic fixed-time stability. As the application of this theorem and its corollary, we solve the problem of fixed-time stabilization for stochastic nonlinear systems with high-order and low-order nonlinearities.

在所有关于随机定时稳定性的参考文献中,对最坏情况下随机噪声的习惯处理方法是将其视为系统稳定性的不利因素。因此,随机定时李亚普诺夫类型条件具有相当大的限制性。认识到这一局限性,我们重新审视了随机定时稳定性,并提出了随机微分方程的广义定时稳定性定理。该定理完全消除了这些参考文献中关于 Lyapunov 函数的微分算子必须严格为负的假设,并揭示了随机噪声在随机定时稳定性中的积极作用。应用该定理及其推论,我们解决了具有高阶和低阶非线性的随机非线性系统的定时稳定性问题。
{"title":"Fixed-time Lyapunov criteria of stochastic nonlinear systems revisited and its applications","authors":"","doi":"10.1016/j.automatica.2024.111866","DOIUrl":"10.1016/j.automatica.2024.111866","url":null,"abstract":"<div><p>In all the references on stochastic fixed-time stability, the customary treatment of the stochastic noise in the worst-case sense is that it is treated as the unfavorable factor for system stability. Consequently, stochastic fixed-time Lyapunov-type conditions are rather restrictive. Realizing this limitation, we revisit stochastic fixed-time stability and present a generalized fixed-time stability theorem for stochastic differential equations. This theorem completely removes the assumption in these references that the differential operator of the Lyapunov function must be strictly negative, and reveals a positive role of the stochastic noise in stochastic fixed-time stability. As the application of this theorem and its corollary, we solve the problem of fixed-time stabilization for stochastic nonlinear systems with high-order and low-order nonlinearities.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142048286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Marginal improvement procedures for top-m selection 顶层模型选择的边际改进程序
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-24 DOI: 10.1016/j.automatica.2024.111875

Given a fixed simulation budget, the problem of selecting the best and top-m alternatives among a finite set of alternatives have been studied separately in simulation optimization literature, because the existing sampling procedures are often dedicated to one problem. Under a Bayesian framework, we formulate the top-m selection into a stochastic dynamic program, and characterize the optimal sampling policy via Bellman equations. To determine sequential sampling decisions, we measure the expected marginal improvement from obtaining one additional simulation observation based on predictive distributions, and then develop two cheaply computational approximations to the improvement, thereby yielding two generic sampling procedures that are efficient in selecting top-m alternative(s). The two procedures are proved to be consistent, in a sense that the best and top-m alternatives can be correctly identified as the simulation budget goes to infinity. Numerical experiments on synthetic problems and a coronavirus transmission control application are conducted to demonstrate the efficiency and generality of our procedures.

在给定仿真预算的情况下,在有限的备选方案集中选择最佳和前m个备选方案的问题在仿真优化文献中被单独研究,因为现有的抽样程序通常只针对一个问题。在贝叶斯框架下,我们将前-m 项选择表述为一个随机动态程序,并通过贝尔曼方程表征了最优采样策略。为了确定顺序抽样决策,我们根据预测分布来衡量获得一个额外模拟观测值所带来的预期边际改进,然后开发出两个计算成本低廉的改进近似值,从而得到两个能有效选择前m备选方案的通用抽样程序。这两个程序被证明是一致的,即当模拟预算达到无穷大时,可以正确识别最佳和前 m 个备选方案。我们对合成问题和冠状病毒传播控制应用程序进行了数值实验,以证明我们程序的效率和通用性。
{"title":"Marginal improvement procedures for top-m selection","authors":"","doi":"10.1016/j.automatica.2024.111875","DOIUrl":"10.1016/j.automatica.2024.111875","url":null,"abstract":"<div><p>Given a fixed simulation budget, the problem of selecting the best and top-<span><math><mi>m</mi></math></span> alternatives among a finite set of alternatives have been studied separately in simulation optimization literature, because the existing sampling procedures are often dedicated to one problem. Under a Bayesian framework, we formulate the top-<span><math><mi>m</mi></math></span> selection into a stochastic dynamic program, and characterize the optimal sampling policy via Bellman equations. To determine sequential sampling decisions, we measure the expected marginal improvement from obtaining one additional simulation observation based on predictive distributions, and then develop two cheaply computational approximations to the improvement, thereby yielding two generic sampling procedures that are efficient in selecting top-<span><math><mi>m</mi></math></span> alternative(s). The two procedures are proved to be consistent, in a sense that the best and top-<span><math><mi>m</mi></math></span> alternatives can be correctly identified as the simulation budget goes to infinity. Numerical experiments on synthetic problems and a coronavirus transmission control application are conducted to demonstrate the efficiency and generality of our procedures.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142058183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online distributed nonconvex optimization with stochastic objective functions: High probability bound analysis of dynamic regrets 具有随机目标函数的在线分布式非凸优化:动态遗憾的高概率边界分析
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-24 DOI: 10.1016/j.automatica.2024.111863

In this paper, the problem of online distributed optimization with stochastic and nonconvex objective functions is studied by employing a multi-agent system. When making decisions, each agent only has access to a noisy gradient of its own objective function in the previous time and can only communicate with its immediate neighbors via a time-varying digraph. To handle this problem, an online distributed stochastic projection-free algorithm is proposed. Of particular interest is that the dynamic regrets are employed to measure the performance of the online algorithm. Existing works on online distributed algorithms involving stochastic gradients only provide the sublinearity results of regrets in expectation. Different from them, we study the high probability bounds of dynamic regrets, i.e., the sublinear bounds of dynamic regrets are characterized by the natural logarithm of the failure probability’s inverse. Under mild assumptions on the graph and objective functions, we prove that if the variations in both the objective function sequence and its gradient sequence grow within a certain rate, then the high probability bounds of the dynamic regrets grow sublinearly. Finally, a simulation example is carried out to demonstrate the effectiveness of our theoretical results.

本文利用多代理系统研究了具有随机和非凸目标函数的在线分布式优化问题。在做决策时,每个代理只能获得自己目标函数在前一时间的噪声梯度,并且只能通过时变数字图与近邻通信。为了解决这个问题,我们提出了一种在线分布式无随机投影算法。特别值得关注的是,该算法采用了动态遗憾来衡量在线算法的性能。涉及随机梯度的在线分布式算法的现有研究只提供了期望遗憾的亚线性结果。与之不同,我们研究的是动态遗憾的高概率边界,即动态遗憾的亚线性边界以失败概率的自然对数倒数为特征。在对图和目标函数的温和假设下,我们证明了如果目标函数序列及其梯度序列的变化都在一定速率内增长,那么动态遗憾的高概率边界就会呈亚线性增长。最后,我们通过一个仿真实例来证明我们理论结果的有效性。
{"title":"Online distributed nonconvex optimization with stochastic objective functions: High probability bound analysis of dynamic regrets","authors":"","doi":"10.1016/j.automatica.2024.111863","DOIUrl":"10.1016/j.automatica.2024.111863","url":null,"abstract":"<div><p>In this paper, the problem of online distributed optimization with stochastic and nonconvex objective functions is studied by employing a multi-agent system. When making decisions, each agent only has access to a noisy gradient of its own objective function in the previous time and can only communicate with its immediate neighbors via a time-varying digraph. To handle this problem, an online distributed stochastic projection-free algorithm is proposed. Of particular interest is that the dynamic regrets are employed to measure the performance of the online algorithm. Existing works on online distributed algorithms involving stochastic gradients only provide the sublinearity results of regrets in expectation. Different from them, we study the high probability bounds of dynamic regrets, i.e., the sublinear bounds of dynamic regrets are characterized by the natural logarithm of the failure probability’s inverse. Under mild assumptions on the graph and objective functions, we prove that if the variations in both the objective function sequence and its gradient sequence grow within a certain rate, then the high probability bounds of the dynamic regrets grow sublinearly. Finally, a simulation example is carried out to demonstrate the effectiveness of our theoretical results.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142048290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed adaptive cooperative optimal output regulation via integral reinforcement learning 通过积分强化学习实现分布式自适应合作优化输出调节
IF 4.8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-24 DOI: 10.1016/j.automatica.2024.111861

This paper studies the optimal cooperative output regulation problem for unknown linear multi-agent systems by the integral reinforcement learning technique. Existing results on this problem were obtained by a non-fully distributed learning process. In contrast, we propose a distributed learning algorithm over the jointly connected switching communication networks. Moreover, by modifying the existing algorithm, we reduce the computational cost and weaken the solvability conditions. Two numerical examples are used to illustrate the effectiveness of our approach.

本文通过积分强化学习技术研究了未知线性多代理系统的最优合作输出调节问题。关于该问题的现有结果是通过非完全分布式学习过程获得的。相比之下,我们提出了一种在共同连接的交换通信网络上的分布式学习算法。此外,通过修改现有算法,我们降低了计算成本,削弱了可解条件。我们用两个数值示例来说明我们方法的有效性。
{"title":"Distributed adaptive cooperative optimal output regulation via integral reinforcement learning","authors":"","doi":"10.1016/j.automatica.2024.111861","DOIUrl":"10.1016/j.automatica.2024.111861","url":null,"abstract":"<div><p>This paper studies the optimal cooperative output regulation problem for unknown linear multi-agent systems by the integral reinforcement learning technique. Existing results on this problem were obtained by a non-fully distributed learning process. In contrast, we propose a distributed learning algorithm over the jointly connected switching communication networks. Moreover, by modifying the existing algorithm, we reduce the computational cost and weaken the solvability conditions. Two numerical examples are used to illustrate the effectiveness of our approach.</p></div>","PeriodicalId":55413,"journal":{"name":"Automatica","volume":null,"pages":null},"PeriodicalIF":4.8,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142048285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Automatica
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1