首页 > 最新文献

Annual Reviews in Control最新文献

英文 中文
Learning quadrotor dynamics for precise, safe, and agile flight control 学习四旋翼动力学精确,安全和灵活的飞行控制
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.03.009
Alessandro Saviolo, Giuseppe Loianno

This article reviews the state-of-the-art modeling and control techniques for aerial robots such as quadrotor systems and presents several future research directions in this area. The review starts by introducing the benefits and drawbacks of classic physic-based dynamic modeling and control techniques. Subsequently, the manuscript presents the key challenges to augment or replace classic techniques with data-driven approaches that can offer several key benefits in terms of flight precision, safety, adaptation, and agility.

本文综述了最先进的航空机器人建模和控制技术,如四旋翼系统,并提出了该领域未来的几个研究方向。综述首先介绍了经典的基于物理的动态建模和控制技术的优点和缺点。随后,该手稿提出了用数据驱动的方法来增强或取代经典技术的关键挑战,这些方法可以在飞行精度、安全性、适应性和灵活性方面提供几个关键优势。
{"title":"Learning quadrotor dynamics for precise, safe, and agile flight control","authors":"Alessandro Saviolo,&nbsp;Giuseppe Loianno","doi":"10.1016/j.arcontrol.2023.03.009","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.03.009","url":null,"abstract":"<div><p>This article reviews the state-of-the-art modeling and control techniques for aerial robots such as quadrotor systems and presents several future research directions in this area. The review starts by introducing the benefits and drawbacks of classic physic-based dynamic modeling and control techniques. Subsequently, the manuscript presents the key challenges to augment or replace classic techniques with data-driven approaches that can offer several key benefits in terms of flight precision, safety, adaptation, and agility.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 45-60"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49738908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Active queue management for alleviating Internet congestion via a nonlinear differential equation with a variable delay 基于可变延迟非线性微分方程的主动队列管理缓解网络拥塞
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.02.002
Hugues Mounier , Cédric Join , Emmanuel Delaleau , Michel Fliess

Active Queue Management (AQM) for mitigating Internet congestion has been addressed via various feedback control syntheses, especially P, PI, and PID regulators, by using a linear approximation where the “round trip time,” i.e., the delay, is assumed to be constant. This constraint is lifted here by using a nonlinear modeling with a variable delay, introduced more than 20 years ago. This delay, intimately linked to the congestion phenomenon, may be viewed as “ a flat output.” All other system variables, especially the control variable, i.e., the packet loss ratio, are expressed as a function of the delay and its derivatives: they are frozen if the delay is kept constant. This flatness-like property, which demonstrates the mathematical discrepancy of the linear approximation adopted until today, yields also our control strategy in two steps: Firstly, designing an open-loop control, thanks to straightforward Flatness-Based Control (FBC) techniques, and secondly, closing the loop via Model-Free Control (MFC) in order to take into account severe model mismatches, like, here, the number of TCP sessions. Several convincing computer simulations, which are easily implementable, are presented and discussed.

用于缓解互联网拥塞的主动队列管理(AQM)已通过各种反馈控制综合,特别是P、PI和PID调节器,通过使用线性近似来解决,其中“往返时间”(即延迟)假定为常数。通过使用20多年前引入的具有可变延迟的非线性建模,消除了这一限制。这种与拥塞现象密切相关的延迟可以被视为“平坦输出”。所有其他系统变量,尤其是控制变量,即丢包率,都被表示为延迟及其导数的函数:如果延迟保持不变,它们就会被冻结。这种类似平面度的特性证明了迄今为止所采用的线性近似的数学差异,也产生了我们分两步的控制策略:首先,由于直接的基于平面度的控制(FBC)技术,设计开环控制,其次,通过无模型控制(MFC)闭合回路,以考虑严重的模型失配,比如,这里,TCP会话的数量。介绍并讨论了几种易于实现的令人信服的计算机模拟。
{"title":"Active queue management for alleviating Internet congestion via a nonlinear differential equation with a variable delay","authors":"Hugues Mounier ,&nbsp;Cédric Join ,&nbsp;Emmanuel Delaleau ,&nbsp;Michel Fliess","doi":"10.1016/j.arcontrol.2023.02.002","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.02.002","url":null,"abstract":"<div><p><span>Active Queue Management (AQM) for mitigating Internet congestion has been addressed via various feedback control syntheses, especially P, PI, and PID regulators, by using a linear approximation where the “round trip time,” i.e., the delay, is assumed to be constant. This constraint is lifted here by using a nonlinear modeling with a variable delay, introduced more than 20 years ago. This delay, intimately linked to the congestion phenomenon, may be viewed as “ a flat output.” All other system variables, especially the control variable, i.e., the </span>packet loss ratio, are expressed as a function of the delay and its derivatives: they are frozen if the delay is kept constant. This flatness-like property, which demonstrates the mathematical discrepancy of the linear approximation adopted until today, yields also our control strategy in two steps: Firstly, designing an open-loop control, thanks to straightforward Flatness-Based Control (FBC) techniques, and secondly, closing the loop via Model-Free Control (MFC) in order to take into account severe model mismatches, like, here, the number of TCP sessions. Several convincing computer simulations, which are easily implementable, are presented and discussed.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 61-69"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49738910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Review of floating object manipulation by autonomous multi-vessel systems 自主多船系统的漂浮物操纵研究进展
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2022.10.003
Zhe Du, Rudy R. Negenborn, Vasso Reppa

The regulatory endorsement of the International Maritime Organization (IMO) and the support of pivotal shipping market players in recent years motivate the investigation of the potential role that autonomous vessels play in the shipping industry. As the complexity and scale of the envisioned applications increase, research works gradually transform the focus from single-vessel systems to multi-vessel systems. Thus, autonomous multi-vessel systems applied in the shipping industry are becoming a promising research direction. One of the typical research directions is floating object manipulation by multiple tugboats.

This paper offers a comprehensive literature review of the existing research on floating object manipulation by autonomous multi-vessel systems. Based on the prior knowledge of object manipulation problems in multi-robot systems, four typical ways of maritime object manipulation are summarized: attaching, caging, pushing, and towing. The advantages and disadvantages of each manipulation way are discussed, including its typical floating object and application scenarios. Moreover, the aspects of control objective, control architecture, collision avoidance operation, disturbances consideration, and role of each involved vessel are analyzed for gaining insight into the approaches for solving these problems. Finally, challenges and future directions are highlighted to give possible inspiration.

近年来,国际海事组织(海事组织)的监管认可和关键航运市场参与者的支持促使人们对自主船舶在航运业中发挥的潜在作用进行调查。随着所设想应用的复杂性和规模的增加,研究工作逐渐将重点从单船系统转移到多船系统。因此,自主多船系统在航运业中的应用正成为一个很有前途的研究方向。多艘拖船操纵漂浮物是一个典型的研究方向。本文对自主多船系统操纵漂浮物的现有研究进行了全面的文献综述。基于多机器人系统中物体操纵问题的先验知识,总结了四种典型的海上物体操纵方法:附着、锁定、推动和拖曳。讨论了每种操作方式的优缺点,包括其典型的浮动对象和应用场景。此外,还分析了控制目标、控制架构、防撞操作、干扰考虑和每艘相关船只的作用等方面,以深入了解解决这些问题的方法。最后,强调了挑战和未来的方向,以提供可能的灵感。
{"title":"Review of floating object manipulation by autonomous multi-vessel systems","authors":"Zhe Du,&nbsp;Rudy R. Negenborn,&nbsp;Vasso Reppa","doi":"10.1016/j.arcontrol.2022.10.003","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2022.10.003","url":null,"abstract":"<div><p>The regulatory endorsement of the International Maritime Organization (IMO) and the support of pivotal shipping market players in recent years motivate the investigation of the potential role that autonomous vessels play in the shipping industry. As the complexity and scale of the envisioned applications increase, research works gradually transform the focus from single-vessel systems to multi-vessel systems. Thus, autonomous multi-vessel systems applied in the shipping industry are becoming a promising research direction. One of the typical research directions is floating object manipulation by multiple tugboats.</p><p>This paper offers a comprehensive literature review of the existing research on floating object manipulation by autonomous multi-vessel systems. Based on the prior knowledge of object manipulation problems in multi-robot systems, four typical ways of maritime object manipulation are summarized: attaching, caging, pushing, and towing. The advantages and disadvantages of each manipulation way are discussed, including its typical floating object and application scenarios. Moreover, the aspects of control objective, control architecture, collision avoidance operation, disturbances consideration, and role of each involved vessel are analyzed for gaining insight into the approaches for solving these problems. Finally, challenges and future directions are highlighted to give possible inspiration.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 255-278"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49739029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Ergodic risk-sensitive control—A survey 遍历风险敏感控制——一项调查
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.03.001
Anup Biswas , Vivek S. Borkar

Risk-sensitive control has received considerable interest since the seminal work of Howard and Matheson (Howard and Matheson, 1971/72) because of its ability to account for fluctuations about the mean, its connection with H control, and its application to financial mathematics. In this article we attempt to put together a comprehensive survey on the research done on ergodic risk-sensitive control over the last four decades.

自Howard和Matheson(Howard and Matheson,1971/72)的开创性工作以来,风险敏感控制因其能够解释平均值的波动、与H∞控制的联系以及在金融数学中的应用而引起了人们的极大兴趣。在这篇文章中,我们试图对过去四十年来对遍历风险敏感控制的研究进行全面的调查。
{"title":"Ergodic risk-sensitive control—A survey","authors":"Anup Biswas ,&nbsp;Vivek S. Borkar","doi":"10.1016/j.arcontrol.2023.03.001","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.03.001","url":null,"abstract":"<div><p>Risk-sensitive control has received considerable interest since the seminal work of Howard and Matheson (Howard and Matheson, 1971/72) because of its ability to account for fluctuations about the mean, its connection with <span><math><msub><mrow><mi>H</mi></mrow><mrow><mi>∞</mi></mrow></msub></math></span> control, and its application to financial mathematics. In this article we attempt to put together a comprehensive survey on the research done on ergodic risk-sensitive control over the last four decades.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 118-141"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49739390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Modeling, analysis and control of robot–object nonsmooth underactuated Lagrangian systems: A tutorial overview and perspectives 机器人-物体非光滑欠驱动拉格朗日系统的建模、分析和控制:教程概述和展望
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2022.12.002
Bernard Brogliato

So-called robot–object Lagrangian systems consist of a class of nonsmooth underactuated complementarity Lagrangian systems, with a specific structure: an “object” and a “robot”. Only the robot is actuated. The object dynamics can thus be controlled only through the action of the contact Lagrange multipliers, which represent the interaction forces between the robot and the object. Juggling, walking, running, hopping machines, robotic systems that manipulate objects, tapping, pushing systems, kinematic chains with joint clearance, crawling, climbing robots, some cable-driven manipulators, and some circuits with set-valued nonsmooth components, belong this class. This article aims at presenting their main features, then many application examples which belong to the robot–object class, then reviewing the main tools and control strategies which have been proposed in the Automatic Control and in the Robotics literature. Some comments and open issues conclude the article.

所谓的机器人-物体拉格朗日系统由一类非光滑欠驱动互补拉格朗日系统组成,具有特定的结构:“物体”和“机器人”。只有机器人被启动。因此,物体动力学只能通过接触拉格朗日乘子的作用来控制,拉格朗日乘子表示机器人和物体之间的相互作用力。杂耍、行走、跑步、跳跃机、操纵物体的机器人系统、敲击、推动系统、具有关节间隙的运动链、爬行、攀爬机器人、一些电缆驱动的机械手以及一些具有集值非光滑组件的电路都属于这一类。本文旨在介绍它们的主要特征,然后介绍许多属于机器人-对象类的应用实例,然后回顾自动控制和机器人学文献中提出的主要工具和控制策略。一些评论和悬而未决的问题总结了这篇文章。
{"title":"Modeling, analysis and control of robot–object nonsmooth underactuated Lagrangian systems: A tutorial overview and perspectives","authors":"Bernard Brogliato","doi":"10.1016/j.arcontrol.2022.12.002","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2022.12.002","url":null,"abstract":"<div><p><span>So-called robot–object Lagrangian systems consist of a class of nonsmooth underactuated complementarity Lagrangian systems, with a specific structure: an “object” and a “robot”. Only the robot is actuated. The object dynamics can thus be controlled only through the action of the contact Lagrange multipliers, which represent the interaction forces between the robot and the object. Juggling, walking, running, hopping machines, </span>robotic systems<span><span><span> that manipulate objects, tapping, pushing systems, </span>kinematic chains<span> with joint clearance, crawling, </span></span>climbing robots, some cable-driven manipulators, and some circuits with set-valued nonsmooth components, belong this class. This article aims at presenting their main features, then many application examples which belong to the robot–object class, then reviewing the main tools and control strategies which have been proposed in the Automatic Control and in the Robotics literature. Some comments and open issues conclude the article.</span></p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 297-337"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49762809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Learning and forgetting in systems neuroscience: A control perspective 系统神经科学中的学习与遗忘:控制视角
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.100912
Erick Mejia Uzeda, Mohamed A. Hafez, Mireille E. Broucke

A longstanding open problem of systems neuroscience is to understand how the brain calibrates thousands of reflexes to achieve near instantaneous disturbance rejection. While reflexes typically act locally at the site of sensory measurements, the adaptation of reflex gains is the result of an ingenious architecture in which knowledge of disturbances is transferred from the cerebellum to the deep cerebellar nuclei or the brainstem. This paper investigates the use of control theory as the mathematical foundation to explain the mechanisms by which such forms of learning, as well as forgetting, manifest themselves in systems neuroscience. Particularly, we use adaptive control and averaging theory to model the computations performed in learning appropriate reflex gains. While forgetting is perceived as counter-productive to learning, we show that if incorporated correctly, it can endow the much needed robustness to train thousands of reflexes without interfering with their adaptation. This is accomplished using the μ-modification which achieves robustness of adaptive schemes through the estimation of exciting subspaces. Our techniques are combined in a comprehensive model, with simulations illustrating their effectiveness.

系统神经科学的一个长期悬而未决的问题是了解大脑如何校准成千上万的反射,以实现近乎瞬时的干扰抑制。条件反射通常在感觉测量部位局部起作用,而条件反射增益的适应则是一种巧妙结构的结果,在这种结构中,有关干扰的知识从小脑转移到小脑深核或脑干。本文研究了以控制论为数学基础,解释系统神经科学中这种形式的学习和遗忘的表现机制。特别是,我们使用自适应控制和平均理论来模拟在学习适当的反射增益时所进行的计算。虽然遗忘被认为会对学习产生反作用,但我们的研究表明,如果能正确地将遗忘纳入其中,就能赋予训练成千上万个条件反射所急需的稳健性,而不会干扰它们的适应性。我们利用μ修正来实现这一点,它通过估计令人兴奋的子空间来实现自适应方案的稳健性。我们将这些技术结合到一个综合模型中,并通过模拟说明了它们的有效性。
{"title":"Learning and forgetting in systems neuroscience: A control perspective","authors":"Erick Mejia Uzeda,&nbsp;Mohamed A. Hafez,&nbsp;Mireille E. Broucke","doi":"10.1016/j.arcontrol.2023.100912","DOIUrl":"10.1016/j.arcontrol.2023.100912","url":null,"abstract":"<div><p>A longstanding open problem of systems neuroscience is to understand how the brain calibrates thousands of reflexes to achieve near instantaneous disturbance rejection. While reflexes typically act locally at the site of sensory measurements, the adaptation of reflex gains is the result of an ingenious architecture in which knowledge of disturbances is transferred from the cerebellum to the deep cerebellar nuclei or the brainstem. This paper investigates the use of control theory as the mathematical foundation to explain the mechanisms by which such forms of learning, as well as forgetting, manifest themselves in systems neuroscience. Particularly, we use adaptive control and averaging theory to model the computations performed in learning appropriate reflex gains. While forgetting is perceived as counter-productive to learning, we show that if incorporated correctly, it can endow the much needed robustness to train thousands of reflexes without interfering with their adaptation. This is accomplished using the <span><math><mi>μ</mi></math></span>-modification which achieves robustness of adaptive schemes through the estimation of exciting subspaces. Our techniques are combined in a comprehensive model, with simulations illustrating their effectiveness.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"56 ","pages":"Article 100912"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135515912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Value-gradient iteration with quadratic approximate value functions 二次逼近函数的值梯度迭代
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.100917
Alan Yang, Stephen Boyd

We propose a method for designing policies for convex stochastic control problems characterized by random linear dynamics and convex stage cost. We consider policies that employ quadratic approximate value functions as a substitute for the true value function. Evaluating the associated control policy involves solving a convex problem, typically a quadratic program, which can be carried out reliably in real-time. Such policies often perform well even when the approximate value function is not a particularly good approximation of the true value function. We propose value-gradient iteration, which fits the gradient of value function, with regularization that can include constraints reflecting known bounds on the true value function. Our value-gradient iteration method can yield a good approximate value function with few samples, and little hyperparameter tuning. We find that the method can find a good policy with computational effort comparable to that required to just evaluate a control policy via simulation.

针对具有随机线性动力学和凸阶段代价的凸随机控制问题,提出了一种策略设计方法。我们考虑使用二次近似值函数代替真值函数的策略。评估相关的控制策略涉及求解一个凸问题,通常是一个二次规划,可以可靠地实时执行。即使近似值函数不是真实值函数的特别好的近似值,这种策略通常也会表现良好。我们提出了值梯度迭代,它适合值函数的梯度,正则化可以包括反映真值函数上已知边界的约束。我们的值梯度迭代方法可以在少量样本和少量超参数调优的情况下得到一个很好的近似值函数。我们发现,该方法可以找到一个好的策略,其计算量与仅通过仿真评估控制策略所需的计算量相当。
{"title":"Value-gradient iteration with quadratic approximate value functions","authors":"Alan Yang,&nbsp;Stephen Boyd","doi":"10.1016/j.arcontrol.2023.100917","DOIUrl":"10.1016/j.arcontrol.2023.100917","url":null,"abstract":"<div><p>We propose a method for designing policies for convex stochastic control problems characterized by random linear dynamics and convex stage cost. We consider policies that employ quadratic approximate value functions as a substitute for the true value function. Evaluating the associated control policy involves solving a convex problem<span>, typically a quadratic program, which can be carried out reliably in real-time. Such policies often perform well even when the approximate value function is not a particularly good approximation of the true value function. We propose value-gradient iteration, which fits the gradient of value function, with regularization that can include constraints reflecting known bounds on the true value function. Our value-gradient iteration method can yield a good approximate value function with few samples, and little hyperparameter tuning. We find that the method can find a good policy with computational effort comparable to that required to just evaluate a control policy via simulation.</span></p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"56 ","pages":"Article 100917"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135609020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Probabilistic feasibility in data-driven multi-agent non-convex optimization 数据驱动多智能体非凸优化的概率可行性
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.100925
Lucrezia Manieri, Alessandro Falsone, Maria Prandini

In this paper, we focus on the optimal operation of a multi-agent system affected by uncertainty. In particular, we consider a cooperative setting where agents jointly optimize a performance index compatibly with individual constraints on their discrete and continuous decision variables and with coupling global constraints. We assume that individual constraints are affected by uncertainty, which is known to each agent via a private set of data that cannot be shared with others. Exploiting tools from statistical learning theory, we provide data-based probabilistic feasibility guarantees for a (possibly sub-optimal) solution of the multi-agent problem that is obtained via a decentralized/distributed scheme that preserves the privacy of the local information. The generalization properties of the data-based solution are shown to depend on the size of each local dataset and on the complexity of the uncertain individual constraint sets. Explicit bounds are derived in the case of linear individual constraints. A comparative analysis with the cases of a common dataset and of local uncertainties that are independent is performed.

本文主要研究受不确定性影响的多智能体系统的最优运行问题。特别是,我们考虑了一种合作设置,其中智能体共同优化性能指标,该指标与对其离散和连续决策变量的单个约束以及耦合全局约束兼容。我们假设个体约束受到不确定性的影响,每个代理都通过一组私有数据知道不确定性,这些数据不能与其他代理共享。利用统计学习理论的工具,我们为通过分散/分布式方案获得的多代理问题(可能是次优的)解决方案提供基于数据的概率可行性保证,该方案保留了本地信息的隐私性。基于数据的解决方案的泛化特性取决于每个局部数据集的大小和不确定单个约束集的复杂性。在线性个体约束的情况下,导出了显式边界。对一个公共数据集和独立的局部不确定性进行了比较分析。
{"title":"Probabilistic feasibility in data-driven multi-agent non-convex optimization","authors":"Lucrezia Manieri,&nbsp;Alessandro Falsone,&nbsp;Maria Prandini","doi":"10.1016/j.arcontrol.2023.100925","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.100925","url":null,"abstract":"<div><p>In this paper, we focus on the optimal operation of a multi-agent system affected by uncertainty. In particular, we consider a cooperative setting where agents jointly optimize a performance index compatibly with individual constraints on their discrete and continuous decision variables and with coupling global constraints. We assume that individual constraints are affected by uncertainty, which is known to each agent via a private set of data that cannot be shared with others. Exploiting tools from statistical learning theory, we provide data-based probabilistic feasibility guarantees for a (possibly sub-optimal) solution of the multi-agent problem that is obtained via a decentralized/distributed scheme that preserves the privacy of the local information. The generalization properties of the data-based solution are shown to depend on the size of each local dataset and on the complexity of the uncertain individual constraint sets. Explicit bounds are derived in the case of linear individual constraints. A comparative analysis with the cases of a common dataset and of local uncertainties that are independent is performed.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"56 ","pages":"Article 100925"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1367578823000895/pdfft?md5=400ea0f3b2244fc3aeddd15afd97f00f&pid=1-s2.0-S1367578823000895-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136697008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Addendum: Predictive form of the FPM model 附录:FPM模型的预测形式
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.02.001
Alain Oustaloup , François Levron , Stéphane Victor , Luc Dugard
<div><p>The article Oustaloup et al. (2021) has shown that the Fractional Power Model (FPM), <span><math><mrow><mi>A</mi><mo>+</mo><mi>B</mi><msup><mrow><mi>t</mi></mrow><mrow><mi>m</mi></mrow></msup></mrow></math></span>, enables well representing the cumulated data of COVID infections, thanks to a nonlinear identification technique. Beyond this identification interval, the article has also shown that the model enables predicting the future values on an unusual prediction horizon as for its range. The objective of this addendum is to explain, via an autoregressive form, why this model intrinsically benefits from such a predictivity property, the idea being to show the interest of the FPM model by highlighting its <em>predictive specificity</em>, inherent to non-integer integration that conditions the model. More precisely, this addendum establishes a <em>predictive form with long memory</em><span> of the FPM model. This form corresponds to an autoregressive (AR) filter of infinite order. Taking into account the whole past through an indefinite linear combination of past values, a first predictive form, said to be with </span><em>long memory</em>, results from an approach using one of the formulations of non-integer differentiation. Actually, as this first predictive form is the one of the power-law, <span><math><msup><mrow><mi>t</mi></mrow><mrow><mi>m</mi></mrow></msup></math></span>, its adaptation to the FPM model, <span><math><mrow><mi>A</mi><mo>+</mo><mi>B</mi><msup><mrow><mi>t</mi></mrow><mrow><mi>m</mi></mrow></msup></mrow></math></span>, which generalizes the linear regression, <span><math><mrow><mi>A</mi><mo>+</mo><mi>B</mi><mi>t</mi></mrow></math></span>, is then straightforward: it leads to the <em>predictive form of the FPM model</em> that specifies the model in prediction. This predictive form with long memory shows that the <em>predictivity</em> of the FPM model is such that <em>any predicted value takes into account the whole past</em>, according to a weighted sum of all the past values. These values are taken into account through weighting coefficients, that, for <span><math><mrow><mi>m</mi><mo>></mo><mo>−</mo><mn>1</mn></mrow></math></span> and <em>a fortiori</em> for <span><math><mrow><mi>m</mi><mo>></mo><mn>0</mn></mrow></math></span>, correspond to an <em>attenuation of the past</em>, that the non-integer power, <span><math><mi>m</mi></math></span><span>, determines by itself. To confirm the specificity of the FPM model in considering the past, this model is compared with a model of another nature, also having three parameters, namely an exponential model (Liu et al. (2020); Sallahi et al. (2021)): whereas, for the FPM model, the past is taken into account </span><em>globally</em> through <em>all past instants</em>, for the exponential model, the past is taken into account only <em>locally</em> through <em>one single past instant</em>, the predictive form of the model having a <em>short memory</em> and corresponding to a
Oustaloup等人。(2021)已经表明,由于非线性识别技术,分数幂模型(FPM)A+Btm能够很好地表示新冠病毒感染的累积数据。除了这个识别区间,文章还表明,该模型能够在不寻常的预测范围内预测未来的值。本附录的目的是通过自回归形式解释为什么该模型本质上受益于这种预测性,其思想是通过强调其预测特异性来显示FPM模型的兴趣,这是制约模型的非整数积分所固有的。更准确地说,本附录建立了FPM模型的长记忆预测形式。这种形式对应于无限阶的自回归(AR)滤波器。通过对过去值的不确定线性组合来考虑整个过去,据说具有长记忆的第一种预测形式来自于使用非整数微分公式之一的方法。事实上,由于第一种预测形式是幂律的一种,tm,它对FPM模型A+Btm的适应,推广了线性回归A+Bt,因此是直接的:它导致了FPM模型的预测形式,在预测中指定了模型。这种具有长记忆的预测形式表明,FPM模型的预测性使得根据所有过去值的加权和,任何预测值都考虑了整个过去。通过加权系数来考虑这些值,即对于m>;−1,更进一步的是m>;0,对应于过去的衰减,该衰减由非整数幂m自己确定。为了证实FPM模型在考虑过去时的特异性,将该模型与另一种性质的模型进行了比较,该模型也有三个参数,即指数模型(Liu et al.(2020);Sallahi等人。(2021)):而对于FPM模型,过去通过所有过去瞬间被全局考虑,对于指数模型,过去仅通过一个过去瞬间被局部考虑,该模型的预测形式具有短记忆,对应于1阶AR滤波器。在这两个模型的预测中获得的比较结果显示了FPM模型的预测兴趣。
{"title":"Addendum: Predictive form of the FPM model","authors":"Alain Oustaloup ,&nbsp;François Levron ,&nbsp;Stéphane Victor ,&nbsp;Luc Dugard","doi":"10.1016/j.arcontrol.2023.02.001","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.02.001","url":null,"abstract":"&lt;div&gt;&lt;p&gt;The article Oustaloup et al. (2021) has shown that the Fractional Power Model (FPM), &lt;span&gt;&lt;math&gt;&lt;mrow&gt;&lt;mi&gt;A&lt;/mi&gt;&lt;mo&gt;+&lt;/mo&gt;&lt;mi&gt;B&lt;/mi&gt;&lt;msup&gt;&lt;mrow&gt;&lt;mi&gt;t&lt;/mi&gt;&lt;/mrow&gt;&lt;mrow&gt;&lt;mi&gt;m&lt;/mi&gt;&lt;/mrow&gt;&lt;/msup&gt;&lt;/mrow&gt;&lt;/math&gt;&lt;/span&gt;, enables well representing the cumulated data of COVID infections, thanks to a nonlinear identification technique. Beyond this identification interval, the article has also shown that the model enables predicting the future values on an unusual prediction horizon as for its range. The objective of this addendum is to explain, via an autoregressive form, why this model intrinsically benefits from such a predictivity property, the idea being to show the interest of the FPM model by highlighting its &lt;em&gt;predictive specificity&lt;/em&gt;, inherent to non-integer integration that conditions the model. More precisely, this addendum establishes a &lt;em&gt;predictive form with long memory&lt;/em&gt;&lt;span&gt; of the FPM model. This form corresponds to an autoregressive (AR) filter of infinite order. Taking into account the whole past through an indefinite linear combination of past values, a first predictive form, said to be with &lt;/span&gt;&lt;em&gt;long memory&lt;/em&gt;, results from an approach using one of the formulations of non-integer differentiation. Actually, as this first predictive form is the one of the power-law, &lt;span&gt;&lt;math&gt;&lt;msup&gt;&lt;mrow&gt;&lt;mi&gt;t&lt;/mi&gt;&lt;/mrow&gt;&lt;mrow&gt;&lt;mi&gt;m&lt;/mi&gt;&lt;/mrow&gt;&lt;/msup&gt;&lt;/math&gt;&lt;/span&gt;, its adaptation to the FPM model, &lt;span&gt;&lt;math&gt;&lt;mrow&gt;&lt;mi&gt;A&lt;/mi&gt;&lt;mo&gt;+&lt;/mo&gt;&lt;mi&gt;B&lt;/mi&gt;&lt;msup&gt;&lt;mrow&gt;&lt;mi&gt;t&lt;/mi&gt;&lt;/mrow&gt;&lt;mrow&gt;&lt;mi&gt;m&lt;/mi&gt;&lt;/mrow&gt;&lt;/msup&gt;&lt;/mrow&gt;&lt;/math&gt;&lt;/span&gt;, which generalizes the linear regression, &lt;span&gt;&lt;math&gt;&lt;mrow&gt;&lt;mi&gt;A&lt;/mi&gt;&lt;mo&gt;+&lt;/mo&gt;&lt;mi&gt;B&lt;/mi&gt;&lt;mi&gt;t&lt;/mi&gt;&lt;/mrow&gt;&lt;/math&gt;&lt;/span&gt;, is then straightforward: it leads to the &lt;em&gt;predictive form of the FPM model&lt;/em&gt; that specifies the model in prediction. This predictive form with long memory shows that the &lt;em&gt;predictivity&lt;/em&gt; of the FPM model is such that &lt;em&gt;any predicted value takes into account the whole past&lt;/em&gt;, according to a weighted sum of all the past values. These values are taken into account through weighting coefficients, that, for &lt;span&gt;&lt;math&gt;&lt;mrow&gt;&lt;mi&gt;m&lt;/mi&gt;&lt;mo&gt;&gt;&lt;/mo&gt;&lt;mo&gt;−&lt;/mo&gt;&lt;mn&gt;1&lt;/mn&gt;&lt;/mrow&gt;&lt;/math&gt;&lt;/span&gt; and &lt;em&gt;a fortiori&lt;/em&gt; for &lt;span&gt;&lt;math&gt;&lt;mrow&gt;&lt;mi&gt;m&lt;/mi&gt;&lt;mo&gt;&gt;&lt;/mo&gt;&lt;mn&gt;0&lt;/mn&gt;&lt;/mrow&gt;&lt;/math&gt;&lt;/span&gt;, correspond to an &lt;em&gt;attenuation of the past&lt;/em&gt;, that the non-integer power, &lt;span&gt;&lt;math&gt;&lt;mi&gt;m&lt;/mi&gt;&lt;/math&gt;&lt;/span&gt;&lt;span&gt;, determines by itself. To confirm the specificity of the FPM model in considering the past, this model is compared with a model of another nature, also having three parameters, namely an exponential model (Liu et al. (2020); Sallahi et al. (2021)): whereas, for the FPM model, the past is taken into account &lt;/span&gt;&lt;em&gt;globally&lt;/em&gt; through &lt;em&gt;all past instants&lt;/em&gt;, for the exponential model, the past is taken into account only &lt;em&gt;locally&lt;/em&gt; through &lt;em&gt;one single past instant&lt;/em&gt;, the predictive form of the model having a &lt;em&gt;short memory&lt;/em&gt; and corresponding to a","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 291-296"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49762799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A survey of feedback particle filter and related controlled interacting particle systems (CIPS) 反馈粒子滤波器及相关受控相互作用粒子系统(CIPS)综述
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.03.006
Amirhossein Taghvaei , Prashant G. Mehta

In this survey, we describe controlled interacting particle systems (CIPS) to approximate the solution of the optimal filtering and the optimal control problems. Part I of the survey is focussed on the feedback particle filter (FPF) algorithm, its derivation based on optimal transportation theory, and its relationship to the ensemble Kalman filter (EnKF) and the conventional sequential importance sampling–resampling (SIR) particle filters. The central numerical problem of FPF—to approximate the solution of the Poisson equation—is described together with the main solution approaches. An analytical and numerical comparison with the SIR particle filter is given to illustrate the advantages of the CIPS approach. Part II of the survey is focussed on adapting these algorithms for the problem of reinforcement learning. The survey includes several remarks that describe extensions as well as open problems in this subject.

在这篇综述中,我们描述了受控相互作用粒子系统(CIPS)来近似最优滤波和最优控制问题的解。调查的第一部分集中于反馈粒子滤波器(FPF)算法,它基于最优传输理论的推导,以及它与集成卡尔曼滤波器(EnKF)和传统的顺序重要性采样-重采样(SIR)粒子滤波器的关系。FPF的中心数值问题——近似泊松方程的解——与主要的求解方法一起描述。通过与SIR粒子滤波器的分析和数值比较,说明了CIPS方法的优点。调查的第二部分集中于将这些算法用于强化学习问题。该调查包括几条评论,描述了该主题中的扩展以及悬而未决的问题。
{"title":"A survey of feedback particle filter and related controlled interacting particle systems (CIPS)","authors":"Amirhossein Taghvaei ,&nbsp;Prashant G. Mehta","doi":"10.1016/j.arcontrol.2023.03.006","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.03.006","url":null,"abstract":"<div><p><span>In this survey, we describe controlled interacting particle systems (CIPS) to approximate the solution of the optimal filtering and the optimal control problems. Part I of the survey is focussed on the feedback particle filter (FPF) algorithm, its derivation based on optimal transportation theory, and its relationship to the ensemble </span>Kalman filter (EnKF) and the conventional sequential importance sampling–resampling (SIR) particle filters. The central numerical problem of FPF—to approximate the solution of the Poisson equation—is described together with the main solution approaches. An analytical and numerical comparison with the SIR particle filter is given to illustrate the advantages of the CIPS approach. Part II of the survey is focussed on adapting these algorithms for the problem of reinforcement learning. The survey includes several remarks that describe extensions as well as open problems in this subject.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 356-378"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49762819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Annual Reviews in Control
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1