首页 > 最新文献

Annual Reviews in Control最新文献

英文 中文
Ergodic risk-sensitive control—A survey 遍历风险敏感控制——一项调查
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.03.001
Anup Biswas , Vivek S. Borkar

Risk-sensitive control has received considerable interest since the seminal work of Howard and Matheson (Howard and Matheson, 1971/72) because of its ability to account for fluctuations about the mean, its connection with H control, and its application to financial mathematics. In this article we attempt to put together a comprehensive survey on the research done on ergodic risk-sensitive control over the last four decades.

自Howard和Matheson(Howard and Matheson,1971/72)的开创性工作以来,风险敏感控制因其能够解释平均值的波动、与H∞控制的联系以及在金融数学中的应用而引起了人们的极大兴趣。在这篇文章中,我们试图对过去四十年来对遍历风险敏感控制的研究进行全面的调查。
{"title":"Ergodic risk-sensitive control—A survey","authors":"Anup Biswas ,&nbsp;Vivek S. Borkar","doi":"10.1016/j.arcontrol.2023.03.001","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.03.001","url":null,"abstract":"<div><p>Risk-sensitive control has received considerable interest since the seminal work of Howard and Matheson (Howard and Matheson, 1971/72) because of its ability to account for fluctuations about the mean, its connection with <span><math><msub><mrow><mi>H</mi></mrow><mrow><mi>∞</mi></mrow></msub></math></span> control, and its application to financial mathematics. In this article we attempt to put together a comprehensive survey on the research done on ergodic risk-sensitive control over the last four decades.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 118-141"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49739390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Modeling, analysis and control of robot–object nonsmooth underactuated Lagrangian systems: A tutorial overview and perspectives 机器人-物体非光滑欠驱动拉格朗日系统的建模、分析和控制:教程概述和展望
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2022.12.002
Bernard Brogliato

So-called robot–object Lagrangian systems consist of a class of nonsmooth underactuated complementarity Lagrangian systems, with a specific structure: an “object” and a “robot”. Only the robot is actuated. The object dynamics can thus be controlled only through the action of the contact Lagrange multipliers, which represent the interaction forces between the robot and the object. Juggling, walking, running, hopping machines, robotic systems that manipulate objects, tapping, pushing systems, kinematic chains with joint clearance, crawling, climbing robots, some cable-driven manipulators, and some circuits with set-valued nonsmooth components, belong this class. This article aims at presenting their main features, then many application examples which belong to the robot–object class, then reviewing the main tools and control strategies which have been proposed in the Automatic Control and in the Robotics literature. Some comments and open issues conclude the article.

所谓的机器人-物体拉格朗日系统由一类非光滑欠驱动互补拉格朗日系统组成,具有特定的结构:“物体”和“机器人”。只有机器人被启动。因此,物体动力学只能通过接触拉格朗日乘子的作用来控制,拉格朗日乘子表示机器人和物体之间的相互作用力。杂耍、行走、跑步、跳跃机、操纵物体的机器人系统、敲击、推动系统、具有关节间隙的运动链、爬行、攀爬机器人、一些电缆驱动的机械手以及一些具有集值非光滑组件的电路都属于这一类。本文旨在介绍它们的主要特征,然后介绍许多属于机器人-对象类的应用实例,然后回顾自动控制和机器人学文献中提出的主要工具和控制策略。一些评论和悬而未决的问题总结了这篇文章。
{"title":"Modeling, analysis and control of robot–object nonsmooth underactuated Lagrangian systems: A tutorial overview and perspectives","authors":"Bernard Brogliato","doi":"10.1016/j.arcontrol.2022.12.002","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2022.12.002","url":null,"abstract":"<div><p><span>So-called robot–object Lagrangian systems consist of a class of nonsmooth underactuated complementarity Lagrangian systems, with a specific structure: an “object” and a “robot”. Only the robot is actuated. The object dynamics can thus be controlled only through the action of the contact Lagrange multipliers, which represent the interaction forces between the robot and the object. Juggling, walking, running, hopping machines, </span>robotic systems<span><span><span> that manipulate objects, tapping, pushing systems, </span>kinematic chains<span> with joint clearance, crawling, </span></span>climbing robots, some cable-driven manipulators, and some circuits with set-valued nonsmooth components, belong this class. This article aims at presenting their main features, then many application examples which belong to the robot–object class, then reviewing the main tools and control strategies which have been proposed in the Automatic Control and in the Robotics literature. Some comments and open issues conclude the article.</span></p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 297-337"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49762809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Learning and forgetting in systems neuroscience: A control perspective 系统神经科学中的学习与遗忘:控制视角
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.100912
Erick Mejia Uzeda, Mohamed A. Hafez, Mireille E. Broucke

A longstanding open problem of systems neuroscience is to understand how the brain calibrates thousands of reflexes to achieve near instantaneous disturbance rejection. While reflexes typically act locally at the site of sensory measurements, the adaptation of reflex gains is the result of an ingenious architecture in which knowledge of disturbances is transferred from the cerebellum to the deep cerebellar nuclei or the brainstem. This paper investigates the use of control theory as the mathematical foundation to explain the mechanisms by which such forms of learning, as well as forgetting, manifest themselves in systems neuroscience. Particularly, we use adaptive control and averaging theory to model the computations performed in learning appropriate reflex gains. While forgetting is perceived as counter-productive to learning, we show that if incorporated correctly, it can endow the much needed robustness to train thousands of reflexes without interfering with their adaptation. This is accomplished using the μ-modification which achieves robustness of adaptive schemes through the estimation of exciting subspaces. Our techniques are combined in a comprehensive model, with simulations illustrating their effectiveness.

系统神经科学的一个长期悬而未决的问题是了解大脑如何校准成千上万的反射,以实现近乎瞬时的干扰抑制。条件反射通常在感觉测量部位局部起作用,而条件反射增益的适应则是一种巧妙结构的结果,在这种结构中,有关干扰的知识从小脑转移到小脑深核或脑干。本文研究了以控制论为数学基础,解释系统神经科学中这种形式的学习和遗忘的表现机制。特别是,我们使用自适应控制和平均理论来模拟在学习适当的反射增益时所进行的计算。虽然遗忘被认为会对学习产生反作用,但我们的研究表明,如果能正确地将遗忘纳入其中,就能赋予训练成千上万个条件反射所急需的稳健性,而不会干扰它们的适应性。我们利用μ修正来实现这一点,它通过估计令人兴奋的子空间来实现自适应方案的稳健性。我们将这些技术结合到一个综合模型中,并通过模拟说明了它们的有效性。
{"title":"Learning and forgetting in systems neuroscience: A control perspective","authors":"Erick Mejia Uzeda,&nbsp;Mohamed A. Hafez,&nbsp;Mireille E. Broucke","doi":"10.1016/j.arcontrol.2023.100912","DOIUrl":"10.1016/j.arcontrol.2023.100912","url":null,"abstract":"<div><p>A longstanding open problem of systems neuroscience is to understand how the brain calibrates thousands of reflexes to achieve near instantaneous disturbance rejection. While reflexes typically act locally at the site of sensory measurements, the adaptation of reflex gains is the result of an ingenious architecture in which knowledge of disturbances is transferred from the cerebellum to the deep cerebellar nuclei or the brainstem. This paper investigates the use of control theory as the mathematical foundation to explain the mechanisms by which such forms of learning, as well as forgetting, manifest themselves in systems neuroscience. Particularly, we use adaptive control and averaging theory to model the computations performed in learning appropriate reflex gains. While forgetting is perceived as counter-productive to learning, we show that if incorporated correctly, it can endow the much needed robustness to train thousands of reflexes without interfering with their adaptation. This is accomplished using the <span><math><mi>μ</mi></math></span>-modification which achieves robustness of adaptive schemes through the estimation of exciting subspaces. Our techniques are combined in a comprehensive model, with simulations illustrating their effectiveness.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"56 ","pages":"Article 100912"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135515912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Value-gradient iteration with quadratic approximate value functions 二次逼近函数的值梯度迭代
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.100917
Alan Yang, Stephen Boyd

We propose a method for designing policies for convex stochastic control problems characterized by random linear dynamics and convex stage cost. We consider policies that employ quadratic approximate value functions as a substitute for the true value function. Evaluating the associated control policy involves solving a convex problem, typically a quadratic program, which can be carried out reliably in real-time. Such policies often perform well even when the approximate value function is not a particularly good approximation of the true value function. We propose value-gradient iteration, which fits the gradient of value function, with regularization that can include constraints reflecting known bounds on the true value function. Our value-gradient iteration method can yield a good approximate value function with few samples, and little hyperparameter tuning. We find that the method can find a good policy with computational effort comparable to that required to just evaluate a control policy via simulation.

针对具有随机线性动力学和凸阶段代价的凸随机控制问题,提出了一种策略设计方法。我们考虑使用二次近似值函数代替真值函数的策略。评估相关的控制策略涉及求解一个凸问题,通常是一个二次规划,可以可靠地实时执行。即使近似值函数不是真实值函数的特别好的近似值,这种策略通常也会表现良好。我们提出了值梯度迭代,它适合值函数的梯度,正则化可以包括反映真值函数上已知边界的约束。我们的值梯度迭代方法可以在少量样本和少量超参数调优的情况下得到一个很好的近似值函数。我们发现,该方法可以找到一个好的策略,其计算量与仅通过仿真评估控制策略所需的计算量相当。
{"title":"Value-gradient iteration with quadratic approximate value functions","authors":"Alan Yang,&nbsp;Stephen Boyd","doi":"10.1016/j.arcontrol.2023.100917","DOIUrl":"10.1016/j.arcontrol.2023.100917","url":null,"abstract":"<div><p>We propose a method for designing policies for convex stochastic control problems characterized by random linear dynamics and convex stage cost. We consider policies that employ quadratic approximate value functions as a substitute for the true value function. Evaluating the associated control policy involves solving a convex problem<span>, typically a quadratic program, which can be carried out reliably in real-time. Such policies often perform well even when the approximate value function is not a particularly good approximation of the true value function. We propose value-gradient iteration, which fits the gradient of value function, with regularization that can include constraints reflecting known bounds on the true value function. Our value-gradient iteration method can yield a good approximate value function with few samples, and little hyperparameter tuning. We find that the method can find a good policy with computational effort comparable to that required to just evaluate a control policy via simulation.</span></p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"56 ","pages":"Article 100917"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135609020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Probabilistic feasibility in data-driven multi-agent non-convex optimization 数据驱动多智能体非凸优化的概率可行性
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.100925
Lucrezia Manieri, Alessandro Falsone, Maria Prandini

In this paper, we focus on the optimal operation of a multi-agent system affected by uncertainty. In particular, we consider a cooperative setting where agents jointly optimize a performance index compatibly with individual constraints on their discrete and continuous decision variables and with coupling global constraints. We assume that individual constraints are affected by uncertainty, which is known to each agent via a private set of data that cannot be shared with others. Exploiting tools from statistical learning theory, we provide data-based probabilistic feasibility guarantees for a (possibly sub-optimal) solution of the multi-agent problem that is obtained via a decentralized/distributed scheme that preserves the privacy of the local information. The generalization properties of the data-based solution are shown to depend on the size of each local dataset and on the complexity of the uncertain individual constraint sets. Explicit bounds are derived in the case of linear individual constraints. A comparative analysis with the cases of a common dataset and of local uncertainties that are independent is performed.

本文主要研究受不确定性影响的多智能体系统的最优运行问题。特别是,我们考虑了一种合作设置,其中智能体共同优化性能指标,该指标与对其离散和连续决策变量的单个约束以及耦合全局约束兼容。我们假设个体约束受到不确定性的影响,每个代理都通过一组私有数据知道不确定性,这些数据不能与其他代理共享。利用统计学习理论的工具,我们为通过分散/分布式方案获得的多代理问题(可能是次优的)解决方案提供基于数据的概率可行性保证,该方案保留了本地信息的隐私性。基于数据的解决方案的泛化特性取决于每个局部数据集的大小和不确定单个约束集的复杂性。在线性个体约束的情况下,导出了显式边界。对一个公共数据集和独立的局部不确定性进行了比较分析。
{"title":"Probabilistic feasibility in data-driven multi-agent non-convex optimization","authors":"Lucrezia Manieri,&nbsp;Alessandro Falsone,&nbsp;Maria Prandini","doi":"10.1016/j.arcontrol.2023.100925","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.100925","url":null,"abstract":"<div><p>In this paper, we focus on the optimal operation of a multi-agent system affected by uncertainty. In particular, we consider a cooperative setting where agents jointly optimize a performance index compatibly with individual constraints on their discrete and continuous decision variables and with coupling global constraints. We assume that individual constraints are affected by uncertainty, which is known to each agent via a private set of data that cannot be shared with others. Exploiting tools from statistical learning theory, we provide data-based probabilistic feasibility guarantees for a (possibly sub-optimal) solution of the multi-agent problem that is obtained via a decentralized/distributed scheme that preserves the privacy of the local information. The generalization properties of the data-based solution are shown to depend on the size of each local dataset and on the complexity of the uncertain individual constraint sets. Explicit bounds are derived in the case of linear individual constraints. A comparative analysis with the cases of a common dataset and of local uncertainties that are independent is performed.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"56 ","pages":"Article 100925"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1367578823000895/pdfft?md5=400ea0f3b2244fc3aeddd15afd97f00f&pid=1-s2.0-S1367578823000895-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136697008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Addendum: Predictive form of the FPM model 附录:FPM模型的预测形式
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.02.001
Alain Oustaloup , François Levron , Stéphane Victor , Luc Dugard
<div><p>The article Oustaloup et al. (2021) has shown that the Fractional Power Model (FPM), <span><math><mrow><mi>A</mi><mo>+</mo><mi>B</mi><msup><mrow><mi>t</mi></mrow><mrow><mi>m</mi></mrow></msup></mrow></math></span>, enables well representing the cumulated data of COVID infections, thanks to a nonlinear identification technique. Beyond this identification interval, the article has also shown that the model enables predicting the future values on an unusual prediction horizon as for its range. The objective of this addendum is to explain, via an autoregressive form, why this model intrinsically benefits from such a predictivity property, the idea being to show the interest of the FPM model by highlighting its <em>predictive specificity</em>, inherent to non-integer integration that conditions the model. More precisely, this addendum establishes a <em>predictive form with long memory</em><span> of the FPM model. This form corresponds to an autoregressive (AR) filter of infinite order. Taking into account the whole past through an indefinite linear combination of past values, a first predictive form, said to be with </span><em>long memory</em>, results from an approach using one of the formulations of non-integer differentiation. Actually, as this first predictive form is the one of the power-law, <span><math><msup><mrow><mi>t</mi></mrow><mrow><mi>m</mi></mrow></msup></math></span>, its adaptation to the FPM model, <span><math><mrow><mi>A</mi><mo>+</mo><mi>B</mi><msup><mrow><mi>t</mi></mrow><mrow><mi>m</mi></mrow></msup></mrow></math></span>, which generalizes the linear regression, <span><math><mrow><mi>A</mi><mo>+</mo><mi>B</mi><mi>t</mi></mrow></math></span>, is then straightforward: it leads to the <em>predictive form of the FPM model</em> that specifies the model in prediction. This predictive form with long memory shows that the <em>predictivity</em> of the FPM model is such that <em>any predicted value takes into account the whole past</em>, according to a weighted sum of all the past values. These values are taken into account through weighting coefficients, that, for <span><math><mrow><mi>m</mi><mo>></mo><mo>−</mo><mn>1</mn></mrow></math></span> and <em>a fortiori</em> for <span><math><mrow><mi>m</mi><mo>></mo><mn>0</mn></mrow></math></span>, correspond to an <em>attenuation of the past</em>, that the non-integer power, <span><math><mi>m</mi></math></span><span>, determines by itself. To confirm the specificity of the FPM model in considering the past, this model is compared with a model of another nature, also having three parameters, namely an exponential model (Liu et al. (2020); Sallahi et al. (2021)): whereas, for the FPM model, the past is taken into account </span><em>globally</em> through <em>all past instants</em>, for the exponential model, the past is taken into account only <em>locally</em> through <em>one single past instant</em>, the predictive form of the model having a <em>short memory</em> and corresponding to a
Oustaloup等人。(2021)已经表明,由于非线性识别技术,分数幂模型(FPM)A+Btm能够很好地表示新冠病毒感染的累积数据。除了这个识别区间,文章还表明,该模型能够在不寻常的预测范围内预测未来的值。本附录的目的是通过自回归形式解释为什么该模型本质上受益于这种预测性,其思想是通过强调其预测特异性来显示FPM模型的兴趣,这是制约模型的非整数积分所固有的。更准确地说,本附录建立了FPM模型的长记忆预测形式。这种形式对应于无限阶的自回归(AR)滤波器。通过对过去值的不确定线性组合来考虑整个过去,据说具有长记忆的第一种预测形式来自于使用非整数微分公式之一的方法。事实上,由于第一种预测形式是幂律的一种,tm,它对FPM模型A+Btm的适应,推广了线性回归A+Bt,因此是直接的:它导致了FPM模型的预测形式,在预测中指定了模型。这种具有长记忆的预测形式表明,FPM模型的预测性使得根据所有过去值的加权和,任何预测值都考虑了整个过去。通过加权系数来考虑这些值,即对于m>;−1,更进一步的是m>;0,对应于过去的衰减,该衰减由非整数幂m自己确定。为了证实FPM模型在考虑过去时的特异性,将该模型与另一种性质的模型进行了比较,该模型也有三个参数,即指数模型(Liu et al.(2020);Sallahi等人。(2021)):而对于FPM模型,过去通过所有过去瞬间被全局考虑,对于指数模型,过去仅通过一个过去瞬间被局部考虑,该模型的预测形式具有短记忆,对应于1阶AR滤波器。在这两个模型的预测中获得的比较结果显示了FPM模型的预测兴趣。
{"title":"Addendum: Predictive form of the FPM model","authors":"Alain Oustaloup ,&nbsp;François Levron ,&nbsp;Stéphane Victor ,&nbsp;Luc Dugard","doi":"10.1016/j.arcontrol.2023.02.001","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.02.001","url":null,"abstract":"&lt;div&gt;&lt;p&gt;The article Oustaloup et al. (2021) has shown that the Fractional Power Model (FPM), &lt;span&gt;&lt;math&gt;&lt;mrow&gt;&lt;mi&gt;A&lt;/mi&gt;&lt;mo&gt;+&lt;/mo&gt;&lt;mi&gt;B&lt;/mi&gt;&lt;msup&gt;&lt;mrow&gt;&lt;mi&gt;t&lt;/mi&gt;&lt;/mrow&gt;&lt;mrow&gt;&lt;mi&gt;m&lt;/mi&gt;&lt;/mrow&gt;&lt;/msup&gt;&lt;/mrow&gt;&lt;/math&gt;&lt;/span&gt;, enables well representing the cumulated data of COVID infections, thanks to a nonlinear identification technique. Beyond this identification interval, the article has also shown that the model enables predicting the future values on an unusual prediction horizon as for its range. The objective of this addendum is to explain, via an autoregressive form, why this model intrinsically benefits from such a predictivity property, the idea being to show the interest of the FPM model by highlighting its &lt;em&gt;predictive specificity&lt;/em&gt;, inherent to non-integer integration that conditions the model. More precisely, this addendum establishes a &lt;em&gt;predictive form with long memory&lt;/em&gt;&lt;span&gt; of the FPM model. This form corresponds to an autoregressive (AR) filter of infinite order. Taking into account the whole past through an indefinite linear combination of past values, a first predictive form, said to be with &lt;/span&gt;&lt;em&gt;long memory&lt;/em&gt;, results from an approach using one of the formulations of non-integer differentiation. Actually, as this first predictive form is the one of the power-law, &lt;span&gt;&lt;math&gt;&lt;msup&gt;&lt;mrow&gt;&lt;mi&gt;t&lt;/mi&gt;&lt;/mrow&gt;&lt;mrow&gt;&lt;mi&gt;m&lt;/mi&gt;&lt;/mrow&gt;&lt;/msup&gt;&lt;/math&gt;&lt;/span&gt;, its adaptation to the FPM model, &lt;span&gt;&lt;math&gt;&lt;mrow&gt;&lt;mi&gt;A&lt;/mi&gt;&lt;mo&gt;+&lt;/mo&gt;&lt;mi&gt;B&lt;/mi&gt;&lt;msup&gt;&lt;mrow&gt;&lt;mi&gt;t&lt;/mi&gt;&lt;/mrow&gt;&lt;mrow&gt;&lt;mi&gt;m&lt;/mi&gt;&lt;/mrow&gt;&lt;/msup&gt;&lt;/mrow&gt;&lt;/math&gt;&lt;/span&gt;, which generalizes the linear regression, &lt;span&gt;&lt;math&gt;&lt;mrow&gt;&lt;mi&gt;A&lt;/mi&gt;&lt;mo&gt;+&lt;/mo&gt;&lt;mi&gt;B&lt;/mi&gt;&lt;mi&gt;t&lt;/mi&gt;&lt;/mrow&gt;&lt;/math&gt;&lt;/span&gt;, is then straightforward: it leads to the &lt;em&gt;predictive form of the FPM model&lt;/em&gt; that specifies the model in prediction. This predictive form with long memory shows that the &lt;em&gt;predictivity&lt;/em&gt; of the FPM model is such that &lt;em&gt;any predicted value takes into account the whole past&lt;/em&gt;, according to a weighted sum of all the past values. These values are taken into account through weighting coefficients, that, for &lt;span&gt;&lt;math&gt;&lt;mrow&gt;&lt;mi&gt;m&lt;/mi&gt;&lt;mo&gt;&gt;&lt;/mo&gt;&lt;mo&gt;−&lt;/mo&gt;&lt;mn&gt;1&lt;/mn&gt;&lt;/mrow&gt;&lt;/math&gt;&lt;/span&gt; and &lt;em&gt;a fortiori&lt;/em&gt; for &lt;span&gt;&lt;math&gt;&lt;mrow&gt;&lt;mi&gt;m&lt;/mi&gt;&lt;mo&gt;&gt;&lt;/mo&gt;&lt;mn&gt;0&lt;/mn&gt;&lt;/mrow&gt;&lt;/math&gt;&lt;/span&gt;, correspond to an &lt;em&gt;attenuation of the past&lt;/em&gt;, that the non-integer power, &lt;span&gt;&lt;math&gt;&lt;mi&gt;m&lt;/mi&gt;&lt;/math&gt;&lt;/span&gt;&lt;span&gt;, determines by itself. To confirm the specificity of the FPM model in considering the past, this model is compared with a model of another nature, also having three parameters, namely an exponential model (Liu et al. (2020); Sallahi et al. (2021)): whereas, for the FPM model, the past is taken into account &lt;/span&gt;&lt;em&gt;globally&lt;/em&gt; through &lt;em&gt;all past instants&lt;/em&gt;, for the exponential model, the past is taken into account only &lt;em&gt;locally&lt;/em&gt; through &lt;em&gt;one single past instant&lt;/em&gt;, the predictive form of the model having a &lt;em&gt;short memory&lt;/em&gt; and corresponding to a","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 291-296"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49762799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A survey of feedback particle filter and related controlled interacting particle systems (CIPS) 反馈粒子滤波器及相关受控相互作用粒子系统(CIPS)综述
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.03.006
Amirhossein Taghvaei , Prashant G. Mehta

In this survey, we describe controlled interacting particle systems (CIPS) to approximate the solution of the optimal filtering and the optimal control problems. Part I of the survey is focussed on the feedback particle filter (FPF) algorithm, its derivation based on optimal transportation theory, and its relationship to the ensemble Kalman filter (EnKF) and the conventional sequential importance sampling–resampling (SIR) particle filters. The central numerical problem of FPF—to approximate the solution of the Poisson equation—is described together with the main solution approaches. An analytical and numerical comparison with the SIR particle filter is given to illustrate the advantages of the CIPS approach. Part II of the survey is focussed on adapting these algorithms for the problem of reinforcement learning. The survey includes several remarks that describe extensions as well as open problems in this subject.

在这篇综述中,我们描述了受控相互作用粒子系统(CIPS)来近似最优滤波和最优控制问题的解。调查的第一部分集中于反馈粒子滤波器(FPF)算法,它基于最优传输理论的推导,以及它与集成卡尔曼滤波器(EnKF)和传统的顺序重要性采样-重采样(SIR)粒子滤波器的关系。FPF的中心数值问题——近似泊松方程的解——与主要的求解方法一起描述。通过与SIR粒子滤波器的分析和数值比较,说明了CIPS方法的优点。调查的第二部分集中于将这些算法用于强化学习问题。该调查包括几条评论,描述了该主题中的扩展以及悬而未决的问题。
{"title":"A survey of feedback particle filter and related controlled interacting particle systems (CIPS)","authors":"Amirhossein Taghvaei ,&nbsp;Prashant G. Mehta","doi":"10.1016/j.arcontrol.2023.03.006","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.03.006","url":null,"abstract":"<div><p><span>In this survey, we describe controlled interacting particle systems (CIPS) to approximate the solution of the optimal filtering and the optimal control problems. Part I of the survey is focussed on the feedback particle filter (FPF) algorithm, its derivation based on optimal transportation theory, and its relationship to the ensemble </span>Kalman filter (EnKF) and the conventional sequential importance sampling–resampling (SIR) particle filters. The central numerical problem of FPF—to approximate the solution of the Poisson equation—is described together with the main solution approaches. An analytical and numerical comparison with the SIR particle filter is given to illustrate the advantages of the CIPS approach. Part II of the survey is focussed on adapting these algorithms for the problem of reinforcement learning. The survey includes several remarks that describe extensions as well as open problems in this subject.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 356-378"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49762819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Control education for societal-scale challenges: A community roadmap 社会规模挑战的控制教育:社区路线图
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.03.007
John Anthony Rossiter , Christos G. Cassandras , João Hespanha , Sebastian Dormido , Luis de la Torre , Gireeja Ranade , Antonio Visioli , John Hedengren , Richard M. Murray , Panos Antsaklis , Francoise Lamnabhi-Lagarrigue , Thomas Parisini

This article focuses on extending, disseminating and interpreting the findings of an IEEE Control Systems Society working group looking at the role of control theory and engineering in solving some of the many current and future societal challenges. The findings are interpreted in a manner designed to give focus and direction to both future education and research work in the general control theory and engineering arena, interpreted in the broadest sense. The paper is intended to promote discussion in the community and also provide a useful starting point for colleagues wishing to re-imagine the design and delivery of control-related topics in our education systems, especially at the tertiary level and beyond.

本文的重点是扩展、传播和解释IEEE控制系统协会工作组的研究结果,该工作组着眼于控制理论和工程在解决当前和未来许多社会挑战中的作用。这些发现的解释方式旨在为通用控制理论和工程领域的未来教育和研究工作提供重点和方向,并从最广泛的意义上进行解释。该文件旨在促进社区的讨论,并为希望重新设想我们教育系统中控制相关主题的设计和实施的同事们提供一个有用的起点,尤其是在高等教育及以后。
{"title":"Control education for societal-scale challenges: A community roadmap","authors":"John Anthony Rossiter ,&nbsp;Christos G. Cassandras ,&nbsp;João Hespanha ,&nbsp;Sebastian Dormido ,&nbsp;Luis de la Torre ,&nbsp;Gireeja Ranade ,&nbsp;Antonio Visioli ,&nbsp;John Hedengren ,&nbsp;Richard M. Murray ,&nbsp;Panos Antsaklis ,&nbsp;Francoise Lamnabhi-Lagarrigue ,&nbsp;Thomas Parisini","doi":"10.1016/j.arcontrol.2023.03.007","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.03.007","url":null,"abstract":"<div><p>This article focuses on extending, disseminating and interpreting the findings of an IEEE Control Systems Society working group looking at the role of control theory and engineering in solving some of the many current and future societal challenges. The findings are interpreted in a manner designed to give focus and direction to both future education and research work in the general control theory and engineering arena, interpreted in the broadest sense. The paper is intended to promote discussion in the community and also provide a useful starting point for colleagues wishing to re-imagine the design and delivery of control-related topics in our education systems, especially at the tertiary level and beyond.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 1-17"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49739179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Probabilistic prediction methods for nonlinear systems with application to stochastic model predictive control 非线性系统的概率预测方法及其在随机模型预测控制中的应用
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.100905
Daniel Landgraf , Andreas Völz , Felix Berkel , Kevin Schmidt , Thomas Specker , Knut Graichen
{"title":"Probabilistic prediction methods for nonlinear systems with application to stochastic model predictive control","authors":"Daniel Landgraf ,&nbsp;Andreas Völz ,&nbsp;Felix Berkel ,&nbsp;Kevin Schmidt ,&nbsp;Thomas Specker ,&nbsp;Knut Graichen","doi":"10.1016/j.arcontrol.2023.100905","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.100905","url":null,"abstract":"","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"56 ","pages":"100905"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49740292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A brief survey on encrypted control: From the first to the second generation and beyond 加密控制概述:从第一代到第二代及以后
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.100913
Nils Schlüter, Philipp Binfet, Moritz Schulze Darup

This article provides a comprehensive and illustrative presentation of the young field of encrypted control. In particular, we survey the evolution of encrypted controllers from their first appearance in 2015 until 2023 and derive a categorization into two generations mainly characterized by the utilized cryptographic methods. We further envision future developments and challenges of encrypted control. Throughout our presentation, we build less on technicalities but rather on intuitive tutorial-style explanations. This way, we intend to build a bridge from control engineering to cryptography and to make the interdisciplinary field of encrypted control more accessible.

本文对加密控制这一新兴领域进行了全面和说明性的介绍。特别是,我们调查了加密控制器从2015年首次出现到2023年的演变,并将其分为两代,主要以所使用的加密方法为特征。我们进一步展望了加密控制的未来发展和挑战。在整个演示过程中,我们较少使用技术细节,而是使用直观的教程式解释。通过这种方式,我们打算建立一个从控制工程到密码学的桥梁,并使加密控制的跨学科领域更容易访问。
{"title":"A brief survey on encrypted control: From the first to the second generation and beyond","authors":"Nils Schlüter,&nbsp;Philipp Binfet,&nbsp;Moritz Schulze Darup","doi":"10.1016/j.arcontrol.2023.100913","DOIUrl":"10.1016/j.arcontrol.2023.100913","url":null,"abstract":"<div><p>This article provides a comprehensive and illustrative presentation of the young field of encrypted control. In particular, we survey the evolution of encrypted controllers from their first appearance in 2015 until 2023 and derive a categorization into two generations mainly characterized by the utilized cryptographic methods. We further envision future developments and challenges of encrypted control. Throughout our presentation, we build less on technicalities but rather on intuitive tutorial-style explanations. This way, we intend to build a bridge from control engineering to cryptography and to make the interdisciplinary field of encrypted control more accessible.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"56 ","pages":"Article 100913"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138536647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Annual Reviews in Control
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1