首页 > 最新文献

Annual Reviews in Control最新文献

英文 中文
A unified concurrent-composition method to state/event inference and concealment in labeled finite-state automata as discrete-event systems 标记有限状态自动机离散事件系统状态/事件推理与隐藏的统一并发组合方法
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.100902
Kuize Zhang
{"title":"A unified concurrent-composition method to state/event inference and concealment in labeled finite-state automata as discrete-event systems","authors":"Kuize Zhang","doi":"10.1016/j.arcontrol.2023.100902","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.100902","url":null,"abstract":"","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"56 ","pages":"100902"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49763042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Multi-time scale control and optimization via averaging and singular perturbation theory: From ODEs to hybrid dynamical systems 基于平均和奇异摄动理论的多时间尺度控制和优化:从ode到混合动力系统
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.100926
Mahmoud Abdelgalil, Daniel E. Ochoa, Jorge I. Poveda

Multi-time scale techniques based on singular perturbations and averaging theory are among the most powerful tools developed for the synthesis and analysis of feedback control algorithms. This paper introduces some of the recent advances in singular perturbation theory and averaging theory for continuous-time dynamical systems modeled as ordinary differential equations (ODEs), as well as for hybrid dynamical systems that combine continuous-time dynamics and discrete-time dynamics. Novel multi-time scale analytical tools based on higher-order averaging and singular perturbation theory are also discussed and illustrated via different examples. In the context of hybrid dynamical systems, a class of sufficient Lyapunov-based conditions for global stability results are also presented. The analytical tools are illustrated through various new architectures and algorithms within the context of adaptive and extremum-seeking systems. These tools are suitable for the study of model-free optimization and stabilization problems that require the synergistic use of continuous-time and discrete-time feedback. The paper aims to acquaint the reader with a range of modern tools for studying multi-time scale phenomena in optimization and control systems, providing some guidelines for future research in this field.

基于奇异摄动和平均理论的多时间尺度技术是为综合和分析反馈控制算法而开发的最强大的工具之一。本文介绍了用常微分方程(ODEs)建模的连续动力系统的奇异摄动理论和平均理论,以及连续动力和离散动力相结合的混合动力系统的一些最新进展。本文还讨论了基于高阶平均和奇异摄动理论的新型多时间尺度分析工具,并通过不同的实例进行了说明。在混合动力系统中,给出了全局稳定性结果的一类充分lyapunov条件。分析工具通过各种新的架构和算法在自适应和极值搜索系统的背景下进行说明。这些工具适用于需要协同使用连续时间和离散时间反馈的无模型优化和镇定问题的研究。本文旨在向读者介绍一系列用于研究优化和控制系统中多时间尺度现象的现代工具,为该领域的未来研究提供一些指导。
{"title":"Multi-time scale control and optimization via averaging and singular perturbation theory: From ODEs to hybrid dynamical systems","authors":"Mahmoud Abdelgalil,&nbsp;Daniel E. Ochoa,&nbsp;Jorge I. Poveda","doi":"10.1016/j.arcontrol.2023.100926","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.100926","url":null,"abstract":"<div><p>Multi-time scale techniques based on singular perturbations and averaging theory are among the most powerful tools developed for the synthesis and analysis of feedback control algorithms. This paper introduces some of the recent advances in singular perturbation theory and averaging theory for continuous-time dynamical systems modeled as ordinary differential equations (ODEs), as well as for hybrid dynamical systems that combine continuous-time dynamics and discrete-time dynamics. Novel multi-time scale analytical tools based on higher-order averaging and singular perturbation theory are also discussed and illustrated via different examples. In the context of hybrid dynamical systems, a class of sufficient Lyapunov-based conditions for global stability results are also presented. The analytical tools are illustrated through various new architectures and algorithms within the context of adaptive and extremum-seeking systems. These tools are suitable for the study of model-free optimization and stabilization problems that require the synergistic use of continuous-time and discrete-time feedback. The paper aims to acquaint the reader with a range of modern tools for studying multi-time scale phenomena in optimization and control systems, providing some guidelines for future research in this field.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"56 ","pages":"Article 100926"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1367578823000901/pdfft?md5=bf2297a434b63cf7c9074cea71bd8782&pid=1-s2.0-S1367578823000901-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138436737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PID control of quadrotor UAVs: A survey 四旋翼无人机的PID控制综述
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.100900
Ivan Lopez-Sanchez, Javier Moreno-Valenzuela
{"title":"PID control of quadrotor UAVs: A survey","authors":"Ivan Lopez-Sanchez,&nbsp;Javier Moreno-Valenzuela","doi":"10.1016/j.arcontrol.2023.100900","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.100900","url":null,"abstract":"","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"56 ","pages":"100900"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49739949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Learning quadrotor dynamics for precise, safe, and agile flight control 学习四旋翼动力学精确,安全和灵活的飞行控制
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.03.009
Alessandro Saviolo, Giuseppe Loianno

This article reviews the state-of-the-art modeling and control techniques for aerial robots such as quadrotor systems and presents several future research directions in this area. The review starts by introducing the benefits and drawbacks of classic physic-based dynamic modeling and control techniques. Subsequently, the manuscript presents the key challenges to augment or replace classic techniques with data-driven approaches that can offer several key benefits in terms of flight precision, safety, adaptation, and agility.

本文综述了最先进的航空机器人建模和控制技术,如四旋翼系统,并提出了该领域未来的几个研究方向。综述首先介绍了经典的基于物理的动态建模和控制技术的优点和缺点。随后,该手稿提出了用数据驱动的方法来增强或取代经典技术的关键挑战,这些方法可以在飞行精度、安全性、适应性和灵活性方面提供几个关键优势。
{"title":"Learning quadrotor dynamics for precise, safe, and agile flight control","authors":"Alessandro Saviolo,&nbsp;Giuseppe Loianno","doi":"10.1016/j.arcontrol.2023.03.009","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.03.009","url":null,"abstract":"<div><p>This article reviews the state-of-the-art modeling and control techniques for aerial robots such as quadrotor systems and presents several future research directions in this area. The review starts by introducing the benefits and drawbacks of classic physic-based dynamic modeling and control techniques. Subsequently, the manuscript presents the key challenges to augment or replace classic techniques with data-driven approaches that can offer several key benefits in terms of flight precision, safety, adaptation, and agility.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 45-60"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49738908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Active queue management for alleviating Internet congestion via a nonlinear differential equation with a variable delay 基于可变延迟非线性微分方程的主动队列管理缓解网络拥塞
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.02.002
Hugues Mounier , Cédric Join , Emmanuel Delaleau , Michel Fliess

Active Queue Management (AQM) for mitigating Internet congestion has been addressed via various feedback control syntheses, especially P, PI, and PID regulators, by using a linear approximation where the “round trip time,” i.e., the delay, is assumed to be constant. This constraint is lifted here by using a nonlinear modeling with a variable delay, introduced more than 20 years ago. This delay, intimately linked to the congestion phenomenon, may be viewed as “ a flat output.” All other system variables, especially the control variable, i.e., the packet loss ratio, are expressed as a function of the delay and its derivatives: they are frozen if the delay is kept constant. This flatness-like property, which demonstrates the mathematical discrepancy of the linear approximation adopted until today, yields also our control strategy in two steps: Firstly, designing an open-loop control, thanks to straightforward Flatness-Based Control (FBC) techniques, and secondly, closing the loop via Model-Free Control (MFC) in order to take into account severe model mismatches, like, here, the number of TCP sessions. Several convincing computer simulations, which are easily implementable, are presented and discussed.

用于缓解互联网拥塞的主动队列管理(AQM)已通过各种反馈控制综合,特别是P、PI和PID调节器,通过使用线性近似来解决,其中“往返时间”(即延迟)假定为常数。通过使用20多年前引入的具有可变延迟的非线性建模,消除了这一限制。这种与拥塞现象密切相关的延迟可以被视为“平坦输出”。所有其他系统变量,尤其是控制变量,即丢包率,都被表示为延迟及其导数的函数:如果延迟保持不变,它们就会被冻结。这种类似平面度的特性证明了迄今为止所采用的线性近似的数学差异,也产生了我们分两步的控制策略:首先,由于直接的基于平面度的控制(FBC)技术,设计开环控制,其次,通过无模型控制(MFC)闭合回路,以考虑严重的模型失配,比如,这里,TCP会话的数量。介绍并讨论了几种易于实现的令人信服的计算机模拟。
{"title":"Active queue management for alleviating Internet congestion via a nonlinear differential equation with a variable delay","authors":"Hugues Mounier ,&nbsp;Cédric Join ,&nbsp;Emmanuel Delaleau ,&nbsp;Michel Fliess","doi":"10.1016/j.arcontrol.2023.02.002","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.02.002","url":null,"abstract":"<div><p><span>Active Queue Management (AQM) for mitigating Internet congestion has been addressed via various feedback control syntheses, especially P, PI, and PID regulators, by using a linear approximation where the “round trip time,” i.e., the delay, is assumed to be constant. This constraint is lifted here by using a nonlinear modeling with a variable delay, introduced more than 20 years ago. This delay, intimately linked to the congestion phenomenon, may be viewed as “ a flat output.” All other system variables, especially the control variable, i.e., the </span>packet loss ratio, are expressed as a function of the delay and its derivatives: they are frozen if the delay is kept constant. This flatness-like property, which demonstrates the mathematical discrepancy of the linear approximation adopted until today, yields also our control strategy in two steps: Firstly, designing an open-loop control, thanks to straightforward Flatness-Based Control (FBC) techniques, and secondly, closing the loop via Model-Free Control (MFC) in order to take into account severe model mismatches, like, here, the number of TCP sessions. Several convincing computer simulations, which are easily implementable, are presented and discussed.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 61-69"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49738910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Review of floating object manipulation by autonomous multi-vessel systems 自主多船系统的漂浮物操纵研究进展
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2022.10.003
Zhe Du, Rudy R. Negenborn, Vasso Reppa

The regulatory endorsement of the International Maritime Organization (IMO) and the support of pivotal shipping market players in recent years motivate the investigation of the potential role that autonomous vessels play in the shipping industry. As the complexity and scale of the envisioned applications increase, research works gradually transform the focus from single-vessel systems to multi-vessel systems. Thus, autonomous multi-vessel systems applied in the shipping industry are becoming a promising research direction. One of the typical research directions is floating object manipulation by multiple tugboats.

This paper offers a comprehensive literature review of the existing research on floating object manipulation by autonomous multi-vessel systems. Based on the prior knowledge of object manipulation problems in multi-robot systems, four typical ways of maritime object manipulation are summarized: attaching, caging, pushing, and towing. The advantages and disadvantages of each manipulation way are discussed, including its typical floating object and application scenarios. Moreover, the aspects of control objective, control architecture, collision avoidance operation, disturbances consideration, and role of each involved vessel are analyzed for gaining insight into the approaches for solving these problems. Finally, challenges and future directions are highlighted to give possible inspiration.

近年来,国际海事组织(海事组织)的监管认可和关键航运市场参与者的支持促使人们对自主船舶在航运业中发挥的潜在作用进行调查。随着所设想应用的复杂性和规模的增加,研究工作逐渐将重点从单船系统转移到多船系统。因此,自主多船系统在航运业中的应用正成为一个很有前途的研究方向。多艘拖船操纵漂浮物是一个典型的研究方向。本文对自主多船系统操纵漂浮物的现有研究进行了全面的文献综述。基于多机器人系统中物体操纵问题的先验知识,总结了四种典型的海上物体操纵方法:附着、锁定、推动和拖曳。讨论了每种操作方式的优缺点,包括其典型的浮动对象和应用场景。此外,还分析了控制目标、控制架构、防撞操作、干扰考虑和每艘相关船只的作用等方面,以深入了解解决这些问题的方法。最后,强调了挑战和未来的方向,以提供可能的灵感。
{"title":"Review of floating object manipulation by autonomous multi-vessel systems","authors":"Zhe Du,&nbsp;Rudy R. Negenborn,&nbsp;Vasso Reppa","doi":"10.1016/j.arcontrol.2022.10.003","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2022.10.003","url":null,"abstract":"<div><p>The regulatory endorsement of the International Maritime Organization (IMO) and the support of pivotal shipping market players in recent years motivate the investigation of the potential role that autonomous vessels play in the shipping industry. As the complexity and scale of the envisioned applications increase, research works gradually transform the focus from single-vessel systems to multi-vessel systems. Thus, autonomous multi-vessel systems applied in the shipping industry are becoming a promising research direction. One of the typical research directions is floating object manipulation by multiple tugboats.</p><p>This paper offers a comprehensive literature review of the existing research on floating object manipulation by autonomous multi-vessel systems. Based on the prior knowledge of object manipulation problems in multi-robot systems, four typical ways of maritime object manipulation are summarized: attaching, caging, pushing, and towing. The advantages and disadvantages of each manipulation way are discussed, including its typical floating object and application scenarios. Moreover, the aspects of control objective, control architecture, collision avoidance operation, disturbances consideration, and role of each involved vessel are analyzed for gaining insight into the approaches for solving these problems. Finally, challenges and future directions are highlighted to give possible inspiration.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 255-278"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49739029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Ergodic risk-sensitive control—A survey 遍历风险敏感控制——一项调查
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.03.001
Anup Biswas , Vivek S. Borkar

Risk-sensitive control has received considerable interest since the seminal work of Howard and Matheson (Howard and Matheson, 1971/72) because of its ability to account for fluctuations about the mean, its connection with H control, and its application to financial mathematics. In this article we attempt to put together a comprehensive survey on the research done on ergodic risk-sensitive control over the last four decades.

自Howard和Matheson(Howard and Matheson,1971/72)的开创性工作以来,风险敏感控制因其能够解释平均值的波动、与H∞控制的联系以及在金融数学中的应用而引起了人们的极大兴趣。在这篇文章中,我们试图对过去四十年来对遍历风险敏感控制的研究进行全面的调查。
{"title":"Ergodic risk-sensitive control—A survey","authors":"Anup Biswas ,&nbsp;Vivek S. Borkar","doi":"10.1016/j.arcontrol.2023.03.001","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2023.03.001","url":null,"abstract":"<div><p>Risk-sensitive control has received considerable interest since the seminal work of Howard and Matheson (Howard and Matheson, 1971/72) because of its ability to account for fluctuations about the mean, its connection with <span><math><msub><mrow><mi>H</mi></mrow><mrow><mi>∞</mi></mrow></msub></math></span> control, and its application to financial mathematics. In this article we attempt to put together a comprehensive survey on the research done on ergodic risk-sensitive control over the last four decades.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 118-141"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49739390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Modeling, analysis and control of robot–object nonsmooth underactuated Lagrangian systems: A tutorial overview and perspectives 机器人-物体非光滑欠驱动拉格朗日系统的建模、分析和控制:教程概述和展望
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2022.12.002
Bernard Brogliato

So-called robot–object Lagrangian systems consist of a class of nonsmooth underactuated complementarity Lagrangian systems, with a specific structure: an “object” and a “robot”. Only the robot is actuated. The object dynamics can thus be controlled only through the action of the contact Lagrange multipliers, which represent the interaction forces between the robot and the object. Juggling, walking, running, hopping machines, robotic systems that manipulate objects, tapping, pushing systems, kinematic chains with joint clearance, crawling, climbing robots, some cable-driven manipulators, and some circuits with set-valued nonsmooth components, belong this class. This article aims at presenting their main features, then many application examples which belong to the robot–object class, then reviewing the main tools and control strategies which have been proposed in the Automatic Control and in the Robotics literature. Some comments and open issues conclude the article.

所谓的机器人-物体拉格朗日系统由一类非光滑欠驱动互补拉格朗日系统组成,具有特定的结构:“物体”和“机器人”。只有机器人被启动。因此,物体动力学只能通过接触拉格朗日乘子的作用来控制,拉格朗日乘子表示机器人和物体之间的相互作用力。杂耍、行走、跑步、跳跃机、操纵物体的机器人系统、敲击、推动系统、具有关节间隙的运动链、爬行、攀爬机器人、一些电缆驱动的机械手以及一些具有集值非光滑组件的电路都属于这一类。本文旨在介绍它们的主要特征,然后介绍许多属于机器人-对象类的应用实例,然后回顾自动控制和机器人学文献中提出的主要工具和控制策略。一些评论和悬而未决的问题总结了这篇文章。
{"title":"Modeling, analysis and control of robot–object nonsmooth underactuated Lagrangian systems: A tutorial overview and perspectives","authors":"Bernard Brogliato","doi":"10.1016/j.arcontrol.2022.12.002","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2022.12.002","url":null,"abstract":"<div><p><span>So-called robot–object Lagrangian systems consist of a class of nonsmooth underactuated complementarity Lagrangian systems, with a specific structure: an “object” and a “robot”. Only the robot is actuated. The object dynamics can thus be controlled only through the action of the contact Lagrange multipliers, which represent the interaction forces between the robot and the object. Juggling, walking, running, hopping machines, </span>robotic systems<span><span><span> that manipulate objects, tapping, pushing systems, </span>kinematic chains<span> with joint clearance, crawling, </span></span>climbing robots, some cable-driven manipulators, and some circuits with set-valued nonsmooth components, belong this class. This article aims at presenting their main features, then many application examples which belong to the robot–object class, then reviewing the main tools and control strategies which have been proposed in the Automatic Control and in the Robotics literature. Some comments and open issues conclude the article.</span></p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"55 ","pages":"Pages 297-337"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49762809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Learning and forgetting in systems neuroscience: A control perspective 系统神经科学中的学习与遗忘:控制视角
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.100912
Erick Mejia Uzeda, Mohamed A. Hafez, Mireille E. Broucke

A longstanding open problem of systems neuroscience is to understand how the brain calibrates thousands of reflexes to achieve near instantaneous disturbance rejection. While reflexes typically act locally at the site of sensory measurements, the adaptation of reflex gains is the result of an ingenious architecture in which knowledge of disturbances is transferred from the cerebellum to the deep cerebellar nuclei or the brainstem. This paper investigates the use of control theory as the mathematical foundation to explain the mechanisms by which such forms of learning, as well as forgetting, manifest themselves in systems neuroscience. Particularly, we use adaptive control and averaging theory to model the computations performed in learning appropriate reflex gains. While forgetting is perceived as counter-productive to learning, we show that if incorporated correctly, it can endow the much needed robustness to train thousands of reflexes without interfering with their adaptation. This is accomplished using the μ-modification which achieves robustness of adaptive schemes through the estimation of exciting subspaces. Our techniques are combined in a comprehensive model, with simulations illustrating their effectiveness.

系统神经科学的一个长期悬而未决的问题是了解大脑如何校准成千上万的反射,以实现近乎瞬时的干扰抑制。条件反射通常在感觉测量部位局部起作用,而条件反射增益的适应则是一种巧妙结构的结果,在这种结构中,有关干扰的知识从小脑转移到小脑深核或脑干。本文研究了以控制论为数学基础,解释系统神经科学中这种形式的学习和遗忘的表现机制。特别是,我们使用自适应控制和平均理论来模拟在学习适当的反射增益时所进行的计算。虽然遗忘被认为会对学习产生反作用,但我们的研究表明,如果能正确地将遗忘纳入其中,就能赋予训练成千上万个条件反射所急需的稳健性,而不会干扰它们的适应性。我们利用μ修正来实现这一点,它通过估计令人兴奋的子空间来实现自适应方案的稳健性。我们将这些技术结合到一个综合模型中,并通过模拟说明了它们的有效性。
{"title":"Learning and forgetting in systems neuroscience: A control perspective","authors":"Erick Mejia Uzeda,&nbsp;Mohamed A. Hafez,&nbsp;Mireille E. Broucke","doi":"10.1016/j.arcontrol.2023.100912","DOIUrl":"10.1016/j.arcontrol.2023.100912","url":null,"abstract":"<div><p>A longstanding open problem of systems neuroscience is to understand how the brain calibrates thousands of reflexes to achieve near instantaneous disturbance rejection. While reflexes typically act locally at the site of sensory measurements, the adaptation of reflex gains is the result of an ingenious architecture in which knowledge of disturbances is transferred from the cerebellum to the deep cerebellar nuclei or the brainstem. This paper investigates the use of control theory as the mathematical foundation to explain the mechanisms by which such forms of learning, as well as forgetting, manifest themselves in systems neuroscience. Particularly, we use adaptive control and averaging theory to model the computations performed in learning appropriate reflex gains. While forgetting is perceived as counter-productive to learning, we show that if incorporated correctly, it can endow the much needed robustness to train thousands of reflexes without interfering with their adaptation. This is accomplished using the <span><math><mi>μ</mi></math></span>-modification which achieves robustness of adaptive schemes through the estimation of exciting subspaces. Our techniques are combined in a comprehensive model, with simulations illustrating their effectiveness.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"56 ","pages":"Article 100912"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135515912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Value-gradient iteration with quadratic approximate value functions 二次逼近函数的值梯度迭代
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2023-01-01 DOI: 10.1016/j.arcontrol.2023.100917
Alan Yang, Stephen Boyd

We propose a method for designing policies for convex stochastic control problems characterized by random linear dynamics and convex stage cost. We consider policies that employ quadratic approximate value functions as a substitute for the true value function. Evaluating the associated control policy involves solving a convex problem, typically a quadratic program, which can be carried out reliably in real-time. Such policies often perform well even when the approximate value function is not a particularly good approximation of the true value function. We propose value-gradient iteration, which fits the gradient of value function, with regularization that can include constraints reflecting known bounds on the true value function. Our value-gradient iteration method can yield a good approximate value function with few samples, and little hyperparameter tuning. We find that the method can find a good policy with computational effort comparable to that required to just evaluate a control policy via simulation.

针对具有随机线性动力学和凸阶段代价的凸随机控制问题,提出了一种策略设计方法。我们考虑使用二次近似值函数代替真值函数的策略。评估相关的控制策略涉及求解一个凸问题,通常是一个二次规划,可以可靠地实时执行。即使近似值函数不是真实值函数的特别好的近似值,这种策略通常也会表现良好。我们提出了值梯度迭代,它适合值函数的梯度,正则化可以包括反映真值函数上已知边界的约束。我们的值梯度迭代方法可以在少量样本和少量超参数调优的情况下得到一个很好的近似值函数。我们发现,该方法可以找到一个好的策略,其计算量与仅通过仿真评估控制策略所需的计算量相当。
{"title":"Value-gradient iteration with quadratic approximate value functions","authors":"Alan Yang,&nbsp;Stephen Boyd","doi":"10.1016/j.arcontrol.2023.100917","DOIUrl":"10.1016/j.arcontrol.2023.100917","url":null,"abstract":"<div><p>We propose a method for designing policies for convex stochastic control problems characterized by random linear dynamics and convex stage cost. We consider policies that employ quadratic approximate value functions as a substitute for the true value function. Evaluating the associated control policy involves solving a convex problem<span>, typically a quadratic program, which can be carried out reliably in real-time. Such policies often perform well even when the approximate value function is not a particularly good approximation of the true value function. We propose value-gradient iteration, which fits the gradient of value function, with regularization that can include constraints reflecting known bounds on the true value function. Our value-gradient iteration method can yield a good approximate value function with few samples, and little hyperparameter tuning. We find that the method can find a good policy with computational effort comparable to that required to just evaluate a control policy via simulation.</span></p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"56 ","pages":"Article 100917"},"PeriodicalIF":9.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135609020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Annual Reviews in Control
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1