首页 > 最新文献

Annual Reviews in Control最新文献

英文 中文
Parametrizations of all stable closed-loop responses: From theory to neural network control design 全稳定闭环响应的参数化:从理论到神经网络控制设计
IF 10.7 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-01-01 DOI: 10.1016/j.arcontrol.2025.101012
Clara Lucía Galimberti, Luca Furieri, Giancarlo Ferrari-Trecate
The complexity of modern control systems necessitates architectures that achieve high performance while ensuring robust stability, particularly for nonlinear systems. In this work, we tackle the challenge of designing output-feedback controllers to boost the performance of p-stable discrete-time nonlinear systems while preserving closed-loop stability from external disturbances to input and output channels. Leveraging operator theory and neural network representations, we parametrize the achievable closed-loop maps for a given system and propose novel parametrizations of all p-stabilizing controllers, unifying frameworks such as nonlinear Youla parametrization and internal model control. Contributing to a rapidly growing research line, our approach enables unconstrained optimization exclusively over stabilizing controllers and provides sufficient conditions to ensure robustness against model mismatch. Additionally, our methods reveal that stronger notions of stability can be imposed on the closed-loop maps if disturbance realizations are available after one time step. Last, our approaches are compatible with the design of nonlinear distributed controllers. Numerical experiments on cooperative robotics demonstrate the flexibility of the proposed framework, allowing cost functions to be freely designed for achieving complex behaviors while preserving stability.
现代控制系统的复杂性需要在确保鲁棒稳定性的同时实现高性能的体系结构,特别是对于非线性系统。在这项工作中,我们解决了设计输出反馈控制器的挑战,以提高p稳定离散时间非线性系统的性能,同时保持输入和输出通道免受外部干扰的闭环稳定性。利用算子理论和神经网络表示,我们对给定系统的可实现闭环映射进行了参数化,并提出了所有稳定控制器的新参数化,统一了非线性Youla参数化和内模控制等框架。我们的方法促进了快速发展的研究方向,使稳定控制器的无约束优化成为可能,并提供了足够的条件来确保对模型不匹配的鲁棒性。此外,我们的方法表明,如果扰动实现在一个时间步后可用,则可以对闭环映射施加更强的稳定性概念。最后,我们的方法适用于非线性分布式控制器的设计。协作机器人的数值实验证明了该框架的灵活性,允许在保持稳定性的同时自由设计成本函数以实现复杂的行为。
{"title":"Parametrizations of all stable closed-loop responses: From theory to neural network control design","authors":"Clara Lucía Galimberti,&nbsp;Luca Furieri,&nbsp;Giancarlo Ferrari-Trecate","doi":"10.1016/j.arcontrol.2025.101012","DOIUrl":"10.1016/j.arcontrol.2025.101012","url":null,"abstract":"<div><div>The complexity of modern control systems necessitates architectures that achieve high performance while ensuring robust stability, particularly for nonlinear systems. In this work, we tackle the challenge of designing output-feedback controllers to boost the performance of <span><math><msub><mrow><mi>ℓ</mi></mrow><mrow><mi>p</mi></mrow></msub></math></span>-stable discrete-time nonlinear systems while preserving closed-loop stability from external disturbances to input and output channels. Leveraging operator theory and neural network representations, we parametrize the achievable closed-loop maps for a given system and propose novel parametrizations of all <span><math><msub><mrow><mi>ℓ</mi></mrow><mrow><mi>p</mi></mrow></msub></math></span>-stabilizing controllers, unifying frameworks such as nonlinear Youla parametrization and internal model control. Contributing to a rapidly growing research line, our approach enables unconstrained optimization exclusively over stabilizing controllers and provides sufficient conditions to ensure robustness against model mismatch. Additionally, our methods reveal that stronger notions of stability can be imposed on the closed-loop maps if disturbance realizations are available after one time step. Last, our approaches are compatible with the design of nonlinear distributed controllers. Numerical experiments on cooperative robotics demonstrate the flexibility of the proposed framework, allowing cost functions to be freely designed for achieving complex behaviors while preserving stability.</div></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"60 ","pages":"Article 101012"},"PeriodicalIF":10.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144888884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bio-inspired control strategies in wearable robotics: A comprehensive review of CPGs and DMPs 可穿戴机器人中的仿生控制策略:cpg和dmp的综合综述
IF 7.3 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-01-01 DOI: 10.1016/j.arcontrol.2025.100991
Joana F. Almeida , Cristina P. Santos
Wearable robotic devices such as exoskeletons and orthoses have undergone significant advancements over the past two decades, aiming to support human mobility in rehabilitation, daily life, and industrial settings. Central to their effectiveness is the implementation of control strategies that generate smooth, adaptive, and user-synchronized movements. Among these, bio-inspired approaches that emulate neural and motor mechanisms of human locomotion have gained increasing attention.
This review presents a comprehensive analysis of two prominent bio-inspired control frameworks – Central Pattern Generators (CPGs) and Dynamic Movement Primitives (DMPs) – implemented in wearable lower-limb robotic systems. A total of 45 articles were systematically analysed to identify trends and challenges in their application.
The review examines the purposes of these controllers, the joints and degrees of freedom addressed, the sensors employed, the structural characteristics of each approach, the integration of sensory feedback and intention decoding, the tracking controllers used, and the validation methodologies adopted.
The findings reveal that CPGs and DMPs are primarily adopted for generating adaptive joint trajectories, enabling stable, rhythmic, and responsive locomotion. Their flexibility allows for encoding motion patterns that adapt to user-specific and task-specific requirements. However, challenges such as parameter tuning, integration of sensory feedback, real-time intention decoding, and validation robustness remain open issues.
This work highlights the potential of CPG- and DMP-based strategies to enhance the autonomy, safety, and personalization of wearable robots and provides future research directions to address their current limitations and improve their practical applicability.
在过去的二十年里,外骨骼和矫形器等可穿戴机器人设备取得了重大进展,旨在支持人类在康复、日常生活和工业环境中的移动能力。其有效性的核心是控制策略的实施,产生平滑,自适应和用户同步的运动。其中,模仿人类运动的神经和运动机制的生物启发方法已获得越来越多的关注。本文综述了在可穿戴下肢机器人系统中实现的两个突出的仿生控制框架-中央模式发生器(CPGs)和动态运动原语(dmp)的综合分析。共系统分析了45篇文章,以确定其应用中的趋势和挑战。本文考察了这些控制器的用途、关节和自由度、所采用的传感器、每种方法的结构特征、感官反馈和意图解码的集成、所使用的跟踪控制器以及所采用的验证方法。研究结果表明,cpg和dmp主要用于产生自适应关节轨迹,从而实现稳定、有节奏和响应性的运动。它们的灵活性允许对运动模式进行编码,以适应特定于用户和特定于任务的需求。然而,诸如参数调整、感官反馈集成、实时意图解码和验证鲁棒性等挑战仍然存在。这项工作强调了基于CPG和dmp的策略在增强可穿戴机器人的自主性、安全性和个性化方面的潜力,并提供了未来的研究方向,以解决它们目前的局限性并提高它们的实际适用性。
{"title":"Bio-inspired control strategies in wearable robotics: A comprehensive review of CPGs and DMPs","authors":"Joana F. Almeida ,&nbsp;Cristina P. Santos","doi":"10.1016/j.arcontrol.2025.100991","DOIUrl":"10.1016/j.arcontrol.2025.100991","url":null,"abstract":"<div><div>Wearable robotic devices such as exoskeletons and orthoses have undergone significant advancements over the past two decades, aiming to support human mobility in rehabilitation, daily life, and industrial settings. Central to their effectiveness is the implementation of control strategies that generate smooth, adaptive, and user-synchronized movements. Among these, bio-inspired approaches that emulate neural and motor mechanisms of human locomotion have gained increasing attention.</div><div>This review presents a comprehensive analysis of two prominent bio-inspired control frameworks – Central Pattern Generators (CPGs) and Dynamic Movement Primitives (DMPs) – implemented in wearable lower-limb robotic systems. A total of 45 articles were systematically analysed to identify trends and challenges in their application.</div><div>The review examines the purposes of these controllers, the joints and degrees of freedom addressed, the sensors employed, the structural characteristics of each approach, the integration of sensory feedback and intention decoding, the tracking controllers used, and the validation methodologies adopted.</div><div>The findings reveal that CPGs and DMPs are primarily adopted for generating adaptive joint trajectories, enabling stable, rhythmic, and responsive locomotion. Their flexibility allows for encoding motion patterns that adapt to user-specific and task-specific requirements. However, challenges such as parameter tuning, integration of sensory feedback, real-time intention decoding, and validation robustness remain open issues.</div><div>This work highlights the potential of CPG- and DMP-based strategies to enhance the autonomy, safety, and personalization of wearable robots and provides future research directions to address their current limitations and improve their practical applicability.</div></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"60 ","pages":"Article 100991"},"PeriodicalIF":7.3,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144106345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modeling, analysis and experimental observations of fat volume properties in compartmental models for drug pharmacokinetics 药物药代动力学室室模型中脂肪体积特性的建模、分析和实验观察
IF 7.3 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-01-01 DOI: 10.1016/j.arcontrol.2025.100996
Amani R. Ynineb , Bouchra Khoumeri , Erhan Yumuk , Bora Ayvaz , Ghada Ben Othman , Clara M. Ionescu , Dana Copot
Drug related dynamic uptake, clearance and anomalous diffusion patterns are dependent of biological tissue composition. Of particular importance is the fat volume, where drug trapping occurs in long term usage and related to lipophilic properties of drugs and non-homogeneous mixing. This paper introduces three probable drug pathways from fat volumes to augmented compartments, herein called trap volumes. In silico validation by using patient simulators with body mass index larger than 25. A non-homogeneous drug diffusion in biological tissue was considered using the partial-Caputo fractional-order compartmental model. To understand the effects in potential monitoring and sensing data information systems, an experiment has been proposed for in vitro impedance spectroscopy of fat samples. Variability with fat volume in observed data confirms the fractional-order characteristics that support theoretical model assumptions. A discussion section provides the implications for control of drug delivery systems.
药物相关的动态摄取、清除和异常扩散模式依赖于生物组织组成。特别重要的是脂肪体积,在长期使用中会发生药物捕获,并与药物的亲脂性和非均匀混合有关。本文介绍了三种可能的药物途径,从脂肪体积到增强室,这里称为陷阱体积。使用身体质量指数大于25的病人模拟器进行计算机验证。采用部分- caputo分数阶区室模型考虑药物在生物组织中的非均匀扩散。为了了解潜在的监测和传感数据信息系统的影响,提出了脂肪样品的体外阻抗谱实验。观测数据中脂肪体积的变化证实了支持理论模型假设的分数阶特征。讨论部分提供了对药物输送系统控制的影响。
{"title":"Modeling, analysis and experimental observations of fat volume properties in compartmental models for drug pharmacokinetics","authors":"Amani R. Ynineb ,&nbsp;Bouchra Khoumeri ,&nbsp;Erhan Yumuk ,&nbsp;Bora Ayvaz ,&nbsp;Ghada Ben Othman ,&nbsp;Clara M. Ionescu ,&nbsp;Dana Copot","doi":"10.1016/j.arcontrol.2025.100996","DOIUrl":"10.1016/j.arcontrol.2025.100996","url":null,"abstract":"<div><div>Drug related dynamic uptake, clearance and anomalous diffusion patterns are dependent of biological tissue composition. Of particular importance is the fat volume, where drug trapping occurs in long term usage and related to lipophilic properties of drugs and non-homogeneous mixing. This paper introduces three probable drug pathways from fat volumes to augmented compartments, herein called trap volumes. In silico validation by using patient simulators with body mass index larger than 25. A non-homogeneous drug diffusion in biological tissue was considered using the partial-Caputo fractional-order compartmental model. To understand the effects in potential monitoring and sensing data information systems, an experiment has been proposed for in vitro impedance spectroscopy of fat samples. Variability with fat volume in observed data confirms the fractional-order characteristics that support theoretical model assumptions. A discussion section provides the implications for control of drug delivery systems.</div></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"60 ","pages":"Article 100996"},"PeriodicalIF":7.3,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144239336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IF 10.7 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-01-01
{"title":"","authors":"","doi":"","DOIUrl":"","url":null,"abstract":"","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"60 ","pages":"Article 101006"},"PeriodicalIF":10.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147162744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IF 10.7 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-01-01
{"title":"","authors":"","doi":"","DOIUrl":"","url":null,"abstract":"","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"60 ","pages":"Article 101039"},"PeriodicalIF":10.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147162747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IF 10.7 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-01-01
{"title":"","authors":"","doi":"","DOIUrl":"","url":null,"abstract":"","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"60 ","pages":"Article 101016"},"PeriodicalIF":10.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147162748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IF 10.7 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-01-01
{"title":"","authors":"","doi":"","DOIUrl":"","url":null,"abstract":"","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"60 ","pages":"Article 101024"},"PeriodicalIF":10.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147162756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IF 10.7 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-01-01
{"title":"","authors":"","doi":"","DOIUrl":"","url":null,"abstract":"","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"60 ","pages":"Article 101033"},"PeriodicalIF":10.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147162763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Survey of distributed algorithms for resource allocation over multi-agent systems 多智能体系统资源分配的分布式算法综述
IF 7.3 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-11-29 DOI: 10.1016/j.arcontrol.2024.100983
Mohammadreza Doostmohammadian , Alireza Aghasi , Mohammad Pirani , Ehsan Nekouei , Houman Zarrabi , Reza Keypour , Apostolos I. Rikos , Karl H. Johansson
Resource allocation and scheduling in multi-agent systems present challenges due to complex interactions and decentralization. This survey paper provides a comprehensive analysis of distributed algorithms for addressing the distributed resource allocation (DRA) problem over multi-agent systems. It covers a significant area of research at the intersection of optimization, multi-agent systems, and distributed consensus-based computing. The paper begins by presenting a mathematical formulation of the DRA problem, establishing a solid foundation for further exploration. Real-world applications of DRA in various domains are examined to underscore the importance of efficient resource allocation, and relevant distributed optimization formulations are presented. The survey then delves into existing solutions for DRA, encompassing linear, nonlinear, primal-based, and dual-formulation-based approaches. Furthermore, this paper evaluates the features and properties of DRA algorithms, addressing key aspects such as feasibility, convergence rate, and network reliability. The analysis of mathematical foundations, diverse applications, existing solutions, and algorithmic properties contributes to a broader comprehension of the challenges and potential solutions for this domain.
多智能体系统中的资源分配和调度由于复杂的交互和分散性而面临挑战。本文全面分析了多智能体系统中用于解决分布式资源分配问题的分布式算法。它涵盖了优化、多智能体系统和基于共识的分布式计算交叉研究的重要领域。本文首先给出了DRA问题的数学公式,为进一步的探索奠定了坚实的基础。研究了DRA在各个领域的实际应用,强调了有效分配资源的重要性,并提出了相关的分布式优化公式。然后,调查深入研究了现有的DRA解决方案,包括线性、非线性、基于原始和基于双公式的方法。此外,本文评估了DRA算法的特征和性质,解决了可行性、收敛速度和网络可靠性等关键方面的问题。对数学基础、各种应用、现有解决方案和算法属性的分析有助于更广泛地理解该领域的挑战和潜在解决方案。
{"title":"Survey of distributed algorithms for resource allocation over multi-agent systems","authors":"Mohammadreza Doostmohammadian ,&nbsp;Alireza Aghasi ,&nbsp;Mohammad Pirani ,&nbsp;Ehsan Nekouei ,&nbsp;Houman Zarrabi ,&nbsp;Reza Keypour ,&nbsp;Apostolos I. Rikos ,&nbsp;Karl H. Johansson","doi":"10.1016/j.arcontrol.2024.100983","DOIUrl":"10.1016/j.arcontrol.2024.100983","url":null,"abstract":"<div><div>Resource allocation and scheduling in multi-agent systems present challenges due to complex interactions and decentralization. This survey paper provides a comprehensive analysis of distributed algorithms for addressing the distributed resource allocation (DRA) problem over multi-agent systems. It covers a significant area of research at the intersection of optimization, multi-agent systems, and distributed consensus-based computing. The paper begins by presenting a mathematical formulation of the DRA problem, establishing a solid foundation for further exploration. Real-world applications of DRA in various domains are examined to underscore the importance of efficient resource allocation, and relevant distributed optimization formulations are presented. The survey then delves into existing solutions for DRA, encompassing linear, nonlinear, primal-based, and dual-formulation-based approaches. Furthermore, this paper evaluates the features and properties of DRA algorithms, addressing key aspects such as feasibility, convergence rate, and network reliability. The analysis of mathematical foundations, diverse applications, existing solutions, and algorithmic properties contributes to a broader comprehension of the challenges and potential solutions for this domain.</div></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"59 ","pages":"Article 100983"},"PeriodicalIF":7.3,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142746294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning safe control for multi-robot systems: Methods, verification, and open challenges 学习多机器人系统的安全控制:方法、验证和公开挑战
IF 9.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-01-01 DOI: 10.1016/j.arcontrol.2024.100948
Kunal Garg, Songyuan Zhang, Oswin So, Charles Dawson, Chuchu Fan

In this survey, we review the recent advances in control design methods for robotic multi-agent systems (MAS), focusing on learning-based methods with safety considerations. We start by reviewing various notions of safety and liveness properties, and modeling frameworks used for problem formulation of MAS. Then we provide a comprehensive review of learning-based methods for safe control design for multi-robot systems. We start with various shielding-based methods, such as safety certificates, predictive filters, and reachability tools. Then, we review the current state of control barrier certificate learning in both a centralized and distributed manner, followed by a comprehensive review of multi-agent reinforcement learning with a particular focus on safety. Next, we discuss the state-of-the-art verification tools for the correctness of learning-based methods. Based on the capabilities and the limitations of the state-of-the-art methods in learning and verification for MAS, we identify various broad themes for open challenges: how to design methods that can achieve good performance along with safety guarantees; how to decompose single-agent-based centralized methods for MAS; how to account for communication-related practical issues; and how to assess transfer of theoretical guarantees to practice.

在本调查报告中,我们回顾了机器人多代理系统(MAS)控制设计方法的最新进展,重点是基于学习并考虑安全因素的方法。首先,我们回顾了安全和有效性的各种概念,以及用于 MAS 问题表述的建模框架。然后,我们全面回顾了基于学习的多机器人系统安全控制设计方法。我们首先介绍各种基于屏蔽的方法,如安全证书、预测过滤器和可达性工具。然后,我们回顾了集中式和分布式控制屏障证书学习的现状,接着全面回顾了多机器人强化学习,并特别关注安全性。接下来,我们讨论了基于学习的方法正确性的最新验证工具。基于最先进的 MAS 学习和验证方法的能力和局限性,我们确定了各种开放挑战的广泛主题:如何设计既能实现良好性能又能保证安全的方法;如何分解基于单个代理的 MAS 集中式方法;如何考虑与通信相关的实际问题;以及如何评估理论保证向实践的转移。
{"title":"Learning safe control for multi-robot systems: Methods, verification, and open challenges","authors":"Kunal Garg,&nbsp;Songyuan Zhang,&nbsp;Oswin So,&nbsp;Charles Dawson,&nbsp;Chuchu Fan","doi":"10.1016/j.arcontrol.2024.100948","DOIUrl":"https://doi.org/10.1016/j.arcontrol.2024.100948","url":null,"abstract":"<div><p>In this survey, we review the recent advances in control design methods for robotic multi-agent systems (MAS), focusing on learning-based methods with safety considerations. We start by reviewing various notions of safety and liveness properties, and modeling frameworks used for problem formulation of MAS. Then we provide a comprehensive review of learning-based methods for safe control design for multi-robot systems. We start with various shielding-based methods, such as safety certificates, predictive filters, and reachability tools. Then, we review the current state of control barrier certificate learning in both a centralized and distributed manner, followed by a comprehensive review of multi-agent reinforcement learning with a particular focus on safety. Next, we discuss the state-of-the-art verification tools for the correctness of learning-based methods. Based on the capabilities and the limitations of the state-of-the-art methods in learning and verification for MAS, we identify various broad themes for open challenges: how to design methods that can achieve good performance along with safety guarantees; how to decompose single-agent-based centralized methods for MAS; how to account for communication-related practical issues; and how to assess transfer of theoretical guarantees to practice.</p></div>","PeriodicalId":50750,"journal":{"name":"Annual Reviews in Control","volume":"57 ","pages":"Article 100948"},"PeriodicalIF":9.4,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140187916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Annual Reviews in Control
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1