Concurrent-Learning-Based Adaptive Critic Formation for Multirobots Under Safety Constraints

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Internet of Things Journal Pub Date : 2024-11-14 DOI:10.1109/JIOT.2024.3497979

Yunjie Cheng;Xingling Shao;Jiangmiao Li;Jun Liu;Qingzhen Zhang

{"title":"Concurrent-Learning-Based Adaptive Critic Formation for Multirobots Under Safety Constraints","authors":"Yunjie Cheng;Xingling Shao;Jiangmiao Li;Jun Liu;Qingzhen Zhang","doi":"10.1109/JIOT.2024.3497979","DOIUrl":null,"url":null,"abstract":"This article presents a concurrent learning-based adaptive critic formation for multirobots under safety constraints, which comprises of an initial formation consensus item and a collision-free adaptive critic policy. First, based on directed graph communication, an initial formation consensus item is designed to maintain the velocity agreement under a leader-follower setting. Particularly, a collision-free adaptive critic policy is developed that enables robots to preserve formation configuration with the minimum cost while excluding collisions caused by inter-robots and static/moving obstacles, wherein safety constraints encoded by an elegantly devised penalty function are enforced by converting constrained optimal control into unconstrained optimal control issue. Furthermore, by revisiting real-time and historical information, a concurrent weight learning rule is elaborated under a critic-only adaptive dynamic programming, improving the weight convergence without demanding the persistence excitation conditions. The remarkable benefits outperforming existing outcomes are safety-critical coordination with energy-saving performances is assured under a computationally efficient optimal learning paradigm. Involved errors are theoretically proved to be convergent. Finally, the values and superiorities are verified through extensive simulations on 2-D and 3-D multirobots.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 6","pages":"7610-7621"},"PeriodicalIF":8.9000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10753353/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

This article presents a concurrent learning-based adaptive critic formation for multirobots under safety constraints, which comprises of an initial formation consensus item and a collision-free adaptive critic policy. First, based on directed graph communication, an initial formation consensus item is designed to maintain the velocity agreement under a leader-follower setting. Particularly, a collision-free adaptive critic policy is developed that enables robots to preserve formation configuration with the minimum cost while excluding collisions caused by inter-robots and static/moving obstacles, wherein safety constraints encoded by an elegantly devised penalty function are enforced by converting constrained optimal control into unconstrained optimal control issue. Furthermore, by revisiting real-time and historical information, a concurrent weight learning rule is elaborated under a critic-only adaptive dynamic programming, improving the weight convergence without demanding the persistence excitation conditions. The remarkable benefits outperforming existing outcomes are safety-critical coordination with energy-saving performances is assured under a computationally efficient optimal learning paradigm. Involved errors are theoretically proved to be convergent. Finally, the values and superiorities are verified through extensive simulations on 2-D and 3-D multirobots.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

安全约束条件下基于并发学习的多机器人自适应批判性编队

本文提出了一种基于并发学习的安全约束下多机器人自适应评论形成方法，该方法由初始形成共识项和无碰撞自适应评论策略组成。首先，在有向图通信的基础上，设计了一个初始的队形共识项，以保持在领导者-追随者设置下的速度一致性。特别是，开发了一种无碰撞自适应批评策略，使机器人能够以最小的成本保持队形，同时排除机器人间和静态/移动障碍物引起的碰撞，其中由设计优雅的惩罚函数编码的安全约束通过将约束最优控制转换为无约束最优控制问题来强制执行。此外，通过重访实时和历史信息，在仅限临界的自适应动态规划下制定了一种并发权值学习规则，在不要求持续激励条件的情况下提高了权值的收敛性。卓越的效益优于现有的结果是安全关键协调与节能性能是在一个计算高效的最佳学习范式下得到保证。从理论上证明了相关误差是收敛的。最后，通过对二维和三维多机器人的大量仿真验证了该方法的价值和优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Internet of Things Journal Computer Science-Information Systems

CiteScore

17.60

自引率

13.20%

发文量

1982

期刊介绍： The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.