Concurrent-Learning-Based Adaptive Critic Formation for Multirobots Under Safety Constraints

IF 8.9 1区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Internet of Things Journal Pub Date : 2024-11-14 DOI:10.1109/JIOT.2024.3497979
Yunjie Cheng;Xingling Shao;Jiangmiao Li;Jun Liu;Qingzhen Zhang
{"title":"Concurrent-Learning-Based Adaptive Critic Formation for Multirobots Under Safety Constraints","authors":"Yunjie Cheng;Xingling Shao;Jiangmiao Li;Jun Liu;Qingzhen Zhang","doi":"10.1109/JIOT.2024.3497979","DOIUrl":null,"url":null,"abstract":"This article presents a concurrent learning-based adaptive critic formation for multirobots under safety constraints, which comprises of an initial formation consensus item and a collision-free adaptive critic policy. First, based on directed graph communication, an initial formation consensus item is designed to maintain the velocity agreement under a leader-follower setting. Particularly, a collision-free adaptive critic policy is developed that enables robots to preserve formation configuration with the minimum cost while excluding collisions caused by inter-robots and static/moving obstacles, wherein safety constraints encoded by an elegantly devised penalty function are enforced by converting constrained optimal control into unconstrained optimal control issue. Furthermore, by revisiting real-time and historical information, a concurrent weight learning rule is elaborated under a critic-only adaptive dynamic programming, improving the weight convergence without demanding the persistence excitation conditions. The remarkable benefits outperforming existing outcomes are safety-critical coordination with energy-saving performances is assured under a computationally efficient optimal learning paradigm. Involved errors are theoretically proved to be convergent. Finally, the values and superiorities are verified through extensive simulations on 2-D and 3-D multirobots.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 6","pages":"7610-7621"},"PeriodicalIF":8.9000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10753353/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

This article presents a concurrent learning-based adaptive critic formation for multirobots under safety constraints, which comprises of an initial formation consensus item and a collision-free adaptive critic policy. First, based on directed graph communication, an initial formation consensus item is designed to maintain the velocity agreement under a leader-follower setting. Particularly, a collision-free adaptive critic policy is developed that enables robots to preserve formation configuration with the minimum cost while excluding collisions caused by inter-robots and static/moving obstacles, wherein safety constraints encoded by an elegantly devised penalty function are enforced by converting constrained optimal control into unconstrained optimal control issue. Furthermore, by revisiting real-time and historical information, a concurrent weight learning rule is elaborated under a critic-only adaptive dynamic programming, improving the weight convergence without demanding the persistence excitation conditions. The remarkable benefits outperforming existing outcomes are safety-critical coordination with energy-saving performances is assured under a computationally efficient optimal learning paradigm. Involved errors are theoretically proved to be convergent. Finally, the values and superiorities are verified through extensive simulations on 2-D and 3-D multirobots.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
安全约束条件下基于并发学习的多机器人自适应批判性编队
本文提出了一种基于并发学习的安全约束下多机器人自适应评论形成方法,该方法由初始形成共识项和无碰撞自适应评论策略组成。首先,在有向图通信的基础上,设计了一个初始的队形共识项,以保持在领导者-追随者设置下的速度一致性。特别是,开发了一种无碰撞自适应批评策略,使机器人能够以最小的成本保持队形,同时排除机器人间和静态/移动障碍物引起的碰撞,其中由设计优雅的惩罚函数编码的安全约束通过将约束最优控制转换为无约束最优控制问题来强制执行。此外,通过重访实时和历史信息,在仅限临界的自适应动态规划下制定了一种并发权值学习规则,在不要求持续激励条件的情况下提高了权值的收敛性。卓越的效益优于现有的结果是安全关键协调与节能性能是在一个计算高效的最佳学习范式下得到保证。从理论上证明了相关误差是收敛的。最后,通过对二维和三维多机器人的大量仿真验证了该方法的价值和优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Internet of Things Journal
IEEE Internet of Things Journal Computer Science-Information Systems
CiteScore
17.60
自引率
13.20%
发文量
1982
期刊介绍: The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.
期刊最新文献
AI for AIoT as a Service: AI to Configure Models, Capacities, and Tasks SPIoT: An Adaptive Federated Sparse Framework for Intrusion Detection in IoT KoopShield: A Koopman based Online Data-Driven Safety Framework for Truck Platoons Resilient to Communication Delays Identifying Critical Nodes in Smart Grid IoT Infrastructure: A Graph Convolutional Network Approach Enabling the 6G and IoT-Verse: Non-Radiative Dielectric (NRD) Waveguides for Millimeter-Wave Communications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1