Hierarchical Decentralized Deep Reinforcement Learning Architecture for a Simulated Four-Legged Agent

W. Z. E. Amri, L. Hermes, M. Schilling
{"title":"Hierarchical Decentralized Deep Reinforcement Learning Architecture for a Simulated Four-Legged Agent","authors":"W. Z. E. Amri, L. Hermes, M. Schilling","doi":"10.48550/arXiv.2210.08003","DOIUrl":null,"url":null,"abstract":"Legged locomotion is widespread in nature and has inspired the design of current robots. The controller of these legged robots is often realized as one centralized instance. However, in nature, control of movement happens in a hierarchical and decentralized fashion. Introducing these biological design principles into robotic control systems has motivated this work. We tackle the question whether decentralized and hierarchical control is beneficial for legged robots and present a novel decentral, hierarchical architecture to control a simulated legged agent. Three different tasks varying in complexity are designed to benchmark five architectures (centralized, decentralized, hierarchical and two different combinations of hierarchical decentralized architectures). The results demonstrate that decentralizing the different levels of the hierarchical architectures facilitates learning of the agent, ensures more energy efficient movements as well as robustness towards new unseen environments. Furthermore, this comparison sheds light on the importance of modularity in hierarchical architectures to solve complex goal-directed tasks. We provide an open-source code implementation of our architecture (https://github.com/wzaielamri/hddrl).","PeriodicalId":432112,"journal":{"name":"International Conference on Machine Learning, Optimization, and Data Science","volume":"481 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Machine Learning, Optimization, and Data Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2210.08003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Legged locomotion is widespread in nature and has inspired the design of current robots. The controller of these legged robots is often realized as one centralized instance. However, in nature, control of movement happens in a hierarchical and decentralized fashion. Introducing these biological design principles into robotic control systems has motivated this work. We tackle the question whether decentralized and hierarchical control is beneficial for legged robots and present a novel decentral, hierarchical architecture to control a simulated legged agent. Three different tasks varying in complexity are designed to benchmark five architectures (centralized, decentralized, hierarchical and two different combinations of hierarchical decentralized architectures). The results demonstrate that decentralizing the different levels of the hierarchical architectures facilitates learning of the agent, ensures more energy efficient movements as well as robustness towards new unseen environments. Furthermore, this comparison sheds light on the importance of modularity in hierarchical architectures to solve complex goal-directed tasks. We provide an open-source code implementation of our architecture (https://github.com/wzaielamri/hddrl).
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
模拟四足智能体的分层分散深度强化学习体系结构
腿式运动在自然界中广泛存在,并启发了当前机器人的设计。这些足式机器人的控制器通常作为一个集中实例来实现。然而,在自然界中,对运动的控制是以分层和分散的方式发生的。将这些生物设计原理引入机器人控制系统激发了这项工作。我们解决了去中心化和分层控制是否对有腿机器人有益的问题,并提出了一种新的去中心化、分层结构来控制模拟的有腿机器人。设计了复杂度不同的三种不同任务,对五种架构(集中式、分散式、分层式和分层式分散式架构的两种不同组合)进行基准测试。结果表明,分散层次结构的不同层次有助于智能体的学习,确保更节能的运动以及对新的未知环境的鲁棒性。此外,这种比较揭示了模块化在分层体系结构中解决复杂目标导向任务的重要性。我们提供了架构的开源代码实现(https://github.com/wzaielamri/hddrl)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Comparison of SVM against Pre-trained Language Models (PLMs) for Text Classification Tasks Hierarchical Decentralized Deep Reinforcement Learning Architecture for a Simulated Four-Legged Agent On the utility and protection of optimization with differential privacy and classic regularization techniques Adaptive Zeroth-Order Optimisation of Nonconvex Composite Objectives Robust PCA for Anomaly Detection and Data Imputation in Seasonal Time Series
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1