通过步态强化学习实现经济的四足多步态运动

IF 4.9 3区 计算机科学 Q1 ENGINEERING, MULTIDISCIPLINARY Journal of Bionic Engineering Pub Date : 2024-05-18 DOI:10.1007/s42235-024-00517-3
Lang Wei, Jinzhou Zou, Xi Yu, Liangyu Liu, Jianbin Liao, Wei Wang, Tong Zhang
{"title":"通过步态强化学习实现经济的四足多步态运动","authors":"Lang Wei,&nbsp;Jinzhou Zou,&nbsp;Xi Yu,&nbsp;Liangyu Liu,&nbsp;Jianbin Liao,&nbsp;Wei Wang,&nbsp;Tong Zhang","doi":"10.1007/s42235-024-00517-3","DOIUrl":null,"url":null,"abstract":"<div><p>In order to strike a balance between achieving desired velocities and minimizing energy consumption, legged animals have the ability to adopt the appropriate gait pattern and seamlessly transition to another if needed. This ability makes them more versatile and efficient when traversing natural terrains, and more suitable for long treks. In the same way, it is meaningful and important for quadruped robots to master this ability. To achieve this goal, we propose an effective gait-heuristic reinforcement learning framework in which multiple gait locomotion and smooth gait transitions automatically emerge to reach target velocities while minimizing energy consumption. We incorporate a novel trajectory generator with explicit gait information as a memory mechanism into the deep reinforcement learning framework. This allows the quadruped robot to adopt reliable and distinct gait patterns while benefiting from a warm start provided by the trajectory generator. Furthermore, we investigate the key factors contributing to the emergence of multiple gait locomotion. We tested our framework on a closed-chain quadruped robot and demonstrated that the robot can change its gait patterns, such as standing, walking, and trotting, to adopt the most energy-efficient gait at a given speed. Lastly, we deploy our learned controller to a quadruped robot and demonstrate the energy efficiency and robustness of our method.</p></div>","PeriodicalId":614,"journal":{"name":"Journal of Bionic Engineering","volume":"21 4","pages":"1720 - 1732"},"PeriodicalIF":4.9000,"publicationDate":"2024-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Economical Quadrupedal Multi-Gait Locomotion via Gait-Heuristic Reinforcement Learning\",\"authors\":\"Lang Wei,&nbsp;Jinzhou Zou,&nbsp;Xi Yu,&nbsp;Liangyu Liu,&nbsp;Jianbin Liao,&nbsp;Wei Wang,&nbsp;Tong Zhang\",\"doi\":\"10.1007/s42235-024-00517-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In order to strike a balance between achieving desired velocities and minimizing energy consumption, legged animals have the ability to adopt the appropriate gait pattern and seamlessly transition to another if needed. This ability makes them more versatile and efficient when traversing natural terrains, and more suitable for long treks. In the same way, it is meaningful and important for quadruped robots to master this ability. To achieve this goal, we propose an effective gait-heuristic reinforcement learning framework in which multiple gait locomotion and smooth gait transitions automatically emerge to reach target velocities while minimizing energy consumption. We incorporate a novel trajectory generator with explicit gait information as a memory mechanism into the deep reinforcement learning framework. This allows the quadruped robot to adopt reliable and distinct gait patterns while benefiting from a warm start provided by the trajectory generator. Furthermore, we investigate the key factors contributing to the emergence of multiple gait locomotion. We tested our framework on a closed-chain quadruped robot and demonstrated that the robot can change its gait patterns, such as standing, walking, and trotting, to adopt the most energy-efficient gait at a given speed. Lastly, we deploy our learned controller to a quadruped robot and demonstrate the energy efficiency and robustness of our method.</p></div>\",\"PeriodicalId\":614,\"journal\":{\"name\":\"Journal of Bionic Engineering\",\"volume\":\"21 4\",\"pages\":\"1720 - 1732\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2024-05-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Bionic Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s42235-024-00517-3\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Bionic Engineering","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s42235-024-00517-3","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

为了在实现理想速度和最大限度减少能量消耗之间取得平衡,有腿动物能够采用适当的步态,并在需要时无缝过渡到另一种步态。这种能力使它们在穿越自然地形时更加灵活高效,也更适合长途跋涉。同样,对于四足机器人来说,掌握这种能力既有意义又很重要。为了实现这一目标,我们提出了一种有效的步态启发式强化学习框架,在该框架中,多种步态运动和平滑步态转换会自动出现,以达到目标速度,同时将能耗降至最低。我们将具有明确步态信息的新型轨迹生成器作为记忆机制纳入深度强化学习框架。这使得四足机器人能够采用可靠而独特的步态模式,同时受益于轨迹生成器提供的热启动。此外,我们还研究了促成多种步态运动出现的关键因素。我们在一个闭链四足机器人上测试了我们的框架,并证明机器人可以改变其步态模式,如站立、行走和小跑,以在给定速度下采用最节能的步态。最后,我们将学习到的控制器部署到四足机器人上,展示了我们方法的能效和鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Economical Quadrupedal Multi-Gait Locomotion via Gait-Heuristic Reinforcement Learning

In order to strike a balance between achieving desired velocities and minimizing energy consumption, legged animals have the ability to adopt the appropriate gait pattern and seamlessly transition to another if needed. This ability makes them more versatile and efficient when traversing natural terrains, and more suitable for long treks. In the same way, it is meaningful and important for quadruped robots to master this ability. To achieve this goal, we propose an effective gait-heuristic reinforcement learning framework in which multiple gait locomotion and smooth gait transitions automatically emerge to reach target velocities while minimizing energy consumption. We incorporate a novel trajectory generator with explicit gait information as a memory mechanism into the deep reinforcement learning framework. This allows the quadruped robot to adopt reliable and distinct gait patterns while benefiting from a warm start provided by the trajectory generator. Furthermore, we investigate the key factors contributing to the emergence of multiple gait locomotion. We tested our framework on a closed-chain quadruped robot and demonstrated that the robot can change its gait patterns, such as standing, walking, and trotting, to adopt the most energy-efficient gait at a given speed. Lastly, we deploy our learned controller to a quadruped robot and demonstrate the energy efficiency and robustness of our method.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Bionic Engineering
Journal of Bionic Engineering 工程技术-材料科学:生物材料
CiteScore
7.10
自引率
10.00%
发文量
162
审稿时长
10.0 months
期刊介绍: The Journal of Bionic Engineering (JBE) is a peer-reviewed journal that publishes original research papers and reviews that apply the knowledge learned from nature and biological systems to solve concrete engineering problems. The topics that JBE covers include but are not limited to: Mechanisms, kinematical mechanics and control of animal locomotion, development of mobile robots with walking (running and crawling), swimming or flying abilities inspired by animal locomotion. Structures, morphologies, composition and physical properties of natural and biomaterials; fabrication of new materials mimicking the properties and functions of natural and biomaterials. Biomedical materials, artificial organs and tissue engineering for medical applications; rehabilitation equipment and devices. Development of bioinspired computation methods and artificial intelligence for engineering applications.
期刊最新文献
Sandwich-Structured Solar Cells with Accelerated Conversion Efficiency by Self-Cooling and Self-Cleaning Design From Perception to Action: Brain-to-Brain Information Transmission of Pigeons Design and Motion Characteristics of a Ray-Inspired Micro-Robot Made of Magnetic Film Bionic Jumping of Humanoid Robot via Online Centroid Trajectory Optimization and High Dynamic Motion Controller Multi-Sensor Fusion for State Estimation and Control of Cable-Driven Soft Robots
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1