Learning obstacle avoidance and predation in complex reef environments with deep reinforcement learning.

IF 3.1 3区 计算机科学 Q1 ENGINEERING, MULTIDISCIPLINARY Bioinspiration & Biomimetics Pub Date : 2024-08-07 DOI:10.1088/1748-3190/ad6544
Ji Hou, Changling He, Tao Li, Chunze Zhang, Qin Zhou
{"title":"Learning obstacle avoidance and predation in complex reef environments with deep reinforcement learning.","authors":"Ji Hou, Changling He, Tao Li, Chunze Zhang, Qin Zhou","doi":"10.1088/1748-3190/ad6544","DOIUrl":null,"url":null,"abstract":"<p><p>The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but also influencing their behavioral tendencies. Understanding the intricate mechanism through which fish adeptly navigate the moving targets within reef environments within complex water flow, all while evading obstacles and maintaining stable postures, has remained a challenging and prominent subject in the realms of fish behavior, ecology, and biomimetics alike. An integrated simulation framework is used to investigate fish predation problems within intricate environments, combining deep reinforcement learning algorithms (DRL) with high-precision fluid-structure interaction numerical methods-immersed boundary lattice Boltzmann method (lB-LBM). The Soft Actor-Critic (SAC) algorithm is used to improve the intelligent fish's capacity for random exploration, tackling the multi-objective sparse reward challenge inherent in real-world scenarios. Additionally, a reward shaping method tailored to its action purposes has been developed, capable of capturing outcomes and trend characteristics effectively. The convergence and robustness advantages of the method elucidated in this paper are showcased through two case studies: one addressing fish capturing randomly moving targets in hydrostatic flow field, and the other focusing on fish counter-current foraging in reef environments to capture drifting food. A comprehensive analysis was conducted of the influence and significance of various reward types on the decision-making processes of intelligent fish within intricate environments.</p>","PeriodicalId":55377,"journal":{"name":"Bioinspiration & Biomimetics","volume":" ","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinspiration & Biomimetics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1088/1748-3190/ad6544","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but also influencing their behavioral tendencies. Understanding the intricate mechanism through which fish adeptly navigate the moving targets within reef environments within complex water flow, all while evading obstacles and maintaining stable postures, has remained a challenging and prominent subject in the realms of fish behavior, ecology, and biomimetics alike. An integrated simulation framework is used to investigate fish predation problems within intricate environments, combining deep reinforcement learning algorithms (DRL) with high-precision fluid-structure interaction numerical methods-immersed boundary lattice Boltzmann method (lB-LBM). The Soft Actor-Critic (SAC) algorithm is used to improve the intelligent fish's capacity for random exploration, tackling the multi-objective sparse reward challenge inherent in real-world scenarios. Additionally, a reward shaping method tailored to its action purposes has been developed, capable of capturing outcomes and trend characteristics effectively. The convergence and robustness advantages of the method elucidated in this paper are showcased through two case studies: one addressing fish capturing randomly moving targets in hydrostatic flow field, and the other focusing on fish counter-current foraging in reef environments to capture drifting food. A comprehensive analysis was conducted of the influence and significance of various reward types on the decision-making processes of intelligent fish within intricate environments.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用深度强化学习在复杂的珊瑚礁环境中学习避障和捕食。
珊瑚礁生态系统作为游泳能力有限的鱼类的栖息地发挥着至关重要的作用,它不仅是鱼类的避难所和食物来源,还影响着鱼类的行为倾向。在复杂的水流中,鱼类通过复杂的机制在珊瑚礁环境中巧妙地导航移动目标,同时躲避障碍物并保持稳定的姿态,这在鱼类行为学、生态学和生物仿生学领域一直是一个具有挑战性的重要课题。该研究采用了一个综合模拟框架,将深度强化学习算法(DRL)与高精度流固耦合数值方法--浸没边界晶格玻尔兹曼法(lB-LBM)相结合,研究复杂环境中的鱼类捕食问题。软行为批判(SAC)算法用于提高智能鱼的随机探索能力,以应对现实世界场景中固有的多目标稀疏奖励挑战。此外,还开发了一种针对其行动目的的奖励塑造方法,能够有效捕捉结果和趋势特征。本文通过两个案例研究展示了该方法的收敛性和鲁棒性优势:一个案例研究了鱼类在静态流场中捕捉随机移动目标的情况,另一个案例研究了鱼类在珊瑚礁环境中逆流觅食捕捉漂流食物的情况。本文全面分析了各种奖励类型对复杂环境中智能鱼类决策过程的影响和意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Bioinspiration & Biomimetics
Bioinspiration & Biomimetics 工程技术-材料科学:生物材料
CiteScore
5.90
自引率
14.70%
发文量
132
审稿时长
3 months
期刊介绍: Bioinspiration & Biomimetics publishes research involving the study and distillation of principles and functions found in biological systems that have been developed through evolution, and application of this knowledge to produce novel and exciting basic technologies and new approaches to solving scientific problems. It provides a forum for interdisciplinary research which acts as a pipeline, facilitating the two-way flow of ideas and understanding between the extensive bodies of knowledge of the different disciplines. It has two principal aims: to draw on biology to enrich engineering and to draw from engineering to enrich biology. The journal aims to include input from across all intersecting areas of both fields. In biology, this would include work in all fields from physiology to ecology, with either zoological or botanical focus. In engineering, this would include both design and practical application of biomimetic or bioinspired devices and systems. Typical areas of interest include: Systems, designs and structure Communication and navigation Cooperative behaviour Self-organizing biological systems Self-healing and self-assembly Aerial locomotion and aerospace applications of biomimetics Biomorphic surface and subsurface systems Marine dynamics: swimming and underwater dynamics Applications of novel materials Biomechanics; including movement, locomotion, fluidics Cellular behaviour Sensors and senses Biomimetic or bioinformed approaches to geological exploration.
期刊最新文献
Stability and agility trade-offs in spring-wing systems. Genetic algorithm-based optimal design for fluidic artificial muscle (FAM) bundles. Touch-down condition control for the bipedal spring-mass model in walking. Predictive uncertainty in state-estimation drives active sensing. Analysis and actuation design of a novel at-scale 3-DOF biomimetic flapping-wing mechanism inspired by flying insects.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1