Simulating fish autonomous swimming behaviours using deep reinforcement learning based on Kolmogorov-Arnold Networks.

IF 3.1 3区计算机科学 Q1 ENGINEERING, MULTIDISCIPLINARY Bioinspiration & Biomimetics Pub Date : 2025-01-16 DOI:10.1088/1748-3190/ada59c

Tao Li, Chunze Zhang, Guibin Zhang, Qin Zhou, Ji Hou, Wei Diao, Wanwan Meng, Xujin Zhang

{"title":"Simulating fish autonomous swimming behaviours using deep reinforcement learning based on Kolmogorov-Arnold Networks.","authors":"Tao Li, Chunze Zhang, Guibin Zhang, Qin Zhou, Ji Hou, Wei Diao, Wanwan Meng, Xujin Zhang","doi":"10.1088/1748-3190/ada59c","DOIUrl":null,"url":null,"abstract":"<p><p>The study of fish swimming behaviours and locomotion mechanisms holds significant scientific and engineering value. With the rapid advancements in artificial intelligence, a new method combining deep reinforcement learning (DRL) with computational fluid dynamics has emerged and been applied to simulate the fish's adaptive swimming behaviour, where the complex fish behaviour is decoupled to focus on the fish's response to the hydrodynamic field, and the simulation is driven by reward-based objectives to model the fish's swimming behaviour. However, the scale of this cross-disciplinary method is directly affected by the efficiency of the DRL model. To promote it to more general application scenarios, there is a pressing need for further research on more efficient and economical network architectures to address the challenge of approximating state-value function in high-dimensional, dynamic, and uncertain environments. Building upon a previously proposed computational platform for the simulation of fish autonomous swimming behaviour, we integrated Kolmogorov-Arnold Networks(KANs) and tested their performance in point-to-point swimming and Kármán gait swimming environments. Experimental results demonstrated that, compared to long short-term memory Networks(LSTMs) and multilayer perceptron networks(MLPs), the introduction of KANs significantly enhanced the perception and decision-making abilities of the intelligent fish in complex fluid environments. With a smaller network scale, in the point-to-point swimming case, KANs effectively approximated the state-value function, achieving average reward improvements of up to 88.0% and 94.1% over MLPs and LSTMs networks, respectively, and increased by 766.7% and 105.6% in the Kármán gait swimming case. Under comparable network sizes, the intelligent fish with KANs exhibited faster learning capabilities and more stable swimming performance in complex fluid settings.</p>","PeriodicalId":55377,"journal":{"name":"Bioinspiration & Biomimetics","volume":" ","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinspiration & Biomimetics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1088/1748-3190/ada59c","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

The study of fish swimming behaviours and locomotion mechanisms holds significant scientific and engineering value. With the rapid advancements in artificial intelligence, a new method combining deep reinforcement learning (DRL) with computational fluid dynamics has emerged and been applied to simulate the fish's adaptive swimming behaviour, where the complex fish behaviour is decoupled to focus on the fish's response to the hydrodynamic field, and the simulation is driven by reward-based objectives to model the fish's swimming behaviour. However, the scale of this cross-disciplinary method is directly affected by the efficiency of the DRL model. To promote it to more general application scenarios, there is a pressing need for further research on more efficient and economical network architectures to address the challenge of approximating state-value function in high-dimensional, dynamic, and uncertain environments. Building upon a previously proposed computational platform for the simulation of fish autonomous swimming behaviour, we integrated Kolmogorov-Arnold Networks(KANs) and tested their performance in point-to-point swimming and Kármán gait swimming environments. Experimental results demonstrated that, compared to long short-term memory Networks(LSTMs) and multilayer perceptron networks(MLPs), the introduction of KANs significantly enhanced the perception and decision-making abilities of the intelligent fish in complex fluid environments. With a smaller network scale, in the point-to-point swimming case, KANs effectively approximated the state-value function, achieving average reward improvements of up to 88.0% and 94.1% over MLPs and LSTMs networks, respectively, and increased by 766.7% and 105.6% in the Kármán gait swimming case. Under comparable network sizes, the intelligent fish with KANs exhibited faster learning capabilities and more stable swimming performance in complex fluid settings.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于Kolmogorov-Arnold网络的深度强化学习模拟鱼类自主游泳行为。

鱼类游泳行为和运动机制的研究具有重要的科学和工程价值。随着人工智能的快速发展，一种将深度强化学习（DRL）与计算流体动力学（CFD）相结合的新方法已经出现，并被应用于模拟鱼类等高等生物的自主行为。然而，这种跨学科方法的规模直接受到DRL模型效率的影响。为了将其推广到更广泛的应用场景，迫切需要进一步研究更高效、更经济的网络架构，以解决在高维、动态和不确定环境中逼近状态值函数的挑战。基于先前提出的模拟鱼类自主游泳行为的计算平台，我们集成了KANs并测试了它们在点对点游泳和Kármán步态游泳环境中的性能。实验结果表明，与lstm和mlp网络相比，KANs的引入显著提高了智能鱼在复杂流体环境中的感知和决策能力。在更小的网络规模下，在点对点游泳情况下，KANs有效地逼近了状态值函数，比mlp和LSTMs网络分别获得了高达88.0%和94.1%的平均奖励改进，在Kármán步态游泳情况下分别增加了766.7%和105.6%。在相同的网络规模下，具有KANs的智能鱼在复杂的流体环境中表现出更快的学习能力和更稳定的游泳表现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Bioinspiration & Biomimetics 工程技术-材料科学：生物材料

CiteScore

5.90

自引率

14.70%

发文量

132

审稿时长

3 months

期刊介绍： Bioinspiration & Biomimetics publishes research involving the study and distillation of principles and functions found in biological systems that have been developed through evolution, and application of this knowledge to produce novel and exciting basic technologies and new approaches to solving scientific problems. It provides a forum for interdisciplinary research which acts as a pipeline, facilitating the two-way flow of ideas and understanding between the extensive bodies of knowledge of the different disciplines. It has two principal aims: to draw on biology to enrich engineering and to draw from engineering to enrich biology. The journal aims to include input from across all intersecting areas of both fields. In biology, this would include work in all fields from physiology to ecology, with either zoological or botanical focus. In engineering, this would include both design and practical application of biomimetic or bioinspired devices and systems. Typical areas of interest include: Systems, designs and structure Communication and navigation Cooperative behaviour Self-organizing biological systems Self-healing and self-assembly Aerial locomotion and aerospace applications of biomimetics Biomorphic surface and subsurface systems Marine dynamics: swimming and underwater dynamics Applications of novel materials Biomechanics; including movement, locomotion, fluidics Cellular behaviour Sensors and senses Biomimetic or bioinformed approaches to geological exploration.