Quoc Tuan Vu , Van Tu Duong , Huy Hung Nguyen , Tan Tien Nguyen
{"title":"利用多代理深度确定性策略梯度优化细长起伏鳍的游泳模式","authors":"Quoc Tuan Vu , Van Tu Duong , Huy Hung Nguyen , Tan Tien Nguyen","doi":"10.1016/j.jestch.2024.101783","DOIUrl":null,"url":null,"abstract":"<div><p>Optimizing speed and propulsive efficiency are the most crucial survival skills for biomimetic robots. This paper investigates a swimming mode controller inspired by the black Knifefish to govern the fast-swimming gait with high propulsive efficiency for an elongated undulating fin robot. The proposed swimming mode controller is composed of a couple of Hopf oscillator-based central pattern generators (CPG) to generate the moving gait of robotic fish and a novel variant of Reinforcement Learning (RL) known as Multi-Agent Deep Deterministic Policy Gradient (MA-DDPG) for optimizing the propulsive efficiency. The proposed swimming controller facilitates the autonomous optimization of the oscillatory amplitude of the robotic fish to improve its propulsive efficiency. The proposed MA-DDPG demonstrates an aptitude for functioning within mixed cooperative-competitive environments. Furthermore, it effectively mitigates the drawback of zero amplitude in the updating process of conventional reinforcement learning (RL) methodologies. These findings highlight the potential utility of the MA-DDPG in optimizing the performance of multi-agent systems in dynamic, real-world scenarios. The simulation results show that the undulating fin robot reaches a maximum thrust of 0.9 N with a propulsive efficiency of 12.48 %, which is higher than that of traditional reinforcement learning methods.</p></div>","PeriodicalId":48609,"journal":{"name":"Engineering Science and Technology-An International Journal-Jestech","volume":"56 ","pages":"Article 101783"},"PeriodicalIF":5.1000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2215098624001691/pdfft?md5=0eefdece4f42d090ad74fa2ac96cee63&pid=1-s2.0-S2215098624001691-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Optimization of swimming mode for elongated undulating fin using multi-agent deep deterministic policy gradient\",\"authors\":\"Quoc Tuan Vu , Van Tu Duong , Huy Hung Nguyen , Tan Tien Nguyen\",\"doi\":\"10.1016/j.jestch.2024.101783\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Optimizing speed and propulsive efficiency are the most crucial survival skills for biomimetic robots. This paper investigates a swimming mode controller inspired by the black Knifefish to govern the fast-swimming gait with high propulsive efficiency for an elongated undulating fin robot. The proposed swimming mode controller is composed of a couple of Hopf oscillator-based central pattern generators (CPG) to generate the moving gait of robotic fish and a novel variant of Reinforcement Learning (RL) known as Multi-Agent Deep Deterministic Policy Gradient (MA-DDPG) for optimizing the propulsive efficiency. The proposed swimming controller facilitates the autonomous optimization of the oscillatory amplitude of the robotic fish to improve its propulsive efficiency. The proposed MA-DDPG demonstrates an aptitude for functioning within mixed cooperative-competitive environments. Furthermore, it effectively mitigates the drawback of zero amplitude in the updating process of conventional reinforcement learning (RL) methodologies. These findings highlight the potential utility of the MA-DDPG in optimizing the performance of multi-agent systems in dynamic, real-world scenarios. The simulation results show that the undulating fin robot reaches a maximum thrust of 0.9 N with a propulsive efficiency of 12.48 %, which is higher than that of traditional reinforcement learning methods.</p></div>\",\"PeriodicalId\":48609,\"journal\":{\"name\":\"Engineering Science and Technology-An International Journal-Jestech\",\"volume\":\"56 \",\"pages\":\"Article 101783\"},\"PeriodicalIF\":5.1000,\"publicationDate\":\"2024-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2215098624001691/pdfft?md5=0eefdece4f42d090ad74fa2ac96cee63&pid=1-s2.0-S2215098624001691-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Science and Technology-An International Journal-Jestech\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2215098624001691\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Science and Technology-An International Journal-Jestech","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2215098624001691","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
Optimization of swimming mode for elongated undulating fin using multi-agent deep deterministic policy gradient
Optimizing speed and propulsive efficiency are the most crucial survival skills for biomimetic robots. This paper investigates a swimming mode controller inspired by the black Knifefish to govern the fast-swimming gait with high propulsive efficiency for an elongated undulating fin robot. The proposed swimming mode controller is composed of a couple of Hopf oscillator-based central pattern generators (CPG) to generate the moving gait of robotic fish and a novel variant of Reinforcement Learning (RL) known as Multi-Agent Deep Deterministic Policy Gradient (MA-DDPG) for optimizing the propulsive efficiency. The proposed swimming controller facilitates the autonomous optimization of the oscillatory amplitude of the robotic fish to improve its propulsive efficiency. The proposed MA-DDPG demonstrates an aptitude for functioning within mixed cooperative-competitive environments. Furthermore, it effectively mitigates the drawback of zero amplitude in the updating process of conventional reinforcement learning (RL) methodologies. These findings highlight the potential utility of the MA-DDPG in optimizing the performance of multi-agent systems in dynamic, real-world scenarios. The simulation results show that the undulating fin robot reaches a maximum thrust of 0.9 N with a propulsive efficiency of 12.48 %, which is higher than that of traditional reinforcement learning methods.
期刊介绍:
Engineering Science and Technology, an International Journal (JESTECH) (formerly Technology), a peer-reviewed quarterly engineering journal, publishes both theoretical and experimental high quality papers of permanent interest, not previously published in journals, in the field of engineering and applied science which aims to promote the theory and practice of technology and engineering. In addition to peer-reviewed original research papers, the Editorial Board welcomes original research reports, state-of-the-art reviews and communications in the broadly defined field of engineering science and technology.
The scope of JESTECH includes a wide spectrum of subjects including:
-Electrical/Electronics and Computer Engineering (Biomedical Engineering and Instrumentation; Coding, Cryptography, and Information Protection; Communications, Networks, Mobile Computing and Distributed Systems; Compilers and Operating Systems; Computer Architecture, Parallel Processing, and Dependability; Computer Vision and Robotics; Control Theory; Electromagnetic Waves, Microwave Techniques and Antennas; Embedded Systems; Integrated Circuits, VLSI Design, Testing, and CAD; Microelectromechanical Systems; Microelectronics, and Electronic Devices and Circuits; Power, Energy and Energy Conversion Systems; Signal, Image, and Speech Processing)
-Mechanical and Civil Engineering (Automotive Technologies; Biomechanics; Construction Materials; Design and Manufacturing; Dynamics and Control; Energy Generation, Utilization, Conversion, and Storage; Fluid Mechanics and Hydraulics; Heat and Mass Transfer; Micro-Nano Sciences; Renewable and Sustainable Energy Technologies; Robotics and Mechatronics; Solid Mechanics and Structure; Thermal Sciences)
-Metallurgical and Materials Engineering (Advanced Materials Science; Biomaterials; Ceramic and Inorgnanic Materials; Electronic-Magnetic Materials; Energy and Environment; Materials Characterizastion; Metallurgy; Polymers and Nanocomposites)