Autonomous unmanned surface vehicle docking using large language model guide reinforcement learning

IF 4.6 2区工程技术 Q1 ENGINEERING, CIVIL Ocean Engineering Pub Date : 2025-02-13 DOI:10.1016/j.oceaneng.2025.120608

Chenhang Xu , Yijie Chu , Qizhong Gao , Ziniu Wu , Jia Wang , Yong Yue , Wojtczak Dominik , Xiaohui Zhu

{"title":"Autonomous unmanned surface vehicle docking using large language model guide reinforcement learning","authors":"Chenhang Xu , Yijie Chu , Qizhong Gao , Ziniu Wu , Jia Wang , Yong Yue , Wojtczak Dominik , Xiaohui Zhu","doi":"10.1016/j.oceaneng.2025.120608","DOIUrl":null,"url":null,"abstract":"<div><div>Autonomous docking of unmanned surface vehicles (USVs) represents the critical ”last mile” of intelligent navigation, presenting two main challenges: traditional control methods lack robustness in dynamic environments with disturbances such as wind and currents, while reinforcement learning (RL) methods suffer from low efficiency and often fail to transfer effectively from simulation to real-world applications. To tackle these issues, we propose LLM4SAC, a novel algorithm that integrates Large Language Models (LLMs) with the Soft Actor–Critic (SAC) framework to achieve USV autonomous docking tasks. LLM4SAC addresses these issues by leveraging the advanced contextual understanding and adaptive decision-making capabilities of LLMs. By providing high-level, context-specific guidance, LLMs enhance the RL agent’s ability to interpret complex environmental data and adjust strategies in real time. This reduces the reliance on extensive simulated training datasets and increases the robustness and accuracy of the system under actual operating conditions. The dynamic request policy further refines the system’s efficiency, querying LLMs only when necessary to minimize computational demands and interaction costs. Experiments in both simulation and real-world environments show that LLM4SAC significantly improves docking success rates, reduces computational costs, and enhances adaptability to dynamic conditions. Full implementation and resources are available on GitHub: <span><span>https://github.com/RyanXu0428/LLM4SAC</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":19403,"journal":{"name":"Ocean Engineering","volume":"323 ","pages":"Article 120608"},"PeriodicalIF":4.6000,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029801825003233","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}

引用次数: 0

Abstract

Autonomous docking of unmanned surface vehicles (USVs) represents the critical ”last mile” of intelligent navigation, presenting two main challenges: traditional control methods lack robustness in dynamic environments with disturbances such as wind and currents, while reinforcement learning (RL) methods suffer from low efficiency and often fail to transfer effectively from simulation to real-world applications. To tackle these issues, we propose LLM4SAC, a novel algorithm that integrates Large Language Models (LLMs) with the Soft Actor–Critic (SAC) framework to achieve USV autonomous docking tasks. LLM4SAC addresses these issues by leveraging the advanced contextual understanding and adaptive decision-making capabilities of LLMs. By providing high-level, context-specific guidance, LLMs enhance the RL agent’s ability to interpret complex environmental data and adjust strategies in real time. This reduces the reliance on extensive simulated training datasets and increases the robustness and accuracy of the system under actual operating conditions. The dynamic request policy further refines the system’s efficiency, querying LLMs only when necessary to minimize computational demands and interaction costs. Experiments in both simulation and real-world environments show that LLM4SAC significantly improves docking success rates, reduces computational costs, and enhances adaptability to dynamic conditions. Full implementation and resources are available on GitHub: https://github.com/RyanXu0428/LLM4SAC.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

求助全文

约1分钟内获得全文去求助

来源期刊

Ocean Engineering 工程技术-工程：大洋

CiteScore

7.30

自引率

34.00%

发文量

2379

审稿时长

8.1 months

期刊介绍： Ocean Engineering provides a medium for the publication of original research and development work in the field of ocean engineering. Ocean Engineering seeks papers in the following topics.