Autonomous unmanned surface vehicle docking using large language model guide reinforcement learning

IF 5.5 2区 工程技术 Q1 ENGINEERING, CIVIL Ocean Engineering Pub Date : 2025-04-15 Epub Date: 2025-02-13 DOI:10.1016/j.oceaneng.2025.120608
Chenhang Xu , Yijie Chu , Qizhong Gao , Ziniu Wu , Jia Wang , Yong Yue , Wojtczak Dominik , Xiaohui Zhu
{"title":"Autonomous unmanned surface vehicle docking using large language model guide reinforcement learning","authors":"Chenhang Xu ,&nbsp;Yijie Chu ,&nbsp;Qizhong Gao ,&nbsp;Ziniu Wu ,&nbsp;Jia Wang ,&nbsp;Yong Yue ,&nbsp;Wojtczak Dominik ,&nbsp;Xiaohui Zhu","doi":"10.1016/j.oceaneng.2025.120608","DOIUrl":null,"url":null,"abstract":"<div><div>Autonomous docking of unmanned surface vehicles (USVs) represents the critical ”last mile” of intelligent navigation, presenting two main challenges: traditional control methods lack robustness in dynamic environments with disturbances such as wind and currents, while reinforcement learning (RL) methods suffer from low efficiency and often fail to transfer effectively from simulation to real-world applications. To tackle these issues, we propose LLM4SAC, a novel algorithm that integrates Large Language Models (LLMs) with the Soft Actor–Critic (SAC) framework to achieve USV autonomous docking tasks. LLM4SAC addresses these issues by leveraging the advanced contextual understanding and adaptive decision-making capabilities of LLMs. By providing high-level, context-specific guidance, LLMs enhance the RL agent’s ability to interpret complex environmental data and adjust strategies in real time. This reduces the reliance on extensive simulated training datasets and increases the robustness and accuracy of the system under actual operating conditions. The dynamic request policy further refines the system’s efficiency, querying LLMs only when necessary to minimize computational demands and interaction costs. Experiments in both simulation and real-world environments show that LLM4SAC significantly improves docking success rates, reduces computational costs, and enhances adaptability to dynamic conditions. Full implementation and resources are available on GitHub: <span><span>https://github.com/RyanXu0428/LLM4SAC</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":19403,"journal":{"name":"Ocean Engineering","volume":"323 ","pages":"Article 120608"},"PeriodicalIF":5.5000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029801825003233","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/13 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

Abstract

Autonomous docking of unmanned surface vehicles (USVs) represents the critical ”last mile” of intelligent navigation, presenting two main challenges: traditional control methods lack robustness in dynamic environments with disturbances such as wind and currents, while reinforcement learning (RL) methods suffer from low efficiency and often fail to transfer effectively from simulation to real-world applications. To tackle these issues, we propose LLM4SAC, a novel algorithm that integrates Large Language Models (LLMs) with the Soft Actor–Critic (SAC) framework to achieve USV autonomous docking tasks. LLM4SAC addresses these issues by leveraging the advanced contextual understanding and adaptive decision-making capabilities of LLMs. By providing high-level, context-specific guidance, LLMs enhance the RL agent’s ability to interpret complex environmental data and adjust strategies in real time. This reduces the reliance on extensive simulated training datasets and increases the robustness and accuracy of the system under actual operating conditions. The dynamic request policy further refines the system’s efficiency, querying LLMs only when necessary to minimize computational demands and interaction costs. Experiments in both simulation and real-world environments show that LLM4SAC significantly improves docking success rates, reduces computational costs, and enhances adaptability to dynamic conditions. Full implementation and resources are available on GitHub: https://github.com/RyanXu0428/LLM4SAC.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
自主无人水面飞行器对接采用大语言模型引导强化学习
无人水面车辆(usv)的自主对接代表了智能导航的关键“最后一英里”,提出了两个主要挑战:传统的控制方法在具有风和电流等干扰的动态环境中缺乏鲁棒性,而强化学习(RL)方法效率低,并且经常无法有效地从模拟转移到实际应用。为了解决这些问题,我们提出了LLM4SAC,这是一种将大型语言模型(llm)与软Actor-Critic (SAC)框架集成在一起的新算法,以实现USV自主对接任务。LLM4SAC通过利用llm的高级上下文理解和自适应决策能力来解决这些问题。通过提供高层次的、特定于情境的指导,llm增强了RL代理解释复杂环境数据和实时调整策略的能力。这减少了对大量模拟训练数据集的依赖,提高了系统在实际操作条件下的鲁棒性和准确性。动态请求策略进一步优化了系统的效率,仅在必要时查询llm,以最小化计算需求和交互成本。仿真和现实环境的实验表明,LLM4SAC显著提高了对接成功率,降低了计算成本,增强了对动态条件的适应性。完整的实现和资源可在GitHub: https://github.com/RyanXu0428/LLM4SAC。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Ocean Engineering
Ocean Engineering 工程技术-工程:大洋
CiteScore
7.30
自引率
34.00%
发文量
2379
审稿时长
8.1 months
期刊介绍: Ocean Engineering provides a medium for the publication of original research and development work in the field of ocean engineering. Ocean Engineering seeks papers in the following topics.
期刊最新文献
Effect of flow conditions on solid-liquid two-phase flow characteristics in the multistage deep-sea mining pump Performance enhancement of a vertical-axis wind turbine using an optimised trailing-edge wedge flap: CFD and experimental investigation Experimental investigation on the dynamic response of approach bridge and wharf structures under vessel collision Design and lifetime cost optimization of ship energy systems including weather-driven speed profile variability Data-driven fault diagnosis for autonomous underwater vehicles using lag-aware spatio-temporal dynamic graph networks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1