Autonomous underwater vehicle link alignment control in unknown environments using reinforcement learning

IF 5.2 2区计算机科学 Q2 ROBOTICS Journal of Field Robotics Pub Date : 2024-04-23 DOI:10.1002/rob.22348

Yang Weng, Sehwa Chun, Masaki Ohashi, Takumi Matsuda, Yuki Sekimori, Joni Pajarinen, Jan Peters, Toshihiro Maki

{"title":"Autonomous underwater vehicle link alignment control in unknown environments using reinforcement learning","authors":"Yang Weng, Sehwa Chun, Masaki Ohashi, Takumi Matsuda, Yuki Sekimori, Joni Pajarinen, Jan Peters, Toshihiro Maki","doi":"10.1002/rob.22348","DOIUrl":null,"url":null,"abstract":"<p>High-speed underwater wireless optical communication holds immense promise in ocean monitoring and surveys, providing crucial support for the real-time sharing of observational data collected by autonomous underwater vehicles (AUVs). However, due to inaccurate target information and external interference in unknown environments, link alignment is challenging and needs to be addressed. In response to these challenges, we propose a reinforcement learning-based alignment method to control the AUV to establish an optical link and maintain alignment. Our alignment control system utilizes a combination of sensors, including a depth sensor, Doppler velocity log (DVL), gyroscope, ultra-short baseline device, and acoustic modem. These sensors are used in conjunction with a particle filter to observe the environment and estimate the AUV's state accurately. The soft actor-critic algorithm is used to train a reinforcement learning-based controller in a simulated environment to reduce pointing errors and energy consumption in alignment. After experimental validation in simulation, we deployed the controller on an actual AUV called Tri-TON. In experiments at sea, Tri-TON maintained the link and angular pointing errors within 1 m and <span></span><math>\n <semantics>\n <mrow>\n \n <mrow>\n <mn>1</mn>\n \n <msup>\n <mn>0</mn>\n \n <mo>∘</mo>\n </msup>\n </mrow>\n </mrow>\n <annotation> $1{0}^{\\circ }$</annotation>\n </semantics></math>, respectively. Experimental results demonstrate that the proposed alignment control method can establish underwater optical communication between AUV fleets, thus improving the efficiency of marine surveys.</p>","PeriodicalId":192,"journal":{"name":"Journal of Field Robotics","volume":"41 6","pages":"1724-1743"},"PeriodicalIF":5.2000,"publicationDate":"2024-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/rob.22348","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Field Robotics","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/rob.22348","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}

引用次数: 0

Abstract

High-speed underwater wireless optical communication holds immense promise in ocean monitoring and surveys, providing crucial support for the real-time sharing of observational data collected by autonomous underwater vehicles (AUVs). However, due to inaccurate target information and external interference in unknown environments, link alignment is challenging and needs to be addressed. In response to these challenges, we propose a reinforcement learning-based alignment method to control the AUV to establish an optical link and maintain alignment. Our alignment control system utilizes a combination of sensors, including a depth sensor, Doppler velocity log (DVL), gyroscope, ultra-short baseline device, and acoustic modem. These sensors are used in conjunction with a particle filter to observe the environment and estimate the AUV's state accurately. The soft actor-critic algorithm is used to train a reinforcement learning-based controller in a simulated environment to reduce pointing errors and energy consumption in alignment. After experimental validation in simulation, we deployed the controller on an actual AUV called Tri-TON. In experiments at sea, Tri-TON maintained the link and angular pointing errors within 1 m and $1 0^{\circ}$ , respectively. Experimental results demonstrate that the proposed alignment control method can establish underwater optical communication between AUV fleets, thus improving the efficiency of marine surveys.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用强化学习在未知环境中进行自主水下航行器链接排列控制

高速水下无线光通信在海洋监测和勘测方面前景广阔，为实时共享自主潜水器（AUV）收集的观测数据提供了重要支持。然而，在未知环境中，由于目标信息不准确和外部干扰，链路对准具有挑战性，亟待解决。针对这些挑战，我们提出了一种基于强化学习的对准方法，以控制自动潜航器建立光链路并保持对准。我们的对准控制系统综合利用了多种传感器，包括深度传感器、多普勒速度记录仪（DVL）、陀螺仪、超短基线装置和声学调制解调器。这些传感器与粒子滤波器结合使用，可观测环境并准确估计 AUV 的状态。软演员批评算法用于在模拟环境中训练基于强化学习的控制器，以减少对准过程中的指向误差和能耗。经过模拟实验验证后，我们在名为 Tri-TON 的实际 AUV 上部署了控制器。在海上实验中，Tri-TON 的链路和角度指向误差分别保持在 1 米和，以内。实验结果表明，所提出的对准控制方法可以在 AUV 船队之间建立水下光通信，从而提高海洋勘测的效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Field Robotics 工程技术-机器人学

CiteScore

15.00

自引率

3.60%

发文量

审稿时长

6 months

期刊介绍： The Journal of Field Robotics seeks to promote scholarly publications dealing with the fundamentals of robotics in unstructured and dynamic environments. The Journal focuses on experimental robotics and encourages publication of work that has both theoretical and practical significance.

期刊最新文献

Issue Information Issue Information Issue Information Cover Image, Volume 42, Number 8, December 2025 Cover Image, Volume 42, Number 7, October 2025