探索软约束整合对基于强化学习的自主船舶导航性能的影响:实验启示

IF 2.3 3区 工程技术 Q2 ENGINEERING, MARINE International Journal of Naval Architecture and Ocean Engineering Pub Date : 2024-01-01 DOI:10.1016/j.ijnaoe.2024.100609
Xin Jiang , Jiawen Li , Zhenkai Huang , Ji Huang , Ronghui Li
{"title":"探索软约束整合对基于强化学习的自主船舶导航性能的影响:实验启示","authors":"Xin Jiang ,&nbsp;Jiawen Li ,&nbsp;Zhenkai Huang ,&nbsp;Ji Huang ,&nbsp;Ronghui Li","doi":"10.1016/j.ijnaoe.2024.100609","DOIUrl":null,"url":null,"abstract":"<div><p>Reinforcement learning has shown promise in enabling autonomous ship navigation, allowing vessels to adapt and make informed decisions in complex marine environments. However, the integration of soft constraints and their impact on performance in RL-based autonomous vessel navigation research remain understudied. This research addresses this gap by investigating the implications of soft constraints in the context of the risk-averse ship navigation problem. Four distinct soft constraint functions are proposed, which are integrated with two widely used RL algorithms, resulting in the creation of eight risk-averse autonomous vessel navigation models. To ensure a comprehensive evaluation of their performance, comparative analyses are conducted across seven virtual digital channel environments. Additionally, a novel metric, known as Large Helm Momentum (LHM), is introduced to quantify the smoothness of autonomous vessel navigation. Through thorough experimentation, key considerations for the design of soft constraint functions in the domain of autonomous ship navigation are identified. A comprehensive understanding of how different soft constraint functions influence autonomous driving behavior has been achieved. Key considerations for designing soft constraint functions in the domain of autonomous ship navigation have also been identified. Five principles, namely the constraint association principle, dominance of hard constraints, reward-balance principle, mapping requirement principle, and iterative improvement principle, are proposed to optimize the design of soft constraint functions for autonomous ship navigation, providing valuable guidance and insights.</p></div>","PeriodicalId":14160,"journal":{"name":"International Journal of Naval Architecture and Ocean Engineering","volume":"16 ","pages":"Article 100609"},"PeriodicalIF":2.3000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2092678224000281/pdfft?md5=88e268ac351e4b2b9b7c1582b3e4b23b&pid=1-s2.0-S2092678224000281-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Exploring the performance impact of soft constraint integration on reinforcement learning-based autonomous vessel navigation: Experimental insights\",\"authors\":\"Xin Jiang ,&nbsp;Jiawen Li ,&nbsp;Zhenkai Huang ,&nbsp;Ji Huang ,&nbsp;Ronghui Li\",\"doi\":\"10.1016/j.ijnaoe.2024.100609\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Reinforcement learning has shown promise in enabling autonomous ship navigation, allowing vessels to adapt and make informed decisions in complex marine environments. However, the integration of soft constraints and their impact on performance in RL-based autonomous vessel navigation research remain understudied. This research addresses this gap by investigating the implications of soft constraints in the context of the risk-averse ship navigation problem. Four distinct soft constraint functions are proposed, which are integrated with two widely used RL algorithms, resulting in the creation of eight risk-averse autonomous vessel navigation models. To ensure a comprehensive evaluation of their performance, comparative analyses are conducted across seven virtual digital channel environments. Additionally, a novel metric, known as Large Helm Momentum (LHM), is introduced to quantify the smoothness of autonomous vessel navigation. Through thorough experimentation, key considerations for the design of soft constraint functions in the domain of autonomous ship navigation are identified. A comprehensive understanding of how different soft constraint functions influence autonomous driving behavior has been achieved. Key considerations for designing soft constraint functions in the domain of autonomous ship navigation have also been identified. Five principles, namely the constraint association principle, dominance of hard constraints, reward-balance principle, mapping requirement principle, and iterative improvement principle, are proposed to optimize the design of soft constraint functions for autonomous ship navigation, providing valuable guidance and insights.</p></div>\",\"PeriodicalId\":14160,\"journal\":{\"name\":\"International Journal of Naval Architecture and Ocean Engineering\",\"volume\":\"16 \",\"pages\":\"Article 100609\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2092678224000281/pdfft?md5=88e268ac351e4b2b9b7c1582b3e4b23b&pid=1-s2.0-S2092678224000281-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Naval Architecture and Ocean Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2092678224000281\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, MARINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Naval Architecture and Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2092678224000281","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MARINE","Score":null,"Total":0}
引用次数: 0

摘要

强化学习在实现船舶自主导航方面大有可为,它使船舶能够在复杂的海洋环境中适应环境并做出明智的决策。然而,在基于 RL 的自主船舶导航研究中,软约束的整合及其对性能的影响仍未得到充分研究。本研究针对这一空白,研究了软约束在规避风险的船舶导航问题中的影响。研究提出了四种不同的软约束函数,并将其与两种广泛使用的 RL 算法相结合,最终创建了八个规避风险的自主船舶导航模型。为确保对这些模型的性能进行全面评估,在七个虚拟数字航道环境中进行了比较分析。此外,还引入了一种称为大舵机动量(LHM)的新指标,用于量化自主船舶导航的平稳性。通过全面的实验,确定了自主船舶导航领域软约束函数设计的关键考虑因素。对不同软约束函数如何影响自主驾驶行为有了全面的了解。此外,还确定了在船舶自主导航领域设计软约束函数的关键考虑因素。提出了五个原则,即约束关联原则、硬约束主导原则、奖励平衡原则、映射要求原则和迭代改进原则,为优化船舶自主导航软约束函数的设计提供了有价值的指导和启示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Exploring the performance impact of soft constraint integration on reinforcement learning-based autonomous vessel navigation: Experimental insights

Reinforcement learning has shown promise in enabling autonomous ship navigation, allowing vessels to adapt and make informed decisions in complex marine environments. However, the integration of soft constraints and their impact on performance in RL-based autonomous vessel navigation research remain understudied. This research addresses this gap by investigating the implications of soft constraints in the context of the risk-averse ship navigation problem. Four distinct soft constraint functions are proposed, which are integrated with two widely used RL algorithms, resulting in the creation of eight risk-averse autonomous vessel navigation models. To ensure a comprehensive evaluation of their performance, comparative analyses are conducted across seven virtual digital channel environments. Additionally, a novel metric, known as Large Helm Momentum (LHM), is introduced to quantify the smoothness of autonomous vessel navigation. Through thorough experimentation, key considerations for the design of soft constraint functions in the domain of autonomous ship navigation are identified. A comprehensive understanding of how different soft constraint functions influence autonomous driving behavior has been achieved. Key considerations for designing soft constraint functions in the domain of autonomous ship navigation have also been identified. Five principles, namely the constraint association principle, dominance of hard constraints, reward-balance principle, mapping requirement principle, and iterative improvement principle, are proposed to optimize the design of soft constraint functions for autonomous ship navigation, providing valuable guidance and insights.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
4.90
自引率
4.50%
发文量
62
审稿时长
12 months
期刊介绍: International Journal of Naval Architecture and Ocean Engineering provides a forum for engineers and scientists from a wide range of disciplines to present and discuss various phenomena in the utilization and preservation of ocean environment. Without being limited by the traditional categorization, it is encouraged to present advanced technology development and scientific research, as long as they are aimed for more and better human engagement with ocean environment. Topics include, but not limited to: marine hydrodynamics; structural mechanics; marine propulsion system; design methodology & practice; production technology; system dynamics & control; marine equipment technology; materials science; underwater acoustics; ocean remote sensing; and information technology related to ship and marine systems; ocean energy systems; marine environmental engineering; maritime safety engineering; polar & arctic engineering; coastal & port engineering; subsea engineering; and specialized watercraft engineering.
期刊最新文献
A fundamental study on structural strength assessment of U-bolts for expanded application to shipbuilding and offshore piping systems A numerical study on the feasibility of predicting the resistance of a full-scale ship using a virtual fluid A novel formula for predicting the ultimate compressive strength of the cylindrically curved plates A numerical study of added resistance performance and hydrodynamics of KCS hull in oblique regular waves and estimation of resistance in short-crested irregular waves through spectral method Evaluation of subgrid scale models in turbulent large eddy simulations of pumpjet propulsor
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1