YOLO-MRS:基于深度学习的高效无人水面飞行器海洋物体检测方法

IF 4.3 2区 工程技术 Q1 ENGINEERING, OCEAN Applied Ocean Research Pub Date : 2024-09-28 DOI:10.1016/j.apor.2024.104240
{"title":"YOLO-MRS:基于深度学习的高效无人水面飞行器海洋物体检测方法","authors":"","doi":"10.1016/j.apor.2024.104240","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning-based object detection for an unmanned surface vehicle (USV) is an important way of visual perception. However, current methods perform poorly when performing complex maritime object detection tasks. It also lacks available datasets of complex maritime objects for visual perception system of USVs. In order to solve these problems, we propose an improved maritime object detection method, called YOLO-MRS, based on lightweight YOLOv8 model in this paper. Specifically, we introduce a multi-scale cross-axis attention (MCA) mechanism into the backbone network of the model to establish long-distance dependencies between pixels to capture global feature information. In addition, we introduce Simplified Spatial Pyramid Pooling-Fast (SimSPPF) to the backbone to enhance prediction accuracy. Also, considering computational efficiency, we replace the ordinary convolutional layers in the backbone network and neck network with refocused convolutional (RefConv) layers to reduce model parameters. Especially, we construct a maritime object detection dataset, termed MODD-13, which contains over 9000 precisely annotated images. The proposed MODD-13 sufficiently considers the characteristics of object categories (13 types), image diversity, sample independence, and background confusion, and can be used as a benchmark dataset for maritime object detection. The final experimental results show that compared with the representative YOLO series models, YOLO-MRS improves the average mAP50 accuracy by 1.8%–7% and mAP50-95 by 1.1%–11.5%, and effectively balances detection accuracy and computational efficiency, thereby effectively achieving fast and accurate detection of maritime objects.</div></div>","PeriodicalId":8261,"journal":{"name":"Applied Ocean Research","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2024-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"YOLO-MRS: An efficient deep learning-based maritime object detection method for unmanned surface vehicles\",\"authors\":\"\",\"doi\":\"10.1016/j.apor.2024.104240\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Deep learning-based object detection for an unmanned surface vehicle (USV) is an important way of visual perception. However, current methods perform poorly when performing complex maritime object detection tasks. It also lacks available datasets of complex maritime objects for visual perception system of USVs. In order to solve these problems, we propose an improved maritime object detection method, called YOLO-MRS, based on lightweight YOLOv8 model in this paper. Specifically, we introduce a multi-scale cross-axis attention (MCA) mechanism into the backbone network of the model to establish long-distance dependencies between pixels to capture global feature information. In addition, we introduce Simplified Spatial Pyramid Pooling-Fast (SimSPPF) to the backbone to enhance prediction accuracy. Also, considering computational efficiency, we replace the ordinary convolutional layers in the backbone network and neck network with refocused convolutional (RefConv) layers to reduce model parameters. Especially, we construct a maritime object detection dataset, termed MODD-13, which contains over 9000 precisely annotated images. The proposed MODD-13 sufficiently considers the characteristics of object categories (13 types), image diversity, sample independence, and background confusion, and can be used as a benchmark dataset for maritime object detection. The final experimental results show that compared with the representative YOLO series models, YOLO-MRS improves the average mAP50 accuracy by 1.8%–7% and mAP50-95 by 1.1%–11.5%, and effectively balances detection accuracy and computational efficiency, thereby effectively achieving fast and accurate detection of maritime objects.</div></div>\",\"PeriodicalId\":8261,\"journal\":{\"name\":\"Applied Ocean Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-09-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Ocean Research\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0141118724003614\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, OCEAN\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Ocean Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141118724003614","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, OCEAN","Score":null,"Total":0}
引用次数: 0

摘要

基于深度学习的无人水面航行器(USV)目标检测是视觉感知的一种重要方式。然而,目前的方法在执行复杂的海上物体检测任务时表现不佳。同时,USV 的视觉感知系统也缺乏可用的复杂海洋物体数据集。为了解决这些问题,我们在本文中提出了一种基于轻量级 YOLOv8 模型的改进型海洋物体检测方法,称为 YOLO-MRS。具体来说,我们在模型的骨干网络中引入了多尺度跨轴关注(MCA)机制,以建立像素间的远距离依赖关系,从而捕捉全局特征信息。此外,我们还在骨干网络中引入了简化空间金字塔池化-快速(SimSPPF)机制,以提高预测精度。同时,考虑到计算效率,我们将骨干网络和颈部网络中的普通卷积层替换为重新聚焦卷积层(RefConv),以减少模型参数。特别是,我们构建了一个包含 9000 多张精确注释图像的海洋物体检测数据集,称为 MODD-13。提出的 MODD-13 充分考虑了物体类别(13 种)、图像多样性、样本独立性和背景混淆等特点,可作为海上物体检测的基准数据集。最终实验结果表明,与具有代表性的 YOLO 系列模型相比,YOLO-MRS 的 mAP50 平均精度提高了 1.8%-7%,mAP50-95 平均精度提高了 1.1%-11.5%,有效平衡了检测精度和计算效率,从而有效实现了对海洋物体的快速准确检测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
YOLO-MRS: An efficient deep learning-based maritime object detection method for unmanned surface vehicles
Deep learning-based object detection for an unmanned surface vehicle (USV) is an important way of visual perception. However, current methods perform poorly when performing complex maritime object detection tasks. It also lacks available datasets of complex maritime objects for visual perception system of USVs. In order to solve these problems, we propose an improved maritime object detection method, called YOLO-MRS, based on lightweight YOLOv8 model in this paper. Specifically, we introduce a multi-scale cross-axis attention (MCA) mechanism into the backbone network of the model to establish long-distance dependencies between pixels to capture global feature information. In addition, we introduce Simplified Spatial Pyramid Pooling-Fast (SimSPPF) to the backbone to enhance prediction accuracy. Also, considering computational efficiency, we replace the ordinary convolutional layers in the backbone network and neck network with refocused convolutional (RefConv) layers to reduce model parameters. Especially, we construct a maritime object detection dataset, termed MODD-13, which contains over 9000 precisely annotated images. The proposed MODD-13 sufficiently considers the characteristics of object categories (13 types), image diversity, sample independence, and background confusion, and can be used as a benchmark dataset for maritime object detection. The final experimental results show that compared with the representative YOLO series models, YOLO-MRS improves the average mAP50 accuracy by 1.8%–7% and mAP50-95 by 1.1%–11.5%, and effectively balances detection accuracy and computational efficiency, thereby effectively achieving fast and accurate detection of maritime objects.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Applied Ocean Research
Applied Ocean Research 地学-工程:大洋
CiteScore
8.70
自引率
7.00%
发文量
316
审稿时长
59 days
期刊介绍: The aim of Applied Ocean Research is to encourage the submission of papers that advance the state of knowledge in a range of topics relevant to ocean engineering.
期刊最新文献
Investigation of morphodynamic response to the storm-induced currents and waves in the Bay of Bengal Wave Energy Potential and the Role of Extreme Events on South America's Coasts. A Regional Frequency Analysis Evaluation, sampling and testing methods for offshore disturbed sands with plastic fines: A case study Dynamic response of three different floating platform (OC4, BSS, GVA) using multi-segment mooring system Numerical study of underwater acoustic radiation and propagation induced by structural vibration in ocean environments using FEM-BMSBM
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1