{"title":"An efficient detector for maritime search and rescue object based on unmanned aerial vehicle images","authors":"Wanxuan Geng , Junfan Yi , Liang Cheng","doi":"10.1016/j.displa.2025.102994","DOIUrl":null,"url":null,"abstract":"<div><div>Unmanned aerial vehicle (UAV) remote sensing has the advantages of responsive and high image resolution, which can better serve the object detection of maritime search and rescue (SAR). However, there are still some obstacles in maritime SAR object detection based on UAV images, due to the lack of samples for training and the complexity background of the maritime images. In this study, we build a maritime search and rescue target dataset (MSRTD) based on UAV images and further propose an efficient multi-category detector named Maritime Search and Rescue-You Only Look Once network (MSR-YOLO). To eliminate the influence of objects scale and shooting angle, we introduce the deformable convolution network (DCN) to modules in backbone. The Coordinated Attention (CA) is added to the neck of network to extract the powerful features. We replace the original detection head with decoupled detection head to better complete the task of object recognition and localization. Finally, we use Wise-Intersection over Union loss (WIoU) during the training to reduce the influence of the samples quality and help model converges rapidly. The experiments on MSRTD confirm that the proposed MSR-YOLO achieves precision, recall, and mean average precision (mAP) (0.5) of 90.00%, 68.52%, and 79.98% respectively. Compared with other methods on public dataset, ours also performs well and provides an effective detector model for maritime SAR object detection based on UAV images.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"87 ","pages":"Article 102994"},"PeriodicalIF":3.4000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225000319","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/9 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Unmanned aerial vehicle (UAV) remote sensing has the advantages of responsive and high image resolution, which can better serve the object detection of maritime search and rescue (SAR). However, there are still some obstacles in maritime SAR object detection based on UAV images, due to the lack of samples for training and the complexity background of the maritime images. In this study, we build a maritime search and rescue target dataset (MSRTD) based on UAV images and further propose an efficient multi-category detector named Maritime Search and Rescue-You Only Look Once network (MSR-YOLO). To eliminate the influence of objects scale and shooting angle, we introduce the deformable convolution network (DCN) to modules in backbone. The Coordinated Attention (CA) is added to the neck of network to extract the powerful features. We replace the original detection head with decoupled detection head to better complete the task of object recognition and localization. Finally, we use Wise-Intersection over Union loss (WIoU) during the training to reduce the influence of the samples quality and help model converges rapidly. The experiments on MSRTD confirm that the proposed MSR-YOLO achieves precision, recall, and mean average precision (mAP) (0.5) of 90.00%, 68.52%, and 79.98% respectively. Compared with other methods on public dataset, ours also performs well and provides an effective detector model for maritime SAR object detection based on UAV images.
无人机遥感具有响应快、图像分辨率高的优点,可以更好地为海上搜救(SAR)的目标检测服务。然而,由于缺乏训练样本和海洋图像背景的复杂性,基于无人机图像的海上SAR目标检测仍然存在一些障碍。在本研究中,我们建立了基于无人机图像的海上搜救目标数据集(MSRTD),并进一步提出了一种高效的多类别探测器——海上搜救-你只看一次网络(MSR-YOLO)。为了消除目标尺度和拍摄角度的影响,我们在主干模块中引入了可变形卷积网络(DCN)。在网络颈部加入协调注意(CA)来提取强大的特征。我们用解耦的检测头代替原来的检测头,更好地完成了目标识别和定位的任务。最后,我们在训练过程中使用智能交叉优于联合损失(Wise-Intersection over Union loss, WIoU)来减少样本质量的影响,帮助模型快速收敛。在MSRTD上的实验证实,该方法的精密度、召回率和平均精密度(mAP)(0.5)分别达到90.00%、68.52%和79.98%。与其他公共数据集上的方法相比,我们的方法也表现良好,为基于无人机图像的海上SAR目标检测提供了一种有效的检测器模型。
期刊介绍:
Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface.
Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.