{"title":"An efficient detector for maritime search and rescue object based on unmanned aerial vehicle images","authors":"Wanxuan Geng , Junfan Yi , Liang Cheng","doi":"10.1016/j.displa.2025.102994","DOIUrl":null,"url":null,"abstract":"<div><div>Unmanned aerial vehicle (UAV) remote sensing has the advantages of responsive and high image resolution, which can better serve the object detection of maritime search and rescue (SAR). However, there are still some obstacles in maritime SAR object detection based on UAV images, due to the lack of samples for training and the complexity background of the maritime images. In this study, we build a maritime search and rescue target dataset (MSRTD) based on UAV images and further propose an efficient multi-category detector named Maritime Search and Rescue-You Only Look Once network (MSR-YOLO). To eliminate the influence of objects scale and shooting angle, we introduce the deformable convolution network (DCN) to modules in backbone. The Coordinated Attention (CA) is added to the neck of network to extract the powerful features. We replace the original detection head with decoupled detection head to better complete the task of object recognition and localization. Finally, we use Wise-Intersection over Union loss (WIoU) during the training to reduce the influence of the samples quality and help model converges rapidly. The experiments on MSRTD confirm that the proposed MSR-YOLO achieves precision, recall, and mean average precision (mAP) (0.5) of 90.00%, 68.52%, and 79.98% respectively. Compared with other methods on public dataset, ours also performs well and provides an effective detector model for maritime SAR object detection based on UAV images.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"87 ","pages":"Article 102994"},"PeriodicalIF":3.7000,"publicationDate":"2025-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225000319","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Unmanned aerial vehicle (UAV) remote sensing has the advantages of responsive and high image resolution, which can better serve the object detection of maritime search and rescue (SAR). However, there are still some obstacles in maritime SAR object detection based on UAV images, due to the lack of samples for training and the complexity background of the maritime images. In this study, we build a maritime search and rescue target dataset (MSRTD) based on UAV images and further propose an efficient multi-category detector named Maritime Search and Rescue-You Only Look Once network (MSR-YOLO). To eliminate the influence of objects scale and shooting angle, we introduce the deformable convolution network (DCN) to modules in backbone. The Coordinated Attention (CA) is added to the neck of network to extract the powerful features. We replace the original detection head with decoupled detection head to better complete the task of object recognition and localization. Finally, we use Wise-Intersection over Union loss (WIoU) during the training to reduce the influence of the samples quality and help model converges rapidly. The experiments on MSRTD confirm that the proposed MSR-YOLO achieves precision, recall, and mean average precision (mAP) (0.5) of 90.00%, 68.52%, and 79.98% respectively. Compared with other methods on public dataset, ours also performs well and provides an effective detector model for maritime SAR object detection based on UAV images.
期刊介绍:
Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface.
Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.