{"title":"Pas: a scale-invariant approach to maritime search and rescue object detection using preprocessing and attention scaling","authors":"Shibao Li, Chen Li, Zhaoyu Wang, Zekun Jia, Jinze Zhu, Xuerong Cui, Jianhang Liu","doi":"10.1007/s11370-024-00526-5","DOIUrl":null,"url":null,"abstract":"<p>Object detection is a primary means of unmanned aerial vehicle (UAV) maritime search and rescue. The problem of scale variation caused by UAV flight height changes, shooting angle changes, and giant waves seriously affects the detection performance. However, most work does not explicitly consider the effects of these factors. In this work, we propose an algorithm called Preprocessing and Attention Scaling, which explicitly considers the scale variation problem caused by height, angle changes, and giant waves for the first time and solves it through Preprocessing Scaling and Attention Scaling. The Preprocessing Scaling module scales and perspective changes the images according to each photograph’s recorded flight altitude and shooting angle and crops them to the appropriate size, significantly improving the detection accuracy and shortening the inference time. At the same time, the scale variation caused by the up and down of the object due to the vast swells cannot be solved by the Preprocessing Scaling module, so we designed the Attention Scaling module again to quickly capture the area that needs further scale change by fusing the horizontal attention and vertical attention, and then transform it to the appropriate scale by the affine transformation, further improving detection accuracy. We extensively tested PAS on the well-known SeaDronesSee-DET and the SeaDronesSee-DET v2 (S-ODv2) datasets, significantly improving the detection accuracy. In addition, we successfully tested our method on a height-angle transfer task, where we trained on some height-angle intervals and tested on different height-angle intervals, achieving good results.</p>","PeriodicalId":48813,"journal":{"name":"Intelligent Service Robotics","volume":"102 1","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2024-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent Service Robotics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11370-024-00526-5","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ROBOTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Object detection is a primary means of unmanned aerial vehicle (UAV) maritime search and rescue. The problem of scale variation caused by UAV flight height changes, shooting angle changes, and giant waves seriously affects the detection performance. However, most work does not explicitly consider the effects of these factors. In this work, we propose an algorithm called Preprocessing and Attention Scaling, which explicitly considers the scale variation problem caused by height, angle changes, and giant waves for the first time and solves it through Preprocessing Scaling and Attention Scaling. The Preprocessing Scaling module scales and perspective changes the images according to each photograph’s recorded flight altitude and shooting angle and crops them to the appropriate size, significantly improving the detection accuracy and shortening the inference time. At the same time, the scale variation caused by the up and down of the object due to the vast swells cannot be solved by the Preprocessing Scaling module, so we designed the Attention Scaling module again to quickly capture the area that needs further scale change by fusing the horizontal attention and vertical attention, and then transform it to the appropriate scale by the affine transformation, further improving detection accuracy. We extensively tested PAS on the well-known SeaDronesSee-DET and the SeaDronesSee-DET v2 (S-ODv2) datasets, significantly improving the detection accuracy. In addition, we successfully tested our method on a height-angle transfer task, where we trained on some height-angle intervals and tested on different height-angle intervals, achieving good results.
期刊介绍:
The journal directs special attention to the emerging significance of integrating robotics with information technology and cognitive science (such as ubiquitous and adaptive computing,information integration in a distributed environment, and cognitive modelling for human-robot interaction), which spurs innovation toward a new multi-dimensional robotic service to humans. The journal intends to capture and archive this emerging yet significant advancement in the field of intelligent service robotics. The journal will publish original papers of innovative ideas and concepts, new discoveries and improvements, as well as novel applications and business models which are related to the field of intelligent service robotics described above and are proven to be of high quality. The areas that the Journal will cover include, but are not limited to: Intelligent robots serving humans in daily life or in a hazardous environment, such as home or personal service robots, entertainment robots, education robots, medical robots, healthcare and rehabilitation robots, and rescue robots (Service Robotics); Intelligent robotic functions in the form of embedded systems for applications to, for example, intelligent space, intelligent vehicles and transportation systems, intelligent manufacturing systems, and intelligent medical facilities (Embedded Robotics); The integration of robotics with network technologies, generating such services and solutions as distributed robots, distance robotic education-aides, and virtual laboratories or museums (Networked Robotics).