{"title":"GWS: Rotation object detection in aerial remote sensing images based on Gauss–Wasserstein scattering","authors":"Ling Gan, Xiaodong Tan, Liuhui Hu","doi":"10.3233/aic-230135","DOIUrl":null,"url":null,"abstract":"The majority of existing rotating target detectors inherit the horizontal detection paradigm and design the rotational regression loss based on the inductive paradigm. But the loss design limitation of the inductive paradigm makes these detectors hardly detect effectively tiny targets with high accuracy, particularly for large-aspect-ratio objects. Therefore, in view of the fact that horizontal detection is a special scenario of rotating target detection and based on the relationship between rotational and horizontal detection, we shift from an inductive to a deductive paradigm of design to develop a new regression loss function named Gauss–Wasserstein scattering (GWS). First, the rotating bounding box is transformed into a two-dimensional Gaussian distribution, and then the regression losses between Gaussian distributions are calculated as the Wasserstein scatter; By analyzing the gradient of centroid regression, centroid regression is shown to be able to adjust gradients dynamically based on object characteristics, and small targets requiring high accuracy detection rely on this mechanism, and more importantly, it is further demonstrated that GWS is scale-invariant while possessing an explicit regression logic. The method is performed on a large public remote sensing dataset DOTA and two popular detectors and achieves a large accuracy improvement in both large aspect ratio targets and small targets detection compared to similar methods.","PeriodicalId":50835,"journal":{"name":"AI Communications","volume":"14 2","pages":""},"PeriodicalIF":1.4000,"publicationDate":"2023-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AI Communications","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.3233/aic-230135","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The majority of existing rotating target detectors inherit the horizontal detection paradigm and design the rotational regression loss based on the inductive paradigm. But the loss design limitation of the inductive paradigm makes these detectors hardly detect effectively tiny targets with high accuracy, particularly for large-aspect-ratio objects. Therefore, in view of the fact that horizontal detection is a special scenario of rotating target detection and based on the relationship between rotational and horizontal detection, we shift from an inductive to a deductive paradigm of design to develop a new regression loss function named Gauss–Wasserstein scattering (GWS). First, the rotating bounding box is transformed into a two-dimensional Gaussian distribution, and then the regression losses between Gaussian distributions are calculated as the Wasserstein scatter; By analyzing the gradient of centroid regression, centroid regression is shown to be able to adjust gradients dynamically based on object characteristics, and small targets requiring high accuracy detection rely on this mechanism, and more importantly, it is further demonstrated that GWS is scale-invariant while possessing an explicit regression logic. The method is performed on a large public remote sensing dataset DOTA and two popular detectors and achieves a large accuracy improvement in both large aspect ratio targets and small targets detection compared to similar methods.
期刊介绍:
AI Communications is a journal on artificial intelligence (AI) which has a close relationship to EurAI (European Association for Artificial Intelligence, formerly ECCAI). It covers the whole AI community: Scientific institutions as well as commercial and industrial companies.
AI Communications aims to enhance contacts and information exchange between AI researchers and developers, and to provide supranational information to those concerned with AI and advanced information processing. AI Communications publishes refereed articles concerning scientific and technical AI procedures, provided they are of sufficient interest to a large readership of both scientific and practical background. In addition it contains high-level background material, both at the technical level as well as the level of opinions, policies and news.