Min Dang , Gang Liu , Chao Chen , Di Wang , Xike Li , Quan Wang
{"title":"Adaptive spatial and scale label assignment for anchor-free object detection","authors":"Min Dang , Gang Liu , Chao Chen , Di Wang , Xike Li , Quan Wang","doi":"10.1016/j.patcog.2025.111549","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, anchor-free object detection has attracted widespread attention due to its simplicity and efficiency. The mainstream anchor-free object detectors allocate positive/negative candidate samples through prior guidance at a fixed spatial position and assign positive/negative samples according to predefined scale constraints. However, artificially designing assignment strategies according to prior data distribution may hinder further optimization of label assignment. To this end, this paper proposes Adaptive Spatial and Scale Label Assignment (ASS-LA) to improve the performance of anchor-free object detection. Positive/negative samples are distributed from different pyramid levels using spatial and scale constraints. Specifically, an adaptive Intersection-over-Union (IoU) space assignment is designed to select candidate positive sample points. The membership degree is introduced at each pyramid level to adaptively fuzzy the scale assignment range so that the detector selects the final positive sample from the candidate sample points. Furthermore, a reference box is introduced to design the predicted IoU branch of coupled regression. In the inference stage, the predicted IoU and classification scores are combined as the confidence of the regression bounding box to alleviate the inconsistency between classification and regression. Extensive experiments show that our method achieves comparable performance to other existing label assignment schemes. With the introduction of ASS-LA, the anchor-free object detector has significant performance improvements without introducing other overhead.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"164 ","pages":"Article 111549"},"PeriodicalIF":7.5000,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325002092","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, anchor-free object detection has attracted widespread attention due to its simplicity and efficiency. The mainstream anchor-free object detectors allocate positive/negative candidate samples through prior guidance at a fixed spatial position and assign positive/negative samples according to predefined scale constraints. However, artificially designing assignment strategies according to prior data distribution may hinder further optimization of label assignment. To this end, this paper proposes Adaptive Spatial and Scale Label Assignment (ASS-LA) to improve the performance of anchor-free object detection. Positive/negative samples are distributed from different pyramid levels using spatial and scale constraints. Specifically, an adaptive Intersection-over-Union (IoU) space assignment is designed to select candidate positive sample points. The membership degree is introduced at each pyramid level to adaptively fuzzy the scale assignment range so that the detector selects the final positive sample from the candidate sample points. Furthermore, a reference box is introduced to design the predicted IoU branch of coupled regression. In the inference stage, the predicted IoU and classification scores are combined as the confidence of the regression bounding box to alleviate the inconsistency between classification and regression. Extensive experiments show that our method achieves comparable performance to other existing label assignment schemes. With the introduction of ASS-LA, the anchor-free object detector has significant performance improvements without introducing other overhead.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.