Dynamic Anchor: Density Map Guided Small Object Detector for Tiny Persons

IF 3.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Computer Vision and Image Understanding Pub Date : 2025-04-01 Epub Date: 2025-03-05 DOI:10.1016/j.cviu.2025.104325

Xingzhou Xu , Zhaoyong Mao , Xin Wang , Qinhao Tu , Junge Shen

{"title":"Dynamic Anchor: Density Map Guided Small Object Detector for Tiny Persons","authors":"Xingzhou Xu , Zhaoyong Mao , Xin Wang , Qinhao Tu , Junge Shen","doi":"10.1016/j.cviu.2025.104325","DOIUrl":null,"url":null,"abstract":"<div><div>With the application of aerial and space-based equipments, such as drones in the search and rescue process, there is an increasing demand on the detection of small and even tiny human targets. However, most existing detectors rely on generating smaller and denser anchors for small target detection, which introduces a high number of redundant negative anchor samples. To alleviate this issue, we propose a novel density map-guided tiny person detector with dynamic anchor. Specifically, we elaborately design an Anchor Proposals Mask (APM) module to effectively eliminate negative anchor samples and adaptively adjust anchor distribution with the guidance of density maps produced by Density Map Generator (DMG). To promote the quality of the density map, we develop a Multi-Scale Feature Distillation (MSFD) module and incorporate the Focal Inverse Distance Transform (FIDT) map to conduct knowledge distillation for DMG with the assistance of the crowd counting network. Extensive experiments on the TinyPerson and VisDrone datasets demonstrate that our method significantly enhances the performance of two-stage detectors in terms of average precision (AP) and average recall (AR) while effectively reducing the impact of negative anchor boxes.</div></div>","PeriodicalId":50633,"journal":{"name":"Computer Vision and Image Understanding","volume":"255 ","pages":"Article 104325"},"PeriodicalIF":3.5000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Vision and Image Understanding","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1077314225000487","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/5 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

With the application of aerial and space-based equipments, such as drones in the search and rescue process, there is an increasing demand on the detection of small and even tiny human targets. However, most existing detectors rely on generating smaller and denser anchors for small target detection, which introduces a high number of redundant negative anchor samples. To alleviate this issue, we propose a novel density map-guided tiny person detector with dynamic anchor. Specifically, we elaborately design an Anchor Proposals Mask (APM) module to effectively eliminate negative anchor samples and adaptively adjust anchor distribution with the guidance of density maps produced by Density Map Generator (DMG). To promote the quality of the density map, we develop a Multi-Scale Feature Distillation (MSFD) module and incorporate the Focal Inverse Distance Transform (FIDT) map to conduct knowledge distillation for DMG with the assistance of the crowd counting network. Extensive experiments on the TinyPerson and VisDrone datasets demonstrate that our method significantly enhances the performance of two-stage detectors in terms of average precision (AP) and average recall (AR) while effectively reducing the impact of negative anchor boxes.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

动态锚：密度图引导的微小物体探测器

随着无人机等空中和天基设备在搜救过程中的应用，对微小甚至微小人体目标的探测需求越来越大。然而，大多数现有的检测器依赖于生成更小、更密集的锚点来进行小目标检测，这引入了大量冗余的负锚点样本。为了解决这一问题，我们提出了一种新型的带动态锚点的密度地图引导微型人体探测器。具体而言，我们精心设计了锚点建议掩码（APM）模块，在密度图生成器（DMG）生成的密度图的指导下，有效地消除负锚点样本，自适应调整锚点分布。为了提高密度图的质量，我们开发了一个多尺度特征蒸馏（MSFD）模块，并结合焦点反距离变换（FIDT）地图，在人群计数网络的帮助下对DMG进行知识蒸馏。在TinyPerson和VisDrone数据集上的大量实验表明，我们的方法在平均精度（AP）和平均召回率（AR）方面显著提高了两级检测器的性能，同时有效地降低了负锚盒的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Computer Vision and Image Understanding 工程技术-工程：电子与电气

CiteScore

7.80

自引率

4.40%

发文量

112

审稿时长

79 days

期刊介绍： The central focus of this journal is the computer analysis of pictorial information. Computer Vision and Image Understanding publishes papers covering all aspects of image analysis from the low-level, iconic processes of early vision to the high-level, symbolic processes of recognition and interpretation. A wide range of topics in the image understanding area is covered, including papers offering insights that differ from predominant views. Research Areas Include: • Theory • Early vision • Data structures and representations • Shape • Range • Motion • Matching and recognition • Architecture and languages • Vision systems