Xianhong Zhang, Tao Lu, Jiaming Wang, Shichang Fu, Fangqun Gao
{"title":"Small object detection by Edge-aware Neural Network","authors":"Xianhong Zhang, Tao Lu, Jiaming Wang, Shichang Fu, Fangqun Gao","doi":"10.1016/j.engappai.2024.109406","DOIUrl":null,"url":null,"abstract":"<div><div>The object detection method is widely applied in industrial inspections. However, many detectors face challenges in accurately capturing the blurred edge details of small objects, resulting in inaccurate bounding box predictions. To address this, we propose an Edge-aware Neural Network (EANN) for small object detection. Firstly, we introduce a Channel and Spatial Attention Fusion Module (CSAFM) to enhance the edge features of small objects, enabling the network to extract more discriminative information. Next, we propose a Multiple Aggregation Feature Pyramid (MAFP) to integrate multi-scale deep features into shallow features. This fusion enriches the shallow features with abundant semantic information, thereby aiding in the detection of small objects. Additionally, we propose a Side and Center Point Aligned Intersection over Union loss (SCPAIoULoss) to enhance the bounding box regression when there is minimal overlap between predicted and ground truth boxes. SCPAIoULoss combines Side Ratio (SR) loss, Center Point Distance (CPD) loss, and Intersection over Union (IoU) loss. The utilization of SR Loss directly constrains the width and height regression of bounding boxes, while CPD loss introduces stricter constraints to facilitate bounding box regression. Furthermore, IoU loss promotes the overall regression of predicted boxes. We extensively experiment on Tiny CityPersons, WiderFace, and our proposed dataset of base station data centers to validate the effectiveness of our method. The results indicate that our method surpasses several State-of-The-Art (SOTA) approaches in small object detection and can be effectively applied to the task of inspecting base station data centers.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5000,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624015641","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The object detection method is widely applied in industrial inspections. However, many detectors face challenges in accurately capturing the blurred edge details of small objects, resulting in inaccurate bounding box predictions. To address this, we propose an Edge-aware Neural Network (EANN) for small object detection. Firstly, we introduce a Channel and Spatial Attention Fusion Module (CSAFM) to enhance the edge features of small objects, enabling the network to extract more discriminative information. Next, we propose a Multiple Aggregation Feature Pyramid (MAFP) to integrate multi-scale deep features into shallow features. This fusion enriches the shallow features with abundant semantic information, thereby aiding in the detection of small objects. Additionally, we propose a Side and Center Point Aligned Intersection over Union loss (SCPAIoULoss) to enhance the bounding box regression when there is minimal overlap between predicted and ground truth boxes. SCPAIoULoss combines Side Ratio (SR) loss, Center Point Distance (CPD) loss, and Intersection over Union (IoU) loss. The utilization of SR Loss directly constrains the width and height regression of bounding boxes, while CPD loss introduces stricter constraints to facilitate bounding box regression. Furthermore, IoU loss promotes the overall regression of predicted boxes. We extensively experiment on Tiny CityPersons, WiderFace, and our proposed dataset of base station data centers to validate the effectiveness of our method. The results indicate that our method surpasses several State-of-The-Art (SOTA) approaches in small object detection and can be effectively applied to the task of inspecting base station data centers.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.