{"title":"CB-YOLOv5 Algorithm for Small Target Detection in Aerial Images","authors":"Yingjie Li, Yitian Wang, Huaici Zhao","doi":"10.1109/INCET57972.2023.10170142","DOIUrl":null,"url":null,"abstract":"Aerial images are often plagued by background interference, and small targets with indistinct features, leading to low accuracy, high false detection rates, and high miss detection rates. To address these challenges, a small target detection algorithm based on YOLOv5, Coordinate-attention and Bidirectional-feature-pyramid-network YOLOv5 (CB-YOLOv5), is proposed in this paper. Considering the small number of pixels occupied by small targets and their indistinct features, a fourth target detection layer is added by concatenating the feature map from quadruple down-sampling during feature extraction with the feature map output from 8-fold up-sampling during feature fusion. Additionally, a coordinate attention mechanism is introduced during the feature extraction stage to improve small target localization and enhance detection accuracy. Finally, the original Path Aggregation Networks (PANet) structure is replaced with a weighted Bidirectional Feature Pyramid Network (BiFPN) structure during the feature fusion stage to improve the network’s ability to fuse feature maps of different scales. The simulation results demonstrate that the CB-YOLOv5 improves mAP50 by 9.4%, mAP75 by 9.7%, and mAP50:95 by 7.8% compared to the original YOLOv5s model. Thus, the effectiveness of the CB-YOLOv5 algorithm for detecting small targets in aerial images is validated.","PeriodicalId":403008,"journal":{"name":"2023 4th International Conference for Emerging Technology (INCET)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 4th International Conference for Emerging Technology (INCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INCET57972.2023.10170142","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Aerial images are often plagued by background interference, and small targets with indistinct features, leading to low accuracy, high false detection rates, and high miss detection rates. To address these challenges, a small target detection algorithm based on YOLOv5, Coordinate-attention and Bidirectional-feature-pyramid-network YOLOv5 (CB-YOLOv5), is proposed in this paper. Considering the small number of pixels occupied by small targets and their indistinct features, a fourth target detection layer is added by concatenating the feature map from quadruple down-sampling during feature extraction with the feature map output from 8-fold up-sampling during feature fusion. Additionally, a coordinate attention mechanism is introduced during the feature extraction stage to improve small target localization and enhance detection accuracy. Finally, the original Path Aggregation Networks (PANet) structure is replaced with a weighted Bidirectional Feature Pyramid Network (BiFPN) structure during the feature fusion stage to improve the network’s ability to fuse feature maps of different scales. The simulation results demonstrate that the CB-YOLOv5 improves mAP50 by 9.4%, mAP75 by 9.7%, and mAP50:95 by 7.8% compared to the original YOLOv5s model. Thus, the effectiveness of the CB-YOLOv5 algorithm for detecting small targets in aerial images is validated.