Enhancing Small Object Detection in Aerial Images: A Novel Approach with PCSG Model

IF 4.7 Q2 MATERIALS SCIENCE, BIOMATERIALS ACS Applied Bio Materials Pub Date : 2024-05-14 DOI:10.3390/aerospace11050392
Kang An, Huiping Duanmu, Zhiyang Wu, Yuqiang Liu, Jingzhen Qiao, Qianqian Shangguan, Yaqing Song, Xiaonong Xu
{"title":"Enhancing Small Object Detection in Aerial Images: A Novel Approach with PCSG Model","authors":"Kang An, Huiping Duanmu, Zhiyang Wu, Yuqiang Liu, Jingzhen Qiao, Qianqian Shangguan, Yaqing Song, Xiaonong Xu","doi":"10.3390/aerospace11050392","DOIUrl":null,"url":null,"abstract":"Generalized target detection algorithms perform well for large- and medium-sized targets but struggle with small ones. However, with the growing importance of aerial images in urban transportation and environmental monitoring, detecting small targets in such imagery has been a promising research hotspot. The challenge in small object detection lies in the limited pixel proportion and the complexity of feature extraction. Moreover, current mainstream detection algorithms tend to be overly complex, leading to structural redundancy for small objects. To cope with these challenges, this paper recommends the PCSG model based on yolov5, which optimizes both the detection head and backbone networks. (1) An enhanced detection header is introduced, featuring a new structure that enhances the feature pyramid network and the path aggregation network. This enhancement bolsters the model’s shallow feature reuse capability and introduces a dedicated detection layer for smaller objects. Additionally, redundant structures in the network are pruned, and the lightweight and versatile upsampling operator CARAFE is used to optimize the upsampling algorithm. (2) The paper proposes the module named SPD-Conv to replace the strided convolution operation and pooling structures in yolov5, thereby enhancing the backbone’s feature extraction capability. Furthermore, Ghost convolution is utilized to optimize the parameter count, ensuring that the backbone meets the real-time needs of aerial image detection. The experimental results from the RSOD dataset show that the PCSG model exhibits superior detection performance. The value of mAP increases from 97.1% to 97.8%, while the number of model parameters decreases by 22.3%, from 1,761,871 to 1,368,823. These findings unequivocally highlight the effectiveness of this approach.","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":"18 4","pages":""},"PeriodicalIF":4.7000,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3390/aerospace11050392","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0

Abstract

Generalized target detection algorithms perform well for large- and medium-sized targets but struggle with small ones. However, with the growing importance of aerial images in urban transportation and environmental monitoring, detecting small targets in such imagery has been a promising research hotspot. The challenge in small object detection lies in the limited pixel proportion and the complexity of feature extraction. Moreover, current mainstream detection algorithms tend to be overly complex, leading to structural redundancy for small objects. To cope with these challenges, this paper recommends the PCSG model based on yolov5, which optimizes both the detection head and backbone networks. (1) An enhanced detection header is introduced, featuring a new structure that enhances the feature pyramid network and the path aggregation network. This enhancement bolsters the model’s shallow feature reuse capability and introduces a dedicated detection layer for smaller objects. Additionally, redundant structures in the network are pruned, and the lightweight and versatile upsampling operator CARAFE is used to optimize the upsampling algorithm. (2) The paper proposes the module named SPD-Conv to replace the strided convolution operation and pooling structures in yolov5, thereby enhancing the backbone’s feature extraction capability. Furthermore, Ghost convolution is utilized to optimize the parameter count, ensuring that the backbone meets the real-time needs of aerial image detection. The experimental results from the RSOD dataset show that the PCSG model exhibits superior detection performance. The value of mAP increases from 97.1% to 97.8%, while the number of model parameters decreases by 22.3%, from 1,761,871 to 1,368,823. These findings unequivocally highlight the effectiveness of this approach.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
增强航空图像中的小物体检测:利用 PCSG 模型的新方法
通用目标检测算法在检测大中型目标时表现出色,但在检测小型目标时却举步维艰。然而,随着航空图像在城市交通和环境监测中的重要性日益凸显,在此类图像中检测小型目标已成为一个前景广阔的研究热点。小目标检测的难点在于像素比例有限和特征提取的复杂性。此外,目前主流的检测算法往往过于复杂,导致小目标的结构冗余。为了应对这些挑战,本文推荐基于 yolov5 的 PCSG 模型,该模型优化了检测头和主干网络。(1) 引入了增强型检测头,其新结构增强了特征金字塔网络和路径聚合网络。这一改进增强了模型的浅层特征重用能力,并为较小的对象引入了专用检测层。此外,还对网络中的冗余结构进行了修剪,并使用轻量级和通用的上采样算子 CARAFE 来优化上采样算法。(2) 本文提出了名为 SPD-Conv 的模块,以取代 yolov5 中的步进卷积运算和池化结构,从而增强骨干网的特征提取能力。此外,还利用幽灵卷积来优化参数数量,确保骨干网满足航空图像检测的实时需求。RSOD 数据集的实验结果表明,PCSG 模型表现出卓越的检测性能。mAP 值从 97.1% 增加到 97.8%,而模型参数数则减少了 22.3%,从 1,761,871 减少到 1,368,823。这些发现清楚地表明了这种方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
ACS Applied Bio Materials
ACS Applied Bio Materials Chemistry-Chemistry (all)
CiteScore
9.40
自引率
2.10%
发文量
464
期刊介绍: ACS Applied Bio Materials is an interdisciplinary journal publishing original research covering all aspects of biomaterials and biointerfaces including and beyond the traditional biosensing, biomedical and therapeutic applications. The journal is devoted to reports of new and original experimental and theoretical research of an applied nature that integrates knowledge in the areas of materials, engineering, physics, bioscience, and chemistry into important bio applications. The journal is specifically interested in work that addresses the relationship between structure and function and assesses the stability and degradation of materials under relevant environmental and biological conditions.
期刊最新文献
Hierarchical Logic Control via DNA Polymerase-Driven Molecular Circuits. Silk Protein-Based Materials for Photothermal Therapy: From Morphologies to Multifunctional Applications. 3D Bioprinting of Continuous Nanofibrous Yarn-Reinforced Cell-Laden Constructs. β-Carboline-Based Fluorescent Probes Sense and Stabilize G-quadruplex DNA Structures. Macrophage-Targeted Fullerene Potentiates Redox Homeostasis Regulation and Reprograms Macrophage Polarization to Ameliorate Hepatic Steatosis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1