Small object detection by Edge-aware Neural Network

IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Engineering Applications of Artificial Intelligence Pub Date : 2024-10-21 DOI:10.1016/j.engappai.2024.109406
Xianhong Zhang, Tao Lu, Jiaming Wang, Shichang Fu, Fangqun Gao
{"title":"Small object detection by Edge-aware Neural Network","authors":"Xianhong Zhang,&nbsp;Tao Lu,&nbsp;Jiaming Wang,&nbsp;Shichang Fu,&nbsp;Fangqun Gao","doi":"10.1016/j.engappai.2024.109406","DOIUrl":null,"url":null,"abstract":"<div><div>The object detection method is widely applied in industrial inspections. However, many detectors face challenges in accurately capturing the blurred edge details of small objects, resulting in inaccurate bounding box predictions. To address this, we propose an Edge-aware Neural Network (EANN) for small object detection. Firstly, we introduce a Channel and Spatial Attention Fusion Module (CSAFM) to enhance the edge features of small objects, enabling the network to extract more discriminative information. Next, we propose a Multiple Aggregation Feature Pyramid (MAFP) to integrate multi-scale deep features into shallow features. This fusion enriches the shallow features with abundant semantic information, thereby aiding in the detection of small objects. Additionally, we propose a Side and Center Point Aligned Intersection over Union loss (SCPAIoULoss) to enhance the bounding box regression when there is minimal overlap between predicted and ground truth boxes. SCPAIoULoss combines Side Ratio (SR) loss, Center Point Distance (CPD) loss, and Intersection over Union (IoU) loss. The utilization of SR Loss directly constrains the width and height regression of bounding boxes, while CPD loss introduces stricter constraints to facilitate bounding box regression. Furthermore, IoU loss promotes the overall regression of predicted boxes. We extensively experiment on Tiny CityPersons, WiderFace, and our proposed dataset of base station data centers to validate the effectiveness of our method. The results indicate that our method surpasses several State-of-The-Art (SOTA) approaches in small object detection and can be effectively applied to the task of inspecting base station data centers.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5000,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624015641","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

The object detection method is widely applied in industrial inspections. However, many detectors face challenges in accurately capturing the blurred edge details of small objects, resulting in inaccurate bounding box predictions. To address this, we propose an Edge-aware Neural Network (EANN) for small object detection. Firstly, we introduce a Channel and Spatial Attention Fusion Module (CSAFM) to enhance the edge features of small objects, enabling the network to extract more discriminative information. Next, we propose a Multiple Aggregation Feature Pyramid (MAFP) to integrate multi-scale deep features into shallow features. This fusion enriches the shallow features with abundant semantic information, thereby aiding in the detection of small objects. Additionally, we propose a Side and Center Point Aligned Intersection over Union loss (SCPAIoULoss) to enhance the bounding box regression when there is minimal overlap between predicted and ground truth boxes. SCPAIoULoss combines Side Ratio (SR) loss, Center Point Distance (CPD) loss, and Intersection over Union (IoU) loss. The utilization of SR Loss directly constrains the width and height regression of bounding boxes, while CPD loss introduces stricter constraints to facilitate bounding box regression. Furthermore, IoU loss promotes the overall regression of predicted boxes. We extensively experiment on Tiny CityPersons, WiderFace, and our proposed dataset of base station data centers to validate the effectiveness of our method. The results indicate that our method surpasses several State-of-The-Art (SOTA) approaches in small object detection and can be effectively applied to the task of inspecting base station data centers.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用边缘感知神经网络检测小物体
物体检测方法被广泛应用于工业检测领域。然而,许多检测器在准确捕捉小物体模糊边缘细节方面面临挑战,导致边界框预测不准确。针对这一问题,我们提出了一种边缘感知神经网络(EANN)用于小物体检测。首先,我们引入了通道和空间注意力融合模块(CSAFM)来增强小物体的边缘特征,使网络能够提取更多的判别信息。接下来,我们提出了多重聚合特征金字塔(MAFP),将多尺度深度特征整合到浅层特征中。这种融合为浅层特征提供了丰富的语义信息,从而有助于小物体的检测。此外,我们还提出了边点和中心点对齐交集联合损失(SCPAIoULoss),用于在预测框和地面实况框重叠最小的情况下增强边界框回归。SCPAIoULoss 结合了边距损失 (SR)、中心点距离损失 (CPD) 和联合交叉损失 (IoU)。SR 损失的使用直接限制了边界框的宽度和高度回归,而 CPD 损失则引入了更严格的限制以促进边界框回归。此外,IoU 损失还能促进预测框的整体回归。我们在 Tiny CityPersons、WiderFace 和我们提出的基站数据中心数据集上进行了广泛实验,以验证我们方法的有效性。结果表明,我们的方法在小物体检测方面超越了几种最新方法(SOTA),可以有效地应用于基站数据中心的检测任务。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Engineering Applications of Artificial Intelligence
Engineering Applications of Artificial Intelligence 工程技术-工程:电子与电气
CiteScore
9.60
自引率
10.00%
发文量
505
审稿时长
68 days
期刊介绍: Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.
期刊最新文献
Constrained multi-objective optimization assisted by convergence and diversity auxiliary tasks A deep sequence-to-sequence model for power swing blocking of distance protection in power transmission lines A Chinese named entity recognition method for landslide geological disasters based on deep learning A deep learning ensemble approach for malware detection in Internet of Things utilizing Explainable Artificial Intelligence Evaluating the financial credibility of third-party logistic providers through a novel frank operators-driven group decision-making model with dual hesitant linguistic q-rung orthopair fuzzy information
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1