一种基于扩展卷积特征自适应的航拍图像高纵横比目标检测方法

IF 0.9 4区 计算机科学 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING International Journal of Wavelets Multiresolution and Information Processing Pub Date : 2023-11-03 DOI:10.1142/s0219691323500480
Shaobo Liu, Tian Xia, Xiaodong Chen, Hui Li, Guanghui Yuan, Dong Yang
{"title":"一种基于扩展卷积特征自适应的航拍图像高纵横比目标检测方法","authors":"Shaobo Liu, Tian Xia, Xiaodong Chen, Hui Li, Guanghui Yuan, Dong Yang","doi":"10.1142/s0219691323500480","DOIUrl":null,"url":null,"abstract":"In real scenarios, objects with high aspect ratios are actually very common, and such objects hold significant importance in the field of object detection. However, most of the existing object detection algorithms tend to overlook this specific type of object. After analyzing the statistical data, we observed a substantial decrease in mAP (mean Average Precision) for classical object detection algorithms when they are tasked with detecting only high aspect ratio objects. Therefore, we conducted an analysis of the factors that influence the detection performance of these objects and made the following improvements: (1) We introduced large-kernel attention convolution between the backbone network layers. This addition allows each position feature to have a larger receptive field, facilitating better feature learning; (2) By incorporating multiple sets of deformable convolutions for feature-adaptive processing, we were able to enhance the learning of characteristic information specific to the object itself. This approach also promotes network convergence. The proposed method yielded a significant improvement in accuracy, approximately 5[Formula: see text] higher than the baseline, when evaluated on the FGSD2021 dataset. Furthermore, our method outperformed the current best method by approximately 0.5[Formula: see text].","PeriodicalId":50282,"journal":{"name":"International Journal of Wavelets Multiresolution and Information Processing","volume":null,"pages":null},"PeriodicalIF":0.9000,"publicationDate":"2023-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Dilated Convolution-Based Feature Adaptation Method for Detection of High Aspect Ratio Objects in Aerial Images\",\"authors\":\"Shaobo Liu, Tian Xia, Xiaodong Chen, Hui Li, Guanghui Yuan, Dong Yang\",\"doi\":\"10.1142/s0219691323500480\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In real scenarios, objects with high aspect ratios are actually very common, and such objects hold significant importance in the field of object detection. However, most of the existing object detection algorithms tend to overlook this specific type of object. After analyzing the statistical data, we observed a substantial decrease in mAP (mean Average Precision) for classical object detection algorithms when they are tasked with detecting only high aspect ratio objects. Therefore, we conducted an analysis of the factors that influence the detection performance of these objects and made the following improvements: (1) We introduced large-kernel attention convolution between the backbone network layers. This addition allows each position feature to have a larger receptive field, facilitating better feature learning; (2) By incorporating multiple sets of deformable convolutions for feature-adaptive processing, we were able to enhance the learning of characteristic information specific to the object itself. This approach also promotes network convergence. The proposed method yielded a significant improvement in accuracy, approximately 5[Formula: see text] higher than the baseline, when evaluated on the FGSD2021 dataset. Furthermore, our method outperformed the current best method by approximately 0.5[Formula: see text].\",\"PeriodicalId\":50282,\"journal\":{\"name\":\"International Journal of Wavelets Multiresolution and Information Processing\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2023-11-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Wavelets Multiresolution and Information Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1142/s0219691323500480\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Wavelets Multiresolution and Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s0219691323500480","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0

摘要

在实际场景中,具有高长宽比的物体其实是非常常见的,这类物体在物体检测领域有着重要的意义。然而,大多数现有的目标检测算法往往忽略了这一特定类型的目标。在分析统计数据后,我们观察到当经典目标检测算法只检测高宽高比目标时,mAP(平均平均精度)显著降低。因此,我们对影响这些目标检测性能的因素进行了分析,并做了以下改进:(1)在骨干网层之间引入了大核注意卷积。这种添加允许每个位置特征有更大的接受域,促进更好的特征学习;(2)通过结合多组可变形卷积进行特征自适应处理,我们能够增强对特定于对象本身的特征信息的学习。这种方式也促进了网络的融合。当在FGSD2021数据集上进行评估时,所提出的方法在准确性方面取得了显着提高,比基线高出约5[公式:见文本]。此外,我们的方法比目前最好的方法高出约0.5[公式:见文本]。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A Dilated Convolution-Based Feature Adaptation Method for Detection of High Aspect Ratio Objects in Aerial Images
In real scenarios, objects with high aspect ratios are actually very common, and such objects hold significant importance in the field of object detection. However, most of the existing object detection algorithms tend to overlook this specific type of object. After analyzing the statistical data, we observed a substantial decrease in mAP (mean Average Precision) for classical object detection algorithms when they are tasked with detecting only high aspect ratio objects. Therefore, we conducted an analysis of the factors that influence the detection performance of these objects and made the following improvements: (1) We introduced large-kernel attention convolution between the backbone network layers. This addition allows each position feature to have a larger receptive field, facilitating better feature learning; (2) By incorporating multiple sets of deformable convolutions for feature-adaptive processing, we were able to enhance the learning of characteristic information specific to the object itself. This approach also promotes network convergence. The proposed method yielded a significant improvement in accuracy, approximately 5[Formula: see text] higher than the baseline, when evaluated on the FGSD2021 dataset. Furthermore, our method outperformed the current best method by approximately 0.5[Formula: see text].
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
2.60
自引率
7.10%
发文量
52
审稿时长
2.7 months
期刊介绍: International Journal of Wavelets, Multiresolution and Information Processing (hereafter referred to as IJWMIP) is a bi-monthly publication for theoretical and applied papers on the current state-of-the-art results of wavelet analysis, multiresolution and information processing. Papers related to the IJWMIP theme are especially solicited, including theories, methodologies, algorithms and emerging applications. Topics of interest of the IJWMIP include, but are not limited to: 1. Wavelets: Wavelets and operator theory Frame and applications Time-frequency analysis and applications Sparse representation and approximation Sampling theory and compressive sensing Wavelet based algorithms and applications 2. Multiresolution: Multiresolution analysis Multiscale approximation Multiresolution image processing and signal processing Multiresolution representations Deep learning and neural networks Machine learning theory, algorithms and applications High dimensional data analysis 3. Information Processing: Data sciences Big data and applications Information theory Information systems and technology Information security Information learning and processing Artificial intelligence and pattern recognition Image/signal processing.
期刊最新文献
Piecewise Scalable Frames in Hilbert Spaces A novel occluded face detection approach using Enhanced ORB and optimized GAN A Dilated Convolution-Based Feature Adaptation Method for Detection of High Aspect Ratio Objects in Aerial Images Fully Symmetric Frame Scaling Functions and derived Framelets Author index (Vol. 21)
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1