一种基于扩展卷积特征自适应的航拍图像高纵横比目标检测方法

IF 0.9 4区计算机科学 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING International Journal of Wavelets Multiresolution and Information Processing Pub Date : 2023-11-03 DOI:10.1142/s0219691323500480

Shaobo Liu, Tian Xia, Xiaodong Chen, Hui Li, Guanghui Yuan, Dong Yang

{"title":"一种基于扩展卷积特征自适应的航拍图像高纵横比目标检测方法","authors":"Shaobo Liu, Tian Xia, Xiaodong Chen, Hui Li, Guanghui Yuan, Dong Yang","doi":"10.1142/s0219691323500480","DOIUrl":null,"url":null,"abstract":"In real scenarios, objects with high aspect ratios are actually very common, and such objects hold significant importance in the field of object detection. However, most of the existing object detection algorithms tend to overlook this specific type of object. After analyzing the statistical data, we observed a substantial decrease in mAP (mean Average Precision) for classical object detection algorithms when they are tasked with detecting only high aspect ratio objects. Therefore, we conducted an analysis of the factors that influence the detection performance of these objects and made the following improvements: (1) We introduced large-kernel attention convolution between the backbone network layers. This addition allows each position feature to have a larger receptive field, facilitating better feature learning; (2) By incorporating multiple sets of deformable convolutions for feature-adaptive processing, we were able to enhance the learning of characteristic information specific to the object itself. This approach also promotes network convergence. The proposed method yielded a significant improvement in accuracy, approximately 5[Formula: see text] higher than the baseline, when evaluated on the FGSD2021 dataset. Furthermore, our method outperformed the current best method by approximately 0.5[Formula: see text].","PeriodicalId":50282,"journal":{"name":"International Journal of Wavelets Multiresolution and Information Processing","volume":"184 S491","pages":"0"},"PeriodicalIF":0.9000,"publicationDate":"2023-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Dilated Convolution-Based Feature Adaptation Method for Detection of High Aspect Ratio Objects in Aerial Images\",\"authors\":\"Shaobo Liu, Tian Xia, Xiaodong Chen, Hui Li, Guanghui Yuan, Dong Yang\",\"doi\":\"10.1142/s0219691323500480\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In real scenarios, objects with high aspect ratios are actually very common, and such objects hold significant importance in the field of object detection. However, most of the existing object detection algorithms tend to overlook this specific type of object. After analyzing the statistical data, we observed a substantial decrease in mAP (mean Average Precision) for classical object detection algorithms when they are tasked with detecting only high aspect ratio objects. Therefore, we conducted an analysis of the factors that influence the detection performance of these objects and made the following improvements: (1) We introduced large-kernel attention convolution between the backbone network layers. This addition allows each position feature to have a larger receptive field, facilitating better feature learning; (2) By incorporating multiple sets of deformable convolutions for feature-adaptive processing, we were able to enhance the learning of characteristic information specific to the object itself. This approach also promotes network convergence. The proposed method yielded a significant improvement in accuracy, approximately 5[Formula: see text] higher than the baseline, when evaluated on the FGSD2021 dataset. Furthermore, our method outperformed the current best method by approximately 0.5[Formula: see text].\",\"PeriodicalId\":50282,\"journal\":{\"name\":\"International Journal of Wavelets Multiresolution and Information Processing\",\"volume\":\"184 S491\",\"pages\":\"0\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2023-11-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Wavelets Multiresolution and Information Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1142/s0219691323500480\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Wavelets Multiresolution and Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s0219691323500480","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

摘要

在实际场景中，具有高长宽比的物体其实是非常常见的，这类物体在物体检测领域有着重要的意义。然而，大多数现有的目标检测算法往往忽略了这一特定类型的目标。在分析统计数据后，我们观察到当经典目标检测算法只检测高宽高比目标时，mAP(平均平均精度)显著降低。因此，我们对影响这些目标检测性能的因素进行了分析，并做了以下改进:(1)在骨干网层之间引入了大核注意卷积。这种添加允许每个位置特征有更大的接受域，促进更好的特征学习;(2)通过结合多组可变形卷积进行特征自适应处理，我们能够增强对特定于对象本身的特征信息的学习。这种方式也促进了网络的融合。当在FGSD2021数据集上进行评估时，所提出的方法在准确性方面取得了显着提高，比基线高出约5[公式:见文本]。此外，我们的方法比目前最好的方法高出约0.5[公式:见文本]。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A Dilated Convolution-Based Feature Adaptation Method for Detection of High Aspect Ratio Objects in Aerial Images

In real scenarios, objects with high aspect ratios are actually very common, and such objects hold significant importance in the field of object detection. However, most of the existing object detection algorithms tend to overlook this specific type of object. After analyzing the statistical data, we observed a substantial decrease in mAP (mean Average Precision) for classical object detection algorithms when they are tasked with detecting only high aspect ratio objects. Therefore, we conducted an analysis of the factors that influence the detection performance of these objects and made the following improvements: (1) We introduced large-kernel attention convolution between the backbone network layers. This addition allows each position feature to have a larger receptive field, facilitating better feature learning; (2) By incorporating multiple sets of deformable convolutions for feature-adaptive processing, we were able to enhance the learning of characteristic information specific to the object itself. This approach also promotes network convergence. The proposed method yielded a significant improvement in accuracy, approximately 5[Formula: see text] higher than the baseline, when evaluated on the FGSD2021 dataset. Furthermore, our method outperformed the current best method by approximately 0.5[Formula: see text].

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Wavelets Multiresolution and Information Processing 工程技术-计算机：软件工程

CiteScore

2.60

自引率

7.10%

发文量

审稿时长

2.7 months

期刊介绍： International Journal of Wavelets, Multiresolution and Information Processing (hereafter referred to as IJWMIP) is a bi-monthly publication for theoretical and applied papers on the current state-of-the-art results of wavelet analysis, multiresolution and information processing. Papers related to the IJWMIP theme are especially solicited, including theories, methodologies, algorithms and emerging applications. Topics of interest of the IJWMIP include, but are not limited to: 1. Wavelets: Wavelets and operator theory Frame and applications Time-frequency analysis and applications Sparse representation and approximation Sampling theory and compressive sensing Wavelet based algorithms and applications 2. Multiresolution: Multiresolution analysis Multiscale approximation Multiresolution image processing and signal processing Multiresolution representations Deep learning and neural networks Machine learning theory, algorithms and applications High dimensional data analysis 3. Information Processing: Data sciences Big data and applications Information theory Information systems and technology Information security Information learning and processing Artificial intelligence and pattern recognition Image/signal processing.

期刊最新文献

Piecewise Scalable Frames in Hilbert Spaces A novel occluded face detection approach using Enhanced ORB and optimized GAN A Dilated Convolution-Based Feature Adaptation Method for Detection of High Aspect Ratio Objects in Aerial Images Fully Symmetric Frame Scaling Functions and derived Framelets Author index (Vol. 21)