Scale-Aware Pruning Framework for Remote Sensing Object Detection via Multifeature Representation

IF 8.6 1区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Geoscience and Remote Sensing Pub Date : 2025-02-25 DOI:10.1109/TGRS.2025.3545416
Zhuping Hu;Maoguo Gong;Yue Zhao;Mingyang Zhang;Yiheng Lu;Jianzhao Li;Yan Pu;Zhao Wang
{"title":"Scale-Aware Pruning Framework for Remote Sensing Object Detection via Multifeature Representation","authors":"Zhuping Hu;Maoguo Gong;Yue Zhao;Mingyang Zhang;Yiheng Lu;Jianzhao Li;Yan Pu;Zhao Wang","doi":"10.1109/TGRS.2025.3545416","DOIUrl":null,"url":null,"abstract":"With the rapid advancements in computer vision, high-resolution remote sensing imagery has become a crucial data source for object detection. Nevertheless, effectively utilizing limited computational resources and reducing the burden on satellite edge devices remains a significant challenge. To effectively reduce model complexity while maintaining its representational capacity, this article proposes a scale-aware pruning framework (SAPF) to enhance remote sensing object detection ability. First, this article classifies the convolutional layers in object detection models into two categories: layers with a single-scale feature representation and layers with a multiscale feature representation. For convolutional layers with single-scale features, we utilize singular value decomposition (SVD) to quantify feature importance and assess filter redundancy to enhance model efficiency. By removing less critical filters, this pruning criteria aims to reduce the model size and computational load without compromising performance. However, convolutional layers with multiscale features are crucial for optimizing feature extraction and balancing information capture across various scales. To address this, this article evaluates the similarity between convolutional layers with different scales to determine the contribution of various scale features in multiscale fusion. Surprisingly, the SAPF can reduce the FLOPs and parameters, as well as ensure the representational ability obviously when the YOLO v5s and Faster-RCNN are adopted to classify the NWPU VHR-10, RSOD, and SIMD datasets. This means we can save the training computation resources for the model. Additionally, SAPF can significantly improve the efficiency of the model in object detection to ensure its real-time performance.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-15"},"PeriodicalIF":8.6000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10902564/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

With the rapid advancements in computer vision, high-resolution remote sensing imagery has become a crucial data source for object detection. Nevertheless, effectively utilizing limited computational resources and reducing the burden on satellite edge devices remains a significant challenge. To effectively reduce model complexity while maintaining its representational capacity, this article proposes a scale-aware pruning framework (SAPF) to enhance remote sensing object detection ability. First, this article classifies the convolutional layers in object detection models into two categories: layers with a single-scale feature representation and layers with a multiscale feature representation. For convolutional layers with single-scale features, we utilize singular value decomposition (SVD) to quantify feature importance and assess filter redundancy to enhance model efficiency. By removing less critical filters, this pruning criteria aims to reduce the model size and computational load without compromising performance. However, convolutional layers with multiscale features are crucial for optimizing feature extraction and balancing information capture across various scales. To address this, this article evaluates the similarity between convolutional layers with different scales to determine the contribution of various scale features in multiscale fusion. Surprisingly, the SAPF can reduce the FLOPs and parameters, as well as ensure the representational ability obviously when the YOLO v5s and Faster-RCNN are adopted to classify the NWPU VHR-10, RSOD, and SIMD datasets. This means we can save the training computation resources for the model. Additionally, SAPF can significantly improve the efficiency of the model in object detection to ensure its real-time performance.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于多特征表示的遥感目标检测尺度感知剪枝框架
随着计算机视觉技术的飞速发展,高分辨率遥感图像已成为目标检测的重要数据源。然而,有效利用有限的计算资源和减轻卫星边缘设备的负担仍然是一个重大挑战。为了在保持模型表征能力的同时有效降低模型复杂度,本文提出了一种尺度感知剪叶框架(SAPF)来增强遥感目标检测能力。首先,本文将目标检测模型中的卷积层分为两类:具有单尺度特征表示的层和具有多尺度特征表示的层。对于具有单尺度特征的卷积层,我们利用奇异值分解(SVD)来量化特征重要性和评估滤波器冗余,以提高模型效率。通过去除不太关键的过滤器,该修剪标准旨在减少模型大小和计算负载,而不影响性能。然而,具有多尺度特征的卷积层对于优化特征提取和平衡不同尺度的信息捕获至关重要。为了解决这个问题,本文评估了不同尺度卷积层之间的相似性,以确定各种尺度特征在多尺度融合中的贡献。令人惊讶的是,当采用YOLO v5s和Faster-RCNN对NWPU VHR-10、RSOD和SIMD数据集进行分类时,SAPF可以明显降低FLOPs和参数,并保证表征能力。这意味着我们可以节省模型的训练计算资源。此外,SAPF可以显著提高模型在目标检测方面的效率,保证模型的实时性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Geoscience and Remote Sensing
IEEE Transactions on Geoscience and Remote Sensing 工程技术-地球化学与地球物理
CiteScore
11.50
自引率
28.00%
发文量
1912
审稿时长
4.0 months
期刊介绍: IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.
期刊最新文献
Occlusion-Resistant Object Detection for Dual-View Airborne Platforms A Novel Real-Time Method for 3D Localization and Magnetic Moment Estimation of Multiple Targets MDCA-Net: Remote sensing image classification based on multi-scale dynamic convolution and attention enhancement PI-SAM: Physics-Informed Segment Anything for Infrared Small Target Detection Source-Driven Global-Local Prototype Contrastive Learning for Hyperspectral Domain Adaptation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1