Scale-Aware Pruning Framework for Remote Sensing Object Detection via Multifeature Representation

IF 8.6 1区地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Geoscience and Remote Sensing Pub Date : 2025-02-25 DOI:10.1109/TGRS.2025.3545416

Zhuping Hu;Maoguo Gong;Yue Zhao;Mingyang Zhang;Yiheng Lu;Jianzhao Li;Yan Pu;Zhao Wang

{"title":"Scale-Aware Pruning Framework for Remote Sensing Object Detection via Multifeature Representation","authors":"Zhuping Hu;Maoguo Gong;Yue Zhao;Mingyang Zhang;Yiheng Lu;Jianzhao Li;Yan Pu;Zhao Wang","doi":"10.1109/TGRS.2025.3545416","DOIUrl":null,"url":null,"abstract":"With the rapid advancements in computer vision, high-resolution remote sensing imagery has become a crucial data source for object detection. Nevertheless, effectively utilizing limited computational resources and reducing the burden on satellite edge devices remains a significant challenge. To effectively reduce model complexity while maintaining its representational capacity, this article proposes a scale-aware pruning framework (SAPF) to enhance remote sensing object detection ability. First, this article classifies the convolutional layers in object detection models into two categories: layers with a single-scale feature representation and layers with a multiscale feature representation. For convolutional layers with single-scale features, we utilize singular value decomposition (SVD) to quantify feature importance and assess filter redundancy to enhance model efficiency. By removing less critical filters, this pruning criteria aims to reduce the model size and computational load without compromising performance. However, convolutional layers with multiscale features are crucial for optimizing feature extraction and balancing information capture across various scales. To address this, this article evaluates the similarity between convolutional layers with different scales to determine the contribution of various scale features in multiscale fusion. Surprisingly, the SAPF can reduce the FLOPs and parameters, as well as ensure the representational ability obviously when the YOLO v5s and Faster-RCNN are adopted to classify the NWPU VHR-10, RSOD, and SIMD datasets. This means we can save the training computation resources for the model. Additionally, SAPF can significantly improve the efficiency of the model in object detection to ensure its real-time performance.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-15"},"PeriodicalIF":8.6000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10902564/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

With the rapid advancements in computer vision, high-resolution remote sensing imagery has become a crucial data source for object detection. Nevertheless, effectively utilizing limited computational resources and reducing the burden on satellite edge devices remains a significant challenge. To effectively reduce model complexity while maintaining its representational capacity, this article proposes a scale-aware pruning framework (SAPF) to enhance remote sensing object detection ability. First, this article classifies the convolutional layers in object detection models into two categories: layers with a single-scale feature representation and layers with a multiscale feature representation. For convolutional layers with single-scale features, we utilize singular value decomposition (SVD) to quantify feature importance and assess filter redundancy to enhance model efficiency. By removing less critical filters, this pruning criteria aims to reduce the model size and computational load without compromising performance. However, convolutional layers with multiscale features are crucial for optimizing feature extraction and balancing information capture across various scales. To address this, this article evaluates the similarity between convolutional layers with different scales to determine the contribution of various scale features in multiscale fusion. Surprisingly, the SAPF can reduce the FLOPs and parameters, as well as ensure the representational ability obviously when the YOLO v5s and Faster-RCNN are adopted to classify the NWPU VHR-10, RSOD, and SIMD datasets. This means we can save the training computation resources for the model. Additionally, SAPF can significantly improve the efficiency of the model in object detection to ensure its real-time performance.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于多特征表示的遥感目标检测尺度感知剪枝框架

随着计算机视觉技术的飞速发展，高分辨率遥感图像已成为目标检测的重要数据源。然而，有效利用有限的计算资源和减轻卫星边缘设备的负担仍然是一个重大挑战。为了在保持模型表征能力的同时有效降低模型复杂度，本文提出了一种尺度感知剪叶框架（SAPF）来增强遥感目标检测能力。首先，本文将目标检测模型中的卷积层分为两类：具有单尺度特征表示的层和具有多尺度特征表示的层。对于具有单尺度特征的卷积层，我们利用奇异值分解（SVD）来量化特征重要性和评估滤波器冗余，以提高模型效率。通过去除不太关键的过滤器，该修剪标准旨在减少模型大小和计算负载，而不影响性能。然而，具有多尺度特征的卷积层对于优化特征提取和平衡不同尺度的信息捕获至关重要。为了解决这个问题，本文评估了不同尺度卷积层之间的相似性，以确定各种尺度特征在多尺度融合中的贡献。令人惊讶的是，当采用YOLO v5s和Faster-RCNN对NWPU VHR-10、RSOD和SIMD数据集进行分类时，SAPF可以明显降低FLOPs和参数，并保证表征能力。这意味着我们可以节省模型的训练计算资源。此外，SAPF可以显著提高模型在目标检测方面的效率，保证模型的实时性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Geoscience and Remote Sensing 工程技术-地球化学与地球物理

CiteScore

11.50

自引率

28.00%

发文量

1912

审稿时长

4.0 months

期刊介绍： IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.