Simple is best: A single-CNN method for classifying remote sensing images

IF 1.2 4区数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Networks and Heterogeneous Media Pub Date : 2023-01-01 DOI:10.3934/nhm.2023070

Huaxiang Song, Yong Zhou

{"title":"Simple is best: A single-CNN method for classifying remote sensing images","authors":"Huaxiang Song, Yong Zhou","doi":"10.3934/nhm.2023070","DOIUrl":null,"url":null,"abstract":"Recently, researchers have proposed a lot of methods to boost the performance of convolutional neural networks (CNNs) for classifying remote sensing images (RSI). However, the methods' performance improvements were insignificant, while time and hardware costs increased dramatically due to re-modeling. To tackle this problem, this study sought a simple, lightweight, yet more accurate solution for RSI semantic classification (RSI-SC). At first, we proposed a set of mathematical derivations to analyze and identify the best way among different technical roadmaps. Afterward, we selected a simple route that can significantly boost a single CNN's performance while maintaining simplicity and reducing costs in time and hardware. The proposed method, called RE-EfficientNet, only consists of a lightweight EfficientNet-B3 and a concise training algorithm named RE-CNN. The novelty of RE-EfficientNet and RE-CNN includes the following: First, EfficientNet-B3 employs transfer learning from ImageNet-1K and excludes any complicated re-modeling. It can adequately utilize the easily accessible pre-trained weights for time savings and avoid the pre-training effect being weakened due to re-modeling. Second, RE-CNN includes an effective combination of data augmentation (DA) transformations and two modified training tricks (TTs). It can alleviate the data distribution shift from DA-processed training sets and make the TTs more effective through modification according to the inherent nature of RSI. Extensive experimental results on two RSI sets prove that RE-EfficientNet can surpass all 30 cutting-edge methods published before 2023. It gives a remarkable improvement of 0.50% to 0.75% in overall accuracy (OA) and a 75% or more reduction in parameters. The ablation experiment also reveals that RE-CNN can improve CNN OA by 0.55% to 1.10%. All the results indicate that RE-EfficientNet is a simple, lightweight and more accurate solution for RSI-SC. In addition, we argue that the ideas proposed in this work about how to choose an appropriate model and training algorithm can help us find more efficient approaches in the future.","PeriodicalId":54732,"journal":{"name":"Networks and Heterogeneous Media","volume":null,"pages":null},"PeriodicalIF":1.2000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Networks and Heterogeneous Media","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.3934/nhm.2023070","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 2

Abstract

Recently, researchers have proposed a lot of methods to boost the performance of convolutional neural networks (CNNs) for classifying remote sensing images (RSI). However, the methods' performance improvements were insignificant, while time and hardware costs increased dramatically due to re-modeling. To tackle this problem, this study sought a simple, lightweight, yet more accurate solution for RSI semantic classification (RSI-SC). At first, we proposed a set of mathematical derivations to analyze and identify the best way among different technical roadmaps. Afterward, we selected a simple route that can significantly boost a single CNN's performance while maintaining simplicity and reducing costs in time and hardware. The proposed method, called RE-EfficientNet, only consists of a lightweight EfficientNet-B3 and a concise training algorithm named RE-CNN. The novelty of RE-EfficientNet and RE-CNN includes the following: First, EfficientNet-B3 employs transfer learning from ImageNet-1K and excludes any complicated re-modeling. It can adequately utilize the easily accessible pre-trained weights for time savings and avoid the pre-training effect being weakened due to re-modeling. Second, RE-CNN includes an effective combination of data augmentation (DA) transformations and two modified training tricks (TTs). It can alleviate the data distribution shift from DA-processed training sets and make the TTs more effective through modification according to the inherent nature of RSI. Extensive experimental results on two RSI sets prove that RE-EfficientNet can surpass all 30 cutting-edge methods published before 2023. It gives a remarkable improvement of 0.50% to 0.75% in overall accuracy (OA) and a 75% or more reduction in parameters. The ablation experiment also reveals that RE-CNN can improve CNN OA by 0.55% to 1.10%. All the results indicate that RE-EfficientNet is a simple, lightweight and more accurate solution for RSI-SC. In addition, we argue that the ideas proposed in this work about how to choose an appropriate model and training algorithm can help us find more efficient approaches in the future.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

简单是最好的:一个单一的cnn方法来分类遥感图像

近年来，研究人员提出了许多方法来提高卷积神经网络(cnn)在遥感图像分类中的性能。然而，这些方法的性能改进并不显著，而时间和硬件成本由于重新建模而急剧增加。为了解决这个问题，本研究寻求一种简单、轻量级但更准确的RSI语义分类(RSI- sc)解决方案。首先，我们提出了一套数学推导来分析和确定不同技术路线图的最佳方式。之后，我们选择了一条简单的路线，它可以显著提高单个CNN的性能，同时保持简单性并降低时间和硬件成本。所提出的方法称为RE-EfficientNet，仅由轻量级的EfficientNet-B3和一个名为RE-CNN的简明训练算法组成。RE-EfficientNet和RE-CNN的新颖性包括:首先，EfficientNet-B3采用了ImageNet-1K的迁移学习，排除了任何复杂的重新建模。它可以充分利用容易获取的预训练权值，节省时间，避免预训练效果因重新建模而减弱。其次，RE-CNN包括数据增强(DA)转换和两种改进的训练技巧(tt)的有效组合。根据RSI的固有性质对训练集进行修正，可以缓解da处理训练集的数据分布偏移，使训练集更有效。在两个RSI集上的大量实验结果证明，RE-EfficientNet可以超越2023年之前发表的所有30种前沿方法。它在总体精度(OA)方面提供了0.50%至0.75%的显着改进，参数减少了75%或更多。烧蚀实验也表明RE-CNN能将CNN的OA提高0.55% ~ 1.10%。所有结果表明，RE-EfficientNet是一种简单、轻量级和更准确的RSI-SC解决方案。此外，我们认为在这项工作中提出的关于如何选择合适的模型和训练算法的想法可以帮助我们在未来找到更有效的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Networks and Heterogeneous Media 数学-数学跨学科应用

CiteScore

1.80

自引率

0.00%

发文量

审稿时长

6-12 weeks

期刊介绍： NHM offers a strong combination of three features: Interdisciplinary character, specific focus, and deep mathematical content. Also, the journal aims to create a link between the discrete and the continuous communities, which distinguishes it from other journals with strong PDE orientation. NHM publishes original contributions of high quality in networks, heterogeneous media and related fields. NHM is thus devoted to research work on complex media arising in mathematical, physical, engineering, socio-economical and bio-medical problems.