基于监督对比学习的遥感图像多尺度遮挡鲁棒场景分类

IF 4.4 IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society Pub Date : 2025-01-30 DOI:10.1109/LGRS.2025.3537104

Zhao Yang;Jia Chen;Jun Li;Xiangan Zheng

{"title":"基于监督对比学习的遥感图像多尺度遮挡鲁棒场景分类","authors":"Zhao Yang;Jia Chen;Jun Li;Xiangan Zheng","doi":"10.1109/LGRS.2025.3537104","DOIUrl":null,"url":null,"abstract":"Scene classification of remote sensing images plays a vital role in Earth observation applications. Among various challenges, occlusion is a prevalent and critical issue in practical applications, particularly when dealing with large-area occlusions caused by clouds, shadows, and man-made structures. Current methods, whether based on occlusion recovery or occlusion-robust feature extraction, generally show limited performance when processing extensive occluded regions due to ignoring the inconsistency in feature representation caused by multiscale occlusions. To address the occlusion challenge, this letter proposes a novel contrastive learning-based multiscale occlusion framework with three key components: 1) a pretext task module that distinguishes between small and large occlusions to enable occlusion-invariant feature learning; 2) a multibranch feature extraction network based on ResNet-50’s shared-weight convolutional layers for consistent feature extraction across occlusion levels; and 3) a joint loss function that adaptively balances contrastive feature learning and classification. Extensive experiments were evaluated on the DIOR-Occ and LEVIR-Occ benchmark datasets, demonstrating significant improvements in classification accuracy across different occlusion scenarios. Compared with existing approaches, the proposed framework achieves superior robustness and generalization capabilities, with notable advantages in the analysis of highly occluded data. Future research will explore the adaptation of this framework to detection and segmentation tasks.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4000,"publicationDate":"2025-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multiscale Occlusion-Robust Scene Classification in Remote Sensing Images via Supervised Contrastive Learning\",\"authors\":\"Zhao Yang;Jia Chen;Jun Li;Xiangan Zheng\",\"doi\":\"10.1109/LGRS.2025.3537104\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Scene classification of remote sensing images plays a vital role in Earth observation applications. Among various challenges, occlusion is a prevalent and critical issue in practical applications, particularly when dealing with large-area occlusions caused by clouds, shadows, and man-made structures. Current methods, whether based on occlusion recovery or occlusion-robust feature extraction, generally show limited performance when processing extensive occluded regions due to ignoring the inconsistency in feature representation caused by multiscale occlusions. To address the occlusion challenge, this letter proposes a novel contrastive learning-based multiscale occlusion framework with three key components: 1) a pretext task module that distinguishes between small and large occlusions to enable occlusion-invariant feature learning; 2) a multibranch feature extraction network based on ResNet-50’s shared-weight convolutional layers for consistent feature extraction across occlusion levels; and 3) a joint loss function that adaptively balances contrastive feature learning and classification. Extensive experiments were evaluated on the DIOR-Occ and LEVIR-Occ benchmark datasets, demonstrating significant improvements in classification accuracy across different occlusion scenarios. Compared with existing approaches, the proposed framework achieves superior robustness and generalization capabilities, with notable advantages in the analysis of highly occluded data. Future research will explore the adaptation of this framework to detection and segmentation tasks.\",\"PeriodicalId\":91017,\"journal\":{\"name\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"volume\":\"22 \",\"pages\":\"1-5\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-01-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10858743/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10858743/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

遥感影像的场景分类在对地观测应用中起着至关重要的作用。在各种挑战中，遮挡在实际应用中是一个普遍而关键的问题，特别是在处理由云、阴影和人造结构引起的大面积遮挡时。目前的方法，无论是基于遮挡恢复还是基于遮挡鲁棒特征提取，由于忽略了多尺度遮挡引起的特征表示的不一致性，在处理大范围遮挡区域时，通常表现出有限的性能。为了解决遮挡挑战，本文提出了一种新的基于对比学习的多尺度遮挡框架，该框架包含三个关键组件：1)区分小遮挡和大遮挡的借口任务模块，以实现遮挡不变特征学习；2)基于ResNet-50的共享权重卷积层的多分支特征提取网络，实现跨遮挡水平的一致特征提取；3)自适应平衡对比特征学习和分类的联合损失函数。在DIOR-Occ和LEVIR-Occ基准数据集上进行了大量的实验评估，表明在不同遮挡场景下分类精度有显著提高。与现有方法相比，该框架具有较强的鲁棒性和泛化能力，在高度遮挡数据分析中具有显著优势。未来的研究将探索该框架对检测和分割任务的适应性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Multiscale Occlusion-Robust Scene Classification in Remote Sensing Images via Supervised Contrastive Learning

Scene classification of remote sensing images plays a vital role in Earth observation applications. Among various challenges, occlusion is a prevalent and critical issue in practical applications, particularly when dealing with large-area occlusions caused by clouds, shadows, and man-made structures. Current methods, whether based on occlusion recovery or occlusion-robust feature extraction, generally show limited performance when processing extensive occluded regions due to ignoring the inconsistency in feature representation caused by multiscale occlusions. To address the occlusion challenge, this letter proposes a novel contrastive learning-based multiscale occlusion framework with three key components: 1) a pretext task module that distinguishes between small and large occlusions to enable occlusion-invariant feature learning; 2) a multibranch feature extraction network based on ResNet-50’s shared-weight convolutional layers for consistent feature extraction across occlusion levels; and 3) a joint loss function that adaptively balances contrastive feature learning and classification. Extensive experiments were evaluated on the DIOR-Occ and LEVIR-Occ benchmark datasets, demonstrating significant improvements in classification accuracy across different occlusion scenarios. Compared with existing approaches, the proposed framework achieves superior robustness and generalization capabilities, with notable advantages in the analysis of highly occluded data. Future research will explore the adaptation of this framework to detection and segmentation tasks.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society

自引率

0.00%

发文量