Semantically-Aware Contrastive Learning for multispectral remote sensing images

IF 12.2 1区 地球科学 Q1 GEOGRAPHY, PHYSICAL ISPRS Journal of Photogrammetry and Remote Sensing Pub Date : 2025-05-01 Epub Date: 2025-03-18 DOI:10.1016/j.isprsjprs.2025.02.024
Leandro Stival , Ricardo da Silva Torres , Helio Pedrini
{"title":"Semantically-Aware Contrastive Learning for multispectral remote sensing images","authors":"Leandro Stival ,&nbsp;Ricardo da Silva Torres ,&nbsp;Helio Pedrini","doi":"10.1016/j.isprsjprs.2025.02.024","DOIUrl":null,"url":null,"abstract":"<div><div>Satellites continuously capture vast amounts of data daily, including multispectral remote sensing images (MSRSI), which facilitate the analysis of planetary processes and changes. New machine-learning techniques are employed to develop models to identify regions with significant changes, predict land-use conditions, and segment areas of interest. However, these methods often require large volumes of labeled data for effective training, limiting the utilization of captured data in practice. According to current literature, self-supervised learning (SSL) can be effectively applied to learn how to represent MSRSI. This work introduces Semantically-Aware Contrastive Learning (SACo+), a novel method for training a model using SSL for MSRSI. Relevant known band combinations are utilized to extract semantic information from the MSRSI and texture-based representations, serving as anchors for constructing a feature space. This approach is resilient against changes and yields semantically informative results using contrastive techniques based on sample visual properties, their categories, and their changes over time. This enables training the model using classic SSL contrastive frameworks, such as MoCo and its remote sensing version, SeCo, while also leveraging intrinsic semantic information. SACo+ generates features for each semantic group (band combination), highlighting regions in the images (such as vegetation, urban areas, and water bodies), and explores texture properties encoded based on Local Binary Pattern (LBP). To demonstrate the efficacy of our approach, we trained ResNet models with MSRSI using the semantic band combinations in SSL frameworks. Subsequently, we compared these models on three distinct tasks: land cover classification task using the EuroSAT dataset, change detection using the OSCD dataset, and semantic segmentation using the PASTIS and GID datasets. Our results demonstrate that leveraging semantic and texture features enhances the quality of the feature space, leading to improved performance in all benchmark tasks. The model implementation and weights are available at <span><span>https://github.com/lstival/SACo</span><svg><path></path></svg></span> — As of Jan. 2025.</div></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"223 ","pages":"Pages 173-187"},"PeriodicalIF":12.2000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISPRS Journal of Photogrammetry and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0924271625000826","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/18 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"GEOGRAPHY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Satellites continuously capture vast amounts of data daily, including multispectral remote sensing images (MSRSI), which facilitate the analysis of planetary processes and changes. New machine-learning techniques are employed to develop models to identify regions with significant changes, predict land-use conditions, and segment areas of interest. However, these methods often require large volumes of labeled data for effective training, limiting the utilization of captured data in practice. According to current literature, self-supervised learning (SSL) can be effectively applied to learn how to represent MSRSI. This work introduces Semantically-Aware Contrastive Learning (SACo+), a novel method for training a model using SSL for MSRSI. Relevant known band combinations are utilized to extract semantic information from the MSRSI and texture-based representations, serving as anchors for constructing a feature space. This approach is resilient against changes and yields semantically informative results using contrastive techniques based on sample visual properties, their categories, and their changes over time. This enables training the model using classic SSL contrastive frameworks, such as MoCo and its remote sensing version, SeCo, while also leveraging intrinsic semantic information. SACo+ generates features for each semantic group (band combination), highlighting regions in the images (such as vegetation, urban areas, and water bodies), and explores texture properties encoded based on Local Binary Pattern (LBP). To demonstrate the efficacy of our approach, we trained ResNet models with MSRSI using the semantic band combinations in SSL frameworks. Subsequently, we compared these models on three distinct tasks: land cover classification task using the EuroSAT dataset, change detection using the OSCD dataset, and semantic segmentation using the PASTIS and GID datasets. Our results demonstrate that leveraging semantic and texture features enhances the quality of the feature space, leading to improved performance in all benchmark tasks. The model implementation and weights are available at https://github.com/lstival/SACo — As of Jan. 2025.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
多光谱遥感图像的语义感知对比学习
卫星每天不断捕获大量数据,包括多光谱遥感图像(MSRSI),这有助于分析行星的过程和变化。新的机器学习技术用于开发模型,以识别具有重大变化的区域,预测土地利用状况,并分割感兴趣的区域。然而,这些方法通常需要大量的标记数据来进行有效的训练,这限制了在实践中对捕获数据的利用。根据目前的文献,自监督学习(self-supervised learning, SSL)可以有效地用于学习如何表示MSRSI。本文介绍了语义感知对比学习(SACo+),这是一种使用SSL对MSRSI进行模型训练的新方法。利用相关的已知频带组合从MSRSI和基于纹理的表示中提取语义信息,作为构建特征空间的锚点。这种方法对变化具有弹性,并使用基于示例视觉属性、它们的类别及其随时间变化的对比技术产生语义信息丰富的结果。这使得可以使用经典的SSL对比框架(如MoCo及其遥感版本SeCo)来训练模型,同时还可以利用固有的语义信息。SACo+为每个语义组(波段组合)生成特征,突出显示图像中的区域(如植被、城市地区和水体),并探索基于局部二值模式(Local Binary Pattern, LBP)编码的纹理属性。为了证明我们方法的有效性,我们使用SSL框架中的语义频带组合来训练带有MSRSI的ResNet模型。随后,我们在三个不同的任务上比较了这些模型:使用EuroSAT数据集的土地覆盖分类任务,使用OSCD数据集的变化检测任务,以及使用PASTIS和GID数据集的语义分割任务。我们的结果表明,利用语义和纹理特征增强了特征空间的质量,从而提高了所有基准任务的性能。模型实现和权重可在https://github.com/lstival/SACo上获得-截至2025年1月。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
ISPRS Journal of Photogrammetry and Remote Sensing
ISPRS Journal of Photogrammetry and Remote Sensing 工程技术-成像科学与照相技术
CiteScore
21.00
自引率
6.30%
发文量
273
审稿时长
40 days
期刊介绍: The ISPRS Journal of Photogrammetry and Remote Sensing (P&RS) serves as the official journal of the International Society for Photogrammetry and Remote Sensing (ISPRS). It acts as a platform for scientists and professionals worldwide who are involved in various disciplines that utilize photogrammetry, remote sensing, spatial information systems, computer vision, and related fields. The journal aims to facilitate communication and dissemination of advancements in these disciplines, while also acting as a comprehensive source of reference and archive. P&RS endeavors to publish high-quality, peer-reviewed research papers that are preferably original and have not been published before. These papers can cover scientific/research, technological development, or application/practical aspects. Additionally, the journal welcomes papers that are based on presentations from ISPRS meetings, as long as they are considered significant contributions to the aforementioned fields. In particular, P&RS encourages the submission of papers that are of broad scientific interest, showcase innovative applications (especially in emerging fields), have an interdisciplinary focus, discuss topics that have received limited attention in P&RS or related journals, or explore new directions in scientific or professional realms. It is preferred that theoretical papers include practical applications, while papers focusing on systems and applications should include a theoretical background.
期刊最新文献
SuperSTF: A latent diffusion model for cloud-free spatiotemporal remote sensing image fusion Label-free mangrove mapping from temporally consistent PlanetScope imagery with interpretable deep unfolding network A geographically weighted XGBoost framework for Pixel-Level modeling of vegetation responses using Multi-Source Earth Observation data How does missing-modality affect cross-domain remote sensing image classification: A synergistic inter-modal proxy fine-tuning solution Causally regularized full-resolution channel attention and frequency refinement for cloud removal in satellite imagery
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1