CDMamba: Incorporating Local Clues Into Mamba for Remote Sensing Image Binary Change Detection

IF 8.6 1区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Geoscience and Remote Sensing Pub Date : 2025-02-25 DOI:10.1109/TGRS.2025.3545012
Haotian Zhang;Keyan Chen;Chenyang Liu;Hao Chen;Zhengxia Zou;Zhenwei Shi
{"title":"CDMamba: Incorporating Local Clues Into Mamba for Remote Sensing Image Binary Change Detection","authors":"Haotian Zhang;Keyan Chen;Chenyang Liu;Hao Chen;Zhengxia Zou;Zhenwei Shi","doi":"10.1109/TGRS.2025.3545012","DOIUrl":null,"url":null,"abstract":"Recently, the Mamba architecture based on state-space models has demonstrated remarkable performance in a series of natural language processing tasks and has been rapidly applied to remote sensing change detection (CD) tasks. However, most methods enhance the global receptive field by directly modifying the scanning mode of Mamba, neglecting the crucial role that local information plays in dense prediction tasks (e.g., binary CD). In this article, we propose a model called CDMamba, which effectively combines global and local features for handling binary CD tasks. Specifically, the scaled residual ConvMamba (SRCM) block is proposed to utilize the ability of Mamba to extract global features and convolution to enhance the local details, to alleviate the issue that current Mamba-based methods lack detailed clues and are difficult to achieve fine detection in dense prediction tasks. Furthermore, considering the characteristics of bi-temporal feature interaction required for CD, the adaptive global–local guided fusion (AGLGF) block is proposed to dynamically facilitate the bi-temporal interaction guided by other temporal global/local features. Our intuition is that more discriminative change features can be acquired with the guidance of other temporal features. Extensive experiments on five datasets demonstrate that our proposed CDMamba is comparable to the current methods (such as the F1/intersection over union (IoU) scores are improved by 2.10%/3.00%, 2.44%/2.91%, on LEVIR+CD and CLCD, respectively). Our code is open-sourced at <uri>https://github.com/zmoka-zht/CDMamba</uri>.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-16"},"PeriodicalIF":8.6000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10902569/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Recently, the Mamba architecture based on state-space models has demonstrated remarkable performance in a series of natural language processing tasks and has been rapidly applied to remote sensing change detection (CD) tasks. However, most methods enhance the global receptive field by directly modifying the scanning mode of Mamba, neglecting the crucial role that local information plays in dense prediction tasks (e.g., binary CD). In this article, we propose a model called CDMamba, which effectively combines global and local features for handling binary CD tasks. Specifically, the scaled residual ConvMamba (SRCM) block is proposed to utilize the ability of Mamba to extract global features and convolution to enhance the local details, to alleviate the issue that current Mamba-based methods lack detailed clues and are difficult to achieve fine detection in dense prediction tasks. Furthermore, considering the characteristics of bi-temporal feature interaction required for CD, the adaptive global–local guided fusion (AGLGF) block is proposed to dynamically facilitate the bi-temporal interaction guided by other temporal global/local features. Our intuition is that more discriminative change features can be acquired with the guidance of other temporal features. Extensive experiments on five datasets demonstrate that our proposed CDMamba is comparable to the current methods (such as the F1/intersection over union (IoU) scores are improved by 2.10%/3.00%, 2.44%/2.91%, on LEVIR+CD and CLCD, respectively). Our code is open-sourced at https://github.com/zmoka-zht/CDMamba.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
CDMamba:结合局部线索到Mamba的遥感图像二值变化检测
近年来,基于状态空间模型的Mamba体系结构在一系列自然语言处理任务中表现出了显著的性能,并迅速应用于遥感变化检测任务中。然而,大多数方法通过直接修改曼巴的扫描模式来增强全局接受野,而忽略了局部信息在密集预测任务(如二进制CD)中所起的关键作用。在本文中,我们提出了一个名为CDMamba的模型,它有效地结合了全局和本地特性来处理二进制CD任务。具体而言,提出了缩放残差ConvMamba (SRCM)块,利用Mamba提取全局特征和卷积增强局部细节的能力,解决了目前基于Mamba的方法在密集预测任务中缺乏细节线索、难以实现精细检测的问题。在此基础上,考虑CD所需的双时相特征交互特性,提出了自适应全局-局部引导融合(AGLGF)块,以动态地促进由其他时相全局/局部特征引导的双时相交互。我们的直觉是,在其他时间特征的指导下,可以获得更多的判别变化特征。在5个数据集上的大量实验表明,我们提出的CDMamba方法与现有方法相当(例如F1/intersection over union (IoU)分数在LEVIR+CD和CLCD上分别提高了2.10%/3.00%、2.44%/2.91%)。我们的代码在https://github.com/zmoka-zht/CDMamba上是开源的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Geoscience and Remote Sensing
IEEE Transactions on Geoscience and Remote Sensing 工程技术-地球化学与地球物理
CiteScore
11.50
自引率
28.00%
发文量
1912
审稿时长
4.0 months
期刊介绍: IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.
期刊最新文献
Distribution-Aware Infrared Small Target Detection Based on Multi-Scale Convolutional Decoder and Hypergraph Attention Detecting Weak Underwater Targets in Hyperspectral Imagery via Physics-aware Residual Reasoning Faint Bottom Echo Detection for Airborne LiDAR Bathymetry Based on a Constrained Waveform Stacking Model First Cooperative Formaldehyde Monitoring with Chinese Morning and Afternoon Satellites: Revealing Global Multi-Temporal Concentration Dynamics Fast Anchor Graph Regularized Relaxation Linear Regression for Classification of Hyperspectral Images
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1