Haotian Zhang;Keyan Chen;Chenyang Liu;Hao Chen;Zhengxia Zou;Zhenwei Shi
{"title":"CDMamba: Incorporating Local Clues Into Mamba for Remote Sensing Image Binary Change Detection","authors":"Haotian Zhang;Keyan Chen;Chenyang Liu;Hao Chen;Zhengxia Zou;Zhenwei Shi","doi":"10.1109/TGRS.2025.3545012","DOIUrl":null,"url":null,"abstract":"Recently, the Mamba architecture based on state-space models has demonstrated remarkable performance in a series of natural language processing tasks and has been rapidly applied to remote sensing change detection (CD) tasks. However, most methods enhance the global receptive field by directly modifying the scanning mode of Mamba, neglecting the crucial role that local information plays in dense prediction tasks (e.g., binary CD). In this article, we propose a model called CDMamba, which effectively combines global and local features for handling binary CD tasks. Specifically, the scaled residual ConvMamba (SRCM) block is proposed to utilize the ability of Mamba to extract global features and convolution to enhance the local details, to alleviate the issue that current Mamba-based methods lack detailed clues and are difficult to achieve fine detection in dense prediction tasks. Furthermore, considering the characteristics of bi-temporal feature interaction required for CD, the adaptive global–local guided fusion (AGLGF) block is proposed to dynamically facilitate the bi-temporal interaction guided by other temporal global/local features. Our intuition is that more discriminative change features can be acquired with the guidance of other temporal features. Extensive experiments on five datasets demonstrate that our proposed CDMamba is comparable to the current methods (such as the F1/intersection over union (IoU) scores are improved by 2.10%/3.00%, 2.44%/2.91%, on LEVIR+CD and CLCD, respectively). Our code is open-sourced at <uri>https://github.com/zmoka-zht/CDMamba</uri>.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-16"},"PeriodicalIF":8.6000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10902569/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Recently, the Mamba architecture based on state-space models has demonstrated remarkable performance in a series of natural language processing tasks and has been rapidly applied to remote sensing change detection (CD) tasks. However, most methods enhance the global receptive field by directly modifying the scanning mode of Mamba, neglecting the crucial role that local information plays in dense prediction tasks (e.g., binary CD). In this article, we propose a model called CDMamba, which effectively combines global and local features for handling binary CD tasks. Specifically, the scaled residual ConvMamba (SRCM) block is proposed to utilize the ability of Mamba to extract global features and convolution to enhance the local details, to alleviate the issue that current Mamba-based methods lack detailed clues and are difficult to achieve fine detection in dense prediction tasks. Furthermore, considering the characteristics of bi-temporal feature interaction required for CD, the adaptive global–local guided fusion (AGLGF) block is proposed to dynamically facilitate the bi-temporal interaction guided by other temporal global/local features. Our intuition is that more discriminative change features can be acquired with the guidance of other temporal features. Extensive experiments on five datasets demonstrate that our proposed CDMamba is comparable to the current methods (such as the F1/intersection over union (IoU) scores are improved by 2.10%/3.00%, 2.44%/2.91%, on LEVIR+CD and CLCD, respectively). Our code is open-sourced at https://github.com/zmoka-zht/CDMamba.
近年来,基于状态空间模型的Mamba体系结构在一系列自然语言处理任务中表现出了显著的性能,并迅速应用于遥感变化检测任务中。然而,大多数方法通过直接修改曼巴的扫描模式来增强全局接受野,而忽略了局部信息在密集预测任务(如二进制CD)中所起的关键作用。在本文中,我们提出了一个名为CDMamba的模型,它有效地结合了全局和本地特性来处理二进制CD任务。具体而言,提出了缩放残差ConvMamba (SRCM)块,利用Mamba提取全局特征和卷积增强局部细节的能力,解决了目前基于Mamba的方法在密集预测任务中缺乏细节线索、难以实现精细检测的问题。在此基础上,考虑CD所需的双时相特征交互特性,提出了自适应全局-局部引导融合(AGLGF)块,以动态地促进由其他时相全局/局部特征引导的双时相交互。我们的直觉是,在其他时间特征的指导下,可以获得更多的判别变化特征。在5个数据集上的大量实验表明,我们提出的CDMamba方法与现有方法相当(例如F1/intersection over union (IoU)分数在LEVIR+CD和CLCD上分别提高了2.10%/3.00%、2.44%/2.91%)。我们的代码在https://github.com/zmoka-zht/CDMamba上是开源的。
期刊介绍:
IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.