BT-HRSCD: High-Resolution Feature Is What You Need for a Semantic Change Detection Network With a Triple-Decoding Branch

IF 8.6 1区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Geoscience and Remote Sensing Pub Date : 2024-11-18 DOI:10.1109/TGRS.2024.3500790
Sheng Fang;Wen Li;Shuqi Yang;Zhe Li;Jianli Zhao;Xiaoxin Wang
{"title":"BT-HRSCD: High-Resolution Feature Is What You Need for a Semantic Change Detection Network With a Triple-Decoding Branch","authors":"Sheng Fang;Wen Li;Shuqi Yang;Zhe Li;Jianli Zhao;Xiaoxin Wang","doi":"10.1109/TGRS.2024.3500790","DOIUrl":null,"url":null,"abstract":"In recent years, semantic change detection (SCD) has emerged as a pivotal field within the remote sensing (RS) research community, underscored by its essential contribution to various Earth observation undertakings. Conventional SCD methodologies typically adopt a multitask network architecture, fusing a binary change detection (BCD) sub-task with dual semantic segmentation (SS) sub-tasks. These strategies frequently rely on the encoder’s low-resolution yet semantically dense features, derived from multiple down-sampling stages, as the inputs for the decoding heads. Departing from this traditional path and targeting the nuanced characteristics of the multisubtasks, this study pioneers a novel methodology that harnesses the potential of the encoding phase’s high-resolution features. By integrating HRNet as the encoder structure, we introduce the BT-HRSCD framework, featuring two simple and effective modules. The first, bidirectional shallow and deep features aggregation module (BiFAM), seeks to imbue features with richer semantic insights through bidirectional feature fusion that spans from shallow-to-deep as well as deep-to-shallow layers. The second module, high-resolution difference extraction (HRDE), utilizes the encoder’s highest spatial resolution features, evaluating their differences to enhance the precision in identifying change areas. BiFAM is devised to boost the SS sub-tasks’ effectiveness, whereas HRDE aims to elevate the accuracy of the BCD sub-task. Experimental results reveal that our method outperforms state-of-the-art performances relative to previous SCD efforts. Our source code is released at \n<uri>https://github.com/iridescent524/BT-HRSCD</uri>\n.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"62 ","pages":"1-14"},"PeriodicalIF":8.6000,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10755134/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

In recent years, semantic change detection (SCD) has emerged as a pivotal field within the remote sensing (RS) research community, underscored by its essential contribution to various Earth observation undertakings. Conventional SCD methodologies typically adopt a multitask network architecture, fusing a binary change detection (BCD) sub-task with dual semantic segmentation (SS) sub-tasks. These strategies frequently rely on the encoder’s low-resolution yet semantically dense features, derived from multiple down-sampling stages, as the inputs for the decoding heads. Departing from this traditional path and targeting the nuanced characteristics of the multisubtasks, this study pioneers a novel methodology that harnesses the potential of the encoding phase’s high-resolution features. By integrating HRNet as the encoder structure, we introduce the BT-HRSCD framework, featuring two simple and effective modules. The first, bidirectional shallow and deep features aggregation module (BiFAM), seeks to imbue features with richer semantic insights through bidirectional feature fusion that spans from shallow-to-deep as well as deep-to-shallow layers. The second module, high-resolution difference extraction (HRDE), utilizes the encoder’s highest spatial resolution features, evaluating their differences to enhance the precision in identifying change areas. BiFAM is devised to boost the SS sub-tasks’ effectiveness, whereas HRDE aims to elevate the accuracy of the BCD sub-task. Experimental results reveal that our method outperforms state-of-the-art performances relative to previous SCD efforts. Our source code is released at https://github.com/iridescent524/BT-HRSCD .
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
BT-HRSCD:具有三重解码分支的语义变化检测网络需要高分辨率特征
近年来,语义变化检测(SCD)已成为遥感(RS)研究界的一个重要领域,其对各种地球观测工作的重要贡献凸显了这一点。传统的 SCD 方法通常采用多任务网络架构,将二进制变化检测(BCD)子任务与双语义分割(SS)子任务融合在一起。这些策略通常依赖于编码器的低分辨率但语义密集的特征,这些特征来自多个向下采样阶段,作为解码头的输入。本研究偏离了这一传统路径,针对多子项任务的细微特征,开创了一种利用编码阶段高分辨率特征潜力的新方法。通过整合 HRNet 作为编码器结构,我们引入了 BT-HRSCD 框架,该框架具有两个简单而有效的模块。第一个模块是双向浅层和深层特征聚合模块(BiFAM),旨在通过从浅层到深层以及从深层到浅层的双向特征融合,使特征具有更丰富的语义洞察力。第二个模块是高分辨率差异提取 (HRDE),它利用编码器的最高空间分辨率特征,评估它们之间的差异,以提高识别变化区域的精度。BiFAM 的设计旨在提高 SS 子任务的效率,而 HRDE 则旨在提高 BCD 子任务的精度。实验结果表明,相对于以前的 SCD 工作,我们的方法在性能上优于最先进的方法。我们的源代码发布于 https://github.com/iridescent524/BT-HRSCD。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Geoscience and Remote Sensing
IEEE Transactions on Geoscience and Remote Sensing 工程技术-地球化学与地球物理
CiteScore
11.50
自引率
28.00%
发文量
1912
审稿时长
4.0 months
期刊介绍: IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.
期刊最新文献
Mamba-MPSE: Multi-Pattern State Evolution Based on the Mamba Model for Intra-Class Heterogeneous Wetland Classification with UAV Hyperspectral Imagery Physics-Constrained Adapter-Tuning of Meteorological Foundation Models for Global SST Forecasting ReflectGAN: Modeling Vegetation Effects for Soil Carbon Estimation from Satellite Imagery Synergizing Smoke and Hotspot: A Visible-Infrared Co-Learning Framework with Dataset for Large-Scale Wildfire Detection Probabilistic Fusion Framework Based on Fully Convolutional Networks and Graphical Models for Burnt Area Detection from Multiresolution Satellite and UAV Imagery
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1