Fully Transformer Network for Change Detection of Remote Sensing Images

Computer vision - ACCV ... : ... Asian Conference on Computer Vision : proceedings. Asian Conference on Computer Vision Pub Date : 2022-10-03 DOI:10.48550/arXiv.2210.00757

Tianyu Yan, Zifu Wan, Pingping Zhang

{"title":"Fully Transformer Network for Change Detection of Remote Sensing Images","authors":"Tianyu Yan, Zifu Wan, Pingping Zhang","doi":"10.48550/arXiv.2210.00757","DOIUrl":null,"url":null,"abstract":"Recently, change detection (CD) of remote sensing images have achieved great progress with the advances of deep learning. However, current methods generally deliver incomplete CD regions and irregular CD boundaries due to the limited representation ability of the extracted visual features. To relieve these issues, in this work we propose a novel learning framework named Fully Transformer Network (FTN) for remote sensing image CD, which improves the feature extraction from a global view and combines multi-level visual features in a pyramid manner. More specifically, the proposed framework first utilizes the advantages of Transformers in long-range dependency modeling. It can help to learn more discriminative global-level features and obtain complete CD regions. Then, we introduce a pyramid structure to aggregate multi-level visual features from Transformers for feature enhancement. The pyramid structure grafted with a Progressive Attention Module (PAM) can improve the feature representation ability with additional interdependencies through channel attentions. Finally, to better train the framework, we utilize the deeply-supervised learning with multiple boundaryaware loss functions. Extensive experiments demonstrate that our proposed method achieves a new state-of-the-art performance on four public CD benchmarks. For model reproduction, the source code is released at https://github.com/AI-Zhpp/FTN.","PeriodicalId":87238,"journal":{"name":"Computer vision - ACCV ... : ... Asian Conference on Computer Vision : proceedings. Asian Conference on Computer Vision","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer vision - ACCV ... : ... Asian Conference on Computer Vision : proceedings. Asian Conference on Computer Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2210.00757","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

Recently, change detection (CD) of remote sensing images have achieved great progress with the advances of deep learning. However, current methods generally deliver incomplete CD regions and irregular CD boundaries due to the limited representation ability of the extracted visual features. To relieve these issues, in this work we propose a novel learning framework named Fully Transformer Network (FTN) for remote sensing image CD, which improves the feature extraction from a global view and combines multi-level visual features in a pyramid manner. More specifically, the proposed framework first utilizes the advantages of Transformers in long-range dependency modeling. It can help to learn more discriminative global-level features and obtain complete CD regions. Then, we introduce a pyramid structure to aggregate multi-level visual features from Transformers for feature enhancement. The pyramid structure grafted with a Progressive Attention Module (PAM) can improve the feature representation ability with additional interdependencies through channel attentions. Finally, to better train the framework, we utilize the deeply-supervised learning with multiple boundaryaware loss functions. Extensive experiments demonstrate that our proposed method achieves a new state-of-the-art performance on four public CD benchmarks. For model reproduction, the source code is released at https://github.com/AI-Zhpp/FTN.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

遥感图像变化检测的全变压器网络

近年来，随着深度学习的发展，遥感图像的变化检测取得了很大的进展。然而，由于提取的视觉特征的表达能力有限，目前的方法通常提供不完整的CD区域和不规则的CD边界。为了解决这些问题，本文提出了一种新的遥感图像CD学习框架——全变形网络(FTN)，该框架改进了从全局视图提取特征，并以金字塔的方式组合了多层次的视觉特征。更具体地说，提出的框架首先利用了transformer在远程依赖关系建模方面的优势。它有助于学习更多的判别性全局特征，获得完整的CD区域。然后，我们引入金字塔结构来聚合变形金刚的多层次视觉特征，进行特征增强。接枝渐进式注意模块(PAM)的金字塔结构可以通过通道注意增加相互依赖的特征表示能力。最后，为了更好地训练框架，我们利用了具有多个边界感知损失函数的深度监督学习。大量的实验表明，我们提出的方法在四个公共CD基准上实现了新的最先进的性能。要复制模型，源代码可在https://github.com/AI-Zhpp/FTN上发布。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Computer vision - ACCV ... : ... Asian Conference on Computer Vision : proceedings. Asian Conference on Computer Vision

自引率

0.00%

发文量