MABDT: Multi-scale attention boosted deformable transformer for remote sensing image dehazing

IF 3.4 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Signal Processing Pub Date : 2024-11-07 DOI:10.1016/j.sigpro.2024.109768
Jin Ning, Jie Yin, Fei Deng, Lianbin Xie
{"title":"MABDT: Multi-scale attention boosted deformable transformer for remote sensing image dehazing","authors":"Jin Ning,&nbsp;Jie Yin,&nbsp;Fei Deng,&nbsp;Lianbin Xie","doi":"10.1016/j.sigpro.2024.109768","DOIUrl":null,"url":null,"abstract":"<div><div>Owing to the heterogeneous spatial distribution and non-uniform morphological characteristics of haze in remote sensing images (RSIs), conventional dehazing algorithms struggle to precisely recover the fine-grained details of terrestrial objects. To address this issue, a novel multi-scale attention boosted deformable Transformer (MABDT) tailored for RSI dehazing is proposed. This framework synergizes the multi-receptive field features elicited by convolutional neural network (CNN) with the long-term dependency features derived from Transformer, which facilitates a more adept restitution of texture and intricate detail information within RSIs. Firstly, spatial attention deformable convolution is introduced for computation of multi-head self-attention in the Transformer block, particularly in addressing complex haze scenarios encountered in RSIs. Subsequently, a multi-scale attention feature enhancement (MAFE) block is designed, tailored to capture local and multi-level detailed information features using multi-receptive field convolution operations, thereby accommodating non-uniform haze. Finally, a multi-level feature complementary fusion (MFCF) block is proposed, leveraging both shallow and deep features acquired from all encoding layers to augment each level of reconstructed image. The dehazing performance is evaluated on 6 open-source datasets, and quantitative and qualitative experimental results demonstrate the advancements of the proposed method in both metrical scores and visual quality. The source code is available at <span><span>https://github.com/ningjin00/MABDT</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49523,"journal":{"name":"Signal Processing","volume":"229 ","pages":"Article 109768"},"PeriodicalIF":3.4000,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165168424003888","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Owing to the heterogeneous spatial distribution and non-uniform morphological characteristics of haze in remote sensing images (RSIs), conventional dehazing algorithms struggle to precisely recover the fine-grained details of terrestrial objects. To address this issue, a novel multi-scale attention boosted deformable Transformer (MABDT) tailored for RSI dehazing is proposed. This framework synergizes the multi-receptive field features elicited by convolutional neural network (CNN) with the long-term dependency features derived from Transformer, which facilitates a more adept restitution of texture and intricate detail information within RSIs. Firstly, spatial attention deformable convolution is introduced for computation of multi-head self-attention in the Transformer block, particularly in addressing complex haze scenarios encountered in RSIs. Subsequently, a multi-scale attention feature enhancement (MAFE) block is designed, tailored to capture local and multi-level detailed information features using multi-receptive field convolution operations, thereby accommodating non-uniform haze. Finally, a multi-level feature complementary fusion (MFCF) block is proposed, leveraging both shallow and deep features acquired from all encoding layers to augment each level of reconstructed image. The dehazing performance is evaluated on 6 open-source datasets, and quantitative and qualitative experimental results demonstrate the advancements of the proposed method in both metrical scores and visual quality. The source code is available at https://github.com/ningjin00/MABDT.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
MABDT:用于遥感图像去毛刺的多尺度注意力增强可变形变换器
由于遥感图像(RSI)中雾霾的空间分布不均匀且形态特征不一致,传统的去雾算法难以精确恢复地面物体的细微细节。为解决这一问题,我们提出了一种为 RSI 去雾量身定制的新型多尺度注意力增强可变形变换器(MABDT)。该框架将卷积神经网络(CNN)激发的多感知场特征与变形器产生的长期依赖性特征协同作用,从而更巧妙地还原 RSI 中的纹理和复杂细节信息。首先,在 Transformer 模块中引入了空间注意力可变形卷积,用于计算多头自我注意力,特别是在处理 RSI 中遇到的复杂雾霾场景时。随后,设计了多尺度注意力特征增强(MAFE)区块,利用多感受野卷积操作捕捉局部和多层次的详细信息特征,从而适应非均匀雾度。最后,提出了多层次特征互补融合(MFCF)模块,利用从所有编码层获取的浅层和深层特征来增强重建图像的每个层次。在 6 个开源数据集上对去毛刺性能进行了评估,定量和定性实验结果表明了所提方法在度量分数和视觉质量方面的进步。源代码见 https://github.com/ningjin00/MABDT。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Signal Processing
Signal Processing 工程技术-工程:电子与电气
CiteScore
9.20
自引率
9.10%
发文量
309
审稿时长
41 days
期刊介绍: Signal Processing incorporates all aspects of the theory and practice of signal processing. It features original research work, tutorial and review articles, and accounts of practical developments. It is intended for a rapid dissemination of knowledge and experience to engineers and scientists working in the research, development or practical application of signal processing. Subject areas covered by the journal include: Signal Theory; Stochastic Processes; Detection and Estimation; Spectral Analysis; Filtering; Signal Processing Systems; Software Developments; Image Processing; Pattern Recognition; Optical Signal Processing; Digital Signal Processing; Multi-dimensional Signal Processing; Communication Signal Processing; Biomedical Signal Processing; Geophysical and Astrophysical Signal Processing; Earth Resources Signal Processing; Acoustic and Vibration Signal Processing; Data Processing; Remote Sensing; Signal Processing Technology; Radar Signal Processing; Sonar Signal Processing; Industrial Applications; New Applications.
期刊最新文献
Distributed filtering with time-varying topology: A temporal-difference learning approach in dual games Editorial Board MABDT: Multi-scale attention boosted deformable transformer for remote sensing image dehazing A new method for judging thermal image quality with applications Learning feature-weighted regularization discriminative correlation filters for real-time UAV tracking
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1