状态空间模型与用于高光谱图像分类的变换器相遇

IF 3.4 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Signal Processing Pub Date : 2024-08-22 DOI:10.1016/j.sigpro.2024.109669
Xuefei Shi , Yisi Zhang , Kecheng Liu , Zhaokun Wen , Wenxuan Wang , Tianxiang Zhang , Jiangyun Li
{"title":"状态空间模型与用于高光谱图像分类的变换器相遇","authors":"Xuefei Shi ,&nbsp;Yisi Zhang ,&nbsp;Kecheng Liu ,&nbsp;Zhaokun Wen ,&nbsp;Wenxuan Wang ,&nbsp;Tianxiang Zhang ,&nbsp;Jiangyun Li","doi":"10.1016/j.sigpro.2024.109669","DOIUrl":null,"url":null,"abstract":"<div><p>In recent years, convolutional neural networks and vision transformers have emerged as predominant models for hyperspectral remote sensing image classification task, leveraging staked convolution layers and self-attention mechanisms with high computation costs, respectively. Recent studies, such as the Mamba model, have showcased the ability of state space model (SSM) with efficient hardware-aware designs in efficiently modeling sequences and extracting implicit features along tokens, which is precisely needed for accurate hyperspectral image (HSI) classification. Thus making SSM-based model potentially a new architecture for remote sensing HSI classification task. However, SSM encounters challenges in modeling HSI due to the insensitivity of spatial information and redundant spectral characteristics. Given SSM-based methods rarely explored in HSI classification, in this work, we present the first exploration of SSM-based models for HSI classification task. Our proposed method MamTrans effectively leverages the capacity of transformer for capturing spatial tokens relationships and Mamba for extracting implicit features along tokens. Besides, we propose a Bidirectional Mamba Module to enhance SSM’s spatial perception ability of extracting spatial features in HSI. Our proposed MamTrans obtains a new state-of-the-art performance across five commonly employed HSI classification benchmarks, demonstrating the robust generalization of MamTrans and effectiveness of SSM-based structure for HSI classification task. Our codes could be found at <span><span>https://github.com/PPPPPsanG/MamTrans</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":49523,"journal":{"name":"Signal Processing","volume":"226 ","pages":"Article 109669"},"PeriodicalIF":3.4000,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0165168424002895/pdfft?md5=4a576501dea0e507ceb0f2a46f4619c0&pid=1-s2.0-S0165168424002895-main.pdf","citationCount":"0","resultStr":"{\"title\":\"State space models meet transformers for hyperspectral image classification\",\"authors\":\"Xuefei Shi ,&nbsp;Yisi Zhang ,&nbsp;Kecheng Liu ,&nbsp;Zhaokun Wen ,&nbsp;Wenxuan Wang ,&nbsp;Tianxiang Zhang ,&nbsp;Jiangyun Li\",\"doi\":\"10.1016/j.sigpro.2024.109669\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In recent years, convolutional neural networks and vision transformers have emerged as predominant models for hyperspectral remote sensing image classification task, leveraging staked convolution layers and self-attention mechanisms with high computation costs, respectively. Recent studies, such as the Mamba model, have showcased the ability of state space model (SSM) with efficient hardware-aware designs in efficiently modeling sequences and extracting implicit features along tokens, which is precisely needed for accurate hyperspectral image (HSI) classification. Thus making SSM-based model potentially a new architecture for remote sensing HSI classification task. However, SSM encounters challenges in modeling HSI due to the insensitivity of spatial information and redundant spectral characteristics. Given SSM-based methods rarely explored in HSI classification, in this work, we present the first exploration of SSM-based models for HSI classification task. Our proposed method MamTrans effectively leverages the capacity of transformer for capturing spatial tokens relationships and Mamba for extracting implicit features along tokens. Besides, we propose a Bidirectional Mamba Module to enhance SSM’s spatial perception ability of extracting spatial features in HSI. Our proposed MamTrans obtains a new state-of-the-art performance across five commonly employed HSI classification benchmarks, demonstrating the robust generalization of MamTrans and effectiveness of SSM-based structure for HSI classification task. Our codes could be found at <span><span>https://github.com/PPPPPsanG/MamTrans</span><svg><path></path></svg></span>.</p></div>\",\"PeriodicalId\":49523,\"journal\":{\"name\":\"Signal Processing\",\"volume\":\"226 \",\"pages\":\"Article 109669\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2024-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S0165168424002895/pdfft?md5=4a576501dea0e507ceb0f2a46f4619c0&pid=1-s2.0-S0165168424002895-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0165168424002895\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165168424002895","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

近年来,卷积神经网络和视觉变换器已成为高光谱遥感图像分类任务的主要模型,它们分别利用了计算成本较高的堆积卷积层和自注意机制。最近的研究,如 Mamba 模型,展示了具有高效硬件感知设计的状态空间模型(SSM)在高效建模序列和提取标记隐含特征方面的能力,而这正是准确的高光谱图像(HSI)分类所需要的。因此,基于 SSM 的模型有可能成为遥感高光谱图像分类任务的新架构。然而,由于空间信息和冗余光谱特征的不敏感性,SSM 在对 HSI 进行建模时遇到了挑战。鉴于基于 SSM 的方法在 HSI 分类中鲜有探索,在这项工作中,我们首次探索了基于 SSM 的 HSI 分类模型。我们提出的 MamTrans 方法有效地利用了转换器捕捉空间标记关系的能力和 Mamba 提取标记隐含特征的能力。此外,我们还提出了双向 Mamba 模块,以增强 SSM 在提取人机交互信息中空间特征的空间感知能力。我们提出的 MamTrans 在五个常用的人机交互分类基准中取得了新的一流性能,证明了 MamTrans 的强大泛化能力和基于 SSM 结构的人机交互分类任务的有效性。我们的代码见 https://github.com/PPPPPsanG/MamTrans。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
State space models meet transformers for hyperspectral image classification

In recent years, convolutional neural networks and vision transformers have emerged as predominant models for hyperspectral remote sensing image classification task, leveraging staked convolution layers and self-attention mechanisms with high computation costs, respectively. Recent studies, such as the Mamba model, have showcased the ability of state space model (SSM) with efficient hardware-aware designs in efficiently modeling sequences and extracting implicit features along tokens, which is precisely needed for accurate hyperspectral image (HSI) classification. Thus making SSM-based model potentially a new architecture for remote sensing HSI classification task. However, SSM encounters challenges in modeling HSI due to the insensitivity of spatial information and redundant spectral characteristics. Given SSM-based methods rarely explored in HSI classification, in this work, we present the first exploration of SSM-based models for HSI classification task. Our proposed method MamTrans effectively leverages the capacity of transformer for capturing spatial tokens relationships and Mamba for extracting implicit features along tokens. Besides, we propose a Bidirectional Mamba Module to enhance SSM’s spatial perception ability of extracting spatial features in HSI. Our proposed MamTrans obtains a new state-of-the-art performance across five commonly employed HSI classification benchmarks, demonstrating the robust generalization of MamTrans and effectiveness of SSM-based structure for HSI classification task. Our codes could be found at https://github.com/PPPPPsanG/MamTrans.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Signal Processing
Signal Processing 工程技术-工程:电子与电气
CiteScore
9.20
自引率
9.10%
发文量
309
审稿时长
41 days
期刊介绍: Signal Processing incorporates all aspects of the theory and practice of signal processing. It features original research work, tutorial and review articles, and accounts of practical developments. It is intended for a rapid dissemination of knowledge and experience to engineers and scientists working in the research, development or practical application of signal processing. Subject areas covered by the journal include: Signal Theory; Stochastic Processes; Detection and Estimation; Spectral Analysis; Filtering; Signal Processing Systems; Software Developments; Image Processing; Pattern Recognition; Optical Signal Processing; Digital Signal Processing; Multi-dimensional Signal Processing; Communication Signal Processing; Biomedical Signal Processing; Geophysical and Astrophysical Signal Processing; Earth Resources Signal Processing; Acoustic and Vibration Signal Processing; Data Processing; Remote Sensing; Signal Processing Technology; Radar Signal Processing; Sonar Signal Processing; Industrial Applications; New Applications.
期刊最新文献
PIPO-Net: A Penalty-based Independent Parameters Optimization deep unfolding Network An interference power allocation method against multi-objective radars based on optimized proximal policy optimization Codesign of transmit waveform and reflective beamforming for active reconfigurable intelligent surface-aided MIMO ISAC system Adaptive three-dimensional histogram modification for JPEG reversible data hiding Euclidean direction search algorithm with maximum correntropy criterion for active noise control system
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1