SAG-Net: Spectrum Adaptive Gate Network for Learning Feature Representation From Multispectral Imagery

Yong Li;Bohan Li;Zhongqun Chen;Yixuan Li;Guohan Zhang
{"title":"SAG-Net: Spectrum Adaptive Gate Network for Learning Feature Representation From Multispectral Imagery","authors":"Yong Li;Bohan Li;Zhongqun Chen;Yixuan Li;Guohan Zhang","doi":"10.1109/LGRS.2025.3535635","DOIUrl":null,"url":null,"abstract":"Feature representation plays a key role in matching keypoints, especially for the multispectral images of large spectral difference. On such image pairs, existing methods typically use the two images only, but it is challenging to directly learn spectrum-invariant feature representation due to the complex nonlinear distortion between them. To address this issue, this letter proposes using intermediate-band images to facilitate learning spectrum-invariant feature representation. For this purpose, this work designs a spectrum adaptive gate network (SAG-Net) that consists of a SPectral gate (SPeG) module and a deep feature extractor. The SPeG module selectively activates the spectrum-invariant features according to input image content on-the-fly. It hence allows for training on the images of over two bands simultaneously with a single network without the need of an individual branch per band. To investigate the SPeG module, we also constructed a Landsat 9 Multi-Spectral Images (L9-MSI) dataset including 3167 scenes of aligned images across five spectral bands (visible, B5, B6, B7, and B10) from the Landsat 9 imagery. The experimental results demonstrate the SPeG module can learn common feature representation for varying-band images, and the intermediate B5, B6, and B7 images are useful for the SAG-Net to learn the common feature between visible and B10. On the L9-MSI dataset, the SAG-Net significantly improved the number of correct matches and the matching score (MS). Our dataset will be released at <uri>https://github.com/bohanlee/L9MSI-Dataset.git</uri>.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4000,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10856180/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Feature representation plays a key role in matching keypoints, especially for the multispectral images of large spectral difference. On such image pairs, existing methods typically use the two images only, but it is challenging to directly learn spectrum-invariant feature representation due to the complex nonlinear distortion between them. To address this issue, this letter proposes using intermediate-band images to facilitate learning spectrum-invariant feature representation. For this purpose, this work designs a spectrum adaptive gate network (SAG-Net) that consists of a SPectral gate (SPeG) module and a deep feature extractor. The SPeG module selectively activates the spectrum-invariant features according to input image content on-the-fly. It hence allows for training on the images of over two bands simultaneously with a single network without the need of an individual branch per band. To investigate the SPeG module, we also constructed a Landsat 9 Multi-Spectral Images (L9-MSI) dataset including 3167 scenes of aligned images across five spectral bands (visible, B5, B6, B7, and B10) from the Landsat 9 imagery. The experimental results demonstrate the SPeG module can learn common feature representation for varying-band images, and the intermediate B5, B6, and B7 images are useful for the SAG-Net to learn the common feature between visible and B10. On the L9-MSI dataset, the SAG-Net significantly improved the number of correct matches and the matching score (MS). Our dataset will be released at https://github.com/bohanlee/L9MSI-Dataset.git.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
多光谱图像特征表示学习的光谱自适应门网络
特征表示在关键点匹配中起着关键作用,特别是对于光谱差异较大的多光谱图像。对于这样的图像对,现有的方法通常只使用两幅图像,但由于它们之间复杂的非线性失真,直接学习光谱不变特征表示是一项挑战。为了解决这个问题,这封信建议使用中间波段图像来促进学习光谱不变特征表示。为此,本文设计了一个频谱自适应门网络(SAG-Net),该网络由一个频谱门模块和一个深度特征提取器组成。SPeG模块根据输入的图像内容动态地选择性地激活光谱不变性特征。因此,它允许对两个波段以上的图像同时进行训练,而不需要每个波段的单独分支。为了研究SPeG模块,我们还构建了一个Landsat 9多光谱图像(L9-MSI)数据集,其中包括来自Landsat 9图像的5个光谱带(可见光、B5、B6、B7和B10)的3167个场景。实验结果表明,SPeG模块可以学习到不同波段图像的共同特征表示,并且中间的B5、B6和B7图像有助于SAG-Net学习到可见和B10之间的共同特征。在L9-MSI数据集上,SAG-Net显著提高了正确匹配次数和匹配分数(MS)。我们的数据集将在https://github.com/bohanlee/L9MSI-Dataset.git上发布。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Multiclass Training Dataset and Hybrid Neural Network for Simultaneous Karst and Channel Detection An Improved Ground-Based GNSS-R Soil Moisture Retrieval Algorithm Incorporating Precipitation Effects MCD-YOLO: An Improved YOLOv11 Framework for Manhole Cover Detection in UAV Imagery Robust Recognition of Anomalous Distribution From Electrical Resistivity Tomography Dip-Guided Poststack Inversion via Structure-Tensor Regularization
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1