Embedding Generalized Semantic Knowledge Into Few-Shot Remote Sensing Segmentation

IF 8.6 1区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Geoscience and Remote Sensing Pub Date : 2024-12-18 DOI:10.1109/TGRS.2024.3519772
Qi Wang;Yuyu Jia;Wei Huang;Junyu Gao;Qiang Li
{"title":"Embedding Generalized Semantic Knowledge Into Few-Shot Remote Sensing Segmentation","authors":"Qi Wang;Yuyu Jia;Wei Huang;Junyu Gao;Qiang Li","doi":"10.1109/TGRS.2024.3519772","DOIUrl":null,"url":null,"abstract":"Few-shot segmentation (FSS) for remote sensing (RS) imagery leverages supporting information from limited annotated samples to achieve query segmentation of novel classes. Previous efforts are dedicated to mining segmentation-guiding visual cues from a constrained set of support samples. However, they still struggle to address the pronounced intra-class differences in RS images, as sparse visual cues make it challenging to establish robust class-specific representations. In this article, we propose a holistic semantic embedding (HSE) approach that effectively harnesses general semantic knowledge, i.e., class description (CD) embeddings. Instead of the naive combination of CD embeddings and visual features for segmentation decoding, we investigate embedding the general semantic knowledge during the feature extraction stage. Specifically, in HSE, a spatial dense interaction (SDI) module allows the interaction of visual support features with CD embeddings along the spatial dimension via self-attention. Furthermore, a global content modulation (GCM) module efficiently augments the global information of the target category in both support and query features, thanks to the transformative fusion of visual features and CD embeddings. These two components holistically synergize CD embeddings and visual cues, constructing a robust class-specific representation. Through extensive experiments on the standard FSS benchmark, the proposed HSE approach demonstrates superior performance compared to peer work, setting a new state-of-the-art.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-13"},"PeriodicalIF":8.6000,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10806775/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Few-shot segmentation (FSS) for remote sensing (RS) imagery leverages supporting information from limited annotated samples to achieve query segmentation of novel classes. Previous efforts are dedicated to mining segmentation-guiding visual cues from a constrained set of support samples. However, they still struggle to address the pronounced intra-class differences in RS images, as sparse visual cues make it challenging to establish robust class-specific representations. In this article, we propose a holistic semantic embedding (HSE) approach that effectively harnesses general semantic knowledge, i.e., class description (CD) embeddings. Instead of the naive combination of CD embeddings and visual features for segmentation decoding, we investigate embedding the general semantic knowledge during the feature extraction stage. Specifically, in HSE, a spatial dense interaction (SDI) module allows the interaction of visual support features with CD embeddings along the spatial dimension via self-attention. Furthermore, a global content modulation (GCM) module efficiently augments the global information of the target category in both support and query features, thanks to the transformative fusion of visual features and CD embeddings. These two components holistically synergize CD embeddings and visual cues, constructing a robust class-specific representation. Through extensive experiments on the standard FSS benchmark, the proposed HSE approach demonstrates superior performance compared to peer work, setting a new state-of-the-art.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
将广义语义知识嵌入少镜头遥感分割中
遥感图像的少镜头分割(FSS)利用有限标注样本的支持信息来实现新类别的查询分割。以前的工作致力于从一组受限的支持样本中挖掘分割引导视觉线索。然而,他们仍然在努力解决RS图像中明显的类内差异,因为稀疏的视觉线索使得建立鲁棒的类特定表示具有挑战性。在本文中,我们提出了一种整体语义嵌入(HSE)方法,该方法有效地利用了一般语义知识,即类描述嵌入(CD)。我们研究了在特征提取阶段嵌入通用语义知识的方法,而不是单纯地将CD嵌入与视觉特征相结合进行分割解码。具体来说,在HSE中,空间密集交互(SDI)模块允许视觉支持功能通过自关注与CD嵌入沿着空间维度进行交互。此外,全局内容调制(GCM)模块通过视觉特征和CD嵌入的转换融合,在支持特征和查询特征方面有效地增强了目标类别的全局信息。这两个组件整体地协同CD嵌入和视觉提示,构建了一个健壮的特定于类的表示。通过在标准FSS基准上的大量实验,与同行相比,所提出的HSE方法表现出了卓越的性能,开创了新的技术水平。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Geoscience and Remote Sensing
IEEE Transactions on Geoscience and Remote Sensing 工程技术-地球化学与地球物理
CiteScore
11.50
自引率
28.00%
发文量
1912
审稿时长
4.0 months
期刊介绍: IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.
期刊最新文献
Thin Cloud Removal Method Based on Low-Frequency Residual Diffusion and High-Frequency Modulation Refinement by Laplacian Pyramid Decoupling A Robust Coal-Rock Interface Identification Model for GPR Data with Complex Interference and Low Signal-to-Noise Ratio RS-LLIC: A Lightweight Learned Image Compression Model with Knowledge Distillation for Onboard Remote Sensing PSTH-CDPM: A Physics-Guided Spatio-Temporal Hypergraph Conditional Denoising Diffusion Probabilistic Model for Invasive Alien Species Distribution Prediction PhyGroup-UNet: Dynamic Physically-Correlated Channel Grouping in a Lightweight UNet for Efficient Precipitation Nowcasting
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1