Insight Any Instance: Promptable Instance Segmentation for Remote Sensing Images

IF 8.6 1区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Geoscience and Remote Sensing Pub Date : 2025-02-19 DOI:10.1109/TGRS.2025.3543636
Xuexue Li;Wenhui Diao;Xinming Li;Xian Sun
{"title":"Insight Any Instance: Promptable Instance Segmentation for Remote Sensing Images","authors":"Xuexue Li;Wenhui Diao;Xinming Li;Xian Sun","doi":"10.1109/TGRS.2025.3543636","DOIUrl":null,"url":null,"abstract":"Instance segmentation of remote sensing images (RSIs) is an essential task for a wide range of applications such as land planning and intelligent transport. Instance segmentation of RSIs is constantly plagued by the unbalanced ratio of foreground and background and limited instance size. And most of the instance segmentation models are based on deep feature learning and contain operations such as multiple downsampling, which is harmful to instance segmentation of RSIs, and thus the performance is still limited. Inspired by the recent superior performance of prompt learning in visual tasks, we propose a new prompt paradigm to address the above issues. Based on the existing instance segmentation model, first, a local prompt module is designed to mine local prompt information from original local tokens for specific instances; second, a global-to-local prompt module is designed to model the contextual information from the global tokens to the local tokens where the instances are located for specific instances. Finally, a proposal’s area loss function (PAreaLoss) is designed to add a decoupling dimension for proposals on the scale to better exploit the potential of the above two prompt modules. It is worth mentioning that our proposed approach can extend the instance segmentation model to a promptable instance segmentation model, i.e., to segment the instances with the specific boxes’ prompt. The time consumption for each promptable instance segmentation process is only 40 ms. This article evaluates the effectiveness of our proposed approach based on several existing models in four instance segmentation datasets of RSIs, and thorough experiments prove that our proposed approach is effective for addressing the above issues and is a competitive model for instance segmentation of RSIs.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-15"},"PeriodicalIF":8.6000,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10892282/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Instance segmentation of remote sensing images (RSIs) is an essential task for a wide range of applications such as land planning and intelligent transport. Instance segmentation of RSIs is constantly plagued by the unbalanced ratio of foreground and background and limited instance size. And most of the instance segmentation models are based on deep feature learning and contain operations such as multiple downsampling, which is harmful to instance segmentation of RSIs, and thus the performance is still limited. Inspired by the recent superior performance of prompt learning in visual tasks, we propose a new prompt paradigm to address the above issues. Based on the existing instance segmentation model, first, a local prompt module is designed to mine local prompt information from original local tokens for specific instances; second, a global-to-local prompt module is designed to model the contextual information from the global tokens to the local tokens where the instances are located for specific instances. Finally, a proposal’s area loss function (PAreaLoss) is designed to add a decoupling dimension for proposals on the scale to better exploit the potential of the above two prompt modules. It is worth mentioning that our proposed approach can extend the instance segmentation model to a promptable instance segmentation model, i.e., to segment the instances with the specific boxes’ prompt. The time consumption for each promptable instance segmentation process is only 40 ms. This article evaluates the effectiveness of our proposed approach based on several existing models in four instance segmentation datasets of RSIs, and thorough experiments prove that our proposed approach is effective for addressing the above issues and is a competitive model for instance segmentation of RSIs.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
洞察任何实例:遥感图像的提示实例分割
遥感图像的实例分割是土地规划和智能交通等广泛应用的一项重要任务。rsi的实例分割一直受到前景和背景比例不平衡以及实例大小有限的困扰。而且大多数实例分割模型都是基于深度特征学习,包含多次降采样等操作,不利于rsi的实例分割,因此性能仍然有限。受提示学习在视觉任务中的优异表现的启发,我们提出了一种新的提示范式来解决上述问题。在现有实例分割模型的基础上,首先设计本地提示模块,从原始本地令牌中挖掘特定实例的本地提示信息;其次,设计一个全局到本地提示模块,对从全局令牌到特定实例所在的本地令牌的上下文信息进行建模。最后,设计了提案面积损失函数(PAreaLoss),在尺度上为提案增加解耦维度,以更好地发挥上述两个提示模块的潜力。值得一提的是,我们提出的方法可以将实例分割模型扩展为提示的实例分割模型,即使用特定框的提示符对实例进行分割。每个提示实例分割过程的时间消耗仅为40毫秒。本文在四个rsi实例分割数据集上,基于几种现有模型对本文方法的有效性进行了评估,实验证明本文方法有效解决了上述问题,是一种具有竞争力的rsi实例分割模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Geoscience and Remote Sensing
IEEE Transactions on Geoscience and Remote Sensing 工程技术-地球化学与地球物理
CiteScore
11.50
自引率
28.00%
发文量
1912
审稿时长
4.0 months
期刊介绍: IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.
期刊最新文献
Collaborative Carrier Magnetic Interference Compensation and Dynamic Estimation Method EQAnomalyNet: Unsupervised Seismic Phase Picking via Transformer Autoencoder LADAR: Language-Adaptive Dynamic Aqua Refinement for Underwater Image Enhancement RSRT-DETR: Hierarchical Polar Attention and Multi-Scale Hypergraph Networks for Dense Remote-Sensing Small-Object Detection A Position- and Scale-Aware Object-Agent Classifier via Deep Reinforcement Learning for Remote Sensing Imagery
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1