遥感中的少镜头目标检测:减轻标签不一致和导航类别变化

IF 3.4 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Access Pub Date : 2025-01-09 DOI:10.1109/ACCESS.2025.3527881
Tiancheng Si;Shenyu Kong
{"title":"遥感中的少镜头目标检测:减轻标签不一致和导航类别变化","authors":"Tiancheng Si;Shenyu Kong","doi":"10.1109/ACCESS.2025.3527881","DOIUrl":null,"url":null,"abstract":"Over recent years, the increasing expansion of remote sensing image (RSI) datasets has made annotation tasks more challenging and labor-intensive, drawing considerable attention toward few-shot object detection (FSOD). Nevertheless, current mainstream FSOD models are primarily designed for natural images and encounter two substantial challenges when applied to RSIs. 1) Inconsistent label assignment for novel instances between pre-training and fine-tuning confuses detectors, leading to diminished generalization performance. 2) Complex scenes within RSIs result in significant category variations, comprising high inter-class similarity and large intra-class variance, which impairs classification accuracy. Against the aforementioned challenges, we propose a novel FSOD approach in RSIs, termed EC-FSOD. Specifically, our approach introduces two key modules: Ensemble Class-free RPN (ECF-RPN) and Contrastive Prototype ETF Classifier (CPEC). The preceding module, ECF-RPN, generates proposals by integrating multiple dissimilar yet cooperative Class-free RPNs that perceive the shape and location of target objects, mitigating the confusion caused by label inconsistencies. Furthermore, the subsequent CPEC module combines two submodules, namely Contrastive Prototype Learning Network (CPLN) and Simplex ETF Classifier (SEC), to obtain a set of representative class prototypes and robust discriminative feature representations, which are employed to overcome the category variations and enhance the generalization performance of novel instances. Extensive experiments have revealed that our approach achieves top-2 results on the DIOR dataset and optimal performance on the NWPU VHR-10.v2 dataset.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"13 ","pages":"8169-8186"},"PeriodicalIF":3.4000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10835074","citationCount":"0","resultStr":"{\"title\":\"Few-Shot Object Detection in Remote Sensing: Mitigating Label Inconsistencies and Navigating Category Variations\",\"authors\":\"Tiancheng Si;Shenyu Kong\",\"doi\":\"10.1109/ACCESS.2025.3527881\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Over recent years, the increasing expansion of remote sensing image (RSI) datasets has made annotation tasks more challenging and labor-intensive, drawing considerable attention toward few-shot object detection (FSOD). Nevertheless, current mainstream FSOD models are primarily designed for natural images and encounter two substantial challenges when applied to RSIs. 1) Inconsistent label assignment for novel instances between pre-training and fine-tuning confuses detectors, leading to diminished generalization performance. 2) Complex scenes within RSIs result in significant category variations, comprising high inter-class similarity and large intra-class variance, which impairs classification accuracy. Against the aforementioned challenges, we propose a novel FSOD approach in RSIs, termed EC-FSOD. Specifically, our approach introduces two key modules: Ensemble Class-free RPN (ECF-RPN) and Contrastive Prototype ETF Classifier (CPEC). The preceding module, ECF-RPN, generates proposals by integrating multiple dissimilar yet cooperative Class-free RPNs that perceive the shape and location of target objects, mitigating the confusion caused by label inconsistencies. Furthermore, the subsequent CPEC module combines two submodules, namely Contrastive Prototype Learning Network (CPLN) and Simplex ETF Classifier (SEC), to obtain a set of representative class prototypes and robust discriminative feature representations, which are employed to overcome the category variations and enhance the generalization performance of novel instances. Extensive experiments have revealed that our approach achieves top-2 results on the DIOR dataset and optimal performance on the NWPU VHR-10.v2 dataset.\",\"PeriodicalId\":13079,\"journal\":{\"name\":\"IEEE Access\",\"volume\":\"13 \",\"pages\":\"8169-8186\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-01-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10835074\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Access\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10835074/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Access","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10835074/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

近年来,遥感图像(RSI)数据集的不断扩展使得标注任务更具挑战性和劳动强度,引起了人们对少射目标检测(FSOD)的广泛关注。然而,目前主流的FSOD模型主要是为自然图像设计的,在应用于rsi时遇到了两个重大挑战。1)在预训练和微调之间对新实例不一致的标签分配会混淆检测器,导致泛化性能下降。2) rsi内复杂场景导致类别差异显著,类间相似度高,类内方差大,影响分类精度。针对上述挑战,我们提出了一种新的rsi FSOD方法,称为EC-FSOD。具体来说,我们的方法引入了两个关键模块:集成无类RPN (ECF-RPN)和对比原型ETF分类器(CPEC)。前面的模块ECF-RPN通过整合多个不同但具有协作性的无类rpn来生成提案,这些rpn可以感知目标物体的形状和位置,从而减轻标签不一致造成的混淆。随后的CPEC模块结合对比原型学习网络(CPLN)和单纯形ETF分类器(SEC)两个子模块,获得一组具有代表性的类原型和鲁棒判别特征表示,用于克服类别变化,提高新实例的泛化性能。大量的实验表明,我们的方法在DIOR数据集上取得了前2名的结果,在NWPU VHR-10上取得了最佳性能。v2数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Few-Shot Object Detection in Remote Sensing: Mitigating Label Inconsistencies and Navigating Category Variations
Over recent years, the increasing expansion of remote sensing image (RSI) datasets has made annotation tasks more challenging and labor-intensive, drawing considerable attention toward few-shot object detection (FSOD). Nevertheless, current mainstream FSOD models are primarily designed for natural images and encounter two substantial challenges when applied to RSIs. 1) Inconsistent label assignment for novel instances between pre-training and fine-tuning confuses detectors, leading to diminished generalization performance. 2) Complex scenes within RSIs result in significant category variations, comprising high inter-class similarity and large intra-class variance, which impairs classification accuracy. Against the aforementioned challenges, we propose a novel FSOD approach in RSIs, termed EC-FSOD. Specifically, our approach introduces two key modules: Ensemble Class-free RPN (ECF-RPN) and Contrastive Prototype ETF Classifier (CPEC). The preceding module, ECF-RPN, generates proposals by integrating multiple dissimilar yet cooperative Class-free RPNs that perceive the shape and location of target objects, mitigating the confusion caused by label inconsistencies. Furthermore, the subsequent CPEC module combines two submodules, namely Contrastive Prototype Learning Network (CPLN) and Simplex ETF Classifier (SEC), to obtain a set of representative class prototypes and robust discriminative feature representations, which are employed to overcome the category variations and enhance the generalization performance of novel instances. Extensive experiments have revealed that our approach achieves top-2 results on the DIOR dataset and optimal performance on the NWPU VHR-10.v2 dataset.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Access
IEEE Access COMPUTER SCIENCE, INFORMATION SYSTEMSENGIN-ENGINEERING, ELECTRICAL & ELECTRONIC
CiteScore
9.80
自引率
7.70%
发文量
6673
审稿时长
6 weeks
期刊介绍: IEEE Access® is a multidisciplinary, open access (OA), applications-oriented, all-electronic archival journal that continuously presents the results of original research or development across all of IEEE''s fields of interest. IEEE Access will publish articles that are of high interest to readers, original, technically correct, and clearly presented. Supported by author publication charges (APC), its hallmarks are a rapid peer review and publication process with open access to all readers. Unlike IEEE''s traditional Transactions or Journals, reviews are "binary", in that reviewers will either Accept or Reject an article in the form it is submitted in order to achieve rapid turnaround. Especially encouraged are submissions on: Multidisciplinary topics, or applications-oriented articles and negative results that do not fit within the scope of IEEE''s traditional journals. Practical articles discussing new experiments or measurement techniques, interesting solutions to engineering. Development of new or improved fabrication or manufacturing techniques. Reviews or survey articles of new or evolving fields oriented to assist others in understanding the new area.
期刊最新文献
Editorial Board IEEE Access™ Editorial Board Corrections to “The Recent Technologies to Curb the Second-Wave of COVID-19 Pandemic” Corrections to “Decentralized Asynchronous Formation Planning of Multirotor Aerial Vehicles in Dynamic Environments Using Flexible Formation Graphs and Tight Trajectory Hulls” Study on the Motion Patterns of Nested Test Cabin and Its Shock Response Spectrum Analysis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1