Jie Chen;Ya Guo;Dengda Qin;Jingru Zhu;Zhenbo Gou;Geng Sun
{"title":"Multiscale Feature Knowledge Distillation and Implicit Object Discovery for Few-Shot Object Detection in Remote Sensing Images","authors":"Jie Chen;Ya Guo;Dengda Qin;Jingru Zhu;Zhenbo Gou;Geng Sun","doi":"10.1109/TGRS.2024.3520715","DOIUrl":null,"url":null,"abstract":"Dynamic or sudden changes in various scenes may give rise to new objects. These new objects with limited annotated samples are susceptible to overfitting in deep learning. While few-shot object detection (FSOD) is effective with limited samples, current FSOD methods for remote sensing images still face specific challenges. The “pretraining-transfer” paradigm tends to forget the feature representations of base classes, impacting the learning process for novel classes during few-shot training. Furthermore, the presence of implicit objects in sparsely labeled instances of remote sensing images introduces erroneous supervisory information. To address these challenges, we propose an FSOD method that incorporates multiscale feature knowledge distillation and implicit object discovery, named MFKDIOD, which preserves the performance of base classes and mitigates the impact of implicit objects. Specifically, we first design a multiscale feature knowledge distillation (MFKD) module, which transfers the knowledge of base classes from a teacher network to a student network, enabling the student network to better retain the base class feature representations. Second, we design an implicit object discovery (IOD) module that utilizes both the teacher and student networks to discover implicit objects within the few-shot training data and generate pseudolabels. The code will be available at \n<uri>https://github.com/RS-CSU/MFKDIOD</uri>\n.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-15"},"PeriodicalIF":8.6000,"publicationDate":"2024-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10810500/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Dynamic or sudden changes in various scenes may give rise to new objects. These new objects with limited annotated samples are susceptible to overfitting in deep learning. While few-shot object detection (FSOD) is effective with limited samples, current FSOD methods for remote sensing images still face specific challenges. The “pretraining-transfer” paradigm tends to forget the feature representations of base classes, impacting the learning process for novel classes during few-shot training. Furthermore, the presence of implicit objects in sparsely labeled instances of remote sensing images introduces erroneous supervisory information. To address these challenges, we propose an FSOD method that incorporates multiscale feature knowledge distillation and implicit object discovery, named MFKDIOD, which preserves the performance of base classes and mitigates the impact of implicit objects. Specifically, we first design a multiscale feature knowledge distillation (MFKD) module, which transfers the knowledge of base classes from a teacher network to a student network, enabling the student network to better retain the base class feature representations. Second, we design an implicit object discovery (IOD) module that utilizes both the teacher and student networks to discover implicit objects within the few-shot training data and generate pseudolabels. The code will be available at
https://github.com/RS-CSU/MFKDIOD
.
期刊介绍:
IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.