Dual Teacher: Improving the Reliability of Pseudo Labels for Semi-Supervised Oriented Object Detection

IF 8.6 1区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Geoscience and Remote Sensing Pub Date : 2024-12-17 DOI:10.1109/TGRS.2024.3519173
Zhenyu Fang;Jinchang Ren;Jiangbin Zheng;Rongjun Chen;Huimin Zhao
{"title":"Dual Teacher: Improving the Reliability of Pseudo Labels for Semi-Supervised Oriented Object Detection","authors":"Zhenyu Fang;Jinchang Ren;Jiangbin Zheng;Rongjun Chen;Huimin Zhao","doi":"10.1109/TGRS.2024.3519173","DOIUrl":null,"url":null,"abstract":"Oriented object detection in remote sensing is a critical task for accurately location and measurement of the interested targets. Despite of its success in object detection, deep learning-based detectors rely heavily on extensive data annotation. However, variations in object appearance significantly increase the difficulty and the cost of creating large-scale annotated datasets. Semi-supervised learning (SSL) aims to utilize unlabeled data to enhance object detectors. Among these, pseudo-label-based methods have shown promising results recently. Nonetheless, as training progresses, the accumulation of errors in pseudo labels leads to prediction bias without corrections. To tackle this particular challenge, we present a SSL pipeline, named “dual teacher,” for improving the reliability of pseudo labels in the semi-supervised oriented object detection. First, to mitigate the bias caused by limited annotated data, a global burn-in (GBI) strategy is introduced at the beginning of training, which guides the student detector to learn the feature extraction on a global scale. In addition, an online bounding box (bbox) correction module is proposed to decrease the occurrence of mislabeled instances and enhance the reliability of detection. These improvements are facilitated by an additional detector, instead of a single teacher model in the teacher-student architecture. Dual teacher reduces the dependency on the quality of pseudo labels related to the model complexity and combines the strengths of both the two-stage and one-stage detectors. With only 20% labeled data, dual teacher outperforms fully supervised rotated fully convolutional one-stage object detection (R-FCOS), you only look once X-small (YOLOX-s), and rotated region-based convolutional neural network (R-RCNN) by up to 2% on both a large-scale dataset for object detection in aerial images (DOTA) and SODA-A datasets. This reveals its potential in reducing labor-intensive tasks and enhancing robustness against environmental interference and noisy labels. The code is available at: \n<uri>https://github.com/ZYFFF-CV/DualTeacher-semisup.git</uri>\n.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-15"},"PeriodicalIF":8.6000,"publicationDate":"2024-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10804848/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Oriented object detection in remote sensing is a critical task for accurately location and measurement of the interested targets. Despite of its success in object detection, deep learning-based detectors rely heavily on extensive data annotation. However, variations in object appearance significantly increase the difficulty and the cost of creating large-scale annotated datasets. Semi-supervised learning (SSL) aims to utilize unlabeled data to enhance object detectors. Among these, pseudo-label-based methods have shown promising results recently. Nonetheless, as training progresses, the accumulation of errors in pseudo labels leads to prediction bias without corrections. To tackle this particular challenge, we present a SSL pipeline, named “dual teacher,” for improving the reliability of pseudo labels in the semi-supervised oriented object detection. First, to mitigate the bias caused by limited annotated data, a global burn-in (GBI) strategy is introduced at the beginning of training, which guides the student detector to learn the feature extraction on a global scale. In addition, an online bounding box (bbox) correction module is proposed to decrease the occurrence of mislabeled instances and enhance the reliability of detection. These improvements are facilitated by an additional detector, instead of a single teacher model in the teacher-student architecture. Dual teacher reduces the dependency on the quality of pseudo labels related to the model complexity and combines the strengths of both the two-stage and one-stage detectors. With only 20% labeled data, dual teacher outperforms fully supervised rotated fully convolutional one-stage object detection (R-FCOS), you only look once X-small (YOLOX-s), and rotated region-based convolutional neural network (R-RCNN) by up to 2% on both a large-scale dataset for object detection in aerial images (DOTA) and SODA-A datasets. This reveals its potential in reducing labor-intensive tasks and enhancing robustness against environmental interference and noisy labels. The code is available at: https://github.com/ZYFFF-CV/DualTeacher-semisup.git .
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
双教师:提高面向半监督对象检测的伪标签可靠性
遥感定向目标检测是对感兴趣目标进行精确定位和测量的一项关键任务。尽管在目标检测方面取得了成功,但基于深度学习的检测器严重依赖于大量的数据注释。然而,对象外观的变化显著增加了创建大规模带注释数据集的难度和成本。半监督学习(SSL)旨在利用未标记的数据来增强目标检测器。其中,基于伪标签的方法近年来取得了可喜的成果。然而,随着训练的进行,伪标签中错误的积累会导致预测偏差而不进行修正。为了解决这个特殊的挑战,我们提出了一个名为“双教师”的SSL管道,用于提高面向半监督对象检测中的伪标签的可靠性。首先,为了减轻标注数据有限造成的偏差,在训练开始时引入全局老化(global burn-in, GBI)策略,引导学生检测器在全局尺度上学习特征提取;此外,提出了在线边界框(bbox)校正模块,以减少误标实例的发生,提高检测的可靠性。这些改进是由一个额外的检测器促进的,而不是教师-学生体系结构中的单个教师模型。双教师减少了对与模型复杂性相关的伪标签质量的依赖,并结合了两阶段和一级检测器的优势。在只有20%标记数据的情况下,双教师在航空图像(DOTA)和SODA-A数据集的大规模目标检测数据集上,比完全监督旋转全卷积单阶段目标检测(R-FCOS)、只看一次X-small (YOLOX-s)和旋转基于区域的卷积神经网络(R-RCNN)的性能都高出2%。这揭示了它在减少劳动密集型任务和增强对环境干扰和噪声标签的鲁棒性方面的潜力。代码可从https://github.com/ZYFFF-CV/DualTeacher-semisup.git获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Geoscience and Remote Sensing
IEEE Transactions on Geoscience and Remote Sensing 工程技术-地球化学与地球物理
CiteScore
11.50
自引率
28.00%
发文量
1912
审稿时长
4.0 months
期刊介绍: IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.
期刊最新文献
HAM-CD: Hybrid Attention Mamba for Remote Sensing Change Detection Land Surface Temperature End-to-end Retrieval from Fengyun-4B AGRI Thermal Infrared Remote Sensing Data Considering the Emissivity Angular Effect PointCleaner: Dynamic Manifold Path Optimization for LiDAR Point Cloud Denoising See Hidden Insight From Transposition: Multi-Axis Feature Aggregation for Aerial Object Detection Harmonic-Assisted TDOA Localization Method for Saturation Interference Sources in SAR Satellite Systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1