Wholly-WOOD: Wholly Leveraging Diversified-Quality Labels for Weakly-Supervised Oriented Object Detection

IF 18.6 IEEE transactions on pattern analysis and machine intelligence Pub Date : 2025-02-17 DOI:10.1109/TPAMI.2025.3542542

Yi Yu;Xue Yang;Yansheng Li;Zhenjun Han;Feipeng Da;Junchi Yan

{"title":"Wholly-WOOD: Wholly Leveraging Diversified-Quality Labels for Weakly-Supervised Oriented Object Detection","authors":"Yi Yu;Xue Yang;Yansheng Li;Zhenjun Han;Feipeng Da;Junchi Yan","doi":"10.1109/TPAMI.2025.3542542","DOIUrl":null,"url":null,"abstract":"Accurately estimating the orientation of visual objects with compact rotated bounding boxes (RBoxes) has become a prominent demand, which challenges existing object detection paradigms that only use horizontal bounding boxes (HBoxes). To equip the detectors with orientation awareness, supervised regression/classification modules have been introduced at the high cost of rotation annotation. Meanwhile, some existing datasets with oriented objects are already annotated with horizontal boxes or even single points. It becomes attractive yet remains open for effectively utilizing weaker single point and horizontal annotations to train an oriented object detector (OOD). We develop Wholly-WOOD, a weakly-supervised OOD framework, capable of wholly leveraging various labeling forms (Points, HBoxes, RBoxes, and their combination) in a unified fashion. By only using HBox for training, our Wholly-WOOD achieves performance very close to that of the RBox-trained counterpart on remote sensing and other areas, significantly reducing the tedious efforts on labor-intensive annotation for oriented objects.","PeriodicalId":94034,"journal":{"name":"IEEE transactions on pattern analysis and machine intelligence","volume":"47 6","pages":"4438-4454"},"PeriodicalIF":18.6000,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on pattern analysis and machine intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10891210/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Accurately estimating the orientation of visual objects with compact rotated bounding boxes (RBoxes) has become a prominent demand, which challenges existing object detection paradigms that only use horizontal bounding boxes (HBoxes). To equip the detectors with orientation awareness, supervised regression/classification modules have been introduced at the high cost of rotation annotation. Meanwhile, some existing datasets with oriented objects are already annotated with horizontal boxes or even single points. It becomes attractive yet remains open for effectively utilizing weaker single point and horizontal annotations to train an oriented object detector (OOD). We develop Wholly-WOOD, a weakly-supervised OOD framework, capable of wholly leveraging various labeling forms (Points, HBoxes, RBoxes, and their combination) in a unified fashion. By only using HBox for training, our Wholly-WOOD achieves performance very close to that of the RBox-trained counterpart on remote sensing and other areas, significantly reducing the tedious efforts on labor-intensive annotation for oriented objects.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

完全利用不同质量的标签进行弱监督导向的目标检测

利用紧凑旋转边界框（rbox）准确估计视觉对象的方向已经成为一个突出的需求，这对现有的仅使用水平边界框（hbox）的目标检测范式提出了挑战。为了使检测器具有方向感知，在旋转标注的高成本下引入了监督回归/分类模块。同时，一些现有的有方向对象的数据集已经用水平方框甚至单点进行了标注。有效利用较弱的单点和水平注释来训练面向对象检测器（OOD）变得有吸引力，但仍有开放的空间。我们开发了一个弱监督的OOD框架whole - wood，能够以统一的方式完全利用各种标签形式（point, HBoxes， RBoxes及其组合）。通过只使用HBox进行训练，我们的whole - wood在遥感和其他领域的性能非常接近rbox训练的同类，大大减少了面向对象的劳动密集型标注的繁琐工作。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on pattern analysis and machine intelligence

自引率

0.00%

发文量