A Tri-Layer Plugin to Improve Occluded Detection

BMVC : proceedings of the British Machine Vision Conference. British Machine Vision Conference Pub Date : 2022-10-18 DOI:10.48550/arXiv.2210.10046

Guanqi Zhan, Weidi Xie, Andrew Zisserman

{"title":"A Tri-Layer Plugin to Improve Occluded Detection","authors":"Guanqi Zhan, Weidi Xie, Andrew Zisserman","doi":"10.48550/arXiv.2210.10046","DOIUrl":null,"url":null,"abstract":"Detecting occluded objects still remains a challenge for state-of-the-art object detectors. The objective of this work is to improve the detection for such objects, and thereby improve the overall performance of a modern object detector. To this end we make the following four contributions: (1) We propose a simple 'plugin' module for the detection head of two-stage object detectors to improve the recall of partially occluded objects. The module predicts a tri-layer of segmentation masks for the target object, the occluder and the occludee, and by doing so is able to better predict the mask of the target object. (2) We propose a scalable pipeline for generating training data for the module by using amodal completion of existing object detection and instance segmentation training datasets to establish occlusion relationships. (3) We also establish a COCO evaluation dataset to measure the recall performance of partially occluded and separated objects. (4) We show that the plugin module inserted into a two-stage detector can boost the performance significantly, by only fine-tuning the detection head, and with additional improvements if the entire architecture is fine-tuned. COCO results are reported for Mask R-CNN with Swin-T or Swin-S backbones, and Cascade Mask R-CNN with a Swin-B backbone.","PeriodicalId":72437,"journal":{"name":"BMVC : proceedings of the British Machine Vision Conference. British Machine Vision Conference","volume":"101 1","pages":"250"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMVC : proceedings of the British Machine Vision Conference. British Machine Vision Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2210.10046","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

Detecting occluded objects still remains a challenge for state-of-the-art object detectors. The objective of this work is to improve the detection for such objects, and thereby improve the overall performance of a modern object detector. To this end we make the following four contributions: (1) We propose a simple 'plugin' module for the detection head of two-stage object detectors to improve the recall of partially occluded objects. The module predicts a tri-layer of segmentation masks for the target object, the occluder and the occludee, and by doing so is able to better predict the mask of the target object. (2) We propose a scalable pipeline for generating training data for the module by using amodal completion of existing object detection and instance segmentation training datasets to establish occlusion relationships. (3) We also establish a COCO evaluation dataset to measure the recall performance of partially occluded and separated objects. (4) We show that the plugin module inserted into a two-stage detector can boost the performance significantly, by only fine-tuning the detection head, and with additional improvements if the entire architecture is fine-tuned. COCO results are reported for Mask R-CNN with Swin-T or Swin-S backbones, and Cascade Mask R-CNN with a Swin-B backbone.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

改进遮挡检测的三层插件

对于最先进的目标探测器来说，检测被遮挡的物体仍然是一个挑战。这项工作的目的是提高对这些目标的检测，从而提高现代目标检测器的整体性能。为此，我们做出了以下四点贡献:(1)我们提出了一个简单的“插件”模块，用于两级目标检测器的检测头，以提高部分遮挡物体的召回率。该模块预测目标对象、遮挡者和被遮挡者的三层分割掩码，从而能够更好地预测目标对象的掩码。(2)我们提出了一个可扩展的管道，通过对现有的目标检测和实例分割训练数据集进行模态补全来建立遮挡关系，为模块生成训练数据。(3)我们还建立了一个COCO评价数据集来衡量部分遮挡和分离对象的召回性能。(4)我们表明，插入到两级检测器中的插件模块可以通过仅微调检测头来显着提高性能，并且如果对整个架构进行微调，则会有额外的改进。使用swan - t或swan - s骨干网的Mask R-CNN和使用swan - b骨干网的Cascade Mask R-CNN报告了COCO结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

BMVC : proceedings of the British Machine Vision Conference. British Machine Vision Conference

自引率

0.00%

发文量

期刊最新文献

Learning Anatomically Consistent Embedding for Chest Radiography. Single Pixel Spectral Color Constancy DiffSketching: Sketch Control Image Synthesis with Diffusion Models Defect Transfer GAN: Diverse Defect Synthesis for Data Augmentation Mitigating Bias in Visual Transformers via Targeted Alignment