PSRR-MaxpoolNMS++: Fast Non-Maximum Suppression With Discretization and Pooling

IEEE transactions on pattern analysis and machine intelligence Pub Date : 2024-10-28 DOI:10.1109/TPAMI.2024.3485898

Tianyi Zhang;Chunyun Chen;Yun Liu;Xue Geng;Mohamed M. Sabry Aly;Jie Lin

{"title":"PSRR-MaxpoolNMS++: Fast Non-Maximum Suppression With Discretization and Pooling","authors":"Tianyi Zhang;Chunyun Chen;Yun Liu;Xue Geng;Mohamed M. Sabry Aly;Jie Lin","doi":"10.1109/TPAMI.2024.3485898","DOIUrl":null,"url":null,"abstract":"Non-maximum suppression (NMS) is an essential post-processing step for object detection. The de-facto standard for NMS, namely GreedyNMS, is not parallelizable and could thus be the performance bottleneck in object detection pipelines. MaxpoolNMS is introduced as a fast and parallelizable alternative to GreedyNMS. However, MaxpoolNMS is only capable of replacing the GreedyNMS at the first stage of two-stage detectors like Faster R-CNN. To address this issue, we observe that MaxpoolNMS employs the process of \n<italic>box coordinate discretization\n followed by \n<italic>local score argmax calculation\n, to discard the nested-loop pipeline in GreedyNMS to enable parallelizable implementations. In this paper, we introduce a simple \n<italic>Relationship Recovery\n module and a \n<italic>Pyramid Shifted MaxpoolNMS\n module to improve the above two stages, respectively. With these two modules, our \n<bold>PSRR-MaxpoolNMS\n is a generic and parallelizable approach, which can completely replace GreedyNMS at all stages in all detectors. Furthermore, we extend PSRR-MaxpoolNMS to the more powerful \n<bold>PSRR-MaxpoolNMS++\n. As for \n<italic>box coordinate discretization\n, we propose \n<italic>Density-based Discretization\n for better adherence to the target density of the suppression. As for \n<italic>local score argmax calculation\n, we propose an \n<italic>Adjacent Scale Pooling\n scheme for mining out the duplicated box pairs more accurately and efficiently. Extensive experiments demonstrate that both our PSRR-MaxpoolNMS and PSRR-MaxpoolNMS++ outperform MaxpoolNMS by a large margin. Additionally, PSRR-MaxpoolNMS++ not only surpasses PSRR-MaxpoolNMS but also attains competitive accuracy and much better efficiency when compared with GreedyNMS. Therefore, PSRR-MaxpoolNMS++ is a parallelizable NMS solution that can effectively replace GreedyNMS at all stages in all detectors.","PeriodicalId":94034,"journal":{"name":"IEEE transactions on pattern analysis and machine intelligence","volume":"47 2","pages":"978-993"},"PeriodicalIF":0.0000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on pattern analysis and machine intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10736991/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Non-maximum suppression (NMS) is an essential post-processing step for object detection. The de-facto standard for NMS, namely GreedyNMS, is not parallelizable and could thus be the performance bottleneck in object detection pipelines. MaxpoolNMS is introduced as a fast and parallelizable alternative to GreedyNMS. However, MaxpoolNMS is only capable of replacing the GreedyNMS at the first stage of two-stage detectors like Faster R-CNN. To address this issue, we observe that MaxpoolNMS employs the process of box coordinate discretization followed by local score argmax calculation , to discard the nested-loop pipeline in GreedyNMS to enable parallelizable implementations. In this paper, we introduce a simple Relationship Recovery module and a Pyramid Shifted MaxpoolNMS module to improve the above two stages, respectively. With these two modules, our PSRR-MaxpoolNMS is a generic and parallelizable approach, which can completely replace GreedyNMS at all stages in all detectors. Furthermore, we extend PSRR-MaxpoolNMS to the more powerful PSRR-MaxpoolNMS++ . As for box coordinate discretization , we propose Density-based Discretization for better adherence to the target density of the suppression. As for local score argmax calculation , we propose an Adjacent Scale Pooling scheme for mining out the duplicated box pairs more accurately and efficiently. Extensive experiments demonstrate that both our PSRR-MaxpoolNMS and PSRR-MaxpoolNMS++ outperform MaxpoolNMS by a large margin. Additionally, PSRR-MaxpoolNMS++ not only surpasses PSRR-MaxpoolNMS but also attains competitive accuracy and much better efficiency when compared with GreedyNMS. Therefore, PSRR-MaxpoolNMS++ is a parallelizable NMS solution that can effectively replace GreedyNMS at all stages in all detectors.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

PSRR-MaxpoolNMS++：利用离散化和池化实现快速非最大值抑制

非最大抑制（NMS）是物体检测中必不可少的后处理步骤。NMS 的事实标准，即 GreedyNMS，是不可并行化的，因此可能成为物体检测管道的性能瓶颈。MaxpoolNMS 是作为 GreedyNMS 的快速、可并行化替代品而推出的。然而，MaxpoolNMS 只能在两阶段检测器（如 Faster R-CNN）的第一阶段替代 GreedyNMS。为了解决这个问题，我们发现 MaxpoolNMS 采用了先计算盒坐标离散化，再计算局部得分 argmax 的过程，从而摒弃了 GreedyNMS 中的嵌套循环流水线，实现了可并行化实现。本文引入了简单关系恢复模块和金字塔移动 MaxpoolNMS 模块，分别对上述两个阶段进行改进。有了这两个模块，我们的 PSRR-MaxpoolNMS 是一种通用的可并行化方法，可以在所有探测器的所有阶段完全取代 GreedyNMS。此外，我们还将 PSRR-MaxpoolNMS 扩展为功能更强大的 PSRR-MaxpoolNMS++。在盒坐标离散化方面，我们提出了基于密度的离散化，以更好地遵循抑制的目标密度。在局部得分 argmax 计算方面，我们提出了相邻规模池化方案，以更准确、更高效地挖掘出重复的盒对。大量实验证明，PSRR-MaxpoolNMS 和 PSRR-MaxpoolNMS++ 的性能远远优于 MaxpoolNMS。此外，与 GreedyNMS 相比，PSRR-MaxpoolNMS++ 不仅超越了 PSRR-MaxpoolNMS，而且在准确性和效率方面也更胜一筹。因此，PSRR-MaxpoolNMS++ 是一种可并行化的 NMS 解决方案，能在所有探测器的所有阶段有效取代 GreedyNMS。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on pattern analysis and machine intelligence

自引率

0.00%

发文量

期刊最新文献

FSD V2: Improving Fully Sparse 3D Object Detection With Virtual Voxels Online Learning Under a Separable Stochastic Approximation Framework Event-Enhanced Snapshot Compressive Videography at 10K FPS Fast and Functional Structured Data Generators Rooted in Out-of-Equilibrium Physics Estimating Information Theoretic Measures via Multidimensional Gaussianization