Object discriminability re-extraction for distractor-aware visual object tracking

IF 4.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Computer Vision and Image Understanding Pub Date : 2024-07-24 DOI:10.1016/j.cviu.2024.104075

{"title":"Object discriminability re-extraction for distractor-aware visual object tracking","authors":"","doi":"10.1016/j.cviu.2024.104075","DOIUrl":null,"url":null,"abstract":"<div><p>The similar distractor problem is one of the most difficult challenges for Siamese-based trackers. Since they formulate the visual tracking task as a similar matching problem, these trackers involve an essential problem that they are sensitive to the intra-class and inter-class instances with similar appearance confusion. To solve the problem, we propose an object discriminability re-extraction network (ODR-Net) for distractor-aware visual object tracking. The network first mines similar distractors from existing tracking information with a distractor capture module, and then re-extracts discriminative features to redetect the target from distractors with a discriminative feature re-extraction module. It solves the distractor problem in the decoding phase of a tracker and can be considered as a general block that applied to existing Siamese trackers to tackle the similar distractor problem. To demonstrate the effectiveness of the proposed method, extensive experiments and comparisons with state-of-the-art trackers are conducted on a variety of large-scale benchmark datasets, including GOT-10k, LaSOT, OTB-2015, TrackingNet, VOT2020, VOT2021, and VOT2022. Without bells and whistles, our ODR-Net achieves leading performance with a real-time speed.</p></div>","PeriodicalId":50633,"journal":{"name":"Computer Vision and Image Understanding","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Vision and Image Understanding","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1077314224001565","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The similar distractor problem is one of the most difficult challenges for Siamese-based trackers. Since they formulate the visual tracking task as a similar matching problem, these trackers involve an essential problem that they are sensitive to the intra-class and inter-class instances with similar appearance confusion. To solve the problem, we propose an object discriminability re-extraction network (ODR-Net) for distractor-aware visual object tracking. The network first mines similar distractors from existing tracking information with a distractor capture module, and then re-extracts discriminative features to redetect the target from distractors with a discriminative feature re-extraction module. It solves the distractor problem in the decoding phase of a tracker and can be considered as a general block that applied to existing Siamese trackers to tackle the similar distractor problem. To demonstrate the effectiveness of the proposed method, extensive experiments and comparisons with state-of-the-art trackers are conducted on a variety of large-scale benchmark datasets, including GOT-10k, LaSOT, OTB-2015, TrackingNet, VOT2020, VOT2021, and VOT2022. Without bells and whistles, our ODR-Net achieves leading performance with a real-time speed.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于分心者感知视觉对象跟踪的对象可辨别性再提取

相似干扰物问题是基于连体图像的跟踪器所面临的最大挑战之一。由于它们将视觉跟踪任务表述为相似匹配问题，因此这些跟踪器涉及到一个基本问题，即它们对具有相似外观混淆的类内和类间实例非常敏感。为了解决这个问题，我们提出了一种对象可辨别性再提取网络（ODR-Net），用于分心点感知视觉对象跟踪。该网络首先利用分心点捕捉模块从现有的跟踪信息中挖掘类似的分心点，然后利用分辨特征再提取模块从分心点中重新提取分辨特征，以重新检测目标。它解决了跟踪器解码阶段的分心问题，可被视为一个通用模块，应用于现有的连体跟踪器，以解决相似分心问题。为了证明所提方法的有效性，我们在 GOT-10k、LaSOT、OTB-2015、TrackingNet、VOT2020、VOT2021 和 VOT2022 等各种大规模基准数据集上进行了大量实验，并与最先进的跟踪器进行了比较。我们的 ODR-Net 不需要繁琐的程序，就能以实时的速度实现领先的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Computer Vision and Image Understanding 工程技术-工程：电子与电气

CiteScore

7.80

自引率

4.40%

发文量

112

审稿时长

79 days

期刊介绍： The central focus of this journal is the computer analysis of pictorial information. Computer Vision and Image Understanding publishes papers covering all aspects of image analysis from the low-level, iconic processes of early vision to the high-level, symbolic processes of recognition and interpretation. A wide range of topics in the image understanding area is covered, including papers offering insights that differ from predominant views. Research Areas Include: • Theory • Early vision • Data structures and representations • Shape • Range • Motion • Matching and recognition • Architecture and languages • Vision systems