Object discriminability re-extraction for distractor-aware visual object tracking

IF 4.3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Computer Vision and Image Understanding Pub Date : 2024-07-24 DOI:10.1016/j.cviu.2024.104075
{"title":"Object discriminability re-extraction for distractor-aware visual object tracking","authors":"","doi":"10.1016/j.cviu.2024.104075","DOIUrl":null,"url":null,"abstract":"<div><p>The similar distractor problem is one of the most difficult challenges for Siamese-based trackers. Since they formulate the visual tracking task as a similar matching problem, these trackers involve an essential problem that they are sensitive to the intra-class and inter-class instances with similar appearance confusion. To solve the problem, we propose an object discriminability re-extraction network (ODR-Net) for distractor-aware visual object tracking. The network first mines similar distractors from existing tracking information with a distractor capture module, and then re-extracts discriminative features to redetect the target from distractors with a discriminative feature re-extraction module. It solves the distractor problem in the decoding phase of a tracker and can be considered as a general block that applied to existing Siamese trackers to tackle the similar distractor problem. To demonstrate the effectiveness of the proposed method, extensive experiments and comparisons with state-of-the-art trackers are conducted on a variety of large-scale benchmark datasets, including GOT-10k, LaSOT, OTB-2015, TrackingNet, VOT2020, VOT2021, and VOT2022. Without bells and whistles, our ODR-Net achieves leading performance with a real-time speed.</p></div>","PeriodicalId":50633,"journal":{"name":"Computer Vision and Image Understanding","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Vision and Image Understanding","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1077314224001565","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

The similar distractor problem is one of the most difficult challenges for Siamese-based trackers. Since they formulate the visual tracking task as a similar matching problem, these trackers involve an essential problem that they are sensitive to the intra-class and inter-class instances with similar appearance confusion. To solve the problem, we propose an object discriminability re-extraction network (ODR-Net) for distractor-aware visual object tracking. The network first mines similar distractors from existing tracking information with a distractor capture module, and then re-extracts discriminative features to redetect the target from distractors with a discriminative feature re-extraction module. It solves the distractor problem in the decoding phase of a tracker and can be considered as a general block that applied to existing Siamese trackers to tackle the similar distractor problem. To demonstrate the effectiveness of the proposed method, extensive experiments and comparisons with state-of-the-art trackers are conducted on a variety of large-scale benchmark datasets, including GOT-10k, LaSOT, OTB-2015, TrackingNet, VOT2020, VOT2021, and VOT2022. Without bells and whistles, our ODR-Net achieves leading performance with a real-time speed.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于分心者感知视觉对象跟踪的对象可辨别性再提取
相似干扰物问题是基于连体图像的跟踪器所面临的最大挑战之一。由于它们将视觉跟踪任务表述为相似匹配问题,因此这些跟踪器涉及到一个基本问题,即它们对具有相似外观混淆的类内和类间实例非常敏感。为了解决这个问题,我们提出了一种对象可辨别性再提取网络(ODR-Net),用于分心点感知视觉对象跟踪。该网络首先利用分心点捕捉模块从现有的跟踪信息中挖掘类似的分心点,然后利用分辨特征再提取模块从分心点中重新提取分辨特征,以重新检测目标。它解决了跟踪器解码阶段的分心问题,可被视为一个通用模块,应用于现有的连体跟踪器,以解决相似分心问题。为了证明所提方法的有效性,我们在 GOT-10k、LaSOT、OTB-2015、TrackingNet、VOT2020、VOT2021 和 VOT2022 等各种大规模基准数据集上进行了大量实验,并与最先进的跟踪器进行了比较。我们的 ODR-Net 不需要繁琐的程序,就能以实时的速度实现领先的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computer Vision and Image Understanding
Computer Vision and Image Understanding 工程技术-工程:电子与电气
CiteScore
7.80
自引率
4.40%
发文量
112
审稿时长
79 days
期刊介绍: The central focus of this journal is the computer analysis of pictorial information. Computer Vision and Image Understanding publishes papers covering all aspects of image analysis from the low-level, iconic processes of early vision to the high-level, symbolic processes of recognition and interpretation. A wide range of topics in the image understanding area is covered, including papers offering insights that differ from predominant views. Research Areas Include: • Theory • Early vision • Data structures and representations • Shape • Range • Motion • Matching and recognition • Architecture and languages • Vision systems
期刊最新文献
Deformable surface reconstruction via Riemannian metric preservation Estimating optical flow: A comprehensive review of the state of the art A lightweight convolutional neural network-based feature extractor for visible images LightSOD: Towards lightweight and efficient network for salient object detection Triple-Stream Commonsense Circulation Transformer Network for Image Captioning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1