Multi-Modal Fusion Object Tracking Based on Fully Convolutional Siamese Network

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning Pub Date : 2023-03-17 DOI:10.1145/3590003.3590084

Ke Qi, Liji Chen, Yicong Zhou, Yutao Qi

{"title":"Multi-Modal Fusion Object Tracking Based on Fully Convolutional Siamese Network","authors":"Ke Qi, Liji Chen, Yicong Zhou, Yutao Qi","doi":"10.1145/3590003.3590084","DOIUrl":null,"url":null,"abstract":"RGBT tracking incorporates thermal infrared data to achieve more accurate visual tracking. However, the efficiency of RGBT tracking may be diminished by some bottlenecks, such as thermal crossover, illumination variation and occlusion. To address the aforementioned problems, we propose a fully-convolutional Siamese-based Multi-modal Feature Fusion Network (SiamMFF) that integrates RGB and thermal features. In our work, visible and infrared images are initially processed by the Multi-Modal Feature Fusion framework (MFF) at the search and template sides, respectively. Then, the attribute-aware fusion module is introduced to conduct feature extraction and fusion for the major challenge attributes. In particular, we design a skip connections guidance module to prevent the propagation of noise and to enrich the feature information so that we can improve the tracker’s discriminative ability for modality-specific challenges. The proposed SiamMFF method has been evaluated in a great number of trials on two benchmark datasets GTOT and RGBT234, and the precision rate and success rate can reach 90.5%/73.6% and 81.2%/57.3%, respectively, demonstrating the superiority of our method over existing state-of-the-art methods.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"74 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3590003.3590084","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

RGBT tracking incorporates thermal infrared data to achieve more accurate visual tracking. However, the efficiency of RGBT tracking may be diminished by some bottlenecks, such as thermal crossover, illumination variation and occlusion. To address the aforementioned problems, we propose a fully-convolutional Siamese-based Multi-modal Feature Fusion Network (SiamMFF) that integrates RGB and thermal features. In our work, visible and infrared images are initially processed by the Multi-Modal Feature Fusion framework (MFF) at the search and template sides, respectively. Then, the attribute-aware fusion module is introduced to conduct feature extraction and fusion for the major challenge attributes. In particular, we design a skip connections guidance module to prevent the propagation of noise and to enrich the feature information so that we can improve the tracker’s discriminative ability for modality-specific challenges. The proposed SiamMFF method has been evaluated in a great number of trials on two benchmark datasets GTOT and RGBT234, and the precision rate and success rate can reach 90.5%/73.6% and 81.2%/57.3%, respectively, demonstrating the superiority of our method over existing state-of-the-art methods.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于全卷积Siamese网络的多模态融合目标跟踪

RGBT跟踪结合热红外数据，实现更准确的视觉跟踪。然而，由于热交叉、光照变化和遮挡等瓶颈，RGBT跟踪的效率会受到影响。为了解决上述问题，我们提出了一个基于全卷积暹罗的多模态特征融合网络(SiamMFF)，该网络集成了RGB和热特征。在我们的工作中，可见光和红外图像分别在搜索端和模板端由多模态特征融合框架(MFF)进行初始处理。然后，引入属性感知融合模块，对主要挑战属性进行特征提取和融合;特别地，我们设计了一个跳跃连接引导模块，以防止噪声的传播，丰富特征信息，从而提高跟踪器对特定模态挑战的判别能力。本文方法在GTOT和RGBT234两个基准数据集上进行了大量试验，准确率和成功率分别达到90.5%/73.6%和81.2%/57.3%，证明了本文方法相对于现有先进方法的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

自引率

0.00%

发文量