{"title":"利用时间上均匀的相关滤波器实现跟踪中的变形松弛。","authors":"Yuanming Zhang, Huihui Pan, Jue Wang","doi":"10.1016/j.neunet.2024.106839","DOIUrl":null,"url":null,"abstract":"<p><p>Discriminative correlation filters with temporal regularization have recently attracted much attention in mobile video tracking, due to the challenges of target occlusion and background interference. However, rigidly penalizing the variability of templates between adjacent frames makes trackers lazy for target evolution, leading to inaccurate responses or even tracking failure when deformation occurs. In this paper, we address the problem of instant template learning when the target undergoes drastic variations in appearance and aspect ratio. We first propose a temporally even model featuring deformation slack, which theoretically supports the ability of the template to respond quickly to variations while suppressing disturbances. Then, an optimal derivation of our model is formulated, and the closed form solutions are deduced to facilitate the algorithm implementation. Further, we introduce a cyclic shift methodology for mirror factors to achieve scale estimation of varying aspect ratios, thereby dramatically improving the cross-area accuracy. Comprehensive comparisons on seven datasets demonstrate our excellent performance: DroneTB-70, VisDrone-SOT2019, VOT-2019, LaSOT, TC-128, UAV-20L, and UAVDT. Our approach runs at 16.9 frames per second on a low-cost Central Processing Unit, which makes it suitable for tracking on drones. The code and raw results will be made publicly available at: https://github.com/visualperceptlab/TEDS.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"181 ","pages":"106839"},"PeriodicalIF":6.0000,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enabling deformation slack in tracking with temporally even correlation filters.\",\"authors\":\"Yuanming Zhang, Huihui Pan, Jue Wang\",\"doi\":\"10.1016/j.neunet.2024.106839\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Discriminative correlation filters with temporal regularization have recently attracted much attention in mobile video tracking, due to the challenges of target occlusion and background interference. However, rigidly penalizing the variability of templates between adjacent frames makes trackers lazy for target evolution, leading to inaccurate responses or even tracking failure when deformation occurs. In this paper, we address the problem of instant template learning when the target undergoes drastic variations in appearance and aspect ratio. We first propose a temporally even model featuring deformation slack, which theoretically supports the ability of the template to respond quickly to variations while suppressing disturbances. Then, an optimal derivation of our model is formulated, and the closed form solutions are deduced to facilitate the algorithm implementation. Further, we introduce a cyclic shift methodology for mirror factors to achieve scale estimation of varying aspect ratios, thereby dramatically improving the cross-area accuracy. Comprehensive comparisons on seven datasets demonstrate our excellent performance: DroneTB-70, VisDrone-SOT2019, VOT-2019, LaSOT, TC-128, UAV-20L, and UAVDT. Our approach runs at 16.9 frames per second on a low-cost Central Processing Unit, which makes it suitable for tracking on drones. The code and raw results will be made publicly available at: https://github.com/visualperceptlab/TEDS.</p>\",\"PeriodicalId\":49763,\"journal\":{\"name\":\"Neural Networks\",\"volume\":\"181 \",\"pages\":\"106839\"},\"PeriodicalIF\":6.0000,\"publicationDate\":\"2024-10-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neural Networks\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1016/j.neunet.2024.106839\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1016/j.neunet.2024.106839","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Enabling deformation slack in tracking with temporally even correlation filters.
Discriminative correlation filters with temporal regularization have recently attracted much attention in mobile video tracking, due to the challenges of target occlusion and background interference. However, rigidly penalizing the variability of templates between adjacent frames makes trackers lazy for target evolution, leading to inaccurate responses or even tracking failure when deformation occurs. In this paper, we address the problem of instant template learning when the target undergoes drastic variations in appearance and aspect ratio. We first propose a temporally even model featuring deformation slack, which theoretically supports the ability of the template to respond quickly to variations while suppressing disturbances. Then, an optimal derivation of our model is formulated, and the closed form solutions are deduced to facilitate the algorithm implementation. Further, we introduce a cyclic shift methodology for mirror factors to achieve scale estimation of varying aspect ratios, thereby dramatically improving the cross-area accuracy. Comprehensive comparisons on seven datasets demonstrate our excellent performance: DroneTB-70, VisDrone-SOT2019, VOT-2019, LaSOT, TC-128, UAV-20L, and UAVDT. Our approach runs at 16.9 frames per second on a low-cost Central Processing Unit, which makes it suitable for tracking on drones. The code and raw results will be made publicly available at: https://github.com/visualperceptlab/TEDS.
期刊介绍:
Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.