{"title":"BAM-SORT:边界引导激活匹配在线多目标跟踪","authors":"Yuan Chao, Huaiyang Zhu, Hengyu Lu","doi":"10.1007/s10489-024-06037-1","DOIUrl":null,"url":null,"abstract":"<div><p>Multi-object tracking aims at estimating object bounding boxes and identity IDs in videos. Most tracking methods combine a detector and a Kalman filter using the IoU distance as a similarity metric for association matching of the previous trajectories with the current detection box. These methods usually suffer from ID switches and fragmented trajectories in response to congested and frequently occluded scenarios. To solve this problem, in this study, a simple and effective association method is proposed. First, a bottom edge cost matrix is introduced for the utilization of depth information to improve the data association and increase the robustness in the case of occlusion. Second, an asymmetric trajectory classification mechanism is proposed to distinguish the false-postive trajectories, and an activated trajectory matching strategy is introduced to reduce the interference of noise and transient objects in tracking. Finally, the trajectory deletion strategy is improved by introducing the number of trajectory state switches to delete the trajectories caused by spurious high-scoring detection boxes in real time, as a result, the number of fragmented trajectories is also reduced. These innovations achieve excellent performance on various benchmarks, including MOT17, MOT20, and especially DanceTrack where interactions and occlusions are frequent and severe. The code and models are available at https://github.com/djdodsjsjx/BAM-SORT/.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 5","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"BAM-SORT: border-guided activated matching for online multi-object tracking\",\"authors\":\"Yuan Chao, Huaiyang Zhu, Hengyu Lu\",\"doi\":\"10.1007/s10489-024-06037-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Multi-object tracking aims at estimating object bounding boxes and identity IDs in videos. Most tracking methods combine a detector and a Kalman filter using the IoU distance as a similarity metric for association matching of the previous trajectories with the current detection box. These methods usually suffer from ID switches and fragmented trajectories in response to congested and frequently occluded scenarios. To solve this problem, in this study, a simple and effective association method is proposed. First, a bottom edge cost matrix is introduced for the utilization of depth information to improve the data association and increase the robustness in the case of occlusion. Second, an asymmetric trajectory classification mechanism is proposed to distinguish the false-postive trajectories, and an activated trajectory matching strategy is introduced to reduce the interference of noise and transient objects in tracking. Finally, the trajectory deletion strategy is improved by introducing the number of trajectory state switches to delete the trajectories caused by spurious high-scoring detection boxes in real time, as a result, the number of fragmented trajectories is also reduced. These innovations achieve excellent performance on various benchmarks, including MOT17, MOT20, and especially DanceTrack where interactions and occlusions are frequent and severe. The code and models are available at https://github.com/djdodsjsjx/BAM-SORT/.</p></div>\",\"PeriodicalId\":8041,\"journal\":{\"name\":\"Applied Intelligence\",\"volume\":\"55 5\",\"pages\":\"\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-01-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10489-024-06037-1\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-024-06037-1","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
BAM-SORT: border-guided activated matching for online multi-object tracking
Multi-object tracking aims at estimating object bounding boxes and identity IDs in videos. Most tracking methods combine a detector and a Kalman filter using the IoU distance as a similarity metric for association matching of the previous trajectories with the current detection box. These methods usually suffer from ID switches and fragmented trajectories in response to congested and frequently occluded scenarios. To solve this problem, in this study, a simple and effective association method is proposed. First, a bottom edge cost matrix is introduced for the utilization of depth information to improve the data association and increase the robustness in the case of occlusion. Second, an asymmetric trajectory classification mechanism is proposed to distinguish the false-postive trajectories, and an activated trajectory matching strategy is introduced to reduce the interference of noise and transient objects in tracking. Finally, the trajectory deletion strategy is improved by introducing the number of trajectory state switches to delete the trajectories caused by spurious high-scoring detection boxes in real time, as a result, the number of fragmented trajectories is also reduced. These innovations achieve excellent performance on various benchmarks, including MOT17, MOT20, and especially DanceTrack where interactions and occlusions are frequent and severe. The code and models are available at https://github.com/djdodsjsjx/BAM-SORT/.
期刊介绍:
With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance.
The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.