{"title":"Multi-Object Tracking with Spatial-Temporal Correlation Memory Networks","authors":"Ming Xin, Wenjie Sun, Kaifang Li, Guancheng Hui","doi":"10.1109/cvidliccea56201.2022.9825193","DOIUrl":null,"url":null,"abstract":"Resistance to object appearance deformation and local occlusion is still one of the challenges of multi-object tracking algorithms. Most popular algorithms rely on time-consuming numerical optimization and complex manual design strategies to integrate object appearance information and motion information, so as to alleviate the adverse effects of object appearance deformation and local occlusion on the trajectory updating. This paper proposes a Spatial-Temporal Correlation Memory (STCM) module which can adaptively aggregate useful information from rich historical information in memory. By mining the time dimension information, the STCM module can guide the backbone network to extract the current frame effectively, and adapt to the change in the object’s appearance in the tracking process. Specifically, the STCM module can record the foreground-background information in the history frames and direct the backbone network to focus on the useful information in the current frame. Experiments on the MOT17 data set show that our method outperforms the baseline method and current advanced method in index MOTA and IDFI.","PeriodicalId":23649,"journal":{"name":"Vision","volume":"32 1","pages":"616-619"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/cvidliccea56201.2022.9825193","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Resistance to object appearance deformation and local occlusion is still one of the challenges of multi-object tracking algorithms. Most popular algorithms rely on time-consuming numerical optimization and complex manual design strategies to integrate object appearance information and motion information, so as to alleviate the adverse effects of object appearance deformation and local occlusion on the trajectory updating. This paper proposes a Spatial-Temporal Correlation Memory (STCM) module which can adaptively aggregate useful information from rich historical information in memory. By mining the time dimension information, the STCM module can guide the backbone network to extract the current frame effectively, and adapt to the change in the object’s appearance in the tracking process. Specifically, the STCM module can record the foreground-background information in the history frames and direct the backbone network to focus on the useful information in the current frame. Experiments on the MOT17 data set show that our method outperforms the baseline method and current advanced method in index MOTA and IDFI.