{"title":"基于关键帧选择和强化学习的自适应视觉跟踪","authors":"Ke Zhao, Yongan Lu, Zhizheng Zhang, Wei Wang","doi":"10.1109/IWECAI50956.2020.00039","DOIUrl":null,"url":null,"abstract":"When the appearance of a target changes dramatically, the traditional tracking methods can no longer be used for target tracking task. This paper proposes an adaptive visual tracking algorithm based on key frame selection and reinforcement learning (RL) to solve this problem. First of all, the probability value of the RL network output is analyzed, and the predicted value of output is normalized. At the model update stage of each frame, the probability value corresponding to the optimal behavior is judged. If it conforms to the preset rules, the last frame of the current frame is set as the key frame, and the network model is fine-tuned by using the key frame. The proposed algorithm is only fine-tuned in key frames to obtain multiple fixed prediction models. The experiment is conducted on 100 video sequences of the Object Tracking Benchmark to verify the effectiveness of key frame selection strategy. Compared with the original reinforcement learning based tracking algorithm, the tracking accuracy and the success rate of the proposed algorithm are improved respectively.","PeriodicalId":364789,"journal":{"name":"2020 International Workshop on Electronic Communication and Artificial Intelligence (IWECAI)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Adaptive Visual Tracking Based on Key Frame Selection and Reinforcement Learning\",\"authors\":\"Ke Zhao, Yongan Lu, Zhizheng Zhang, Wei Wang\",\"doi\":\"10.1109/IWECAI50956.2020.00039\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"When the appearance of a target changes dramatically, the traditional tracking methods can no longer be used for target tracking task. This paper proposes an adaptive visual tracking algorithm based on key frame selection and reinforcement learning (RL) to solve this problem. First of all, the probability value of the RL network output is analyzed, and the predicted value of output is normalized. At the model update stage of each frame, the probability value corresponding to the optimal behavior is judged. If it conforms to the preset rules, the last frame of the current frame is set as the key frame, and the network model is fine-tuned by using the key frame. The proposed algorithm is only fine-tuned in key frames to obtain multiple fixed prediction models. The experiment is conducted on 100 video sequences of the Object Tracking Benchmark to verify the effectiveness of key frame selection strategy. Compared with the original reinforcement learning based tracking algorithm, the tracking accuracy and the success rate of the proposed algorithm are improved respectively.\",\"PeriodicalId\":364789,\"journal\":{\"name\":\"2020 International Workshop on Electronic Communication and Artificial Intelligence (IWECAI)\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Workshop on Electronic Communication and Artificial Intelligence (IWECAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IWECAI50956.2020.00039\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Workshop on Electronic Communication and Artificial Intelligence (IWECAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IWECAI50956.2020.00039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Adaptive Visual Tracking Based on Key Frame Selection and Reinforcement Learning
When the appearance of a target changes dramatically, the traditional tracking methods can no longer be used for target tracking task. This paper proposes an adaptive visual tracking algorithm based on key frame selection and reinforcement learning (RL) to solve this problem. First of all, the probability value of the RL network output is analyzed, and the predicted value of output is normalized. At the model update stage of each frame, the probability value corresponding to the optimal behavior is judged. If it conforms to the preset rules, the last frame of the current frame is set as the key frame, and the network model is fine-tuned by using the key frame. The proposed algorithm is only fine-tuned in key frames to obtain multiple fixed prediction models. The experiment is conducted on 100 video sequences of the Object Tracking Benchmark to verify the effectiveness of key frame selection strategy. Compared with the original reinforcement learning based tracking algorithm, the tracking accuracy and the success rate of the proposed algorithm are improved respectively.