A Camera Movement Guidance Method based on Multi-Object Tracking

IF 1.2 Q3 AUTOMATION & CONTROL SYSTEMS IET Cybersystems and Robotics Pub Date : 2022-07-27 DOI:10.1109/CYBER55403.2022.9907417

Puchun Liu, Bo Li, Sheng Bi, Muye Li, Chen Zheng

{"title":"A Camera Movement Guidance Method based on Multi-Object Tracking","authors":"Puchun Liu, Bo Li, Sheng Bi, Muye Li, Chen Zheng","doi":"10.1109/CYBER55403.2022.9907417","DOIUrl":null,"url":null,"abstract":"Multi-object tracking (MOT) attracts great attention in computer vision while playing an increasingly essential role in manufacturing and life. There is currently an urgent requirement for practical implementation of MOT algorithms to guide production in industry. Nevertheless, the algorithm guiding camera movement is prone to be naive in reality to some degree, which is not conducive to handle complex situations and may cause damage. In this paper, we propose a deep-learning-based MOT method to guide camera movement. In particular, jointly learnt detector and embedding model (JDE) is adopted to extract the features of live stream through data preprocessing and training, which detects the spacial and temporal information of objects, for instance, pedestrians. Moreover, we propose a cognitive-based network segmentation method which makes edge-cloud collaboration possible. Additionally, object location information provided by deep learning allows object clustering and weight allocation, followed by utilizing PID algorithm to guide camera motion. Our method is compared with conventional model through several metrics, especially Euclidean Average Distance, which indicates the effectiveness, reliability and robustness of our model.","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":"24 10","pages":"150-155"},"PeriodicalIF":1.2000,"publicationDate":"2022-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Cybersystems and Robotics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CYBER55403.2022.9907417","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Multi-object tracking (MOT) attracts great attention in computer vision while playing an increasingly essential role in manufacturing and life. There is currently an urgent requirement for practical implementation of MOT algorithms to guide production in industry. Nevertheless, the algorithm guiding camera movement is prone to be naive in reality to some degree, which is not conducive to handle complex situations and may cause damage. In this paper, we propose a deep-learning-based MOT method to guide camera movement. In particular, jointly learnt detector and embedding model (JDE) is adopted to extract the features of live stream through data preprocessing and training, which detects the spacial and temporal information of objects, for instance, pedestrians. Moreover, we propose a cognitive-based network segmentation method which makes edge-cloud collaboration possible. Additionally, object location information provided by deep learning allows object clustering and weight allocation, followed by utilizing PID algorithm to guide camera motion. Our method is compared with conventional model through several metrics, especially Euclidean Average Distance, which indicates the effectiveness, reliability and robustness of our model.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一种基于多目标跟踪的摄像机运动引导方法

多目标跟踪(MOT)在计算机视觉领域受到广泛关注，在制造和生活中发挥着越来越重要的作用。目前迫切需要实际实施MOT算法来指导工业生产。然而，引导摄像机运动的算法在现实中存在一定程度的幼稚，不利于处理复杂的情况，可能会造成损害。在本文中，我们提出了一种基于深度学习的MOT方法来引导相机运动。其中，采用联合学习检测器和嵌入模型(JDE)，通过数据预处理和训练提取实时流的特征，检测行人等物体的时空信息。此外，我们提出了一种基于认知的网络分割方法，使边缘云协作成为可能。此外，深度学习提供的目标位置信息可以实现目标聚类和权重分配，然后利用PID算法引导摄像机运动。通过几个指标，特别是欧几里得平均距离与传统模型进行了比较，表明了模型的有效性、可靠性和鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊