Tushar Singh, Jayant Prakash, Tushar Bharti, A. Mandpura
{"title":"基于变压器的视觉伺服时间序列方法","authors":"Tushar Singh, Jayant Prakash, Tushar Bharti, A. Mandpura","doi":"10.1109/MECO58584.2023.10154996","DOIUrl":null,"url":null,"abstract":"We introduce Visual servoing using a novel deep-learning time-series architecture to control an unmanned aerial vehicle (UAV) with a mounted camera to track a target consisting of a finite set of stationary points lying in a plane. Many visual servoing approaches use computer vision along with estimation algorithms, sensors, and actuators' feedback to solve tasks like, tracking, obstacle avoidance, and localization. Nowadays, deep neural networks are gaining popularity in such tasks owing to their accuracy, adaptability, and flexibility. We propose a solution that employs a time-series architecture to learn temporal data from sequential values to output the control cues to the flight controller. Because of its low computational expense, the solution is deployable on less powerful onboard computers present on the UAV, ensuring real-time tracking of the target. The solution is tested both in a simulation environment and in real life, outperforming the current state-of-the-art in terms of time efficiency and accuracy.","PeriodicalId":187825,"journal":{"name":"2023 12th Mediterranean Conference on Embedded Computing (MECO)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Time Series Approach for Visual Servoing Using Transformers\",\"authors\":\"Tushar Singh, Jayant Prakash, Tushar Bharti, A. Mandpura\",\"doi\":\"10.1109/MECO58584.2023.10154996\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We introduce Visual servoing using a novel deep-learning time-series architecture to control an unmanned aerial vehicle (UAV) with a mounted camera to track a target consisting of a finite set of stationary points lying in a plane. Many visual servoing approaches use computer vision along with estimation algorithms, sensors, and actuators' feedback to solve tasks like, tracking, obstacle avoidance, and localization. Nowadays, deep neural networks are gaining popularity in such tasks owing to their accuracy, adaptability, and flexibility. We propose a solution that employs a time-series architecture to learn temporal data from sequential values to output the control cues to the flight controller. Because of its low computational expense, the solution is deployable on less powerful onboard computers present on the UAV, ensuring real-time tracking of the target. The solution is tested both in a simulation environment and in real life, outperforming the current state-of-the-art in terms of time efficiency and accuracy.\",\"PeriodicalId\":187825,\"journal\":{\"name\":\"2023 12th Mediterranean Conference on Embedded Computing (MECO)\",\"volume\":\"64 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 12th Mediterranean Conference on Embedded Computing (MECO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MECO58584.2023.10154996\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 12th Mediterranean Conference on Embedded Computing (MECO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MECO58584.2023.10154996","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Time Series Approach for Visual Servoing Using Transformers
We introduce Visual servoing using a novel deep-learning time-series architecture to control an unmanned aerial vehicle (UAV) with a mounted camera to track a target consisting of a finite set of stationary points lying in a plane. Many visual servoing approaches use computer vision along with estimation algorithms, sensors, and actuators' feedback to solve tasks like, tracking, obstacle avoidance, and localization. Nowadays, deep neural networks are gaining popularity in such tasks owing to their accuracy, adaptability, and flexibility. We propose a solution that employs a time-series architecture to learn temporal data from sequential values to output the control cues to the flight controller. Because of its low computational expense, the solution is deployable on less powerful onboard computers present on the UAV, ensuring real-time tracking of the target. The solution is tested both in a simulation environment and in real life, outperforming the current state-of-the-art in terms of time efficiency and accuracy.