{"title":"A Temporal CNN-based Approach for Autonomous Drone Racing","authors":"L. Rojas-Perez, J. Martínez-Carranza","doi":"10.1109/REDUAS47371.2019.8999703","DOIUrl":null,"url":null,"abstract":"Convolutional Neural Networks (CNN) and Deep Learning (DL) have become a popular tool to address all sorts of artificial intelligent challenges. The Autonomous Drone Racing is a challenge consisting of developing an autonomous drone capable of beating a human in a drone race, and DL is a tool that has been included in state of the art solutions to address this problem. Current works have proposed to use CNN and DL to detect the gates, whereas other works have proposed to use a CNN to obtain drone’s control commands and a goal point, with all of these approaches using a single image as input. In this work we propose a CNN based on the well known pose-net network. Originally used for camera relocalisation, we propose to use pose-net to provide control commands to drive the drone towards and to cross the gate autonomously. In contrast to previous works, we also propose to use a temporal set of images as input for the network. In specific, we use 6 images captured every 166 milliseconds in one second to create a mosaic. The latter is used as input of the CNN to predict the control commands. We compare this proposed temporal approach against using a single image as input for the CNN. Our results, although in simulation, demonstrate that the using only our temporal approach is feasible, less noisy and more effective than the single image approach, enabling the drone to autonomously cross a set of gates placed randomly, and even under the scenario where the gate moves dynamically.","PeriodicalId":351115,"journal":{"name":"2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/REDUAS47371.2019.8999703","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Convolutional Neural Networks (CNN) and Deep Learning (DL) have become a popular tool to address all sorts of artificial intelligent challenges. The Autonomous Drone Racing is a challenge consisting of developing an autonomous drone capable of beating a human in a drone race, and DL is a tool that has been included in state of the art solutions to address this problem. Current works have proposed to use CNN and DL to detect the gates, whereas other works have proposed to use a CNN to obtain drone’s control commands and a goal point, with all of these approaches using a single image as input. In this work we propose a CNN based on the well known pose-net network. Originally used for camera relocalisation, we propose to use pose-net to provide control commands to drive the drone towards and to cross the gate autonomously. In contrast to previous works, we also propose to use a temporal set of images as input for the network. In specific, we use 6 images captured every 166 milliseconds in one second to create a mosaic. The latter is used as input of the CNN to predict the control commands. We compare this proposed temporal approach against using a single image as input for the CNN. Our results, although in simulation, demonstrate that the using only our temporal approach is feasible, less noisy and more effective than the single image approach, enabling the drone to autonomously cross a set of gates placed randomly, and even under the scenario where the gate moves dynamically.