{"title":"无人机视频中基于深度学习的人类和车辆检测","authors":"Bahar Bender, Mehmet Emre Atasoy, Fatih Semiz","doi":"10.1109/UBMK52708.2021.9558888","DOIUrl":null,"url":null,"abstract":"Nowadays, the detection and tracking of stationary or moving objects have begun to be of great importance for military applications as well as for civilian applications. In this case, it is necessary to use deep learning methodologies in order to effectively meet the emerging needs. This study, it is aimed to detect the people and vehicles in the videos recorded by drones in an environment suitable for field conditions. For this purpose, DarkNet-53 architecture in YOLOv3 was used to detect the presence of people and vehicles in motion in videos with 25 (Frame Per Second) images transferred to the screen in one second. The convolutional neural network has been developed by supporting it with various hyperparameter optimizations and an accuracy rate of 78 percent has been achieved.","PeriodicalId":106516,"journal":{"name":"2021 6th International Conference on Computer Science and Engineering (UBMK)","volume":"133 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Deep Learning-Based Human and Vehicle Detection in Drone Videos\",\"authors\":\"Bahar Bender, Mehmet Emre Atasoy, Fatih Semiz\",\"doi\":\"10.1109/UBMK52708.2021.9558888\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nowadays, the detection and tracking of stationary or moving objects have begun to be of great importance for military applications as well as for civilian applications. In this case, it is necessary to use deep learning methodologies in order to effectively meet the emerging needs. This study, it is aimed to detect the people and vehicles in the videos recorded by drones in an environment suitable for field conditions. For this purpose, DarkNet-53 architecture in YOLOv3 was used to detect the presence of people and vehicles in motion in videos with 25 (Frame Per Second) images transferred to the screen in one second. The convolutional neural network has been developed by supporting it with various hyperparameter optimizations and an accuracy rate of 78 percent has been achieved.\",\"PeriodicalId\":106516,\"journal\":{\"name\":\"2021 6th International Conference on Computer Science and Engineering (UBMK)\",\"volume\":\"133 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 6th International Conference on Computer Science and Engineering (UBMK)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/UBMK52708.2021.9558888\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 6th International Conference on Computer Science and Engineering (UBMK)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UBMK52708.2021.9558888","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep Learning-Based Human and Vehicle Detection in Drone Videos
Nowadays, the detection and tracking of stationary or moving objects have begun to be of great importance for military applications as well as for civilian applications. In this case, it is necessary to use deep learning methodologies in order to effectively meet the emerging needs. This study, it is aimed to detect the people and vehicles in the videos recorded by drones in an environment suitable for field conditions. For this purpose, DarkNet-53 architecture in YOLOv3 was used to detect the presence of people and vehicles in motion in videos with 25 (Frame Per Second) images transferred to the screen in one second. The convolutional neural network has been developed by supporting it with various hyperparameter optimizations and an accuracy rate of 78 percent has been achieved.