{"title":"利用深度学习改进无人机图像中的物体检测","authors":"Grishma Poudel","doi":"10.47760/cognizance.2024.v04i07.009","DOIUrl":null,"url":null,"abstract":"The use of unmanned aerial vehicles (UAV) for computer vision analysis is a significant trend in the current scenario. UAV technology is highly utilized for various purposes, including object detection, tracking, traffic management, environment monitoring, and agriculture sector, mainly due to the ease of data collection compared to conventional remote sensing methods through satellites. This study focuses on enhancing the YOLOv5 architecture to effectively detect small targets. The modifications made to the YOLOv5 framework specifically target the architecture, resulting in improved performance in object identification. The addition of a new feature fusion layer within the feature pyramid section of YOLOv5 plays a crucial role in achieving these improvements. To maintain resolution and prevent the loss of valuable feature information in the deeper sections of the network, a lateral connection is introduced, connecting this layer to an earlier part of the network. This addition ensures that crucial details and feature data are preserved throughout the network architecture. Additionally, data augmentation techniques such as image saturation and cropping are employed.","PeriodicalId":151974,"journal":{"name":"Cognizance Journal of Multidisciplinary Studies","volume":"1 6","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improved Object Detection in UAV Images using Deep Learning\",\"authors\":\"Grishma Poudel\",\"doi\":\"10.47760/cognizance.2024.v04i07.009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of unmanned aerial vehicles (UAV) for computer vision analysis is a significant trend in the current scenario. UAV technology is highly utilized for various purposes, including object detection, tracking, traffic management, environment monitoring, and agriculture sector, mainly due to the ease of data collection compared to conventional remote sensing methods through satellites. This study focuses on enhancing the YOLOv5 architecture to effectively detect small targets. The modifications made to the YOLOv5 framework specifically target the architecture, resulting in improved performance in object identification. The addition of a new feature fusion layer within the feature pyramid section of YOLOv5 plays a crucial role in achieving these improvements. To maintain resolution and prevent the loss of valuable feature information in the deeper sections of the network, a lateral connection is introduced, connecting this layer to an earlier part of the network. This addition ensures that crucial details and feature data are preserved throughout the network architecture. Additionally, data augmentation techniques such as image saturation and cropping are employed.\",\"PeriodicalId\":151974,\"journal\":{\"name\":\"Cognizance Journal of Multidisciplinary Studies\",\"volume\":\"1 6\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cognizance Journal of Multidisciplinary Studies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.47760/cognizance.2024.v04i07.009\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognizance Journal of Multidisciplinary Studies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47760/cognizance.2024.v04i07.009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improved Object Detection in UAV Images using Deep Learning
The use of unmanned aerial vehicles (UAV) for computer vision analysis is a significant trend in the current scenario. UAV technology is highly utilized for various purposes, including object detection, tracking, traffic management, environment monitoring, and agriculture sector, mainly due to the ease of data collection compared to conventional remote sensing methods through satellites. This study focuses on enhancing the YOLOv5 architecture to effectively detect small targets. The modifications made to the YOLOv5 framework specifically target the architecture, resulting in improved performance in object identification. The addition of a new feature fusion layer within the feature pyramid section of YOLOv5 plays a crucial role in achieving these improvements. To maintain resolution and prevent the loss of valuable feature information in the deeper sections of the network, a lateral connection is introduced, connecting this layer to an earlier part of the network. This addition ensures that crucial details and feature data are preserved throughout the network architecture. Additionally, data augmentation techniques such as image saturation and cropping are employed.