Maham Misbah;Farooq Alam Orakazi;Laiba Tanveer;Zeeshan Kaleem;Chau Yuen
{"title":"TF-BiFPN 改进 YOLOv5:增强黑暗中的小型多类无人机探测能力","authors":"Maham Misbah;Farooq Alam Orakazi;Laiba Tanveer;Zeeshan Kaleem;Chau Yuen","doi":"10.1109/TAES.2024.3464548","DOIUrl":null,"url":null,"abstract":"Detecting drones at night with complex backgrounds poses a considerable challenge. To address this issue, we propose an enhanced variant of the YOLOv5 model, termed as the tiny feature and bidirectional feature pyramid network (BiFPN). This method incorporates efficient residual bottleneck (ERB) and efficient multireceptive pooling (EMRP) layers along with convolutional block attention module (CBAM). Utilizing ERB, the network achieves enhanced feature extraction through residual connections, whereas the EMRP layers incorporate multiple receptive fields, enabling the model to better understand and process varied spatial hierarchies within the data. Bi-FPN is added within the model's head to enhance feature representation and capture multiple features at various scales. To optimize model efficiency, cross-convolution replaces simple convolution, leading to a notable reduction in parameters. Furthermore, auto-anchor and auto-batch mechanisms are introduced to ensure efficient GPU utilization. Experimental evaluations conducted on a custom multiclass dataset illustrate a significant enhancement over baseline and state-of-the-art algorithms.","PeriodicalId":13157,"journal":{"name":"IEEE Transactions on Aerospace and Electronic Systems","volume":"61 2","pages":"5354-5361"},"PeriodicalIF":5.7000,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"TF-BiFPN Improves YOLOv5: Enhancing Small-Scale Multiclass Drone Detection in Dark\",\"authors\":\"Maham Misbah;Farooq Alam Orakazi;Laiba Tanveer;Zeeshan Kaleem;Chau Yuen\",\"doi\":\"10.1109/TAES.2024.3464548\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Detecting drones at night with complex backgrounds poses a considerable challenge. To address this issue, we propose an enhanced variant of the YOLOv5 model, termed as the tiny feature and bidirectional feature pyramid network (BiFPN). This method incorporates efficient residual bottleneck (ERB) and efficient multireceptive pooling (EMRP) layers along with convolutional block attention module (CBAM). Utilizing ERB, the network achieves enhanced feature extraction through residual connections, whereas the EMRP layers incorporate multiple receptive fields, enabling the model to better understand and process varied spatial hierarchies within the data. Bi-FPN is added within the model's head to enhance feature representation and capture multiple features at various scales. To optimize model efficiency, cross-convolution replaces simple convolution, leading to a notable reduction in parameters. Furthermore, auto-anchor and auto-batch mechanisms are introduced to ensure efficient GPU utilization. Experimental evaluations conducted on a custom multiclass dataset illustrate a significant enhancement over baseline and state-of-the-art algorithms.\",\"PeriodicalId\":13157,\"journal\":{\"name\":\"IEEE Transactions on Aerospace and Electronic Systems\",\"volume\":\"61 2\",\"pages\":\"5354-5361\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2024-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Aerospace and Electronic Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10684481/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, AEROSPACE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Aerospace and Electronic Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10684481/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, AEROSPACE","Score":null,"Total":0}
TF-BiFPN Improves YOLOv5: Enhancing Small-Scale Multiclass Drone Detection in Dark
Detecting drones at night with complex backgrounds poses a considerable challenge. To address this issue, we propose an enhanced variant of the YOLOv5 model, termed as the tiny feature and bidirectional feature pyramid network (BiFPN). This method incorporates efficient residual bottleneck (ERB) and efficient multireceptive pooling (EMRP) layers along with convolutional block attention module (CBAM). Utilizing ERB, the network achieves enhanced feature extraction through residual connections, whereas the EMRP layers incorporate multiple receptive fields, enabling the model to better understand and process varied spatial hierarchies within the data. Bi-FPN is added within the model's head to enhance feature representation and capture multiple features at various scales. To optimize model efficiency, cross-convolution replaces simple convolution, leading to a notable reduction in parameters. Furthermore, auto-anchor and auto-batch mechanisms are introduced to ensure efficient GPU utilization. Experimental evaluations conducted on a custom multiclass dataset illustrate a significant enhancement over baseline and state-of-the-art algorithms.
期刊介绍:
IEEE Transactions on Aerospace and Electronic Systems focuses on the organization, design, development, integration, and operation of complex systems for space, air, ocean, or ground environment. These systems include, but are not limited to, navigation, avionics, spacecraft, aerospace power, radar, sonar, telemetry, defense, transportation, automated testing, and command and control.