An improved SSD lightweight network with coordinate attention for aircraft target recognition in scene videos

IF 1 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Journal of Intelligent & Fuzzy Systems Pub Date : 2023-11-02 DOI:10.3233/jifs-231423

Weidong Li, Zhenying Li, Chisheng Wang, Xuehai Zhang, Jinlong Duan

{"title":"An improved SSD lightweight network with coordinate attention for aircraft target recognition in scene videos","authors":"Weidong Li, Zhenying Li, Chisheng Wang, Xuehai Zhang, Jinlong Duan","doi":"10.3233/jifs-231423","DOIUrl":null,"url":null,"abstract":"Accurate identification and monitoring of aircraft on the airport surface can assist managers in rational scheduling and reduce the probability of aircraft conflicts, an important application value for constructing a \"smart airport.\" For the airport surface video monitoring, there are small aircraft targets, aircraft obscuring each other, and affected by different weather, the aircraft target clarity is low, and other complex monitoring problems. In this paper, a lightweight model network for video aircraft recognition in airport field video in complex environments is proposed based on SSD network incorporating coordinate attention mechanism. First, the model designs a lightweight feature extraction network with five feature extraction layers. Each feature extraction layer consists of two modules, Block_A and Block_I. The Block_A module incorporates the coordinate attention mechanism and the channel attention mechanism to improve the detection of obscured aircraft and to enhance the detection of small targets. The Block_I module uses multi-scale feature fusion to extract feature information with rich semantic meaning to enhance the feature extraction capability of the network in complex environments. Then, the designed feature extraction network is applied to the improved SSD detection algorithm, which enhances the recognition accuracy of airport field aircraft in complex environments. It was tested and subjected to ablation experiments under different complex weather conditions. The results show that compared with the Faster R-CNN, SSD, and YOLOv3 models, the detection accuracy of the improved model has been increased by 3.2% , 14.3% , and 10.9% , respectively, and the model parameters have been reduced by 83.9% , 73.1% , and 78.2% respectively. Compared with the YOLOv5 model, the model parameters are reduced by 38.9% when the detection accuracy is close, and the detection speed is increased by 24.4% , reaching 38.2fps, which can well meet the demand for real-time detection of aircraft on airport surfaces.","PeriodicalId":54795,"journal":{"name":"Journal of Intelligent & Fuzzy Systems","volume":"153 4","pages":"0"},"PeriodicalIF":1.0000,"publicationDate":"2023-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Intelligent & Fuzzy Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/jifs-231423","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Accurate identification and monitoring of aircraft on the airport surface can assist managers in rational scheduling and reduce the probability of aircraft conflicts, an important application value for constructing a "smart airport." For the airport surface video monitoring, there are small aircraft targets, aircraft obscuring each other, and affected by different weather, the aircraft target clarity is low, and other complex monitoring problems. In this paper, a lightweight model network for video aircraft recognition in airport field video in complex environments is proposed based on SSD network incorporating coordinate attention mechanism. First, the model designs a lightweight feature extraction network with five feature extraction layers. Each feature extraction layer consists of two modules, Block_A and Block_I. The Block_A module incorporates the coordinate attention mechanism and the channel attention mechanism to improve the detection of obscured aircraft and to enhance the detection of small targets. The Block_I module uses multi-scale feature fusion to extract feature information with rich semantic meaning to enhance the feature extraction capability of the network in complex environments. Then, the designed feature extraction network is applied to the improved SSD detection algorithm, which enhances the recognition accuracy of airport field aircraft in complex environments. It was tested and subjected to ablation experiments under different complex weather conditions. The results show that compared with the Faster R-CNN, SSD, and YOLOv3 models, the detection accuracy of the improved model has been increased by 3.2% , 14.3% , and 10.9% , respectively, and the model parameters have been reduced by 83.9% , 73.1% , and 78.2% respectively. Compared with the YOLOv5 model, the model parameters are reduced by 38.9% when the detection accuracy is close, and the detection speed is increased by 24.4% , reaching 38.2fps, which can well meet the demand for real-time detection of aircraft on airport surfaces.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一种改进的基于坐标关注的SSD轻量级网络用于场景视频中飞机目标识别

对机场地面上的飞机进行准确的识别和监控，可以帮助管理者合理调度，减少飞机冲突的概率，是建设“智慧机场”的重要应用价值。对于机场地面视频监控，存在飞机目标小、飞机相互遮挡、受不同天气影响、飞机目标清晰度低等复杂的监控问题。基于SSD网络，结合坐标关注机制，提出了一种用于复杂环境下机场现场视频飞机识别的轻量化模型网络。首先，该模型设计了一个包含五个特征提取层的轻量级特征提取网络。每个特征提取层由Block_A和Block_I两个模块组成。Block_A模块结合了坐标注意机制和通道注意机制，提高了对被遮挡飞机的探测能力，增强了对小目标的探测能力。Block_I模块通过多尺度特征融合提取语义丰富的特征信息，增强网络在复杂环境下的特征提取能力。然后，将设计的特征提取网络应用于改进的SSD检测算法，提高了复杂环境下机场野战飞机的识别精度。对其进行了不同复杂天气条件下的烧蚀试验。结果表明，与Faster R-CNN、SSD和YOLOv3模型相比，改进模型的检测准确率分别提高了3.2%、14.3%和10.9%，模型参数分别降低了83.9%、73.1%和78.2%。与YOLOv5模型相比，在检测精度接近时，模型参数降低了38.9%，检测速度提高了24.4%，达到38.2fps，可以很好地满足对机场表面上飞机的实时检测需求。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Intelligent & Fuzzy Systems 工程技术-计算机：人工智能

CiteScore

3.40

自引率

10.00%

发文量

965

审稿时长

5.1 months

期刊介绍： The purpose of the Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology is to foster advancements of knowledge and help disseminate results concerning recent applications and case studies in the areas of fuzzy logic, intelligent systems, and web-based applications among working professionals and professionals in education and research, covering a broad cross-section of technical disciplines.