Hold surrounding’s key-you only look once version 7: a real-time pedestrian and vehicle detection algorithm in the low-signal-to-noise ratio infrared image
Yang Liu, Fulong Yi, Yuhua Ma, Yongfu Wang, Dianhui Wang
{"title":"Hold surrounding’s key-you only look once version 7: a real-time pedestrian and vehicle detection algorithm in the low-signal-to-noise ratio infrared image","authors":"Yang Liu, Fulong Yi, Yuhua Ma, Yongfu Wang, Dianhui Wang","doi":"10.1117/1.OE.62.11.118105","DOIUrl":null,"url":null,"abstract":"Abstract. Compared with visible light imaging technology, infrared optical imaging technology (see Fig. 1) is less affected by weather and illumination and is a potential auxiliary solution method for the future of the autonomous driving system. Based on the above characteristics, this study proposes the hold surrounding’s key (HSK)-you only look once (YOLOv7) algorithm. First, based on the characteristics of the infrared image, this study optimizes the network structure of YOLOv7 and proposes a vehicle and pedestrian detection algorithm based on the improved YOLOv7. Aiming at the problem that the reasoning speed and detection accuracy of pedestrian and vehicle detection algorithms based on infrared images are challenging to balance, occupy large storage space, and are difficult to deploy and run in real-time in low and medium-performance devices, the MPConv is added to replace the Conv structure in YOLOv7. In view of the false detection, missed detection, mutual occlusion and overlap of detected objects, and other situations that the YOLOv7 algorithm is prone to cause in the actual deployment environment easily, a tiny object detection layer is added. At the same time, to solve the problem that infrared optical imaging systems are prone to noise caused by external factors, this study introduces the TRPCA method for image denoising in the preprocessing process of the YOLOv7 algorithm. In the end, the HSK-YOLOv7 algorithm is verified using the self-made infrared traffic object detection dataset and the publicly available FLIR dataset to verify the detection effect of the HSK-YOLOv7 algorithm on near-infrared images and thermal infrared images. The parameter quantity of our algorithm is 37.3M, and the computing throughput is 107.5 GFLOPs. The detection speed on the self-made dataset and FLIR dataset reaches 163 frames per second (FPS) and 71.8 FPS, respectively, and the mAP@0.5 indicator reaches 94.08% and 61.3%, respectively. In general, HSK-YOLOv7 can meet the real-time requirements of the autonomous driving system while ensuring detection accuracy.","PeriodicalId":19561,"journal":{"name":"Optical Engineering","volume":"34 1","pages":"118105 - 118105"},"PeriodicalIF":1.1000,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optical Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1117/1.OE.62.11.118105","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract. Compared with visible light imaging technology, infrared optical imaging technology (see Fig. 1) is less affected by weather and illumination and is a potential auxiliary solution method for the future of the autonomous driving system. Based on the above characteristics, this study proposes the hold surrounding’s key (HSK)-you only look once (YOLOv7) algorithm. First, based on the characteristics of the infrared image, this study optimizes the network structure of YOLOv7 and proposes a vehicle and pedestrian detection algorithm based on the improved YOLOv7. Aiming at the problem that the reasoning speed and detection accuracy of pedestrian and vehicle detection algorithms based on infrared images are challenging to balance, occupy large storage space, and are difficult to deploy and run in real-time in low and medium-performance devices, the MPConv is added to replace the Conv structure in YOLOv7. In view of the false detection, missed detection, mutual occlusion and overlap of detected objects, and other situations that the YOLOv7 algorithm is prone to cause in the actual deployment environment easily, a tiny object detection layer is added. At the same time, to solve the problem that infrared optical imaging systems are prone to noise caused by external factors, this study introduces the TRPCA method for image denoising in the preprocessing process of the YOLOv7 algorithm. In the end, the HSK-YOLOv7 algorithm is verified using the self-made infrared traffic object detection dataset and the publicly available FLIR dataset to verify the detection effect of the HSK-YOLOv7 algorithm on near-infrared images and thermal infrared images. The parameter quantity of our algorithm is 37.3M, and the computing throughput is 107.5 GFLOPs. The detection speed on the self-made dataset and FLIR dataset reaches 163 frames per second (FPS) and 71.8 FPS, respectively, and the mAP@0.5 indicator reaches 94.08% and 61.3%, respectively. In general, HSK-YOLOv7 can meet the real-time requirements of the autonomous driving system while ensuring detection accuracy.
期刊介绍:
Optical Engineering publishes peer-reviewed papers reporting on research and development in optical science and engineering and the practical applications of known optical science, engineering, and technology.