{"title":"UAV image defect detection method for steel structure of high-speed railway bridge girder","authors":"Zonghan Mu, Yong Qin, Chongchong Yu, Huaizhi Yang, Ninghai Qiu","doi":"10.1109/PHM-Yantai55411.2022.9942099","DOIUrl":null,"url":null,"abstract":"Railway bridge is an important infrastructure along the railway, which needs regular inspection and maintenance. Using UAV technology to inspect railway infrastructure is a hot issue in current research. However, because of the detection target of steel structure of railway bridge girder, such as bolts, rust, etc., are small and dense targets, and the manual annotation workload is huge in the early stage. Therefore, this paper first proposes a method for constructing the sample set of semi-automatic annotation of railway bridge defects, which can greatly improve the efficiency of data annotation. Secondly, in order to solve the problem that it is difficult to detect large-scale images of UAV and the target scale changes dramatically due to the flight distance and height changes, this paper proposes an adaptive cropping strategy for image of steel structure UAV of railway bridge girder. Compared with YOLOv5s that does not adopt the strategy, its mAP improves by 32%. In order to reduce the GPU memory usage and the number of parameters, and improve the parallel running efficiency of the model, the GYOLOv5 model is proposed by combining Ghost Bottleneck and SIoU loss function. Compared with YOLOv5s, which also adopts the adaptive cropping strategy, the mAP of GYOLOv5 model increases by 6%, and the number of parameters can be reduced by 1.7 million. Finally, in order to help the network to find the region of interest in the images covered by the UAV and improve the detection accuracy, the Attention mechanism of CBAM (Convolutional Block Attention Module) is combined with YOLOv5s. Compared with the original YOLOv5s model, the number of parameters is only 600, but the total mAP is increased by 12%.","PeriodicalId":315994,"journal":{"name":"2022 Global Reliability and Prognostics and Health Management (PHM-Yantai)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Global Reliability and Prognostics and Health Management (PHM-Yantai)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PHM-Yantai55411.2022.9942099","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Railway bridge is an important infrastructure along the railway, which needs regular inspection and maintenance. Using UAV technology to inspect railway infrastructure is a hot issue in current research. However, because of the detection target of steel structure of railway bridge girder, such as bolts, rust, etc., are small and dense targets, and the manual annotation workload is huge in the early stage. Therefore, this paper first proposes a method for constructing the sample set of semi-automatic annotation of railway bridge defects, which can greatly improve the efficiency of data annotation. Secondly, in order to solve the problem that it is difficult to detect large-scale images of UAV and the target scale changes dramatically due to the flight distance and height changes, this paper proposes an adaptive cropping strategy for image of steel structure UAV of railway bridge girder. Compared with YOLOv5s that does not adopt the strategy, its mAP improves by 32%. In order to reduce the GPU memory usage and the number of parameters, and improve the parallel running efficiency of the model, the GYOLOv5 model is proposed by combining Ghost Bottleneck and SIoU loss function. Compared with YOLOv5s, which also adopts the adaptive cropping strategy, the mAP of GYOLOv5 model increases by 6%, and the number of parameters can be reduced by 1.7 million. Finally, in order to help the network to find the region of interest in the images covered by the UAV and improve the detection accuracy, the Attention mechanism of CBAM (Convolutional Block Attention Module) is combined with YOLOv5s. Compared with the original YOLOv5s model, the number of parameters is only 600, but the total mAP is increased by 12%.