{"title":"A high-speed YOLO detection model for steel surface defects with the channel residual convolution and fusion-distribution","authors":"建行 Huang 黄, Xinliang Zhang, Lijie Jia, Yitian Zhou","doi":"10.1088/1361-6501/ad6281","DOIUrl":null,"url":null,"abstract":"\n Accurately and efficiently detecting steel surface defects is a critical step in steel manufacturing. However, the compromise between the detection speed and accuracy remains a major challenge, especially for steel surface defects with large variations in the scale. To address the issue, an improved YOLO based detection model is proposed through the reinforcement of its backbone and neck. Firstly, for the reduction of the redundant parameters and also the improvement of the characterization ability of the model, an effective channel residual structure is adopted to construct a channel residual convolution module (CRCM) and channel residual cross stage partial (CRCSP) module as components of the backbone network, respectively. They realize the extraction of both the shallow feature and multi-scale feature simultaneously under a small number of convolutional parameters. Secondly, in the neck of YOLO, a fusion-distribution (FD) strategy is employed, which extracts and fuses multi-scale feature maps from the backbone network to provide global information, and then distributes global information into local features of different branches through an inject attention mechanism, thus enhancing the feature gap between different branches. Then, a model called CRFD-YOLO is derived for the steel surface defect detection and localization for the situations where both speed and accuracy are demanding. Finally, extensive experimental validations are conducted to evaluate the performance of CRFD-YOLO. The validation results indicate that CRFD-YOLO achieves a satisfactory detection performance with a mean average precision of 81.3% on the NEU-DET and 71.1% on the GC10-DET. Additionally, CRFD-YOLO achieves a speed of 161 frames per second, giving a great potential in real-time detection and localization tasks.","PeriodicalId":18526,"journal":{"name":"Measurement Science and Technology","volume":null,"pages":null},"PeriodicalIF":2.7000,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Measurement Science and Technology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1088/1361-6501/ad6281","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Accurately and efficiently detecting steel surface defects is a critical step in steel manufacturing. However, the compromise between the detection speed and accuracy remains a major challenge, especially for steel surface defects with large variations in the scale. To address the issue, an improved YOLO based detection model is proposed through the reinforcement of its backbone and neck. Firstly, for the reduction of the redundant parameters and also the improvement of the characterization ability of the model, an effective channel residual structure is adopted to construct a channel residual convolution module (CRCM) and channel residual cross stage partial (CRCSP) module as components of the backbone network, respectively. They realize the extraction of both the shallow feature and multi-scale feature simultaneously under a small number of convolutional parameters. Secondly, in the neck of YOLO, a fusion-distribution (FD) strategy is employed, which extracts and fuses multi-scale feature maps from the backbone network to provide global information, and then distributes global information into local features of different branches through an inject attention mechanism, thus enhancing the feature gap between different branches. Then, a model called CRFD-YOLO is derived for the steel surface defect detection and localization for the situations where both speed and accuracy are demanding. Finally, extensive experimental validations are conducted to evaluate the performance of CRFD-YOLO. The validation results indicate that CRFD-YOLO achieves a satisfactory detection performance with a mean average precision of 81.3% on the NEU-DET and 71.1% on the GC10-DET. Additionally, CRFD-YOLO achieves a speed of 161 frames per second, giving a great potential in real-time detection and localization tasks.
期刊介绍:
Measurement Science and Technology publishes articles on new measurement techniques and associated instrumentation. Papers that describe experiments must represent an advance in measurement science or measurement technique rather than the application of established experimental technique. Bearing in mind the multidisciplinary nature of the journal, authors must provide an introduction to their work that makes clear the novelty, significance, broader relevance of their work in a measurement context and relevance to the readership of Measurement Science and Technology. All submitted articles should contain consideration of the uncertainty, precision and/or accuracy of the measurements presented.
Subject coverage includes the theory, practice and application of measurement in physics, chemistry, engineering and the environmental and life sciences from inception to commercial exploitation. Publications in the journal should emphasize the novelty of reported methods, characterize them and demonstrate their performance using examples or applications.