A high-speed YOLO detection model for steel surface defects with the channel residual convolution and fusion-distribution

IF 2.7 3区工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY Measurement Science and Technology Pub Date : 2024-07-12 DOI:10.1088/1361-6501/ad6281

建行 Huang 黄, Xinliang Zhang, Lijie Jia, Yitian Zhou

{"title":"A high-speed YOLO detection model for steel surface defects with the channel residual convolution and fusion-distribution","authors":"建行 Huang 黄, Xinliang Zhang, Lijie Jia, Yitian Zhou","doi":"10.1088/1361-6501/ad6281","DOIUrl":null,"url":null,"abstract":"\n Accurately and efficiently detecting steel surface defects is a critical step in steel manufacturing. However, the compromise between the detection speed and accuracy remains a major challenge, especially for steel surface defects with large variations in the scale. To address the issue, an improved YOLO based detection model is proposed through the reinforcement of its backbone and neck. Firstly, for the reduction of the redundant parameters and also the improvement of the characterization ability of the model, an effective channel residual structure is adopted to construct a channel residual convolution module (CRCM) and channel residual cross stage partial (CRCSP) module as components of the backbone network, respectively. They realize the extraction of both the shallow feature and multi-scale feature simultaneously under a small number of convolutional parameters. Secondly, in the neck of YOLO, a fusion-distribution (FD) strategy is employed, which extracts and fuses multi-scale feature maps from the backbone network to provide global information, and then distributes global information into local features of different branches through an inject attention mechanism, thus enhancing the feature gap between different branches. Then, a model called CRFD-YOLO is derived for the steel surface defect detection and localization for the situations where both speed and accuracy are demanding. Finally, extensive experimental validations are conducted to evaluate the performance of CRFD-YOLO. The validation results indicate that CRFD-YOLO achieves a satisfactory detection performance with a mean average precision of 81.3% on the NEU-DET and 71.1% on the GC10-DET. Additionally, CRFD-YOLO achieves a speed of 161 frames per second, giving a great potential in real-time detection and localization tasks.","PeriodicalId":18526,"journal":{"name":"Measurement Science and Technology","volume":null,"pages":null},"PeriodicalIF":2.7000,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Measurement Science and Technology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1088/1361-6501/ad6281","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Accurately and efficiently detecting steel surface defects is a critical step in steel manufacturing. However, the compromise between the detection speed and accuracy remains a major challenge, especially for steel surface defects with large variations in the scale. To address the issue, an improved YOLO based detection model is proposed through the reinforcement of its backbone and neck. Firstly, for the reduction of the redundant parameters and also the improvement of the characterization ability of the model, an effective channel residual structure is adopted to construct a channel residual convolution module (CRCM) and channel residual cross stage partial (CRCSP) module as components of the backbone network, respectively. They realize the extraction of both the shallow feature and multi-scale feature simultaneously under a small number of convolutional parameters. Secondly, in the neck of YOLO, a fusion-distribution (FD) strategy is employed, which extracts and fuses multi-scale feature maps from the backbone network to provide global information, and then distributes global information into local features of different branches through an inject attention mechanism, thus enhancing the feature gap between different branches. Then, a model called CRFD-YOLO is derived for the steel surface defect detection and localization for the situations where both speed and accuracy are demanding. Finally, extensive experimental validations are conducted to evaluate the performance of CRFD-YOLO. The validation results indicate that CRFD-YOLO achieves a satisfactory detection performance with a mean average precision of 81.3% on the NEU-DET and 71.1% on the GC10-DET. Additionally, CRFD-YOLO achieves a speed of 161 frames per second, giving a great potential in real-time detection and localization tasks.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用通道残余卷积和融合分布建立的钢表面缺陷高速 YOLO 检测模型

准确高效地检测钢材表面缺陷是钢材生产的关键步骤。然而，检测速度与精度之间的折衷仍然是一个重大挑战，尤其是对于尺度变化较大的钢材表面缺陷。为解决这一问题，本文提出了一种基于 YOLO 的改进型检测模型，并对其主干和颈部进行了强化。首先，为了减少冗余参数，提高模型的表征能力，采用了有效的通道残差结构，分别构建了通道残差卷积模块（CRCM）和通道残差交叉阶段局部模块（CRCSP）作为骨干网络的组成部分。它们实现了在少量卷积参数下同时提取浅层特征和多尺度特征。其次，在 "YOLO "颈中采用了融合分布（FD）策略，从骨干网络中提取并融合多尺度特征图，提供全局信息，然后通过注入注意机制将全局信息分布到不同分支的局部特征中，从而增强不同分支之间的特征差距。然后，针对速度和精度要求都很高的情况，推导出一种名为 CRFD-YOLO 的模型，用于钢材表面缺陷的检测和定位。最后，对 CRFD-YOLO 的性能进行了广泛的实验验证。验证结果表明，CRFD-YOLO 的检测性能令人满意，在 NEU-DET 上的平均精度为 81.3%，在 GC10-DET 上的平均精度为 71.1%。此外，CRFD-YOLO 的速度达到每秒 161 帧，在实时检测和定位任务中具有巨大潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Measurement Science and Technology 工程技术-工程：综合

CiteScore

4.30

自引率

16.70%

发文量

656

审稿时长

4.9 months

期刊介绍： Measurement Science and Technology publishes articles on new measurement techniques and associated instrumentation. Papers that describe experiments must represent an advance in measurement science or measurement technique rather than the application of established experimental technique. Bearing in mind the multidisciplinary nature of the journal, authors must provide an introduction to their work that makes clear the novelty, significance, broader relevance of their work in a measurement context and relevance to the readership of Measurement Science and Technology. All submitted articles should contain consideration of the uncertainty, precision and/or accuracy of the measurements presented. Subject coverage includes the theory, practice and application of measurement in physics, chemistry, engineering and the environmental and life sciences from inception to commercial exploitation. Publications in the journal should emphasize the novelty of reported methods, characterize them and demonstrate their performance using examples or applications.