A high-speed YOLO detection model for steel surface defects with the channel residual convolution and fusion-distribution

IF 2.7 3区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY Measurement Science and Technology Pub Date : 2024-07-12 DOI:10.1088/1361-6501/ad6281
建行 Huang 黄, Xinliang Zhang, Lijie Jia, Yitian Zhou
{"title":"A high-speed YOLO detection model for steel surface defects with the channel residual convolution and fusion-distribution","authors":"建行 Huang 黄, Xinliang Zhang, Lijie Jia, Yitian Zhou","doi":"10.1088/1361-6501/ad6281","DOIUrl":null,"url":null,"abstract":"\n Accurately and efficiently detecting steel surface defects is a critical step in steel manufacturing. However, the compromise between the detection speed and accuracy remains a major challenge, especially for steel surface defects with large variations in the scale. To address the issue, an improved YOLO based detection model is proposed through the reinforcement of its backbone and neck. Firstly, for the reduction of the redundant parameters and also the improvement of the characterization ability of the model, an effective channel residual structure is adopted to construct a channel residual convolution module (CRCM) and channel residual cross stage partial (CRCSP) module as components of the backbone network, respectively. They realize the extraction of both the shallow feature and multi-scale feature simultaneously under a small number of convolutional parameters. Secondly, in the neck of YOLO, a fusion-distribution (FD) strategy is employed, which extracts and fuses multi-scale feature maps from the backbone network to provide global information, and then distributes global information into local features of different branches through an inject attention mechanism, thus enhancing the feature gap between different branches. Then, a model called CRFD-YOLO is derived for the steel surface defect detection and localization for the situations where both speed and accuracy are demanding. Finally, extensive experimental validations are conducted to evaluate the performance of CRFD-YOLO. The validation results indicate that CRFD-YOLO achieves a satisfactory detection performance with a mean average precision of 81.3% on the NEU-DET and 71.1% on the GC10-DET. Additionally, CRFD-YOLO achieves a speed of 161 frames per second, giving a great potential in real-time detection and localization tasks.","PeriodicalId":18526,"journal":{"name":"Measurement Science and Technology","volume":null,"pages":null},"PeriodicalIF":2.7000,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Measurement Science and Technology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1088/1361-6501/ad6281","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Accurately and efficiently detecting steel surface defects is a critical step in steel manufacturing. However, the compromise between the detection speed and accuracy remains a major challenge, especially for steel surface defects with large variations in the scale. To address the issue, an improved YOLO based detection model is proposed through the reinforcement of its backbone and neck. Firstly, for the reduction of the redundant parameters and also the improvement of the characterization ability of the model, an effective channel residual structure is adopted to construct a channel residual convolution module (CRCM) and channel residual cross stage partial (CRCSP) module as components of the backbone network, respectively. They realize the extraction of both the shallow feature and multi-scale feature simultaneously under a small number of convolutional parameters. Secondly, in the neck of YOLO, a fusion-distribution (FD) strategy is employed, which extracts and fuses multi-scale feature maps from the backbone network to provide global information, and then distributes global information into local features of different branches through an inject attention mechanism, thus enhancing the feature gap between different branches. Then, a model called CRFD-YOLO is derived for the steel surface defect detection and localization for the situations where both speed and accuracy are demanding. Finally, extensive experimental validations are conducted to evaluate the performance of CRFD-YOLO. The validation results indicate that CRFD-YOLO achieves a satisfactory detection performance with a mean average precision of 81.3% on the NEU-DET and 71.1% on the GC10-DET. Additionally, CRFD-YOLO achieves a speed of 161 frames per second, giving a great potential in real-time detection and localization tasks.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用通道残余卷积和融合分布建立的钢表面缺陷高速 YOLO 检测模型
准确高效地检测钢材表面缺陷是钢材生产的关键步骤。然而,检测速度与精度之间的折衷仍然是一个重大挑战,尤其是对于尺度变化较大的钢材表面缺陷。为解决这一问题,本文提出了一种基于 YOLO 的改进型检测模型,并对其主干和颈部进行了强化。首先,为了减少冗余参数,提高模型的表征能力,采用了有效的通道残差结构,分别构建了通道残差卷积模块(CRCM)和通道残差交叉阶段局部模块(CRCSP)作为骨干网络的组成部分。它们实现了在少量卷积参数下同时提取浅层特征和多尺度特征。其次,在 "YOLO "颈中采用了融合分布(FD)策略,从骨干网络中提取并融合多尺度特征图,提供全局信息,然后通过注入注意机制将全局信息分布到不同分支的局部特征中,从而增强不同分支之间的特征差距。然后,针对速度和精度要求都很高的情况,推导出一种名为 CRFD-YOLO 的模型,用于钢材表面缺陷的检测和定位。最后,对 CRFD-YOLO 的性能进行了广泛的实验验证。验证结果表明,CRFD-YOLO 的检测性能令人满意,在 NEU-DET 上的平均精度为 81.3%,在 GC10-DET 上的平均精度为 71.1%。此外,CRFD-YOLO 的速度达到每秒 161 帧,在实时检测和定位任务中具有巨大潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Measurement Science and Technology
Measurement Science and Technology 工程技术-工程:综合
CiteScore
4.30
自引率
16.70%
发文量
656
审稿时长
4.9 months
期刊介绍: Measurement Science and Technology publishes articles on new measurement techniques and associated instrumentation. Papers that describe experiments must represent an advance in measurement science or measurement technique rather than the application of established experimental technique. Bearing in mind the multidisciplinary nature of the journal, authors must provide an introduction to their work that makes clear the novelty, significance, broader relevance of their work in a measurement context and relevance to the readership of Measurement Science and Technology. All submitted articles should contain consideration of the uncertainty, precision and/or accuracy of the measurements presented. Subject coverage includes the theory, practice and application of measurement in physics, chemistry, engineering and the environmental and life sciences from inception to commercial exploitation. Publications in the journal should emphasize the novelty of reported methods, characterize them and demonstrate their performance using examples or applications.
期刊最新文献
Morphological characterization of concave particle based on convex decomposition TSMDA: intelligent fault diagnosis of rolling bearing with two stage multi-source domain adaptation Precise orbit determination of integrated BDS-3 and LEO satellites with ambiguity fixing under regional ground stations High-accuracy and lightweight weld surface defect detector based on graph convolution decoupling head Gap Measurement Method Based on Projection Lines and Convex Analysis of 3D Points Cloud
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1