{"title":"SSA-YOLO: An Improved YOLO for Hot-Rolled Strip Steel Surface Defect Detection","authors":"Xiaohua Huang;Jiahao Zhu;Ying Huo","doi":"10.1109/TIM.2024.3488136","DOIUrl":null,"url":null,"abstract":"In the manufacturing process of hot-rolled steel strips, various mechanical forces, and environmental conditions can cause surface defects, making their detection crucial for ensuring high-quality product production and preventing significant economic losses in the industry. However, existing models within the you only look once (YOLO) family, commonly employed for steel surface defect detection, have exhibited limited effectiveness. In this article, we propose an improved version of YOLO, namely, YOLO enhanced by a convolution squeeze-and-excitation (CSE) module, Conv2d-BatchNorm-SiLU (CBS) with Swin transformer (CST) module, and adaptive spatial feature fusion (ASFF) detection head module, i.e., SSA-YOLO, specifically tailored for end-to-end surface defect detection. Our approach incorporates several key modifications aimed at improving performance. First, we integrate a channel attention mechanism module into the shallow convolutional network module of the backbone. This enhancement focuses on channel information to improve feature extraction related to small defects while reducing redundant information in candidate boxes. In addition, we fuse a Swin transformer (Swin-T) module into the neck to enhance feature representation for detecting diverse and multiscale defects. Finally, the ASFF is introduced in YOLO to increase cross-interaction between high and low levels in the feature pyramid network (FPN). Experimental results demonstrate the superior performance and effectiveness of our SSA-YOLO model compared to other state-of-the-art models. Our approach achieves higher accuracy and sensitivity in detecting surface defects, offering significant advancements in steel strip production quality control. The code is available at \n<uri>https://github.com/MIPIT-Team/SSA-YOLO</uri>\n.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":null,"pages":null},"PeriodicalIF":5.6000,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10739337/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
In the manufacturing process of hot-rolled steel strips, various mechanical forces, and environmental conditions can cause surface defects, making their detection crucial for ensuring high-quality product production and preventing significant economic losses in the industry. However, existing models within the you only look once (YOLO) family, commonly employed for steel surface defect detection, have exhibited limited effectiveness. In this article, we propose an improved version of YOLO, namely, YOLO enhanced by a convolution squeeze-and-excitation (CSE) module, Conv2d-BatchNorm-SiLU (CBS) with Swin transformer (CST) module, and adaptive spatial feature fusion (ASFF) detection head module, i.e., SSA-YOLO, specifically tailored for end-to-end surface defect detection. Our approach incorporates several key modifications aimed at improving performance. First, we integrate a channel attention mechanism module into the shallow convolutional network module of the backbone. This enhancement focuses on channel information to improve feature extraction related to small defects while reducing redundant information in candidate boxes. In addition, we fuse a Swin transformer (Swin-T) module into the neck to enhance feature representation for detecting diverse and multiscale defects. Finally, the ASFF is introduced in YOLO to increase cross-interaction between high and low levels in the feature pyramid network (FPN). Experimental results demonstrate the superior performance and effectiveness of our SSA-YOLO model compared to other state-of-the-art models. Our approach achieves higher accuracy and sensitivity in detecting surface defects, offering significant advancements in steel strip production quality control. The code is available at
https://github.com/MIPIT-Team/SSA-YOLO
.
期刊介绍:
Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.