ETDNet: Efficient Transformer-Based Detection Network for Surface Defect Detection

IF 5.9 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Instrumentation and Measurement Pub Date : 2023-01-01 DOI:10.1109/TIM.2023.3307753
Hantao Zhou;Rui Yang;Runze Hu;Chang Shu;Xiaochu Tang;Xiu Li
{"title":"ETDNet: Efficient Transformer-Based Detection Network for Surface Defect Detection","authors":"Hantao Zhou;Rui Yang;Runze Hu;Chang Shu;Xiaochu Tang;Xiu Li","doi":"10.1109/TIM.2023.3307753","DOIUrl":null,"url":null,"abstract":"Deep learning (DL)-based surface defect detectors play a crucial role in ensuring product quality during inspection processes. However, accurately and efficiently detecting defects remain challenging due to specific characteristics inherent in defective images, involving a high degree of foreground–background similarity, scale variation, and shape variation. To address this challenge, we propose an efficient transformer-based detection network, ETDNet, consisting of three novel designs to achieve superior performance. First, ETDNet takes a lightweight vision transformer (ViT) to extract representative global features. This approach ensures an accurate feature characterization of defects even with similar backgrounds. Second, a channel-modulated feature pyramid network (CM-FPN) is devised to fuse multilevel features and maintain critical information from corresponding levels. Finally, a novel task-oriented decoupled (TOD) head is introduced to tackle inconsistent representation between classification and regression tasks. The TOD head employs a local feature representation (LFR) module to learn object-aware local features and introduces a global feature representation (GFR) module, based on the attention mechanism, to learn content-aware global features. By integrating these two modules into the head, ETDNet can effectively classify and perceive defects with varying shapes and scales. Extensive experiments on various defect detection datasets demonstrate the effectiveness of the proposed ETDNet. For instance, it achieves AP 46.7% (versus 45.9%) and $\\mathrm {AP_{50}}~80.2$ % (versus 79.1%) with 49 frames/s on NEU-DET. The code is available at https://github.com/zht8506/ETDNet.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"72 ","pages":"1-14"},"PeriodicalIF":5.9000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10227321/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 1

Abstract

Deep learning (DL)-based surface defect detectors play a crucial role in ensuring product quality during inspection processes. However, accurately and efficiently detecting defects remain challenging due to specific characteristics inherent in defective images, involving a high degree of foreground–background similarity, scale variation, and shape variation. To address this challenge, we propose an efficient transformer-based detection network, ETDNet, consisting of three novel designs to achieve superior performance. First, ETDNet takes a lightweight vision transformer (ViT) to extract representative global features. This approach ensures an accurate feature characterization of defects even with similar backgrounds. Second, a channel-modulated feature pyramid network (CM-FPN) is devised to fuse multilevel features and maintain critical information from corresponding levels. Finally, a novel task-oriented decoupled (TOD) head is introduced to tackle inconsistent representation between classification and regression tasks. The TOD head employs a local feature representation (LFR) module to learn object-aware local features and introduces a global feature representation (GFR) module, based on the attention mechanism, to learn content-aware global features. By integrating these two modules into the head, ETDNet can effectively classify and perceive defects with varying shapes and scales. Extensive experiments on various defect detection datasets demonstrate the effectiveness of the proposed ETDNet. For instance, it achieves AP 46.7% (versus 45.9%) and $\mathrm {AP_{50}}~80.2$ % (versus 79.1%) with 49 frames/s on NEU-DET. The code is available at https://github.com/zht8506/ETDNet.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
ETDNet:用于表面缺陷检测的高效变压器检测网络
基于深度学习(DL)的表面缺陷检测器在确保检测过程中的产品质量方面发挥着至关重要的作用。然而,由于缺陷图像固有的特定特征,包括高度的前景-背景相似性、尺度变化和形状变化,准确有效地检测缺陷仍然具有挑战性。为了应对这一挑战,我们提出了一种高效的基于变压器的检测网络ETDNet,该网络由三种新颖的设计组成,以实现卓越的性能。首先,ETDNet采用轻量级视觉转换器(ViT)来提取具有代表性的全局特征。这种方法确保了缺陷的精确特征表征,即使背景相似。其次,设计了一种信道调制特征金字塔网络(CM-FPN)来融合多级特征并保持来自相应级别的关键信息。最后,引入了一种新的面向任务的解耦(TOD)头来解决分类任务和回归任务之间的不一致表示问题。TOD头采用局部特征表示(LFR)模块来学习感知对象的局部特征,并引入了基于注意力机制的全局特征表示(GFR)模块,以学习感知内容的全局特征。通过将这两个模块集成到头部中,ETDNet可以有效地对不同形状和规模的缺陷进行分类和感知。在各种缺陷检测数据集上进行的大量实验证明了所提出的ETDNet的有效性。例如,它在NEU-DET上以49帧/秒的速度实现了46.7%的AP(对45.9%)和$\mathrm{AP_{50}}~80.2$%(对79.1%)。代码可在https://github.com/zht8506/ETDNet.
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Instrumentation and Measurement
IEEE Transactions on Instrumentation and Measurement 工程技术-工程:电子与电气
CiteScore
9.00
自引率
23.20%
发文量
1294
审稿时长
3.9 months
期刊介绍: Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.
期刊最新文献
Fast In-Motion Alignment for LiDAR–Inertial in Challenging Scenarios EcoSense: An Autonomous Polarization-Insensitive Wireless-Powered Battery-Free IoT Beacon for Live Environmental Sensing Dual Optical Measurement for Early Detection of Tracking in Polymeric Insulators Using Fluorescent Fiber and Emission Spectroscopy Extended-Bandwidth Spacecraft Attitude Jitter Detection Based on Adaptive Sequential Kalman Algorithm Multidirectional Strain-Insensitive Stretchable Temperature Sensor for Active Thermal Protection Flexible Skin
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1