SDD-DETR:带检测变压器的停用航空发动机叶片表面缺陷检测系统

IF 6.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Automation Science and Engineering Pub Date : 2024-09-17 DOI:10.1109/TASE.2024.3457829
Xiangkun Sun;Kechen Song;Xin Wen;Yanyan Wang;Yunhui Yan
{"title":"SDD-DETR:带检测变压器的停用航空发动机叶片表面缺陷检测系统","authors":"Xiangkun Sun;Kechen Song;Xin Wen;Yanyan Wang;Yunhui Yan","doi":"10.1109/TASE.2024.3457829","DOIUrl":null,"url":null,"abstract":"Vision-based surface defect detection (SDD) for no-service aero-engine blades provides a fast and effective way to monitor product quality. Most existing detection algorithms for aero-engine blades are 1) based on CNN, including artificially designed non-maximum suppression (NMS) operations, and 2) focus on improving the detection accuracy rather than improving the inference speed and even ignoring the latter. To solve the above problems, we introduce a novel object detection paradigm, DEtection TRansformer (DETR), to design a novel network (SDD-DETR) with high accuracy for the SDD of aero-engine blades. To our knowledge, the paper is the first to introduce the DETR detector to SDD of aero-engine blades. While providing high accuracy, the inference speed of DETR remained slow due to self-attention operation and feed-forward network (FFN). Therefore, two lightweight modules have been designed for SDD of aero-engine blades: a progressive feature input multi-scale deformable attention module (PFI-MSDA) and a lightweight FFN (LW-FFN). PFI-MSDA hierarchically reduces the number of tokens input to the self-attention module, thereby reducing the time complexity of the self-attention layer. LW-FFN shrinks the complexity of multilayer perceptron. In addition, no parameter sharing of the detection head is utilized to compensate for the accuracy drop caused by the lightweight. Experiments verify that our method has the same AP and F1-score as DINO (a DETR-based detector), but our approach is lighter. Compared with DINO, the FLOPs are reduced by <inline-formula> <tex-math>$113.4{G}$ </tex-math></inline-formula>, the inference speed is increased by 42.4%, and the runtime memory usage is reduced by <inline-formula> <tex-math>$5.9{G}$ </tex-math></inline-formula>, which allows our method to be trained on low-end GPUs with more batch size, further improving the training efficiency. The code is available at <uri>https://github.com/VDT-2048/SDD-DETR</uri>.Note to Practitioners—The motivation for this paper is to design a high-precision and high-inference speed visual detection method for the SDD of aero-engine blades. Most high-precision vision methods are based on the transformer framework. However, its high complexity and poor compatibility in deployment environments lead to slower detection speeds. Although the application object in this paper is aero-engine blades, it is also applicable in other fields of industry, such as rail detection, plate and strip steel detection, etc. However, the method proposed cannot be supported by deployment environments such as RKNN because of the deformable attention operator, so it takes a certain amount of time to be deployed and put into practical use. Currently, the frameworks of visual and language large models are based on transformers, consistent with the framework of our method, which makes extending our approach to large visual and multi-modal models more accessible.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"6984-6997"},"PeriodicalIF":6.4000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SDD-DETR: Surface Defect Detection for No-Service Aero-Engine Blades With Detection Transformer\",\"authors\":\"Xiangkun Sun;Kechen Song;Xin Wen;Yanyan Wang;Yunhui Yan\",\"doi\":\"10.1109/TASE.2024.3457829\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Vision-based surface defect detection (SDD) for no-service aero-engine blades provides a fast and effective way to monitor product quality. Most existing detection algorithms for aero-engine blades are 1) based on CNN, including artificially designed non-maximum suppression (NMS) operations, and 2) focus on improving the detection accuracy rather than improving the inference speed and even ignoring the latter. To solve the above problems, we introduce a novel object detection paradigm, DEtection TRansformer (DETR), to design a novel network (SDD-DETR) with high accuracy for the SDD of aero-engine blades. To our knowledge, the paper is the first to introduce the DETR detector to SDD of aero-engine blades. While providing high accuracy, the inference speed of DETR remained slow due to self-attention operation and feed-forward network (FFN). Therefore, two lightweight modules have been designed for SDD of aero-engine blades: a progressive feature input multi-scale deformable attention module (PFI-MSDA) and a lightweight FFN (LW-FFN). PFI-MSDA hierarchically reduces the number of tokens input to the self-attention module, thereby reducing the time complexity of the self-attention layer. LW-FFN shrinks the complexity of multilayer perceptron. In addition, no parameter sharing of the detection head is utilized to compensate for the accuracy drop caused by the lightweight. Experiments verify that our method has the same AP and F1-score as DINO (a DETR-based detector), but our approach is lighter. Compared with DINO, the FLOPs are reduced by <inline-formula> <tex-math>$113.4{G}$ </tex-math></inline-formula>, the inference speed is increased by 42.4%, and the runtime memory usage is reduced by <inline-formula> <tex-math>$5.9{G}$ </tex-math></inline-formula>, which allows our method to be trained on low-end GPUs with more batch size, further improving the training efficiency. The code is available at <uri>https://github.com/VDT-2048/SDD-DETR</uri>.Note to Practitioners—The motivation for this paper is to design a high-precision and high-inference speed visual detection method for the SDD of aero-engine blades. Most high-precision vision methods are based on the transformer framework. However, its high complexity and poor compatibility in deployment environments lead to slower detection speeds. Although the application object in this paper is aero-engine blades, it is also applicable in other fields of industry, such as rail detection, plate and strip steel detection, etc. However, the method proposed cannot be supported by deployment environments such as RKNN because of the deformable attention operator, so it takes a certain amount of time to be deployed and put into practical use. Currently, the frameworks of visual and language large models are based on transformers, consistent with the framework of our method, which makes extending our approach to large visual and multi-modal models more accessible.\",\"PeriodicalId\":51060,\"journal\":{\"name\":\"IEEE Transactions on Automation Science and Engineering\",\"volume\":\"22 \",\"pages\":\"6984-6997\"},\"PeriodicalIF\":6.4000,\"publicationDate\":\"2024-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Automation Science and Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10681441/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10681441/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

基于视觉的航空发动机叶片表面缺陷检测为产品质量监测提供了一种快速有效的方法。现有的航空发动机叶片检测算法大多是1)基于CNN,包括人为设计的非最大抑制(NMS)操作;2)侧重于提高检测精度,而不是提高推理速度,甚至忽略了后者。为了解决上述问题,本文引入一种新的目标检测范式——检测变压器(detection TRansformer, DETR),设计了一种高精度的航空发动机叶片目标检测网络(SDD-DETR)。据我们所知,本文首次将DETR检测器引入到航空发动机叶片SDD中。在提供高准确率的同时,由于自注意操作和前馈网络(FFN)的原因,DETR的推理速度仍然很慢。因此,针对航空发动机叶片的SDD设计了两个轻量化模块:渐进式特征输入多尺度可变形注意模块(PFI-MSDA)和轻量化FFN (LW-FFN)。PFI-MSDA分层地减少了自关注模块的令牌输入数量,从而降低了自关注层的时间复杂度。LW-FFN减小了多层感知器的复杂度。此外,没有使用检测头的参数共享来补偿由于轻量化而导致的精度下降。实验证明,我们的方法具有与DINO(基于der的检测器)相同的AP和f1分数,但我们的方法更轻。与DINO相比,FLOPs降低了$113.4{G}$,推理速度提高了42.4%,运行时内存使用量降低了$5.9{G}$,使我们的方法能够在更批量的低端gpu上进行训练,进一步提高了训练效率。本文的动机是设计一种高精度、高推理速度的航空发动机叶片SDD视觉检测方法。大多数高精度视觉方法都是基于变压器框架的。但是,它的高复杂性和在部署环境中的兼容性较差导致检测速度较慢。虽然本文的应用对象是航空发动机叶片,但也适用于其他工业领域,如钢轨检测、板带钢检测等。然而,由于注意算子的可变形性,所提出的方法不支持RKNN等部署环境,需要一定的时间才能部署并投入实际使用。目前,可视化和语言大型模型的框架是基于转换的,这与我们方法的框架是一致的,这使得我们的方法更容易扩展到大型可视化和多模态模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
SDD-DETR: Surface Defect Detection for No-Service Aero-Engine Blades With Detection Transformer
Vision-based surface defect detection (SDD) for no-service aero-engine blades provides a fast and effective way to monitor product quality. Most existing detection algorithms for aero-engine blades are 1) based on CNN, including artificially designed non-maximum suppression (NMS) operations, and 2) focus on improving the detection accuracy rather than improving the inference speed and even ignoring the latter. To solve the above problems, we introduce a novel object detection paradigm, DEtection TRansformer (DETR), to design a novel network (SDD-DETR) with high accuracy for the SDD of aero-engine blades. To our knowledge, the paper is the first to introduce the DETR detector to SDD of aero-engine blades. While providing high accuracy, the inference speed of DETR remained slow due to self-attention operation and feed-forward network (FFN). Therefore, two lightweight modules have been designed for SDD of aero-engine blades: a progressive feature input multi-scale deformable attention module (PFI-MSDA) and a lightweight FFN (LW-FFN). PFI-MSDA hierarchically reduces the number of tokens input to the self-attention module, thereby reducing the time complexity of the self-attention layer. LW-FFN shrinks the complexity of multilayer perceptron. In addition, no parameter sharing of the detection head is utilized to compensate for the accuracy drop caused by the lightweight. Experiments verify that our method has the same AP and F1-score as DINO (a DETR-based detector), but our approach is lighter. Compared with DINO, the FLOPs are reduced by $113.4{G}$ , the inference speed is increased by 42.4%, and the runtime memory usage is reduced by $5.9{G}$ , which allows our method to be trained on low-end GPUs with more batch size, further improving the training efficiency. The code is available at https://github.com/VDT-2048/SDD-DETR.Note to Practitioners—The motivation for this paper is to design a high-precision and high-inference speed visual detection method for the SDD of aero-engine blades. Most high-precision vision methods are based on the transformer framework. However, its high complexity and poor compatibility in deployment environments lead to slower detection speeds. Although the application object in this paper is aero-engine blades, it is also applicable in other fields of industry, such as rail detection, plate and strip steel detection, etc. However, the method proposed cannot be supported by deployment environments such as RKNN because of the deformable attention operator, so it takes a certain amount of time to be deployed and put into practical use. Currently, the frameworks of visual and language large models are based on transformers, consistent with the framework of our method, which makes extending our approach to large visual and multi-modal models more accessible.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Transactions on Automation Science and Engineering
IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统
CiteScore
12.50
自引率
14.30%
发文量
404
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.
期刊最新文献
Six-DoF Coupled Dynamics Modeling and Intelligent Vibration Suppression of CMG With the Flexible Vibration Isolator Consensus-Based Privacy-Preserving Energy Management Strategies Based on Output Mask Approaches Outshining the Origin: A Pseudo Sources Fusion Approach via Knowledge Distillation with Feature Decoupling for Domain Generalization in Fault Diagnosis Adaptive Prescribed Performance Secure Consensus Control for High-Order Constrained Multi-Agent Systems With Time-Varying Powers and Arbitrary Initial States AS-TEB: An Adaptive and Smooth Timed Elastic Band-Based Path Planner for Mobile Robots
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1