Rapid inference for penetration prediction of plasma arc welding using enhanced ShuffleNetV2 and FOS-ELM

IF 2.4 4区材料科学 Q2 METALLURGY & METALLURGICAL ENGINEERING Welding in the World Pub Date : 2024-08-10 DOI:10.1007/s40194-024-01818-1

Zhi Zeng, Yuancheng Yang, Junrui Yuan, Bojin Qi

{"title":"Rapid inference for penetration prediction of plasma arc welding using enhanced ShuffleNetV2 and FOS-ELM","authors":"Zhi Zeng, Yuancheng Yang, Junrui Yuan, Bojin Qi","doi":"10.1007/s40194-024-01818-1","DOIUrl":null,"url":null,"abstract":"<div><p>Vision sensing is commonly employed in monitoring the forming process of medium and thick plate in plasma arc welding (PAW). However, due to physical constraints, direct observation of the backside forming process is impractical. Therefore, the weld image on the workpiece’s topside is commonly used to assess weld penetration status. Previous research typically relied on regression and machine learning algorithms to establish this relationship, while recent studies have employed deep learning methods for higher prediction accuracy, but they are computationally demanding, limiting real-time applications in welding. This study aims to improve deep learning model prediction times during welding. We avoid recursive neural network (RNN), vision transformer (ViT), and other high-accuracy approaches with significant computational overhead, opting instead for convolutional neural networks (CNN) for better real-time performance. After comparing six classical CNNs, ShuffleNetV2 backbone was chosen to extract features for its fast computational speed and high prediction accuracy. Innovatively, online sequential extreme learning machine with a forgetting mechanism (FOS-ELM) was introduced to classify penetration status instead of traditional full-layer classification for its high accuracy and speed. Welding experiments on a genuine embedded system validate our approach, reaching a prediction accuracy exceeding 94% on a small dataset, with a prediction time of just 5 ms per welded frame, meeting industrial-grade applications. On the basis of the ShuffleNetV2 backbone and OS-ELM model, transfer learning is used to expedite prediction convergence, while the squeeze excitation (SE) module is employed to enhance accuracy without compromising speed. Moreover, the model’s alignment with skilled welders’ key observation points is visually verified by using gradient-weighted class activation mapping (Grad-CAM). Finally, the deployment of the model in ONNX format on an industrial PC demonstrates its suitability for real-world PAW operations. Vision sensing is crucial for monitoring plasma arc welding (PAW) of medium and thick plates. However, direct observation of the backside formation process is impractical due to certain physical constraints. Therefore, weld image analysis from the workpiece’s topside is commonly used to assess weld penetration. Previous studies relied on fitting and machine learning algorithms, but recent research has shifted towards deep learning for improved accuracy. However, deep learning methods are computationally intensive, limiting their real-time application in welding. This study aims to enhance deep learning model prediction speed during welding by avoiding computationally demanding approaches like recurrent neural networks (RNNs) and vision transformers (ViTs). Instead, we utilize convolutional neural network (CNN) backbones for improved real-time performance. After evaluating six classical CNNs, we selected the ShuffleNetV2 backbone for its fast computational speed and high accuracy and introduced the online sequential extreme learning machine with a forgetting mechanism (FOS-ELM) for classification, achieving high accuracy and speed. Welding experiments validated the proposed approach, achieving over 94% prediction accuracy on a small dataset, with a prediction time of just 5 ms per welded frame. Transfer learning with the ShuffleNetV2 backbone and OS-ELM model expedited prediction convergence. The squeeze-and-excitation (SE) module enhanced accuracy without sacrificing speed. Visualization using gradient-weighted class activation mapping (Grad-CAM) verified the model’s alignment with skilled welders’ observations. Finally, deploying the model in ONNX format on an industrial PC demonstrated its suitability for real-world PAW operations.</p></div>","PeriodicalId":809,"journal":{"name":"Welding in the World","volume":"68 10","pages":"2625 - 2645"},"PeriodicalIF":2.4000,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Welding in the World","FirstCategoryId":"88","ListUrlMain":"https://link.springer.com/article/10.1007/s40194-024-01818-1","RegionNum":4,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"METALLURGY & METALLURGICAL ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Vision sensing is commonly employed in monitoring the forming process of medium and thick plate in plasma arc welding (PAW). However, due to physical constraints, direct observation of the backside forming process is impractical. Therefore, the weld image on the workpiece’s topside is commonly used to assess weld penetration status. Previous research typically relied on regression and machine learning algorithms to establish this relationship, while recent studies have employed deep learning methods for higher prediction accuracy, but they are computationally demanding, limiting real-time applications in welding. This study aims to improve deep learning model prediction times during welding. We avoid recursive neural network (RNN), vision transformer (ViT), and other high-accuracy approaches with significant computational overhead, opting instead for convolutional neural networks (CNN) for better real-time performance. After comparing six classical CNNs, ShuffleNetV2 backbone was chosen to extract features for its fast computational speed and high prediction accuracy. Innovatively, online sequential extreme learning machine with a forgetting mechanism (FOS-ELM) was introduced to classify penetration status instead of traditional full-layer classification for its high accuracy and speed. Welding experiments on a genuine embedded system validate our approach, reaching a prediction accuracy exceeding 94% on a small dataset, with a prediction time of just 5 ms per welded frame, meeting industrial-grade applications. On the basis of the ShuffleNetV2 backbone and OS-ELM model, transfer learning is used to expedite prediction convergence, while the squeeze excitation (SE) module is employed to enhance accuracy without compromising speed. Moreover, the model’s alignment with skilled welders’ key observation points is visually verified by using gradient-weighted class activation mapping (Grad-CAM). Finally, the deployment of the model in ONNX format on an industrial PC demonstrates its suitability for real-world PAW operations. Vision sensing is crucial for monitoring plasma arc welding (PAW) of medium and thick plates. However, direct observation of the backside formation process is impractical due to certain physical constraints. Therefore, weld image analysis from the workpiece’s topside is commonly used to assess weld penetration. Previous studies relied on fitting and machine learning algorithms, but recent research has shifted towards deep learning for improved accuracy. However, deep learning methods are computationally intensive, limiting their real-time application in welding. This study aims to enhance deep learning model prediction speed during welding by avoiding computationally demanding approaches like recurrent neural networks (RNNs) and vision transformers (ViTs). Instead, we utilize convolutional neural network (CNN) backbones for improved real-time performance. After evaluating six classical CNNs, we selected the ShuffleNetV2 backbone for its fast computational speed and high accuracy and introduced the online sequential extreme learning machine with a forgetting mechanism (FOS-ELM) for classification, achieving high accuracy and speed. Welding experiments validated the proposed approach, achieving over 94% prediction accuracy on a small dataset, with a prediction time of just 5 ms per welded frame. Transfer learning with the ShuffleNetV2 backbone and OS-ELM model expedited prediction convergence. The squeeze-and-excitation (SE) module enhanced accuracy without sacrificing speed. Visualization using gradient-weighted class activation mapping (Grad-CAM) verified the model’s alignment with skilled welders’ observations. Finally, deploying the model in ONNX format on an industrial PC demonstrated its suitability for real-world PAW operations.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使用增强型 ShuffleNetV2 和 FOS-ELM 对等离子弧焊的熔透预测进行快速推理

在等离子弧焊（PAW）中，视觉传感通常用于监控中厚板的成形过程。然而，由于物理条件的限制，直接观察背面成形过程并不现实。因此，通常使用工件顶部的焊缝图像来评估焊缝熔透状态。以往的研究通常依靠回归和机器学习算法来建立这种关系，而最近的研究则采用了深度学习方法来提高预测精度，但这些方法对计算要求较高，限制了在焊接中的实时应用。本研究旨在提高深度学习模型在焊接过程中的预测时间。我们避免使用递归神经网络（RNN）、视觉变换器（ViT）和其他计算量巨大的高精度方法，而是选择了卷积神经网络（CNN），以获得更好的实时性能。在比较了六种经典的卷积神经网络之后，我们选择了 ShuffleNetV2 骨干网络来提取特征，因为它计算速度快，预测准确率高。创新性地引入了具有遗忘机制的在线连续极端学习机（FOS-ELM）来对渗透状态进行分类，而不是传统的全层分类，因为其准确率高且速度快。在真正的嵌入式系统上进行的焊接实验验证了我们的方法，在小型数据集上的预测准确率超过 94%，每焊接帧的预测时间仅为 5 毫秒，满足了工业级应用的要求。在 ShuffleNetV2 主干网和 OS-ELM 模型的基础上，利用迁移学习加快预测收敛，同时利用挤压激励（SE）模块在不影响速度的情况下提高准确性。此外，利用梯度加权类激活映射（Grad-CAM）直观地验证了模型与熟练焊工关键观察点的一致性。最后，在工业 PC 上部署 ONNX 格式的模型，证明了该模型适用于实际的 PAW 操作。视觉传感对于监控中厚板等离子弧焊（PAW）至关重要。然而，由于某些物理限制，直接观察背面成形过程是不切实际的。因此，通常使用工件顶部的焊接图像分析来评估焊缝熔透情况。以前的研究依赖于拟合和机器学习算法，但最近的研究已转向深度学习，以提高准确性。然而，深度学习方法计算密集，限制了其在焊接中的实时应用。本研究旨在提高深度学习模型在焊接过程中的预测速度，避免使用计算量大的方法，如递归神经网络（RNN）和视觉转换器（ViT）。相反，我们利用卷积神经网络（CNN）骨干来提高实时性能。在对六种经典卷积神经网络进行评估后，我们选择了计算速度快、准确率高的 ShuffleNetV2 骨干网，并引入了具有遗忘机制的在线连续极限学习机（FOS-ELM）进行分类，从而实现了高准确率和高速度。焊接实验验证了所提出的方法，在一个小型数据集上实现了 94% 以上的预测准确率，每个焊接帧的预测时间仅为 5 毫秒。使用 ShuffleNetV2 主干网和 OS-ELM 模型的迁移学习加快了预测的收敛速度。挤压激励（SE）模块在不影响速度的情况下提高了准确性。使用梯度加权类激活映射（Grad-CAM）进行的可视化验证了模型与熟练焊工观察结果的一致性。最后，在工业 PC 上部署 ONNX 格式的模型，证明了该模型适用于实际的 PAW 操作。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Welding in the World METALLURGY & METALLURGICAL ENGINEERING-

CiteScore

4.20

自引率

14.30%

发文量

181

审稿时长

6-12 weeks

期刊介绍： The journal Welding in the World publishes authoritative papers on every aspect of materials joining, including welding, brazing, soldering, cutting, thermal spraying and allied joining and fabrication techniques.