基于任务预训练和半监督学习增强的深度迁移学习的铝合金GTAW焊透可视化监测

IF 6.8 1区 工程技术 Q1 ENGINEERING, MANUFACTURING Journal of Manufacturing Processes Pub Date : 2025-01-17 Epub Date: 2024-12-09 DOI:10.1016/j.jmapro.2024.11.102
Boce Xue , Dong Du , Guodong Peng , Yanzhen Zhang , Runsheng Li , Zixiang Li
{"title":"基于任务预训练和半监督学习增强的深度迁移学习的铝合金GTAW焊透可视化监测","authors":"Boce Xue ,&nbsp;Dong Du ,&nbsp;Guodong Peng ,&nbsp;Yanzhen Zhang ,&nbsp;Runsheng Li ,&nbsp;Zixiang Li","doi":"10.1016/j.jmapro.2024.11.102","DOIUrl":null,"url":null,"abstract":"<div><div>Appropriate weld penetration is of vital significance for ensuring the welding quality of gas tungsten arc welding (GTAW). Visual monitoring based on deep learning has been widely applied in weld penetration monitoring. However, deep learning requires a large number of labeled samples to achieve satisfactory performance. Deep transfer learning (DTL) is an effective technique to address this issue, but the famous ImageNet dataset may not be suitable for pre-training a deep learning model for weld penetration prediction. In this study, a visual monitoring approach for weld penetration of aluminum alloy GTAW based on DTL enhanced by task-specific pre-training and semi-supervised learning (SSL) is proposed to obtain better prediction accuracy of the backside bead width with limited labeled data. Firstly, an active vision method is used to capture images of the weld pool. Next, a task-specific pre-training method is designed by constructing a keypoint localization task to pre-train a deep learning model with an encoder-decoder architecture, and SSL is introduced to reduce the required number of labeled data in pre-training. Finally, an encoder-based regression model is constructed and fine-tuned to predict the backside bead width. It is found that by using SSL in task-specific pre-training, the keypoint localization model trained with only 40 labeled samples can achieve ideal performance, and the performance of SSL outperforms fully-supervised learning (FSL) in terms of both keypoint localization accuracy and robustness to the randomness of labeled training samples. Moreover, the mean prediction error of backside bead width after fine-tuning is only 0.176 mm, which is reduced by 29.9 % compared to using ImageNet for pre-training. The proposed method also has good real-time performance and thus has the capability to be applied in the real-time monitoring and control of weld penetration.</div></div>","PeriodicalId":16148,"journal":{"name":"Journal of Manufacturing Processes","volume":"133 ","pages":"Pages 1038-1050"},"PeriodicalIF":6.8000,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Visual monitoring of weld penetration in aluminum alloy GTAW based on deep transfer learning enhanced by task-specific pre-training and semi-supervised learning\",\"authors\":\"Boce Xue ,&nbsp;Dong Du ,&nbsp;Guodong Peng ,&nbsp;Yanzhen Zhang ,&nbsp;Runsheng Li ,&nbsp;Zixiang Li\",\"doi\":\"10.1016/j.jmapro.2024.11.102\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Appropriate weld penetration is of vital significance for ensuring the welding quality of gas tungsten arc welding (GTAW). Visual monitoring based on deep learning has been widely applied in weld penetration monitoring. However, deep learning requires a large number of labeled samples to achieve satisfactory performance. Deep transfer learning (DTL) is an effective technique to address this issue, but the famous ImageNet dataset may not be suitable for pre-training a deep learning model for weld penetration prediction. In this study, a visual monitoring approach for weld penetration of aluminum alloy GTAW based on DTL enhanced by task-specific pre-training and semi-supervised learning (SSL) is proposed to obtain better prediction accuracy of the backside bead width with limited labeled data. Firstly, an active vision method is used to capture images of the weld pool. Next, a task-specific pre-training method is designed by constructing a keypoint localization task to pre-train a deep learning model with an encoder-decoder architecture, and SSL is introduced to reduce the required number of labeled data in pre-training. Finally, an encoder-based regression model is constructed and fine-tuned to predict the backside bead width. It is found that by using SSL in task-specific pre-training, the keypoint localization model trained with only 40 labeled samples can achieve ideal performance, and the performance of SSL outperforms fully-supervised learning (FSL) in terms of both keypoint localization accuracy and robustness to the randomness of labeled training samples. Moreover, the mean prediction error of backside bead width after fine-tuning is only 0.176 mm, which is reduced by 29.9 % compared to using ImageNet for pre-training. The proposed method also has good real-time performance and thus has the capability to be applied in the real-time monitoring and control of weld penetration.</div></div>\",\"PeriodicalId\":16148,\"journal\":{\"name\":\"Journal of Manufacturing Processes\",\"volume\":\"133 \",\"pages\":\"Pages 1038-1050\"},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2025-01-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Manufacturing Processes\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1526612524012726\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/12/9 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MANUFACTURING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Manufacturing Processes","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1526612524012726","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/9 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENGINEERING, MANUFACTURING","Score":null,"Total":0}
引用次数: 0

摘要

适当的焊深对保证钨极气体保护焊的焊接质量具有重要意义。基于深度学习的视觉监测在焊透监测中得到了广泛的应用。然而,深度学习需要大量的标记样本才能达到令人满意的性能。深度迁移学习(DTL)是解决这一问题的有效技术,但著名的ImageNet数据集可能不适合预训练用于熔透预测的深度学习模型。本文提出了一种基于任务预训练和半监督学习(SSL)增强的基于DTL的铝合金GTAW焊透可视化监测方法,以在有限的标记数据下获得更好的后焊头宽度预测精度。首先,采用主动视觉方法对焊缝熔池进行图像采集;其次,通过构建关键点定位任务,设计了一种针对特定任务的预训练方法,对具有编码器-解码器架构的深度学习模型进行预训练,并引入SSL来减少预训练中所需的标记数据数量。最后,构建了一个基于编码器的回归模型,并对其进行了微调,以预测后头宽度。研究发现,在特定任务的预训练中使用SSL,仅用40个标记样本训练的关键点定位模型就能达到理想的性能,并且在关键点定位精度和对标记训练样本随机性的鲁棒性方面,SSL的性能优于完全监督学习(FSL)。此外,微调后的后头宽度平均预测误差仅为0.176 mm,与使用ImageNet进行预训练相比降低了29.9%。该方法还具有良好的实时性,可应用于焊透过程的实时监测与控制。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Visual monitoring of weld penetration in aluminum alloy GTAW based on deep transfer learning enhanced by task-specific pre-training and semi-supervised learning
Appropriate weld penetration is of vital significance for ensuring the welding quality of gas tungsten arc welding (GTAW). Visual monitoring based on deep learning has been widely applied in weld penetration monitoring. However, deep learning requires a large number of labeled samples to achieve satisfactory performance. Deep transfer learning (DTL) is an effective technique to address this issue, but the famous ImageNet dataset may not be suitable for pre-training a deep learning model for weld penetration prediction. In this study, a visual monitoring approach for weld penetration of aluminum alloy GTAW based on DTL enhanced by task-specific pre-training and semi-supervised learning (SSL) is proposed to obtain better prediction accuracy of the backside bead width with limited labeled data. Firstly, an active vision method is used to capture images of the weld pool. Next, a task-specific pre-training method is designed by constructing a keypoint localization task to pre-train a deep learning model with an encoder-decoder architecture, and SSL is introduced to reduce the required number of labeled data in pre-training. Finally, an encoder-based regression model is constructed and fine-tuned to predict the backside bead width. It is found that by using SSL in task-specific pre-training, the keypoint localization model trained with only 40 labeled samples can achieve ideal performance, and the performance of SSL outperforms fully-supervised learning (FSL) in terms of both keypoint localization accuracy and robustness to the randomness of labeled training samples. Moreover, the mean prediction error of backside bead width after fine-tuning is only 0.176 mm, which is reduced by 29.9 % compared to using ImageNet for pre-training. The proposed method also has good real-time performance and thus has the capability to be applied in the real-time monitoring and control of weld penetration.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Manufacturing Processes
Journal of Manufacturing Processes ENGINEERING, MANUFACTURING-
CiteScore
10.20
自引率
11.30%
发文量
833
审稿时长
50 days
期刊介绍: The aim of the Journal of Manufacturing Processes (JMP) is to exchange current and future directions of manufacturing processes research, development and implementation, and to publish archival scholarly literature with a view to advancing state-of-the-art manufacturing processes and encouraging innovation for developing new and efficient processes. The journal will also publish from other research communities for rapid communication of innovative new concepts. Special-topic issues on emerging technologies and invited papers will also be published.
期刊最新文献
Achieving superior high-temperature strength-ductility in LPBF-processed aluminum matrix composite via in-situ oxide/intermetallic synergistically stabilizing AlSi eutectic networks From cut to crack: Modeling the shear-affected zone and its role in edge cracking of dual-phase steels A novel cellular automaton framework for elucidating process-dependent grain evolution during wire-arc directed energy deposition Decoding a novel process for feed-assisted progressive micropunching: Homogenized hardness and coordinated material flow enabled by microstructural refinement Multivariable waveform optimization for high-precision droplet volume control in Pico-liter resolution OLED printing based on hybrid inkjet model
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1