用于激光扫描焊接中焊点成形监测的交叉注意时间序列多特征融合视觉变换器

IF 7.9 1区 工程技术 Q1 ENGINEERING, MECHANICAL Mechanical Systems and Signal Processing Pub Date : 2025-03-03 DOI:10.1016/j.ymssp.2025.112531
Shenghong Yan , Bo Chen , Han Gao , Caiwang Tan , Xiaoguo Song , Guodong Wang
{"title":"用于激光扫描焊接中焊点成形监测的交叉注意时间序列多特征融合视觉变换器","authors":"Shenghong Yan ,&nbsp;Bo Chen ,&nbsp;Han Gao ,&nbsp;Caiwang Tan ,&nbsp;Xiaoguo Song ,&nbsp;Guodong Wang","doi":"10.1016/j.ymssp.2025.112531","DOIUrl":null,"url":null,"abstract":"<div><div>As laser scanning welding technology matures in engineering applications, it is a crucial step in developing diagnostics capable of monitoring weld joint forming and meeting the demands of increasingly structurally complex products. In this work, a unique multivariate time-series dataset encompassing keyhole and molten pool image streams was extracted from the collected visual signals. Keyhole and molten pool were respectively fed into a proposed Transformer-based model with two-branches, which incorporated multi-head self-attention and cross-attention mechanisms. The results show that the optimal architecture achieved an accuracy of 99.3%, which outperforms the previous state-of-the-art image-based models. The optimization and ablation experiments have also verified that the temporal characteristics of signals are one of the significant determining factors for the accuracy of laser scanning welding state recognition. The score maps of attention mechanism during the decision-making process demonstrate that the proposed model is able to accurately learn the time-series characteristics of keyhole and molten pool visual signals, exhibiting exceptional capability in effectively capturing fine-grained details of highly dynamic objects from visual signals under varying welding states. In summary, its excellent performance and visualization of the attention mechanism make it a promising diagnostic functional module as a novel strategy for laser scanning welded Joint formation monitoring.</div></div>","PeriodicalId":51124,"journal":{"name":"Mechanical Systems and Signal Processing","volume":"229 ","pages":"Article 112531"},"PeriodicalIF":7.9000,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Cross-attention time-series multi-feature fusion vision transformer for joint formation monitoring in laser scanning welding\",\"authors\":\"Shenghong Yan ,&nbsp;Bo Chen ,&nbsp;Han Gao ,&nbsp;Caiwang Tan ,&nbsp;Xiaoguo Song ,&nbsp;Guodong Wang\",\"doi\":\"10.1016/j.ymssp.2025.112531\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>As laser scanning welding technology matures in engineering applications, it is a crucial step in developing diagnostics capable of monitoring weld joint forming and meeting the demands of increasingly structurally complex products. In this work, a unique multivariate time-series dataset encompassing keyhole and molten pool image streams was extracted from the collected visual signals. Keyhole and molten pool were respectively fed into a proposed Transformer-based model with two-branches, which incorporated multi-head self-attention and cross-attention mechanisms. The results show that the optimal architecture achieved an accuracy of 99.3%, which outperforms the previous state-of-the-art image-based models. The optimization and ablation experiments have also verified that the temporal characteristics of signals are one of the significant determining factors for the accuracy of laser scanning welding state recognition. The score maps of attention mechanism during the decision-making process demonstrate that the proposed model is able to accurately learn the time-series characteristics of keyhole and molten pool visual signals, exhibiting exceptional capability in effectively capturing fine-grained details of highly dynamic objects from visual signals under varying welding states. In summary, its excellent performance and visualization of the attention mechanism make it a promising diagnostic functional module as a novel strategy for laser scanning welded Joint formation monitoring.</div></div>\",\"PeriodicalId\":51124,\"journal\":{\"name\":\"Mechanical Systems and Signal Processing\",\"volume\":\"229 \",\"pages\":\"Article 112531\"},\"PeriodicalIF\":7.9000,\"publicationDate\":\"2025-03-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mechanical Systems and Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0888327025002328\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MECHANICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mechanical Systems and Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0888327025002328","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MECHANICAL","Score":null,"Total":0}
引用次数: 0

摘要

随着激光扫描焊接技术在工程应用中的不断成熟,开发能够监测焊点成型的诊断技术,满足结构日益复杂的产品的需求,是至关重要的一步。在这项工作中,我们从收集到的视觉信号中提取了一个独特的多变量时间序列数据集,其中包括钥匙孔和熔池图像流。钥匙孔和熔池分别被输入到一个基于变压器的双分支模型中,该模型包含多头自注意和交叉注意机制。结果表明,最优架构的准确率达到了 99.3%,优于之前最先进的基于图像的模型。优化和烧蚀实验还验证了信号的时间特性是激光扫描焊接状态识别准确性的重要决定因素之一。决策过程中注意力机制的得分图表明,所提出的模型能够准确学习钥匙孔和熔池视觉信号的时间序列特征,在不同焊接状态下从视觉信号中有效捕捉高动态物体的细粒度细节方面表现出卓越的能力。总之,作为激光扫描焊接接头成形监测的一种新策略,其卓越的性能和关注机制的可视化使其成为一种很有前途的诊断功能模块。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Cross-attention time-series multi-feature fusion vision transformer for joint formation monitoring in laser scanning welding
As laser scanning welding technology matures in engineering applications, it is a crucial step in developing diagnostics capable of monitoring weld joint forming and meeting the demands of increasingly structurally complex products. In this work, a unique multivariate time-series dataset encompassing keyhole and molten pool image streams was extracted from the collected visual signals. Keyhole and molten pool were respectively fed into a proposed Transformer-based model with two-branches, which incorporated multi-head self-attention and cross-attention mechanisms. The results show that the optimal architecture achieved an accuracy of 99.3%, which outperforms the previous state-of-the-art image-based models. The optimization and ablation experiments have also verified that the temporal characteristics of signals are one of the significant determining factors for the accuracy of laser scanning welding state recognition. The score maps of attention mechanism during the decision-making process demonstrate that the proposed model is able to accurately learn the time-series characteristics of keyhole and molten pool visual signals, exhibiting exceptional capability in effectively capturing fine-grained details of highly dynamic objects from visual signals under varying welding states. In summary, its excellent performance and visualization of the attention mechanism make it a promising diagnostic functional module as a novel strategy for laser scanning welded Joint formation monitoring.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Mechanical Systems and Signal Processing
Mechanical Systems and Signal Processing 工程技术-工程:机械
CiteScore
14.80
自引率
13.10%
发文量
1183
审稿时长
5.4 months
期刊介绍: Journal Name: Mechanical Systems and Signal Processing (MSSP) Interdisciplinary Focus: Mechanical, Aerospace, and Civil Engineering Purpose:Reporting scientific advancements of the highest quality Arising from new techniques in sensing, instrumentation, signal processing, modelling, and control of dynamic systems
期刊最新文献
Unidirectional Frequency-Steerable Acoustic Transducer for guided ultrasonic wave damage imaging An efficient stochastic harmonic function approach for the simulation of 3-directional wind field of large wind turbines based on physical turbulent spectral model Causal inference dynamic modeling for real-time surface roughness monitoring in the milling process Robust vision-based estimation of structural parameters using Kalman filtering A fast system estimation algorithm for a discontinuous dynamical model with coefficients coupling
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1