首页 > 最新文献

Journal of Manufacturing Systems最新文献

英文 中文
A comprehensive survey for real-world industrial surface defect detection: Challenges, approaches, and prospects 现实世界工业表面缺陷检测的综合调查:挑战、方法和前景
IF 14.2 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL Pub Date : 2025-12-12 DOI: 10.1016/j.jmsy.2025.11.022
Yuqi Cheng , Yunkang Cao , Haiming Yao , Wei Luo , Cheng Jiang , Hui Zhang , Weiming Shen
Industrial surface defect detection is vital for upholding product quality across contemporary manufacturing systems. As the expectations for precision, automation, and scalability intensify, conventional inspection approaches are increasingly found wanting in addressing real-world demands. Notable progress in computer vision and deep learning has substantially bolstered defect detection capabilities across both 2D and 3D modalities. A significant development has been the pivot from closed-set to open-set defect detection frameworks, which diminishes the necessity for extensive defect annotations and facilitates the recognition of novel anomalies. Despite such strides, a cohesive and contemporary understanding of industrial defect detection remains elusive. Consequently, this survey delivers an in-depth analysis of both closed-set and open-set defect detection strategies within 2D and 3D modalities, charting their evolution in recent years and underscoring the rising prominence of open-set techniques. We distill critical challenges inherent in practical detection environments and illuminate emerging trends, thereby providing a current and comprehensive vista of this swiftly progressing field.
工业表面缺陷检测对于维护当代制造系统的产品质量至关重要。随着对精度、自动化和可扩展性的期望的增强,传统的检查方法越来越难以满足现实世界的需求。计算机视觉和深度学习的显著进步大大增强了2D和3D模式的缺陷检测能力。一个重要的发展是从闭集缺陷检测框架转向开集缺陷检测框架,这减少了大量缺陷注释的必要性,并促进了对新异常的识别。尽管取得了这样的进步,但对工业缺陷检测的连贯和现代理解仍然难以捉摸。因此,本调查对2D和3D模式下的闭集和开集缺陷检测策略进行了深入分析,绘制了它们近年来的演变图表,并强调了开集技术的日益突出。我们提炼了实际检测环境中固有的关键挑战,并阐明了新兴趋势,从而为这一迅速发展的领域提供了当前和全面的前景。
{"title":"A comprehensive survey for real-world industrial surface defect detection: Challenges, approaches, and prospects","authors":"Yuqi Cheng ,&nbsp;Yunkang Cao ,&nbsp;Haiming Yao ,&nbsp;Wei Luo ,&nbsp;Cheng Jiang ,&nbsp;Hui Zhang ,&nbsp;Weiming Shen","doi":"10.1016/j.jmsy.2025.11.022","DOIUrl":"10.1016/j.jmsy.2025.11.022","url":null,"abstract":"<div><div>Industrial surface defect detection is vital for upholding product quality across contemporary manufacturing systems. As the expectations for precision, automation, and scalability intensify, conventional inspection approaches are increasingly found wanting in addressing real-world demands. Notable progress in computer vision and deep learning has substantially bolstered defect detection capabilities across both 2D and 3D modalities. A significant development has been the pivot from closed-set to open-set defect detection frameworks, which diminishes the necessity for extensive defect annotations and facilitates the recognition of novel anomalies. Despite such strides, a cohesive and contemporary understanding of industrial defect detection remains elusive. Consequently, this survey delivers an in-depth analysis of both closed-set and open-set defect detection strategies within 2D and 3D modalities, charting their evolution in recent years and underscoring the rising prominence of open-set techniques. We distill critical challenges inherent in practical detection environments and illuminate emerging trends, thereby providing a current and comprehensive vista of this swiftly progressing field.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"84 ","pages":"Pages 152-172"},"PeriodicalIF":14.2,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145786611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep unsupervised learning-based supplier selection and ranking for assembly manufacturing 基于深度无监督学习的装配制造供应商选择与排序
IF 14.2 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL Pub Date : 2025-12-12 DOI: 10.1016/j.jmsy.2025.12.004
Suyoung Park, Shreyes N. Melkote
Conventional supplier selection methods for assembled products have primarily relied on qualitative or business-level assessments of supplier capabilities, since manufacturing-related metrics such as product geometry, cost, time, and tolerance are heterogeneous and difficult to integrate into a unified evaluation. This reliance makes the identification of suppliers with adequate manufacturing capability particularly challenging as global supply chains grow increasingly complex. To address this gap, we propose the Deep Unsupervised Assembly Supplier Matcher (DU-ASM), an integrated data-driven framework that jointly embeds geometry, topology, and quantitative manufacturing attributes into a unified latent space for assembly-level supplier selection and ranking. Leveraging a graph autoencoder, DU-ASM reconstructs manufacturing attributes and supports robust supplier selection even with incomplete inputs. Experimental validation across multiple case studies demonstrates that DU-ASM achieves over 95 % supplier selection accuracy under complete requirements and over 90 % with partially masked inputs, while attaining mean normalized Discounted Cumulative Gain scores at top-k positions (nDCG@k) exceeding 0.99 in ranking tasks. By linking geometric, topological, and quantitative data, DU-ASM demonstrates both methodological novelty and strong quantitative performance, providing a scalable foundation for supplier matching at the assembly level and supporting multi-tier decision-making in future manufacturing supply networks.
装配产品的传统供应商选择方法主要依赖于对供应商能力的定性或业务级评估,因为与制造相关的指标(如产品几何形状、成本、时间和公差)是异构的,难以集成到统一的评估中。随着全球供应链变得越来越复杂,这种依赖使得识别具有足够制造能力的供应商尤其具有挑战性。为了解决这一差距,我们提出了深度无监督装配供应商匹配器(DU-ASM),这是一个集成的数据驱动框架,它将几何、拓扑和定量制造属性共同嵌入到一个统一的潜在空间中,用于装配级供应商的选择和排名。利用图形自动编码器,DU-ASM重建制造属性,即使在输入不完整的情况下也支持稳健的供应商选择。跨多个案例研究的实验验证表明,在完全要求下,DU-ASM的供应商选择准确率超过95 %,在部分屏蔽输入下,该方法的供应商选择准确率超过90 %,而在排名任务中,排名前k位(nDCG@k)的平均标准化贴现累积增益得分超过0.99。通过连接几何、拓扑和定量数据,DU-ASM展示了方法的新颖性和强大的定量性能,为装配级供应商匹配提供了可扩展的基础,并支持未来制造供应网络的多层决策。
{"title":"Deep unsupervised learning-based supplier selection and ranking for assembly manufacturing","authors":"Suyoung Park,&nbsp;Shreyes N. Melkote","doi":"10.1016/j.jmsy.2025.12.004","DOIUrl":"10.1016/j.jmsy.2025.12.004","url":null,"abstract":"<div><div>Conventional supplier selection methods for assembled products have primarily relied on qualitative or business-level assessments of supplier capabilities, since manufacturing-related metrics such as product geometry, cost, time, and tolerance are heterogeneous and difficult to integrate into a unified evaluation. This reliance makes the identification of suppliers with adequate manufacturing capability particularly challenging as global supply chains grow increasingly complex. To address this gap, we propose the Deep Unsupervised Assembly Supplier Matcher (DU-ASM), an integrated data-driven framework that jointly embeds geometry, topology, and quantitative manufacturing attributes into a unified latent space for assembly-level supplier selection and ranking. Leveraging a graph autoencoder, DU-ASM reconstructs manufacturing attributes and supports robust supplier selection even with incomplete inputs. Experimental validation across multiple case studies demonstrates that DU-ASM achieves over 95 % supplier selection accuracy under complete requirements and over 90 % with partially masked inputs, while attaining mean normalized Discounted Cumulative Gain scores at top-k positions (nDCG@k) exceeding 0.99 in ranking tasks. By linking geometric, topological, and quantitative data, DU-ASM demonstrates both methodological novelty and strong quantitative performance, providing a scalable foundation for supplier matching at the assembly level and supporting multi-tier decision-making in future manufacturing supply networks.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"84 ","pages":"Pages 173-188"},"PeriodicalIF":14.2,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145786758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Process mining-driven modeling and simulation to enhance fault diagnosis in cyber–physical systems 过程挖掘驱动的建模和仿真,以增强网络物理系统的故障诊断
IF 14.2 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL Pub Date : 2025-12-12 DOI: 10.1016/j.jmsy.2025.12.005
Francesco Vitale , Nicola Dall’Ora , Sebastiano Gaiardelli , Enrico Fraccaroli , Nicola Mazzocca , Franco Fummi
Cyber–Physical Systems (CPSs) tightly interconnect digital and physical operations within production environments, enabling real-time monitoring, control, optimization, and autonomous decision-making that directly enhance manufacturing processes and productivity. The inherent complexity of these systems can lead to faults that require robust and interpretable diagnoses to maintain system dependability and operational efficiency. However, manual modeling of faulty behaviors requires extensive domain expertise and cannot leverage the low-level sensor data of the CPS. Furthermore, although powerful, deep learning-based techniques produce black-box diagnostics that lack interpretability, limiting their practical adoption. To address these challenges, we set forth a method that performs unsupervised characterization of system states and state transitions from low-level sensor data, uses several process mining techniques to model faults through interpretable stochastic Petri nets, simulates such Petri nets for a comprehensive understanding of system behavior under faulty conditions, and performs Petri net-based fault diagnosis. The method begins with detecting collective anomalies involving multiple samples in low-level sensor data. These anomalies are then transformed into structured event logs, enabling the data-driven discovery of interpretable Petri nets through process mining. By enhancing these Petri nets with timing distributions, the approach supports the simulation of faulty behaviors. Finally, faults can be diagnosed online by checking collective anomalies with the Petri nets and the corresponding simulations. The method is applied to the Robotic Arm Dataset (RoAD), a benchmark collected from a robotic arm deployed in a scale-replica smart manufacturing assembly line. The application to RoAD demonstrates the method’s effectiveness in modeling, simulating, and classifying faulty behaviors in CPSs. The modeling results demonstrate that our method achieves a satisfactory interpretability-simulation accuracy trade-off with up to 0.676 arc-degree simplicity, 0.395 R2, and 0.088 RMSE. In addition, the fault identification results show that the method achieves an F1 score of up to 98.925%, while maintaining a low conformance checking time of 0.020 s, which competes with other deep learning-based methods.
网络物理系统(cps)紧密连接生产环境中的数字和物理操作,实现实时监控、控制、优化和自主决策,直接提高制造过程和生产力。这些系统固有的复杂性可能导致故障,需要健壮和可解释的诊断来维持系统的可靠性和操作效率。然而,故障行为的手动建模需要广泛的领域专业知识,并且不能利用CPS的低级传感器数据。此外,尽管基于深度学习的强大技术产生了缺乏可解释性的黑箱诊断,限制了它们的实际应用。为了应对这些挑战,我们提出了一种方法,该方法从低级传感器数据中执行系统状态和状态转换的无监督特征,使用几种过程挖掘技术通过可解释的随机Petri网来建模故障,模拟这种Petri网以全面了解故障条件下的系统行为,并执行基于Petri网的故障诊断。该方法首先检测涉及低水平传感器数据中多个样本的集体异常。然后将这些异常转换为结构化事件日志,通过过程挖掘实现数据驱动的可解释Petri网发现。通过用时序分布增强这些Petri网,该方法支持故障行为的模拟。最后,利用Petri网对集体异常进行检测,并进行相应的仿真,实现故障在线诊断。该方法应用于机器人手臂数据集(RoAD),该数据集是从部署在按比例复制的智能制造装配线上的机器人手臂收集的基准。在RoAD中的应用证明了该方法在cps故障行为建模、仿真和分类方面的有效性。建模结果表明,我们的方法达到了令人满意的可解释性-模拟精度权衡,简单性高达0.676弧度,R2为0.395,RMSE为0.088。此外,故障识别结果表明,该方法达到了高达98.925%的F1分数,同时保持了较低的一致性检查时间(0.020 s),与其他基于深度学习的方法相竞争。
{"title":"Process mining-driven modeling and simulation to enhance fault diagnosis in cyber–physical systems","authors":"Francesco Vitale ,&nbsp;Nicola Dall’Ora ,&nbsp;Sebastiano Gaiardelli ,&nbsp;Enrico Fraccaroli ,&nbsp;Nicola Mazzocca ,&nbsp;Franco Fummi","doi":"10.1016/j.jmsy.2025.12.005","DOIUrl":"10.1016/j.jmsy.2025.12.005","url":null,"abstract":"<div><div>Cyber–Physical Systems (CPSs) tightly interconnect digital and physical operations within production environments, enabling real-time monitoring, control, optimization, and autonomous decision-making that directly enhance manufacturing processes and productivity. The inherent complexity of these systems can lead to faults that require robust and interpretable diagnoses to maintain system dependability and operational efficiency. However, manual modeling of faulty behaviors requires extensive domain expertise and cannot leverage the low-level sensor data of the CPS. Furthermore, although powerful, deep learning-based techniques produce black-box diagnostics that lack interpretability, limiting their practical adoption. To address these challenges, we set forth a method that performs unsupervised characterization of system states and state transitions from low-level sensor data, uses several process mining techniques to model faults through interpretable stochastic Petri nets, simulates such Petri nets for a comprehensive understanding of system behavior under faulty conditions, and performs Petri net-based fault diagnosis. The method begins with detecting collective anomalies involving multiple samples in low-level sensor data. These anomalies are then transformed into structured event logs, enabling the data-driven discovery of interpretable Petri nets through process mining. By enhancing these Petri nets with timing distributions, the approach supports the simulation of faulty behaviors. Finally, faults can be diagnosed online by checking collective anomalies with the Petri nets and the corresponding simulations. The method is applied to the Robotic Arm Dataset (RoAD), a benchmark collected from a robotic arm deployed in a scale-replica smart manufacturing assembly line. The application to RoAD demonstrates the method’s effectiveness in modeling, simulating, and classifying faulty behaviors in CPSs. The modeling results demonstrate that our method achieves a satisfactory interpretability-simulation accuracy trade-off with up to 0.676 arc-degree simplicity, 0.395 R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>, and 0.088 RMSE. In addition, the fault identification results show that the method achieves an F1 score of up to 98.925%, while maintaining a low conformance checking time of 0.020 s, which competes with other deep learning-based methods.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"84 ","pages":"Pages 189-206"},"PeriodicalIF":14.2,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145786757","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PI-KAF: A physics-informed constrained online interpretable monitoring method for tool wear PI-KAF:一种基于物理信息的工具磨损约束在线可解释监测方法
IF 14.2 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL Pub Date : 2025-12-10 DOI: 10.1016/j.jmsy.2025.11.024
Liguo Zhang , Qinghua Song , Haifeng Ma , Zhanqiang Liu
Deep learning-based tool wear monitoring methods often suffer from poor interpretability, parameter redundancy, and limited cross-domain generalization, especially under variable operating conditions and when only small sample sizes are available. These limitations hinder their practical deployment in practical manufacturing environments. To overcome these challenges, this study proposes a highly interpretable and lightweight tool wear monitoring framework tailored for small-sample, variable-condition scenarios. The method randomly extracts fixed-length segments from vibration signals to construct small-sample datasets, utilizing both signal envelope spectrum and cutting time as input features. Guided by the degradation law of tool wear, explicit physical constraints are imposed on the solution space of a Kolmogorov‑Arnold Fourier neural network, yielding a physics‑informed data‑driven model. SHAP analysis is employed to quantify the contribution of each feature, enhancing model transparency. Validation on public datasets under both single‑ and multi‑condition settings demonstrates that the proposed method delivers excellent performance across diverse operating conditions, achieving a stable prediction R² of up to 95 %, an inference latency of only 2 ms, and a reduction of approximately 90 % in model parameters. This solution can be integrated into the edge computing platform of CNC systems, making it particularly suitable for machining scenarios with high real-time requirements. It offers a lightweight, precise, and efficient monitoring capability for smart factories, contributing simultaneously to improvements in product quality and manufacturing efficiency.
基于深度学习的工具磨损监测方法通常存在可解释性差、参数冗余和有限的跨域泛化等问题,特别是在可变操作条件下和只有小样本量的情况下。这些限制阻碍了它们在实际制造环境中的实际部署。为了克服这些挑战,本研究提出了一个高度可解释和轻量级的工具磨损监测框架,为小样本、可变条件的场景量身定制。该方法利用信号包络谱和切割时间作为输入特征,从振动信号中随机提取固定长度的片段,构建小样本数据集。在刀具磨损退化规律的指导下,对Kolmogorov - Arnold傅里叶神经网络的解空间施加了明确的物理约束,从而产生了一个物理信息数据驱动的模型。采用SHAP分析来量化每个特征的贡献,提高模型的透明度。在单条件和多条件设置下对公共数据集的验证表明,所提出的方法在不同的操作条件下提供了出色的性能,实现了高达95 %的稳定预测R²,推理延迟仅为2 ms,模型参数减少了约90 %。该解决方案可集成到数控系统的边缘计算平台中,特别适合实时性要求高的加工场景。它为智能工厂提供了轻量级、精确和高效的监控能力,同时有助于提高产品质量和制造效率。
{"title":"PI-KAF: A physics-informed constrained online interpretable monitoring method for tool wear","authors":"Liguo Zhang ,&nbsp;Qinghua Song ,&nbsp;Haifeng Ma ,&nbsp;Zhanqiang Liu","doi":"10.1016/j.jmsy.2025.11.024","DOIUrl":"10.1016/j.jmsy.2025.11.024","url":null,"abstract":"<div><div>Deep learning-based tool wear monitoring methods often suffer from poor interpretability, parameter redundancy, and limited cross-domain generalization, especially under variable operating conditions and when only small sample sizes are available. These limitations hinder their practical deployment in practical manufacturing environments. To overcome these challenges, this study proposes a highly interpretable and lightweight tool wear monitoring framework tailored for small-sample, variable-condition scenarios. The method randomly extracts fixed-length segments from vibration signals to construct small-sample datasets, utilizing both signal envelope spectrum and cutting time as input features. Guided by the degradation law of tool wear, explicit physical constraints are imposed on the solution space of a Kolmogorov‑Arnold Fourier neural network, yielding a physics‑informed data‑driven model. SHAP analysis is employed to quantify the contribution of each feature, enhancing model transparency. Validation on public datasets under both single‑ and multi‑condition settings demonstrates that the proposed method delivers excellent performance across diverse operating conditions, achieving a stable prediction R² of up to 95 %, an inference latency of only 2 ms, and a reduction of approximately 90 % in model parameters. This solution can be integrated into the edge computing platform of CNC systems, making it particularly suitable for machining scenarios with high real-time requirements. It offers a lightweight, precise, and efficient monitoring capability for smart factories, contributing simultaneously to improvements in product quality and manufacturing efficiency.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"84 ","pages":"Pages 135-151"},"PeriodicalIF":14.2,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145786760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards perceptive assembly: An edge vision network-enabled augmented reality (AR) monitoring method for global shape and mechanical information in large aerospace components 面向感知装配:一种支持边缘视觉网络的增强现实(AR)监测方法,用于大型航空航天部件的全局形状和机械信息
IF 14.2 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL Pub Date : 2025-12-10 DOI: 10.1016/j.jmsy.2025.11.017
Yang Zhang, Xu Wang, Jiacheng Cui, Yulin Jin, Qihang Chen, Yongkang Lu, Wei Liu
Real-time, high-fidelity, and interactive monitoring of global mechanical responses during the assembly of large-scale, flexible aerospace structures remains a critical and unresolved challenge. Here, we present a perceptive assembly framework that integrates a distributed edge vision network, physics-informed sparse sensing, and immersive augmented reality (AR) visualization to enable full-field structural state monitoring. A modular edge sensing system is deployed to achieve fast, high-precision measurement of distributed displacements across meter-scale components. To overcome view discontinuities, a hierarchical coordinate transformation pipeline is introduced for global registration under non-overlapping camera views. Building on sparse displacement data, we develop a constrained sensor optimization strategy that enables real-time reconstruction of global displacement and strain fields. Through HoloLens 2, the system provides intuitive AR overlays that deliver immersive, in-situ mechanical feedback during assembly. Validation experiments on composite panels demonstrate sub-millimeter reconstruction accuracy and real-time performance, significantly enhancing transparency and decision-making in the assembly process. This work establishes a scalable AR-based perception infrastructure for next-generation intelligent manufacturing of large aerospace structures.
在大型柔性航空航天结构装配过程中,实时、高保真和交互式的整体机械响应监测仍然是一个关键且未解决的挑战。在这里,我们提出了一个感知组装框架,该框架集成了分布式边缘视觉网络,物理信息稀疏感知和沉浸式增强现实(AR)可视化,以实现全场结构状态监测。部署了模块化边缘传感系统,以实现跨米尺度组件的分布式位移的快速、高精度测量。为了克服视图不连续的问题,在非重叠摄像机视图下引入了层次坐标变换流水线进行全局配准。基于稀疏位移数据,我们开发了一种约束传感器优化策略,可以实时重建全局位移和应变场。通过HoloLens 2,系统提供直观的AR叠加,在组装过程中提供身临其境的现场机械反馈。复合材料面板的验证实验证明了亚毫米级重构的精度和实时性,显著提高了装配过程的透明度和决策能力。这项工作为下一代大型航空航天结构的智能制造建立了可扩展的基于ar的感知基础设施。
{"title":"Towards perceptive assembly: An edge vision network-enabled augmented reality (AR) monitoring method for global shape and mechanical information in large aerospace components","authors":"Yang Zhang,&nbsp;Xu Wang,&nbsp;Jiacheng Cui,&nbsp;Yulin Jin,&nbsp;Qihang Chen,&nbsp;Yongkang Lu,&nbsp;Wei Liu","doi":"10.1016/j.jmsy.2025.11.017","DOIUrl":"10.1016/j.jmsy.2025.11.017","url":null,"abstract":"<div><div>Real-time, high-fidelity, and interactive monitoring of global mechanical responses during the assembly of large-scale, flexible aerospace structures remains a critical and unresolved challenge. Here, we present a perceptive assembly framework that integrates a distributed edge vision network, physics-informed sparse sensing, and immersive augmented reality (AR) visualization to enable full-field structural state monitoring. A modular edge sensing system is deployed to achieve fast, high-precision measurement of distributed displacements across meter-scale components. To overcome view discontinuities, a hierarchical coordinate transformation pipeline is introduced for global registration under non-overlapping camera views. Building on sparse displacement data, we develop a constrained sensor optimization strategy that enables real-time reconstruction of global displacement and strain fields. Through HoloLens 2, the system provides intuitive AR overlays that deliver immersive, in-situ mechanical feedback during assembly. Validation experiments on composite panels demonstrate sub-millimeter reconstruction accuracy and real-time performance, significantly enhancing transparency and decision-making in the assembly process. This work establishes a scalable AR-based perception infrastructure for next-generation intelligent manufacturing of large aerospace structures.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"84 ","pages":"Pages 117-134"},"PeriodicalIF":14.2,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145786759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Agentic digital twin-embedded maintenance methodology for energy equipment: A self-evolving operational paradigm 能源设备的代理数字双嵌入式维护方法:一种自进化的操作范式
IF 14.2 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL Pub Date : 2025-12-09 DOI: 10.1016/j.jmsy.2025.11.020
Wang Cong, Wu Tao, Bao Jinsong
Intelligent operation and maintenance of energy equipment represents a critical component in ensuring the stable performance of new-generation power systems. Faced with complex operational conditions and nonlinear fault characteristics, conventional manual maintenance suffers from delayed perception and ambiguous causality. However, while digital twin technology can establish a virtual-real interaction space, its static modeling approach exhibits prediction failure in dynamic scenarios. To address these challenges, this study proposes an intelligent maintenance methodology based on cognitive agents and virtual-real co-evolution: constructing a dynamic environment representation model to achieve spatiotemporal feature correlation of equipment states and operational condition migration; designing a memory-planning-decision architecture to enhance causal reasoning capabilities for equipment faults and integrating with digital twin models for virtual-real interaction. The methodology is validated through an 18-month case study of a gas-steam boiler in a combined heat and power plant, utilizing 5.2 million historical operational records. Experimental results demonstrate that this approach achieves a 97.3 % accuracy rate in diagnosing non-stationary faults for gas-steam boiler equipment, realizes a 20-fold improvement in knowledge update time (from 48 to 2.3 h), and attains significant performance enhancements including 31.2 % cost efficiency improvement, 3-fold early warning lead time extension (from 24 to 72 h), and 16.2 % overall collaborative performance improvement (from 82.2 % to 95.5 %). The research validates the engineering value of dynamic cognitive paradigms in intelligent maintenance of power equipment, providing a feasible solution for autonomous decision-making in high-real-time scenarios.
能源设备的智能运维是保证新一代电力系统稳定运行的重要组成部分。面对复杂的运行条件和非线性的故障特征,传统的人工维修存在感知延迟和因果关系模糊的问题。然而,虽然数字孪生技术可以建立一个虚拟-真实的交互空间,但其静态建模方法在动态场景中表现出预测失败。针对这些挑战,本研究提出了一种基于认知代理和虚实协同进化的智能维护方法:构建动态环境表示模型,实现设备状态的时空特征关联和运行状态迁移;设计记忆-规划-决策体系结构,增强设备故障的因果推理能力,并与数字孪生模型集成,实现虚实交互。通过对一家热电联产电厂的燃气蒸汽锅炉进行为期18个月的案例研究,利用520万份历史运行记录,验证了该方法的有效性。实验结果表明,该方法对燃气蒸汽锅炉设备非平稳故障的诊断准确率达到97.3% %,知识更新时间从48到2.3 h提高了20倍,成本效率提高了31.2% %,预警提前期延长了3倍(从24到72 h),整体协同性能提高了16.2% %(从82.2 %到95.5 %)。研究验证了动态认知范式在电力设备智能维护中的工程价值,为高实时场景下的自主决策提供了可行的解决方案。
{"title":"Agentic digital twin-embedded maintenance methodology for energy equipment: A self-evolving operational paradigm","authors":"Wang Cong,&nbsp;Wu Tao,&nbsp;Bao Jinsong","doi":"10.1016/j.jmsy.2025.11.020","DOIUrl":"10.1016/j.jmsy.2025.11.020","url":null,"abstract":"<div><div>Intelligent operation and maintenance of energy equipment represents a critical component in ensuring the stable performance of new-generation power systems. Faced with complex operational conditions and nonlinear fault characteristics, conventional manual maintenance suffers from delayed perception and ambiguous causality. However, while digital twin technology can establish a virtual-real interaction space, its static modeling approach exhibits prediction failure in dynamic scenarios. To address these challenges, this study proposes an intelligent maintenance methodology based on cognitive agents and virtual-real co-evolution: constructing a dynamic environment representation model to achieve spatiotemporal feature correlation of equipment states and operational condition migration; designing a memory-planning-decision architecture to enhance causal reasoning capabilities for equipment faults and integrating with digital twin models for virtual-real interaction. The methodology is validated through an 18-month case study of a gas-steam boiler in a combined heat and power plant, utilizing 5.2 million historical operational records. Experimental results demonstrate that this approach achieves a 97.3 % accuracy rate in diagnosing non-stationary faults for gas-steam boiler equipment, realizes a 20-fold improvement in knowledge update time (from 48 to 2.3 h), and attains significant performance enhancements including 31.2 % cost efficiency improvement, 3-fold early warning lead time extension (from 24 to 72 h), and 16.2 % overall collaborative performance improvement (from 82.2 % to 95.5 %). The research validates the engineering value of dynamic cognitive paradigms in intelligent maintenance of power equipment, providing a feasible solution for autonomous decision-making in high-real-time scenarios.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"84 ","pages":"Pages 100-116"},"PeriodicalIF":14.2,"publicationDate":"2025-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145786761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
End-to-end multimodal knowledge graph construction for industrial exploded views via attention-guided expert chains 基于注意力引导专家链的工业爆炸视图端到端多模态知识图谱构建
IF 14.2 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL Pub Date : 2025-12-05 DOI: 10.1016/j.jmsy.2025.10.013
Xinxin Liang , Zuoxu Wang , Mingrui Li , Chun-Hsien Chen , Jihong Liu
Industrial exploded views (IEVs) integrate images, text, and part–assembly relations, which is essential for advancing intelligent manufacturing. However, semantic ambiguities, structural inconsistencies, and fragmented annotations hinder effective knowledge extraction and reuse. We cast extraction from IEVs as constrained inference over scene graphs and present a Scene-aware Cascade Expert Chain (SACEC) that incrementally resolves entities, relations, and assembly context. A Visual–Structural–Rule (VSR) validator then enforces domain rules and semantic consistency on every triple. A dynamic triple-cutting strategy selects credible triples by jointly balancing local evidence, contextual coherence, and assembly order, yielding a multimodal knowledge graph (MMKG). We also introduce the Industrial Exploded-View (IEV) dataset, with fine-grained component and relation annotations and assembly-order metadata. Experiments on VRD, VG150, and the IEV dataset demonstrate significant improvements over state-of-the-art baselines, achieving R@100 of 73.2%, 63.9%, and 67.4%, and TripleAcc of 31.8%, 20.2%, and 24.9%. At the triple level, we further obtain P@100 of 54.9%, 39.8%, and 49.6%, and F1@100 of 46.2%, 34.1%, and 45.1%. Against strong path- and context-based baselines, our method improves by up to +7.4 pp in recall@100, +2.7 pp in TripleAcc, +15.8 pp in Precision@100, and +13.5 pp in F1@100. The approach reduces manual annotation and yields interpretable, audit-ready outputs for intelligent design and process planning, offering a practical route to automated and interpretable knowledge extraction in industrial environments.
工业爆炸视图集成了图像、文本和零部件关系,对推进智能制造至关重要。然而,语义歧义、结构不一致和碎片化的注释阻碍了有效的知识提取和重用。我们将evs提取作为场景图上的约束推理,并提出了一个场景感知级联专家链(SACEC),该链可以增量地解析实体、关系和装配上下文。然后,可视化结构规则(VSR)验证器对每个三元组强制执行域规则和语义一致性。动态三重切割策略通过联合平衡局部证据、上下文一致性和装配顺序来选择可信的三元组,从而产生多模态知识图(MMKG)。我们还介绍了工业爆炸视图(IEV)数据集,该数据集具有细粒度的组件和关系注释以及装配顺序元数据。在VRD、VG150和IEV数据集上的实验表明,与最先进的基线相比,有了显著的改进,R@100的效率分别为73.2%、63.9%和67.4%,TripleAcc的效率分别为31.8%、20.2%和24.9%。在三重水平上,我们进一步得到P@100为54.9%、39.8%和49.6%,F1@100为46.2%、34.1%和45.1%。对于基于路径和上下文的强基线,我们的方法在recall@100中提高了+7.4 pp,在TripleAcc中提高了+2.7 pp,在Precision@100中提高了+15.8 pp,在F1@100中提高了+13.5 pp。该方法减少了手工注释,并为智能设计和过程规划提供了可解释的、可审计的输出,为工业环境中自动化和可解释的知识提取提供了一条实用的途径。
{"title":"End-to-end multimodal knowledge graph construction for industrial exploded views via attention-guided expert chains","authors":"Xinxin Liang ,&nbsp;Zuoxu Wang ,&nbsp;Mingrui Li ,&nbsp;Chun-Hsien Chen ,&nbsp;Jihong Liu","doi":"10.1016/j.jmsy.2025.10.013","DOIUrl":"10.1016/j.jmsy.2025.10.013","url":null,"abstract":"<div><div>Industrial exploded views (IEVs) integrate images, text, and part–assembly relations, which is essential for advancing intelligent manufacturing. However, semantic ambiguities, structural inconsistencies, and fragmented annotations hinder effective knowledge extraction and reuse. We cast extraction from IEVs as constrained inference over scene graphs and present a Scene-aware Cascade Expert Chain (SACEC) that incrementally resolves entities, relations, and assembly context. A Visual–Structural–Rule (VSR) validator then enforces domain rules and semantic consistency on every triple. A dynamic triple-cutting strategy selects credible triples by jointly balancing local evidence, contextual coherence, and assembly order, yielding a multimodal knowledge graph (MMKG). We also introduce the Industrial Exploded-View (IEV) dataset, with fine-grained component and relation annotations and assembly-order metadata. Experiments on VRD, VG150, and the IEV dataset demonstrate significant improvements over state-of-the-art baselines, achieving R@100 of 73.2%, 63.9%, and 67.4%, and TripleAcc of 31.8%, 20.2%, and 24.9%. At the triple level, we further obtain P@100 of 54.9%, 39.8%, and 49.6%, and F1@100 of 46.2%, 34.1%, and 45.1%. Against strong path- and context-based baselines, our method improves by up to +7.4<!--> <!-->pp in recall@100, +2.7<!--> <!-->pp in TripleAcc, +15.8<!--> <!-->pp in Precision@100, and +13.5<!--> <!-->pp in F1@100. The approach reduces manual annotation and yields interpretable, audit-ready outputs for intelligent design and process planning, offering a practical route to automated and interpretable knowledge extraction in industrial environments.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"84 ","pages":"Pages 85-99"},"PeriodicalIF":14.2,"publicationDate":"2025-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145682162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Designing Synthetic Active Learning for model refinement in manufacturing parts detection 面向制造零件检测模型精化的综合主动学习设计
IF 14.2 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL Pub Date : 2025-12-04 DOI: 10.1016/j.jmsy.2025.11.023
Xiaomeng Zhu , Jacob Henningsson , Pär Mårtensson , Lars Hanson , Mårten Björkman , Atsuto Maki
This paper introduces Synthetic Active Learning (SAL), a fully automatic model refinement strategy for manufacturing parts detection using only synthetic data actively generated with domain randomization for training. SAL iteratively updates the detection model by identifying its weaknesses, such as in specific categories, materials, or object sizes, using custom evaluators, and generating targeted synthetic data to address them; it selectively synthesizes new useful data with respect to active learning, where traditionally humans in the loop select data to label. During each iteration, model training and data generation occur simultaneously to improve efficiency. Evaluated on four use cases from two industrial datasets, SAL achieved mAP@50 improvements of 2 to 6% percentage points over static learning, which refers to training on a fixed, pre-generated dataset. It also showed notable gains in underperforming categories, leading to more balanced performance across classes. Another benefit is that it uses a consistent configuration across multiple use cases, avoiding the need for extensive hyperparameter tuning common in prior domain randomization studies. Given its encouraging performance across diverse scenarios, we believe that SAL can scale to broader industrial applications where training can be fully or mostly based on synthetic data.
本文介绍了一种基于领域随机化主动生成的合成数据进行训练的制造零件检测的全自动模型优化策略——合成主动学习(SAL)。SAL迭代地更新检测模型,通过识别它的弱点,例如在特定的类别,材料,或对象大小,使用自定义评估器,并生成目标合成数据来解决它们;相对于主动学习,它有选择地合成新的有用数据,传统上,人类在循环中选择数据来标记。在每次迭代中,模型训练和数据生成同时进行,以提高效率。通过对来自两个工业数据集的四个用例进行评估,SAL实现了mAP@50比静态学习提高了2到6%的百分点,静态学习指的是在固定的、预先生成的数据集上进行训练。在表现不佳的类别中也显示出显著的进步,导致各个类别的表现更加平衡。另一个好处是,它在多个用例中使用一致的配置,避免了在先前的领域随机化研究中常见的大量超参数调优的需要。考虑到它在不同场景中令人鼓舞的表现,我们相信SAL可以扩展到更广泛的工业应用,在这些应用中,训练可以完全或主要基于合成数据。
{"title":"Designing Synthetic Active Learning for model refinement in manufacturing parts detection","authors":"Xiaomeng Zhu ,&nbsp;Jacob Henningsson ,&nbsp;Pär Mårtensson ,&nbsp;Lars Hanson ,&nbsp;Mårten Björkman ,&nbsp;Atsuto Maki","doi":"10.1016/j.jmsy.2025.11.023","DOIUrl":"10.1016/j.jmsy.2025.11.023","url":null,"abstract":"<div><div>This paper introduces Synthetic Active Learning (SAL), a fully automatic model refinement strategy for manufacturing parts detection using only synthetic data actively generated with domain randomization for training. SAL iteratively updates the detection model by identifying its weaknesses, such as in specific categories, materials, or object sizes, using custom evaluators, and generating targeted synthetic data to address them; it selectively synthesizes new useful data with respect to active learning, where traditionally humans in the loop select data to label. During each iteration, model training and data generation occur simultaneously to improve efficiency. Evaluated on four use cases from two industrial datasets, SAL achieved mAP@50 improvements of 2 to 6% percentage points over static learning, which refers to training on a fixed, pre-generated dataset. It also showed notable gains in underperforming categories, leading to more balanced performance across classes. Another benefit is that it uses a consistent configuration across multiple use cases, avoiding the need for extensive hyperparameter tuning common in prior domain randomization studies. Given its encouraging performance across diverse scenarios, we believe that SAL can scale to broader industrial applications where training can be fully or mostly based on synthetic data.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"84 ","pages":"Pages 68-84"},"PeriodicalIF":14.2,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145682163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic data driven uncertain physical information self-awareness method for the aircraft composite component assembly system 飞机复合材料装配系统的动态数据驱动不确定物理信息自感知方法
IF 14.2 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL Pub Date : 2025-12-01 DOI: 10.1016/j.jmsy.2025.11.001
Pengbo Yin, Yang Zhang, Jiacheng Cui, Jiangtao Zhao, Yulin Jin, Qihang Chen, Yongkang Lu, Wei Liu
High-precision assembly of large composite components is crucial for aircraft structural safety. To address geometric deviation of components caused by anisotropic material behaviors and time-varying process constraints during dynamic assembly, this paper proposes a dynamic data driven uncertain physical information self-awareness (DDDPIA) method. This approach accurately updates physical models of components by integrating dynamic data of the manufacturing process containing displacement information, load configurations, and model information through three key innovations: (1) A simplified affine mapping method from model parameters to system stiffness that decouples material properties from process constraints in deformation modeling. (2) A multi-source prior data-driven model parameter optimization framework enabling efficient identification of material parameters and process constraints while quantifying measurement uncertainty impacts and maintaining high-precision performance with measurement errors below 0.3 mm. (3) An industrial application-oriented shape regulation platform that leverages the updated physical model for precise load inversion to achieve specified shapes of composite components. Experimental and simulation results verify over 64% displacement error reduction relative to uncalibrated static modeling, while load inversion with sub-0.2 N solution accuracy achieves geometric deviations correction of components. This establishes a closed-loop measurement-data-model-assimilation paradigm, enhancing decision autonomy in aviation intelligent manufacturing systems.
大型复合材料部件的高精度装配对飞机结构安全至关重要。针对动态装配过程中由于材料各向异性行为和时变工艺约束导致的部件几何偏差,提出了一种动态数据驱动的不确定物理信息自我意识(DDDPIA)方法。该方法通过三个关键创新,将包含位移信息、载荷配置和模型信息的制造过程动态数据集成在一起,准确地更新部件的物理模型:(1)从模型参数到系统刚度的简化仿射映射方法,将变形建模中的材料属性与工艺约束解耦。(2)多源先验数据驱动的模型参数优化框架,能够有效识别材料参数和工艺约束,同时量化测量不确定度影响,并保持测量误差小于0.3 mm的高精度性能。(3)面向工业应用的形状调节平台,利用更新的物理模型进行精确载荷反演,实现复合材料部件的指定形状。实验和仿真结果表明,相对于未标定的静态建模,位移误差降低了64%以上,而低于0.2 N溶液精度的载荷反演实现了构件的几何偏差校正。建立了闭环测量-数据-模型-同化模式,提高了航空智能制造系统的决策自主性。
{"title":"Dynamic data driven uncertain physical information self-awareness method for the aircraft composite component assembly system","authors":"Pengbo Yin,&nbsp;Yang Zhang,&nbsp;Jiacheng Cui,&nbsp;Jiangtao Zhao,&nbsp;Yulin Jin,&nbsp;Qihang Chen,&nbsp;Yongkang Lu,&nbsp;Wei Liu","doi":"10.1016/j.jmsy.2025.11.001","DOIUrl":"10.1016/j.jmsy.2025.11.001","url":null,"abstract":"<div><div>High-precision assembly of large composite components is crucial for aircraft structural safety. To address geometric deviation of components caused by anisotropic material behaviors and time-varying process constraints during dynamic assembly, this paper proposes a dynamic data driven uncertain physical information self-awareness (DDDPIA) method. This approach accurately updates physical models of components by integrating dynamic data of the manufacturing process containing displacement information, load configurations, and model information through three key innovations: (1) A simplified affine mapping method from model parameters to system stiffness that decouples material properties from process constraints in deformation modeling. (2) A multi-source prior data-driven model parameter optimization framework enabling efficient identification of material parameters and process constraints while quantifying measurement uncertainty impacts and maintaining high-precision performance with measurement errors below 0.3 mm. (3) An industrial application-oriented shape regulation platform that leverages the updated physical model for precise load inversion to achieve specified shapes of composite components. Experimental and simulation results verify over 64% displacement error reduction relative to uncalibrated static modeling, while load inversion with sub-0.2 N solution accuracy achieves geometric deviations correction of components. This establishes a closed-loop measurement-data-model-assimilation paradigm, enhancing decision autonomy in aviation intelligent manufacturing systems.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"83 ","pages":"Pages 1009-1023"},"PeriodicalIF":14.2,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145680786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AR-assisted human-robot collaborative assembly system: Integrating visual language model and deep reinforcement learning for task planning and seamless interactive guidance ar辅助人机协同装配系统:集成视觉语言模型和深度强化学习,实现任务规划和无缝交互引导
IF 14.2 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL Pub Date : 2025-11-29 DOI: 10.1016/j.jmsy.2025.11.019
Changchun Liu , Dunbing Tang , Haihua Zhu , Zequn Zhang , Liping Wang , Qingwei Nie
In the human-oriented context of Industry 5.0, human-robot collaboration (HRC) has become the core driving force for innovating the production model of assembly systems by integrating human flexibility ability with the precision and repeatability of robots. As the core link to achieve efficient collaboration, task planning is facing multiple challenges in dynamic and complex scenarios. On the one hand, it is hard to perform cognition of multimodal human-robot-environment in HRC assembly scenarios. On the other hand, heterogeneous capabilities of humans and robots (e.g., flexible decision-making by humans and precise execution by robots) are hard to be fully used to achieve reasonable task allocation and timing optimization in dynamic and complex HRC assembly scenarios. To address these issues, a Vision Language Model (VLM)-enhanced deep reinforcement learning-driven task planning approach is proposed towards Augmented Reality (AR)-assisted HRC assembly system. Firstly, a self-trained VLM is proposed through the integration of domain-specific knowledge and real-time situational data to enable context-aware in HRC assembly system. Through the fine-tuning of role configuration parameters for the pre-constructed VLM via prompt engineering, VLM can possess cognition of multi-dimensional assembly scenario elements. Reinforcement learning model can be endowed with the eyes to perceive HRC assembly scenarios through VLM-enhanced cognition of the dynamic HRC environment. Based on the VLM-enhanced cognition of the dynamic HRC environment, an improved multi-agent reinforcement learning-based HRC assembly task planning model is established to achieve humanized task planning, which can consider the competitive relationship between humans and robots with multi-agent conflict mechanism. Based on the HRC assembly task planning result, AR can enable operators to accomplish visual HRC assembly guidance through virtual-real mapping of HRC assembly information (e.g., HRC assembly procedures) and interact seamlessly with the VLM. Finally, experimental results show that the proposed method can improve the efficiency and well-being of HRC in human-centric assembly systems.
在工业5.0以人为本的背景下,人机协作(human-robot collaboration, HRC)将人的柔性能力与机器人的精度和可重复性相结合,成为创新装配系统生产模式的核心驱动力。任务规划作为实现高效协同的核心环节,在动态复杂的场景下面临着多重挑战。一方面,在HRC装配场景中,很难对多模态人-机器人-环境进行认知。另一方面,在动态复杂的HRC装配场景中,人与机器人的异构能力(如人的灵活决策和机器人的精确执行)难以充分发挥,难以实现任务的合理分配和时间优化。针对这些问题,提出了一种基于视觉语言模型(VLM)的深度强化学习驱动任务规划方法,用于增强现实(AR)辅助HRC装配系统。首先,通过整合领域知识和实时情景数据,提出了一种自训练的VLM,实现了HRC装配系统的上下文感知;通过快速工程化对预构建VLM的角色配置参数进行微调,使VLM具备对多维装配场景要素的认知能力。通过vlm增强对动态HRC环境的认知,可以赋予强化学习模型感知HRC装配场景的眼睛。基于vlm增强的HRC动态环境认知,建立了一种改进的基于多智能体强化学习的HRC装配任务规划模型,以实现任务规划的人性化,该模型考虑了具有多智能体冲突机制的人与机器人之间的竞争关系。基于HRC装配任务规划结果,AR可以使操作员通过HRC装配信息(如HRC装配过程)的虚实映射实现可视化的HRC装配指导,并与VLM实现无缝交互。最后,实验结果表明,该方法可以提高以人为中心的装配系统中HRC的效率和福祉。
{"title":"AR-assisted human-robot collaborative assembly system: Integrating visual language model and deep reinforcement learning for task planning and seamless interactive guidance","authors":"Changchun Liu ,&nbsp;Dunbing Tang ,&nbsp;Haihua Zhu ,&nbsp;Zequn Zhang ,&nbsp;Liping Wang ,&nbsp;Qingwei Nie","doi":"10.1016/j.jmsy.2025.11.019","DOIUrl":"10.1016/j.jmsy.2025.11.019","url":null,"abstract":"<div><div>In the human-oriented context of Industry 5.0, human-robot collaboration (HRC) has become the core driving force for innovating the production model of assembly systems by integrating human flexibility ability with the precision and repeatability of robots. As the core link to achieve efficient collaboration, task planning is facing multiple challenges in dynamic and complex scenarios. On the one hand, it is hard to perform cognition of multimodal human-robot-environment in HRC assembly scenarios. On the other hand, heterogeneous capabilities of humans and robots (e.g., flexible decision-making by humans and precise execution by robots) are hard to be fully used to achieve reasonable task allocation and timing optimization in dynamic and complex HRC assembly scenarios. To address these issues, a Vision Language Model (VLM)-enhanced deep reinforcement learning-driven task planning approach is proposed towards Augmented Reality (AR)-assisted HRC assembly system. Firstly, a self-trained VLM is proposed through the integration of domain-specific knowledge and real-time situational data to enable context-aware in HRC assembly system. Through the fine-tuning of role configuration parameters for the pre-constructed VLM via prompt engineering, VLM can possess cognition of multi-dimensional assembly scenario elements. Reinforcement learning model can be endowed with the eyes to perceive HRC assembly scenarios through VLM-enhanced cognition of the dynamic HRC environment. Based on the VLM-enhanced cognition of the dynamic HRC environment, an improved multi-agent reinforcement learning-based HRC assembly task planning model is established to achieve humanized task planning, which can consider the competitive relationship between humans and robots with multi-agent conflict mechanism. Based on the HRC assembly task planning result, AR can enable operators to accomplish visual HRC assembly guidance through virtual-real mapping of HRC assembly information (e.g., HRC assembly procedures) and interact seamlessly with the VLM. Finally, experimental results show that the proposed method can improve the efficiency and well-being of HRC in human-centric assembly systems.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"84 ","pages":"Pages 40-67"},"PeriodicalIF":14.2,"publicationDate":"2025-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145682161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Manufacturing Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1