首页 > 最新文献

Robotics and Computer-integrated Manufacturing最新文献

英文 中文
Knowledge graph-driven process reasoning of human-robot collaborative disassembly strategy for end-of-life products 知识图驱动的报废产品人机协同拆卸策略过程推理
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-06-01 Epub Date: 2025-12-20 DOI: 10.1016/j.rcim.2025.103211
Jinhua Xiao , Zhiwen Zhang , Yu Zheng , Peng Wu , Sergio Terzi , Marco Macchi
Due to the complex structures and heterogeneous information inherent in End-of-Life (EOL) products, determining optimal disassembly solutions based on Human-Robot Collaboration (HRC) remains a challenging task. As structural and functional uncertainties in EOL products increase, traditional disassembly approaches struggle to meet the practical disassembly demands. Although various algorithms have been proposed for optimizing disassembly processes, significant challenges persist. These include the limited adaptability of existing models and difficulties in representing dynamic structured information effectively. To address these challenges, this study proposes a novel method combining knowledge graph-driven neural networks with an information decomposition module. This mechanism enables the network to discover structural semantic information and relational connections, facilitating the prediction of optimal disassembly strategies and enhancing the process reasoning capability of EOL product data and knowledge. Similarly, the proposed method provides reliable decision support for HRC disassembly task allocations and tool selections, enabling efficient and safe disassembly operations within complex disassembly processes. Finally, we demonstrate the method’s efficacy by using an example of an EOL battery pack, reasoning optimal disassembly strategies and potential process relations in the complex HRC disassembly scenario.
由于报废产品固有的复杂结构和异构信息,确定基于人机协作(HRC)的最佳拆卸方案仍然是一项具有挑战性的任务。随着EOL产品结构和功能不确定性的增加,传统的拆卸方法难以满足实际拆卸需求。尽管已经提出了各种算法来优化拆卸过程,但仍然存在重大挑战。这些问题包括现有模型的适应性有限,以及有效表示动态结构化信息的困难。为了解决这些挑战,本研究提出了一种将知识图驱动神经网络与信息分解模块相结合的新方法。该机制使网络能够发现结构语义信息和关系连接,便于预测最优拆卸策略,增强EOL产品数据和知识的过程推理能力。同样,该方法为HRC拆卸任务分配和工具选择提供了可靠的决策支持,实现了复杂拆卸过程中高效安全的拆卸操作。最后,我们以一个EOL电池组为例,证明了该方法的有效性,推理了复杂HRC拆卸场景下的最佳拆卸策略和潜在过程关系。
{"title":"Knowledge graph-driven process reasoning of human-robot collaborative disassembly strategy for end-of-life products","authors":"Jinhua Xiao ,&nbsp;Zhiwen Zhang ,&nbsp;Yu Zheng ,&nbsp;Peng Wu ,&nbsp;Sergio Terzi ,&nbsp;Marco Macchi","doi":"10.1016/j.rcim.2025.103211","DOIUrl":"10.1016/j.rcim.2025.103211","url":null,"abstract":"<div><div>Due to the complex structures and heterogeneous information inherent in End-of-Life (EOL) products, determining optimal disassembly solutions based on Human-Robot Collaboration (HRC) remains a challenging task. As structural and functional uncertainties in EOL products increase, traditional disassembly approaches struggle to meet the practical disassembly demands. Although various algorithms have been proposed for optimizing disassembly processes, significant challenges persist. These include the limited adaptability of existing models and difficulties in representing dynamic structured information effectively. To address these challenges, this study proposes a novel method combining knowledge graph-driven neural networks with an information decomposition module. This mechanism enables the network to discover structural semantic information and relational connections, facilitating the prediction of optimal disassembly strategies and enhancing the process reasoning capability of EOL product data and knowledge. Similarly, the proposed method provides reliable decision support for HRC disassembly task allocations and tool selections, enabling efficient and safe disassembly operations within complex disassembly processes. Finally, we demonstrate the method’s efficacy by using an example of an EOL battery pack, reasoning optimal disassembly strategies and potential process relations in the complex HRC disassembly scenario.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103211"},"PeriodicalIF":11.4,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145784996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A temporal spatial human digital twin approach for modeling human behavior with uncertainty 不确定性人类行为建模的时空数字孪生方法
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-06-01 Epub Date: 2025-12-13 DOI: 10.1016/j.rcim.2025.103203
Hongquan Gui , Ming Li
Modeling human behavior is paramount in the process of human-robot interaction (HRI). Human motion during HRI is uncertain. Even when the same individual performs the same action, it is impossible to ensure that the motion trajectory will be identical every time. These factors make modeling human behavior extremely challenging. Beyond human uncertainty, there are dynamic temporal spatial dependencies in HRI. Effectively capturing uncertainty while fully integrating temporal spatial features presents a significant challenge. Moreover, representing human behavior solely through human skeleton is insufficient. Recently, human digital twins have been developed to represent human geometry. However, current human digital twins are not well-suited for dynamic HRI scenarios, as they struggle to accurately depict high-dimensional human parameters, leading to issues such as nonlinear mapping and joint drift. In summary, existing methods find it difficult to address the human uncertainty, the temporal spatial dependencies, and the high-dimensional human parameters. To address the above challenges, this study proposes a temporal spatial human digital twin (TSHDT) for modeling human behavior in HRI. The TSHDT is based on predicted human skeletons and integrates forward and inverse kinematics along with diffusion prior distribution to represent high-dimensional human parameters, thus preventing joint drift and nonlinear mapping between joints. In developing the TSHDT, we introduce the human robot temporal spatial (HRTS) diffusion model to mitigate the uncertainty in human motion. The unique diffusion and denoising processes of the HRTS diffusion model can effectively submerge uncertainty in noise and accurately predict human motion during subsequent denoising steps. To ensure that the denoising process favors accuracy over diversity, we propose the temporal spatial fusion graph convolutional network (TSFGCN) to capture temporal spatial features between humans and robots, embedding them into the HRTS diffusion model. Finally, the effectiveness of the TSHDT was validated via predictive collision detection in human-robot fabric cutting experiments. Results demonstrate that the proposed method accurately models human behavior in collision detection experiments, achieving outstanding F1 scores.
在人机交互(HRI)过程中,人类行为建模是至关重要的。HRI期间的人体运动是不确定的。即使同一个体进行相同的动作,也不可能保证每次的运动轨迹都是相同的。这些因素使得模拟人类行为极具挑战性。除了人类的不确定性之外,HRI还存在动态的时空依赖性。在充分整合时空特征的同时,有效地捕捉不确定性是一项重大挑战。此外,仅仅通过人体骨骼来表现人类行为是不够的。最近,人类数字双胞胎已经被开发出来,以代表人类的几何形状。然而,目前的人类数字孪生体并不适合动态HRI场景,因为它们难以准确描述高维人类参数,从而导致非线性映射和关节漂移等问题。综上所述,现有方法难以处理人为不确定性、时空依赖性和高维人为参数。为了解决上述挑战,本研究提出了一个时空人类数字孪生(TSHDT)来模拟HRI中的人类行为。TSHDT基于预测的人体骨骼,将正运动学和逆运动学以及扩散先验分布相结合,以表示高维人体参数,从而防止关节漂移和关节之间的非线性映射。在开发TSHDT时,我们引入了人-机器人时空(HRTS)扩散模型来减轻人体运动中的不确定性。HRTS扩散模型独特的扩散和去噪过程可以有效地消除噪声中的不确定性,并在后续去噪步骤中准确预测人体运动。为了确保去噪过程更有利于准确性而不是多样性,我们提出了时空融合图卷积网络(TSFGCN)来捕获人类和机器人之间的时空特征,并将其嵌入到HRTS扩散模型中。最后,在人机裁剪实验中,通过预测碰撞检测验证了TSHDT算法的有效性。结果表明,该方法在碰撞检测实验中准确地模拟了人类行为,取得了优异的F1分数。
{"title":"A temporal spatial human digital twin approach for modeling human behavior with uncertainty","authors":"Hongquan Gui ,&nbsp;Ming Li","doi":"10.1016/j.rcim.2025.103203","DOIUrl":"10.1016/j.rcim.2025.103203","url":null,"abstract":"<div><div>Modeling human behavior is paramount in the process of human-robot interaction (HRI). Human motion during HRI is uncertain. Even when the same individual performs the same action, it is impossible to ensure that the motion trajectory will be identical every time. These factors make modeling human behavior extremely challenging. Beyond human uncertainty, there are dynamic temporal spatial dependencies in HRI. Effectively capturing uncertainty while fully integrating temporal spatial features presents a significant challenge. Moreover, representing human behavior solely through human skeleton is insufficient. Recently, human digital twins have been developed to represent human geometry. However, current human digital twins are not well-suited for dynamic HRI scenarios, as they struggle to accurately depict high-dimensional human parameters, leading to issues such as nonlinear mapping and joint drift. In summary, existing methods find it difficult to address the human uncertainty, the temporal spatial dependencies, and the high-dimensional human parameters. To address the above challenges, this study proposes a temporal spatial human digital twin (TSHDT) for modeling human behavior in HRI. The TSHDT is based on predicted human skeletons and integrates forward and inverse kinematics along with diffusion prior distribution to represent high-dimensional human parameters, thus preventing joint drift and nonlinear mapping between joints. In developing the TSHDT, we introduce the human robot temporal spatial (HRTS) diffusion model to mitigate the uncertainty in human motion. The unique diffusion and denoising processes of the HRTS diffusion model can effectively submerge uncertainty in noise and accurately predict human motion during subsequent denoising steps. To ensure that the denoising process favors accuracy over diversity, we propose the temporal spatial fusion graph convolutional network (TSFGCN) to capture temporal spatial features between humans and robots, embedding them into the HRTS diffusion model. Finally, the effectiveness of the TSHDT was validated via predictive collision detection in human-robot fabric cutting experiments. Results demonstrate that the proposed method accurately models human behavior in collision detection experiments, achieving outstanding F1 scores.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103203"},"PeriodicalIF":11.4,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145731184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A mixed reality-assisted human-to-robot skill transfer approach for contact-rich assembly via visuomotor primitives 基于视觉运动原语的多接触装配的混合现实辅助人机技能转移方法
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-06-01 Epub Date: 2025-12-22 DOI: 10.1016/j.rcim.2025.103208
Duidi Wu , Qianyou Zhao , Yuliang Shen , Junlai Li , Pai Zheng , Jin Qi , Jie Hu
Industrial assembly represents a core of modern manufacturing but poses significant challenges to the reliability and adaptability of robot systems. As manufacturing shifts toward intelligent production, there is an urgent need for efficient human-to-robot skill transfer methods for mutual cognition. However, current embodied intelligence research has primarily focused on household tasks, while human-level performance in dexterous and long-horizon tasks remains largely unexplored within real-world industrial applications. To bridge this gap, we propose a skill transfer framework and establish a contact-rich assembly benchmark. It integrates an MR-assisted digital twin system for low-cost and diverse demonstrations, an end-to-end generative visuomotor imitation learning policy for continuous action, and primitive skills covering industrially-inspired tasks such as peg insertion, gear meshing, and disassembly. Experiments across six tasks demonstrate high success rates and robust positional generalization. This study explores a novel pathway, it is hoped that it will provide valuable insights for future human–robot collaboration, and serve as a critical precursor for the integration of physical intelligence with generative AI. The project website is available at: https://h2r-mrsta.github.io/.
工业装配是现代制造业的核心,但对机器人系统的可靠性和适应性提出了重大挑战。随着制造业向智能生产的转变,迫切需要一种高效的人机相互认知的技能转移方法。然而,目前的具身智能研究主要集中在家庭任务上,而在现实世界的工业应用中,人类在灵巧和长期任务中的表现仍未得到充分的探索。为了弥补这一差距,我们提出了一个技能转移框架,并建立了一个富有接触的装配基准。它集成了磁共振辅助数字孪生系统,用于低成本和多样化的演示,端到端生成视觉运动模仿学习策略,用于连续动作,以及涵盖工业启发任务(如钉插入,齿轮啮合和拆卸)的原始技能。六个任务的实验证明了高成功率和稳健的位置泛化。本研究探索了一条新的途径,希望它将为未来的人机协作提供有价值的见解,并作为物理智能与生成式人工智能集成的重要先驱。该项目的网站是:https://h2r-mrsta.github.io/。
{"title":"A mixed reality-assisted human-to-robot skill transfer approach for contact-rich assembly via visuomotor primitives","authors":"Duidi Wu ,&nbsp;Qianyou Zhao ,&nbsp;Yuliang Shen ,&nbsp;Junlai Li ,&nbsp;Pai Zheng ,&nbsp;Jin Qi ,&nbsp;Jie Hu","doi":"10.1016/j.rcim.2025.103208","DOIUrl":"10.1016/j.rcim.2025.103208","url":null,"abstract":"<div><div>Industrial assembly represents a core of modern manufacturing but poses significant challenges to the reliability and adaptability of robot systems. As manufacturing shifts toward intelligent production, there is an urgent need for efficient human-to-robot skill transfer methods for mutual cognition. However, current embodied intelligence research has primarily focused on household tasks, while human-level performance in dexterous and long-horizon tasks remains largely unexplored within real-world industrial applications. To bridge this gap, we propose a skill transfer framework and establish a contact-rich assembly benchmark. It integrates an MR-assisted digital twin system for low-cost and diverse demonstrations, an end-to-end generative visuomotor imitation learning policy for continuous action, and primitive skills covering industrially-inspired tasks such as peg insertion, gear meshing, and disassembly. Experiments across six tasks demonstrate high success rates and robust positional generalization. This study explores a novel pathway, it is hoped that it will provide valuable insights for future human–robot collaboration, and serve as a critical precursor for the integration of physical intelligence with generative AI. The project website is available at: <span><span>https://h2r-mrsta.github.io/</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103208"},"PeriodicalIF":11.4,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145813784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
3DprintMIND: An AI-Agent system using large language models and dynamic manufacturing knowledge graphs for smart manufacturing 3DprintMIND:使用大型语言模型和动态制造知识图的AI-Agent系统,用于智能制造
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-06-01 Epub Date: 2025-12-30 DOI: 10.1016/j.rcim.2025.103214
Laiyi Li, Yongwen Zhang, Inno Lorren Désir Makanda, Pingyu Jiang
As an advanced manufacturing paradigm, 3D printing offers significant opportunities for smart manufacturing (SM). However, the prevalence of data silos in its production lines frequently hinders effective process analysis and decision-making. While large language models (LLMs) possess powerful analytical capabilities, their reliability in industrial scenarios is constrained by hallucinations and a disconnection from real-time operational data. Dynamic manufacturing knowledge graphs (MKGs), functioning as structured databases, can integrate structured and unstructured manufacturing data while providing LLMs with real-time data support. Meanwhile, the development of retrieval-augmented generation (RAG) and AI-Agent technologies provides a feasible pathway for the industrial application of LLMs. This study proposes an end-to-end framework ranging from knowledge integration to 3D printing production line applications to enable reliable LLM-driven SM. Firstly, leveraging the semantic analysis capabilities of LLMs, production data and manufacturing knowledge are integrated into a dynamic MKG. Subsequently, an advanced temporal graph network (TGN) model is developed for representation learning on the dynamic MKG, forming the retrieval foundation of an RAG system. A closed-loop manufacturing logic tailored to 3D printing production lines and a “semantic-to-structured” workflow for anomaly analysis were proposed. Finally, an AI-Agent system and a prototype software platform have been developed, and a case study on a laboratory 3D printing production line has been conducted to evaluate the TGN model’s performance and the AI-Agent system’s effectiveness. The results indicate that the dynamic MKG enables continuous learning for SM and provides industries with robust AI-driven data support. The TGN model significantly outperforms baseline models, yielding higher-quality dynamic embeddings for downstream knowledge retrieval tasks. Moreover, the AI-Agent system offers SM reliable intelligent analysis and decision support in 3D printing production lines.
作为一种先进的制造模式,3D打印为智能制造(SM)提供了重要的机会。然而,在其生产线中普遍存在的数据孤岛经常阻碍有效的过程分析和决策。虽然大型语言模型(llm)具有强大的分析能力,但它们在工业场景中的可靠性受到幻觉和与实时操作数据脱节的限制。动态制造知识图(MKGs)作为结构化数据库,可以集成结构化和非结构化制造数据,同时为llm提供实时数据支持。同时,检索增强生成(retrieval-augmented generation, RAG)和AI-Agent技术的发展为llm的产业化应用提供了可行的途径。本研究提出了一个从知识集成到3D打印生产线应用的端到端框架,以实现可靠的llm驱动的SM。首先,利用llm的语义分析能力,将生产数据和制造知识集成到动态MKG中。在此基础上,提出了一种用于动态MKG表示学习的高级时态图网络(TGN)模型,为RAG系统的检索奠定了基础。提出了适合3D打印生产线的闭环制造逻辑和异常分析的“从语义到结构”工作流。最后,开发了AI-Agent系统和原型软件平台,并在实验室3D打印生产线上进行了案例研究,以评估TGN模型的性能和AI-Agent系统的有效性。结果表明,动态MKG使SM能够持续学习,并为行业提供强大的人工智能驱动数据支持。TGN模型显著优于基线模型,为下游知识检索任务提供更高质量的动态嵌入。此外,AI-Agent系统为SM提供了可靠的3D打印生产线智能分析和决策支持。
{"title":"3DprintMIND: An AI-Agent system using large language models and dynamic manufacturing knowledge graphs for smart manufacturing","authors":"Laiyi Li,&nbsp;Yongwen Zhang,&nbsp;Inno Lorren Désir Makanda,&nbsp;Pingyu Jiang","doi":"10.1016/j.rcim.2025.103214","DOIUrl":"10.1016/j.rcim.2025.103214","url":null,"abstract":"<div><div>As an advanced manufacturing paradigm, 3D printing offers significant opportunities for smart manufacturing (SM). However, the prevalence of data silos in its production lines frequently hinders effective process analysis and decision-making. While large language models (LLMs) possess powerful analytical capabilities, their reliability in industrial scenarios is constrained by hallucinations and a disconnection from real-time operational data. Dynamic manufacturing knowledge graphs (MKGs), functioning as structured databases, can integrate structured and unstructured manufacturing data while providing LLMs with real-time data support. Meanwhile, the development of retrieval-augmented generation (RAG) and AI-Agent technologies provides a feasible pathway for the industrial application of LLMs. This study proposes an end-to-end framework ranging from knowledge integration to 3D printing production line applications to enable reliable LLM-driven SM. Firstly, leveraging the semantic analysis capabilities of LLMs, production data and manufacturing knowledge are integrated into a dynamic MKG. Subsequently, an advanced temporal graph network (TGN) model is developed for representation learning on the dynamic MKG, forming the retrieval foundation of an RAG system. A closed-loop manufacturing logic tailored to 3D printing production lines and a “semantic-to-structured” workflow for anomaly analysis were proposed. Finally, an AI-Agent system and a prototype software platform have been developed, and a case study on a laboratory 3D printing production line has been conducted to evaluate the TGN model’s performance and the AI-Agent system’s effectiveness. The results indicate that the dynamic MKG enables continuous learning for SM and provides industries with robust AI-driven data support. The TGN model significantly outperforms baseline models, yielding higher-quality dynamic embeddings for downstream knowledge retrieval tasks. Moreover, the AI-Agent system offers SM reliable intelligent analysis and decision support in 3D printing production lines.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103214"},"PeriodicalIF":11.4,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145883838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A lightweight detection network integrating multi-scale semantic refinement for steel strip defects 基于多尺度语义细化的钢带缺陷轻量化检测网络
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-06-01 Epub Date: 2025-11-29 DOI: 10.1016/j.rcim.2025.103188
Baicheng Bian , Long Chen , Zongwang Han , Shiqing Wu , WeiDong Li , Hongguang Chen
Real-time surface defect detection in steel strips requires high accuracy under computational constraints. Existing methods often struggle to detect low-contrast defects in complex backgrounds while maintaining computational efficiency. This paper proposes MSR-Det, a lightweight network built on a multi-scale semantic refinement framework to address this challenge. The core innovations include: a Hierarchical Texture Enhancement Module (HTEM) that strengthens discriminative textural patterns, a Specific Cross-Level Interaction Module (SCLIM) that facilitates granular feature fusion across network depths, and a Feature Enhancement Module (FEM) that refines defect representations through deep-to-shallow semantic guidance. The architecture further incorporates two key modules: a Lightweight Large-Kernel Bottleneck Module (LLKBM) for efficient receptive field expansion, and an Adaptive Gating Spatial Pyramid Pooling (AGSPP) for dynamic multi-scale context integration. Extensive experiments on NEU-DET, GC10-DET, and our industrial SS-DET dataset demonstrate that MSR-Det achieves state-of-the-art performance of 86.6% Mean Average Precision at 50% Intersection over Union (mAP50) on SS-DET with only 5.0M parameters, also attaining a real-time speed of 180 FPS. This work provides a robust and practical solution for automated visual inspection in industrial settings, effectively balancing high precision with operational efficiency.
钢带表面缺陷实时检测在计算约束下要求精度高。现有的方法往往难以在保持计算效率的同时检测复杂背景下的低对比度缺陷。本文提出了基于多尺度语义细化框架的轻量级网络MSR-Det来解决这一挑战。核心创新包括:增强判别纹理模式的分层纹理增强模块(HTEM),促进跨网络深度颗粒特征融合的特定跨层交互模块(SCLIM),以及通过从深到浅的语义引导来细化缺陷表示的特征增强模块(FEM)。该架构进一步集成了两个关键模块:用于有效接受场扩展的轻量级大核瓶颈模块(LLKBM)和用于动态多尺度上下文集成的自适应门控空间金字塔池(AGSPP)。在nue - det、GC10-DET和我们的工业SS-DET数据集上进行的大量实验表明,MSR-Det在SS-DET上仅使用5.0M参数就能达到86.6%的平均精度(mAP50),并且实时速度达到180 FPS。这项工作为工业环境中的自动视觉检测提供了一个强大而实用的解决方案,有效地平衡了高精度和操作效率。
{"title":"A lightweight detection network integrating multi-scale semantic refinement for steel strip defects","authors":"Baicheng Bian ,&nbsp;Long Chen ,&nbsp;Zongwang Han ,&nbsp;Shiqing Wu ,&nbsp;WeiDong Li ,&nbsp;Hongguang Chen","doi":"10.1016/j.rcim.2025.103188","DOIUrl":"10.1016/j.rcim.2025.103188","url":null,"abstract":"<div><div>Real-time surface defect detection in steel strips requires high accuracy under computational constraints. Existing methods often struggle to detect low-contrast defects in complex backgrounds while maintaining computational efficiency. This paper proposes MSR-Det, a lightweight network built on a multi-scale semantic refinement framework to address this challenge. The core innovations include: a Hierarchical Texture Enhancement Module (HTEM) that strengthens discriminative textural patterns, a Specific Cross-Level Interaction Module (SCLIM) that facilitates granular feature fusion across network depths, and a Feature Enhancement Module (FEM) that refines defect representations through deep-to-shallow semantic guidance. The architecture further incorporates two key modules: a Lightweight Large-Kernel Bottleneck Module (LLKBM) for efficient receptive field expansion, and an Adaptive Gating Spatial Pyramid Pooling (AGSPP) for dynamic multi-scale context integration. Extensive experiments on NEU-DET, GC10-DET, and our industrial SS-DET dataset demonstrate that MSR-Det achieves state-of-the-art performance of 86.6% Mean Average Precision at 50% Intersection over Union (<span><math><msub><mrow><mtext>mAP</mtext></mrow><mrow><mn>50</mn></mrow></msub></math></span>) on SS-DET with only 5.0M parameters, also attaining a real-time speed of 180 FPS. This work provides a robust and practical solution for automated visual inspection in industrial settings, effectively balancing high precision with operational efficiency.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103188"},"PeriodicalIF":11.4,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145614038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A digital twin modeling framework with graphical software for rapid development of aircraft assembly systems 基于图形化软件的飞机装配系统快速开发数字孪生建模框架
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-06-01 Epub Date: 2025-12-23 DOI: 10.1016/j.rcim.2025.103213
Ruihao Kang , Junshan Hu , Zhengping Li , Liangxiang Wang , Jincheng Yang , Wei Tian
Digital Twin (DT) technology is pushing manufacturing toward higher intelligence and adaptability. However, existing DT modeling methods still rely heavily on customization, lacking universality and scalability for assembly-oriented manufacturing systems. To address this limitation, this paper proposes a modular DT control framework that couples graphical interaction with reusable functional modules. Based on the classical five-dimensional DT model, the virtual entity is refined into geometric and physical models, and the service system is expanded into behavior and task models, enabling a clearer description and direct correspondence between system structure and operational logic. A behavior-oriented modeling workflow and a data-mapping mechanism are established to enhance scenario adaptability and reduce modeling effort. A graphical DT modeling platform is developed on top of this framework. Multiple robotic manufacturing prototypes, including robotic drilling, robotic gluing, and hybrid drilling systems, are constructed to assess the generality and reconfigurability of the proposed approach. A drilling experiment is performed on the robotic drilling system to validate the DT-based control execution mechanism. The resulting holes exhibit an average positioning error of 0.23 mm and a diameter error of 0.012 mm, both meeting aerospace drilling requirements. This confirms that virtual task commands can be accurately executed on physical system under the proposed DT framework. Overall, the DT prototype implementations and drilling experiment jointly verify the scalability of the framework and its DT-based control capability, providing a practical approach for the rapid development and deployment of DT prototypes in aircraft assembly systems.
数字孪生(DT)技术正在推动制造业向更高的智能和适应性发展。然而,现有的DT建模方法仍然严重依赖于定制,缺乏面向装配制造系统的通用性和可扩展性。为了解决这一限制,本文提出了一个模块化的DT控制框架,该框架将图形交互与可重用的功能模块相结合。在经典五维DT模型的基础上,将虚拟实体细化为几何和物理模型,将业务系统扩展为行为和任务模型,使系统结构与业务逻辑的描述更加清晰,直接对应。建立了面向行为的建模工作流和数据映射机制,增强了场景适应性,减少了建模工作量。在此框架的基础上开发了图形化DT建模平台。构建了多个机器人制造原型,包括机器人钻井、机器人粘合和混合钻井系统,以评估所提出方法的通用性和可重构性。在机器人钻井系统上进行了钻井实验,验证了基于dt的控制执行机制。所得到的孔的平均定位误差为0.23 mm,直径误差为0.012 mm,均满足航空航天钻井要求。这证实了在所提出的DT框架下,虚拟任务命令可以在物理系统上准确执行。总体而言,DT原型实现和钻井实验共同验证了框架的可扩展性及其基于DT的控制能力,为飞机装配系统中DT原型的快速开发和部署提供了实用方法。
{"title":"A digital twin modeling framework with graphical software for rapid development of aircraft assembly systems","authors":"Ruihao Kang ,&nbsp;Junshan Hu ,&nbsp;Zhengping Li ,&nbsp;Liangxiang Wang ,&nbsp;Jincheng Yang ,&nbsp;Wei Tian","doi":"10.1016/j.rcim.2025.103213","DOIUrl":"10.1016/j.rcim.2025.103213","url":null,"abstract":"<div><div>Digital Twin (DT) technology is pushing manufacturing toward higher intelligence and adaptability. However, existing DT modeling methods still rely heavily on customization, lacking universality and scalability for assembly-oriented manufacturing systems. To address this limitation, this paper proposes a modular DT control framework that couples graphical interaction with reusable functional modules. Based on the classical five-dimensional DT model, the virtual entity is refined into geometric and physical models, and the service system is expanded into behavior and task models, enabling a clearer description and direct correspondence between system structure and operational logic. A behavior-oriented modeling workflow and a data-mapping mechanism are established to enhance scenario adaptability and reduce modeling effort. A graphical DT modeling platform is developed on top of this framework. Multiple robotic manufacturing prototypes, including robotic drilling, robotic gluing, and hybrid drilling systems, are constructed to assess the generality and reconfigurability of the proposed approach. A drilling experiment is performed on the robotic drilling system to validate the DT-based control execution mechanism. The resulting holes exhibit an average positioning error of 0.23 mm and a diameter error of 0.012 mm, both meeting aerospace drilling requirements. This confirms that virtual task commands can be accurately executed on physical system under the proposed DT framework. Overall, the DT prototype implementations and drilling experiment jointly verify the scalability of the framework and its DT-based control capability, providing a practical approach for the rapid development and deployment of DT prototypes in aircraft assembly systems.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103213"},"PeriodicalIF":11.4,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145823074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Auction-based privacy-preserving cloud-edge collaborative scheduling considering flexible service ability for multi-source manufacturing tasks 考虑多源制造任务灵活服务能力的基于拍卖的保密性云边缘协同调度
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-06-01 Epub Date: 2025-12-13 DOI: 10.1016/j.rcim.2025.103206
Weimin Jing , Yong Yan , Yonghui Zhang , Xiang Ji , Wen Huang , Youling Chen , Huan Zhang
In the context of cloud manufacturing service, scheduling manufacturing tasks is a crucial area of research because it directly influences service quality and efficiency. As intelligent manufacturing technologies advance, edge manufacturing service providers have gained increasingly flexible ability, enabling them to adjust local production schedules to adapt cloud manufacturing tasks. However, because local manufacturing information is private, traditional centralized cloud manufacturing scheduling methods cannot fully leverage edge flexibility to collaboratively schedule multi-source manufacturing tasks (including both cloud manufacturing tasks and local tasks of edge service providers) without risking the disclosure of sensitive information, thereby limiting improvements in both performance and efficiency of cloud manufacturing service. Therefore, we propose a privacy-preserving cloud-edge collaborative decision-making approach based on auction theory to schedule multi-source manufacturing tasks. First, a mathematical model that accounts for the objectives of both service providers and demanders is established to characterize the collaborative scheduling of multi-sourced tasks. Subsequently, a cloud-edge collaborative scheduling decision framework is introduced. Building upon this, a multi-stage scheduling method based on combinatorial iterative auctions is proposed, featuring novel bidding with a flexible execution timeline and distributed winner determination process incorporating bid consolidation mechanisms to enhance the efficiency of cloud-edge collaborative decision. Finally, to validate the superiority of the proposed method, computational experiments are conducted, comparing it with traditional centralized manufacturing task scheduling methods. The results present that the proposed method not only completes cloud manufacturing tasks within a relatively shorter makespan but also provides higher-value manufacturing services to demanders. Moreover, as the cloud manufacturing task load increases, this advantage becomes even more pronounced.
在云制造服务背景下,制造任务调度是一个重要的研究领域,因为它直接影响到服务质量和效率。随着智能制造技术的进步,边缘制造服务商获得了越来越灵活的能力,使他们能够调整本地生产计划以适应云制造任务。然而,由于本地制造信息是私有的,传统的集中式云制造调度方法无法充分利用边缘灵活性,在不泄露敏感信息的情况下协同调度多源制造任务(包括云制造任务和边缘服务提供商的本地任务),从而限制了云制造服务的性能和效率提升。因此,我们提出了一种基于拍卖理论的多源制造任务调度的隐私保护云边缘协同决策方法。首先,建立了考虑服务提供者和需求者目标的数学模型来表征多源任务协同调度。随后,介绍了一种云边缘协同调度决策框架。在此基础上,提出了一种基于组合迭代拍卖的多阶段调度方法,采用具有灵活执行时间的新型竞价和包含竞价整合机制的分布式中标人确定过程,提高了云边缘协同决策的效率。最后,为了验证该方法的优越性,进行了计算实验,并与传统的集中式制造任务调度方法进行了比较。结果表明,该方法不仅可以在相对较短的makespan内完成云制造任务,而且可以为需求者提供更高价值的制造服务。此外,随着云制造任务负载的增加,这种优势变得更加明显。
{"title":"Auction-based privacy-preserving cloud-edge collaborative scheduling considering flexible service ability for multi-source manufacturing tasks","authors":"Weimin Jing ,&nbsp;Yong Yan ,&nbsp;Yonghui Zhang ,&nbsp;Xiang Ji ,&nbsp;Wen Huang ,&nbsp;Youling Chen ,&nbsp;Huan Zhang","doi":"10.1016/j.rcim.2025.103206","DOIUrl":"10.1016/j.rcim.2025.103206","url":null,"abstract":"<div><div>In the context of cloud manufacturing service, scheduling manufacturing tasks is a crucial area of research because it directly influences service quality and efficiency. As intelligent manufacturing technologies advance, edge manufacturing service providers have gained increasingly flexible ability, enabling them to adjust local production schedules to adapt cloud manufacturing tasks. However, because local manufacturing information is private, traditional centralized cloud manufacturing scheduling methods cannot fully leverage edge flexibility to collaboratively schedule multi-source manufacturing tasks (including both cloud manufacturing tasks and local tasks of edge service providers) without risking the disclosure of sensitive information, thereby limiting improvements in both performance and efficiency of cloud manufacturing service. Therefore, we propose a privacy-preserving cloud-edge collaborative decision-making approach based on auction theory to schedule multi-source manufacturing tasks. First, a mathematical model that accounts for the objectives of both service providers and demanders is established to characterize the collaborative scheduling of multi-sourced tasks. Subsequently, a cloud-edge collaborative scheduling decision framework is introduced. Building upon this, a multi-stage scheduling method based on combinatorial iterative auctions is proposed, featuring novel bidding with a flexible execution timeline and distributed winner determination process incorporating bid consolidation mechanisms to enhance the efficiency of cloud-edge collaborative decision. Finally, to validate the superiority of the proposed method, computational experiments are conducted, comparing it with traditional centralized manufacturing task scheduling methods. The results present that the proposed method not only completes cloud manufacturing tasks within a relatively shorter makespan but also provides higher-value manufacturing services to demanders. Moreover, as the cloud manufacturing task load increases, this advantage becomes even more pronounced.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103206"},"PeriodicalIF":11.4,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145753457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
iLSPR: A Learning-based Scene Point-cloud Registration method for robotic spatial awareness in intelligent manufacturing 基于学习的智能制造机器人空间感知场景点云配准方法
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-06-01 Epub Date: 2025-12-16 DOI: 10.1016/j.rcim.2025.103204
Yusen Wan, Xu Chen
A critical capability for intelligent manufacturing is the ability for robotic systems to understand the spatial operation environment – the ability for robots to precisely recognize and estimate the spatial positions and orientations of objects in industrial scenes. Existing scene reconstruction methods are designed for general settings with low precision needs of objects and abundant data. However, manufacturing hinges on high object precision and operates with limited data. Addressing such challenges and limitations, we propose a novel Learning-based Scene Point-cloud Registration framework for automatic industrial scene reconstruction (iLSPR). The proposed iLSPR framework leverages point cloud representation and integrates three key innovations: (i) a Multi-Feature Robust Point Matching Network (MF-RPMN) that learns from both raw data and deep features of the objects to accurately align point clouds, (ii) a Geometric-Primitive-based Data Generation (GPDG) method for efficient synthetic data generation, and (iii) a digital model library of industrial target objects. During operation, vision sensors capture point clouds in the scenes, and the iLSPR method registers high-fidelity object models in the scenes using MF-RPMN, pre-trained with GPDG-generated data. We introduce an Industrial Scene Object Point-cloud Registration (ISOPR) dataset in IsaacSim to benchmark performance. Experimental results demonstrate that iLSPR significantly outperforms existing methods in accuracy and robustness. We further validate the approach on a real-world robotic manufacturing system, demonstrating reliable digital reconstruction of industrial scenes.
智能制造的一个关键能力是机器人系统理解空间操作环境的能力——机器人精确识别和估计工业场景中物体的空间位置和方向的能力。现有的场景重建方法都是针对一般场景而设计的,对物体的精度要求不高,数据量大。然而,制造业依赖于高物体精度和有限的数据。针对这些挑战和限制,我们提出了一种新的基于学习的场景点云配准框架,用于自动工业场景重建(iLSPR)。提出的iLSPR框架利用点云表示,并集成了三个关键创新:(i)多特征鲁棒点匹配网络(MF-RPMN),从原始数据和对象的深层特征中学习,以精确对齐点云;(ii)基于几何原语的数据生成(GPDG)方法,用于高效的合成数据生成;(iii)工业目标对象的数字模型库。在操作过程中,视觉传感器捕获场景中的点云,iLSPR方法使用MF-RPMN注册场景中的高保真目标模型,并使用gpdg生成的数据进行预训练。我们在IsaacSim中引入一个工业场景对象点云配准(ISOPR)数据集来对性能进行基准测试。实验结果表明,iLSPR在准确性和鲁棒性方面明显优于现有方法。我们进一步在现实世界的机器人制造系统中验证了该方法,展示了工业场景的可靠数字重建。
{"title":"iLSPR: A Learning-based Scene Point-cloud Registration method for robotic spatial awareness in intelligent manufacturing","authors":"Yusen Wan,&nbsp;Xu Chen","doi":"10.1016/j.rcim.2025.103204","DOIUrl":"10.1016/j.rcim.2025.103204","url":null,"abstract":"<div><div>A critical capability for intelligent manufacturing is the ability for robotic systems to understand the spatial operation environment – the ability for robots to precisely recognize and estimate the spatial positions and orientations of objects in industrial scenes. Existing scene reconstruction methods are designed for general settings with low precision needs of objects and abundant data. However, manufacturing hinges on high object precision and operates with limited data. Addressing such challenges and limitations, we propose a novel <strong>L</strong>earning-based <strong>S</strong>cene <strong>P</strong>oint-cloud <strong>R</strong>egistration framework for automatic <strong>i</strong>ndustrial scene reconstruction (iLSPR). The proposed iLSPR framework leverages point cloud representation and integrates three key innovations: (i) a <strong>M</strong>ulti-<strong>F</strong>eature <strong>R</strong>obust <strong>P</strong>oint <strong>M</strong>atching <strong>N</strong>etwork (MF-RPMN) that learns from both raw data and deep features of the objects to accurately align point clouds, (ii) a <strong>G</strong>eometric-<strong>P</strong>rimitive-based <strong>D</strong>ata <strong>G</strong>eneration (GPDG) method for efficient synthetic data generation, and (iii) a digital model library of industrial target objects. During operation, vision sensors capture point clouds in the scenes, and the iLSPR method registers high-fidelity object models in the scenes using MF-RPMN, pre-trained with GPDG-generated data. We introduce an <strong>I</strong>ndustrial <strong>S</strong>cene <strong>O</strong>bject <strong>P</strong>oint-cloud <strong>R</strong>egistration (ISOPR) dataset in IsaacSim to benchmark performance. Experimental results demonstrate that iLSPR significantly outperforms existing methods in accuracy and robustness. We further validate the approach on a real-world robotic manufacturing system, demonstrating reliable digital reconstruction of industrial scenes.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103204"},"PeriodicalIF":11.4,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145785000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Pivot-Move Strategy for Dual-Robot Manipulator Additive Manufacturing: Enabling Collision Avoidance without Halting Deposition 双机器人机械臂增材制造的一种新型支点移动策略:避免碰撞而不停止沉积
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-06-01 Epub Date: 2025-11-27 DOI: 10.1016/j.rcim.2025.103177
C.L. Li , Y.C. Jiao , K. Ren , N. Liu , Y.F. Zhang
Robot-assisted additive manufacturing (AM) has been gaining increasing popularity due to its great flexibility and reachability. Moreover, an AM system with dual deposition-heads held by robot manipulators would significantly shorten the building time, especially for large-scale parts. However, motion planning (MP) for the dual robot manipulators AM is highly challengeable due to various constraints imposed by the setup and the AM process aiming to improve the qualities of the component, e.g., maintaining travelling speed and posture of the deposition head and avoiding collision. In this paper, a novel pivot-move strategy is proposed for MP in AM with dual robot manipulators. Given the sequenced deposition toolpath segments to each deposition head, an initial MP solution including robot configuration at each time sample waypoint is firstly generated for each robot manipulator, respectively. This is followed by conducting a check-and-correct process at each waypoint, where the collision among the links of two robot manipulators is identified and corrected. Specially, the robot manipulator is designed to simultaneously pivot and move to avoid the collision while maintaining the traveling speed unchanged. Numerical simulation, physical implementation, and benchmarking were conducted to exhibit a 78.295% deposition time reduction and high-quality deposition with the developed strategy. To the best of the authors' knowledge, this study represents the pioneering effort in addressing the collision issue in dual robot manipulators depositing on the same heated bed, achieving collision avoidance without interrupting the ongoing deposition process. It can be a valuable supplement to the state of the art in this area.
机器人辅助增材制造(AM)由于其巨大的灵活性和可达性而越来越受欢迎。此外,一个具有双沉积头的增材制造系统,由机器人机械手握住,将大大缩短建造时间,特别是对于大型零件。然而,由于设置和增材制造过程施加的各种限制,双机器人机械手增材制造的运动规划(MP)具有很高的挑战性,这些限制旨在提高组件的质量,例如,保持沉积头的移动速度和姿态,并避免碰撞。本文提出了一种新的双机械手增材制造中MP的支点移动策略。给定每个沉积头的顺序沉积刀具路径段,首先为每个机器人机械手分别生成包含每个时间样本路径点的机器人配置的初始MP解。随后,在每个航路点进行检查和纠正过程,在此过程中,两个机器人操纵器之间的链接之间的碰撞被识别和纠正。特别地,机器人机械手被设计为在保持行进速度不变的情况下同时转动和移动以避免碰撞。数值模拟、物理实现和基准测试表明,采用所开发的策略,沉积时间缩短78.295%,沉积质量高。据作者所知,这项研究代表了解决在同一加热床上沉积的双机器人操纵器碰撞问题的开创性努力,在不中断正在进行的沉积过程的情况下实现了碰撞避免。它可以成为这一领域最新技术的有价值的补充。
{"title":"A Novel Pivot-Move Strategy for Dual-Robot Manipulator Additive Manufacturing: Enabling Collision Avoidance without Halting Deposition","authors":"C.L. Li ,&nbsp;Y.C. Jiao ,&nbsp;K. Ren ,&nbsp;N. Liu ,&nbsp;Y.F. Zhang","doi":"10.1016/j.rcim.2025.103177","DOIUrl":"10.1016/j.rcim.2025.103177","url":null,"abstract":"<div><div>Robot-assisted additive manufacturing (AM) has been gaining increasing popularity due to its great flexibility and reachability. Moreover, an AM system with dual deposition-heads held by robot manipulators would significantly shorten the building time, especially for large-scale parts. However, motion planning (MP) for the dual robot manipulators AM is highly challengeable due to various constraints imposed by the setup and the AM process aiming to improve the qualities of the component, e.g., maintaining travelling speed and posture of the deposition head and avoiding collision. In this paper, a novel pivot-move strategy is proposed for MP in AM with dual robot manipulators. Given the sequenced deposition toolpath segments to each deposition head, an initial MP solution including robot configuration at each time sample waypoint is firstly generated for each robot manipulator, respectively. This is followed by conducting a check-and-correct process at each waypoint, where the collision among the links of two robot manipulators is identified and corrected. Specially, the robot manipulator is designed to simultaneously pivot and move to avoid the collision while maintaining the traveling speed unchanged. Numerical simulation, physical implementation, and benchmarking were conducted to exhibit a 78.295% deposition time reduction and high-quality deposition with the developed strategy. To the best of the authors' knowledge, this study represents the pioneering effort in addressing the collision issue in dual robot manipulators depositing on the same heated bed, achieving collision avoidance without interrupting the ongoing deposition process. It can be a valuable supplement to the state of the art in this area.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103177"},"PeriodicalIF":11.4,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145611798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A hierarchical human behavior modeling framework for safe and efficient human-robot collaborative assembly 安全高效人机协同装配的分层人行为建模框架
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-06-01 Epub Date: 2025-12-10 DOI: 10.1016/j.rcim.2025.103202
Guoyi Xia , Zied Ghrairi , Aaron Heuermann , Klaus-Dieter Thoben
Human-robot collaboration (HRC) offers promising solutions in industrial assembly by combining human dexterity and robot efficiency. However, the proximity of the human and robot raises safety concerns that can limit efficiency. Previous studies have typically addressed safety or efficiency separately, relying on incomplete models of human behavior, which has led to robot control strategies with limited adaptability. To bridge this gap, this paper proposes a hierarchical human behavior modeling framework that integrates human motion prediction (HMP) and human action segmentation. The proposed model captures both the fine-grained motion dynamics and the higher-level task structure, enabling a more complete and context-aware understanding of human behavior. The model can enhance robot decision-making for both proactive safety mechanisms and dynamic task allocation. HMP compares three predictive models (Convolutional Neural Networks - Long Short-Term Memory (CNN-LSTM), Spatial-Temporal Graph Convolutional Networks (ST-GCN), Transformer). CNN-LSTM and ST-GCN outperformed Transformer, demonstrating better short-term predictive accuracy. Human action segmentation includes feature extraction, dimensionality reduction, clustering, and two-stage temporal segmentation. The HMP (CNN-LSTM) based features achieve the highest clustering performance. Two-stage segmentation demonstrates high accuracy, achieving normalized edit distances (NED) of 0.029 and 0.07 for task-level and sub-task-level segmentation, respectively. Evaluation results show that proactive collision avoidance using predicted motions increased safety distance (from 0.4633 m to 0.4717 m), while dynamic task allocation based on action segmentation improves robot efficiency (from 84.95 % to 98.56 %). These results validate the effectiveness of the proposed hierarchical human behavior modeling framework in simultaneously enhancing safety and efficiency in HRC assembly.
人机协作(HRC)通过结合人的灵活性和机器人的效率,为工业装配提供了有前途的解决方案。然而,人类和机器人的接近引发了安全问题,这可能会限制效率。以前的研究通常是分别解决安全或效率问题,依赖于不完整的人类行为模型,这导致机器人控制策略具有有限的适应性。为了弥补这一缺陷,本文提出了一种将人体运动预测(HMP)和人体动作分割相结合的分层人体行为建模框架。所提出的模型既捕获了细粒度的运动动态,又捕获了更高级别的任务结构,从而能够对人类行为进行更完整和上下文感知的理解。该模型可以增强机器人的主动安全机制和动态任务分配的决策能力。HMP比较了三种预测模型(卷积神经网络-长短期记忆(CNN-LSTM),时空图卷积网络(ST-GCN), Transformer)。CNN-LSTM和ST-GCN表现优于Transformer,表现出更好的短期预测准确性。人体动作分割包括特征提取、降维、聚类和两阶段时间分割。基于HMP (CNN-LSTM)的特征实现了最高的聚类性能。两阶段分割具有较高的准确率,任务级和子任务级分割的归一化编辑距离(NED)分别为0.029和0.07。评估结果表明,基于预测运动的主动避撞提高了机器人的安全距离(从0.4633 m提高到0.4717 m),而基于动作分割的动态任务分配提高了机器人的效率(从84.95%提高到98.56%)。这些结果验证了所提出的分层人类行为建模框架在提高HRC装配安全性和效率方面的有效性。
{"title":"A hierarchical human behavior modeling framework for safe and efficient human-robot collaborative assembly","authors":"Guoyi Xia ,&nbsp;Zied Ghrairi ,&nbsp;Aaron Heuermann ,&nbsp;Klaus-Dieter Thoben","doi":"10.1016/j.rcim.2025.103202","DOIUrl":"10.1016/j.rcim.2025.103202","url":null,"abstract":"<div><div>Human-robot collaboration (HRC) offers promising solutions in industrial assembly by combining human dexterity and robot efficiency. However, the proximity of the human and robot raises safety concerns that can limit efficiency. Previous studies have typically addressed safety or efficiency separately, relying on incomplete models of human behavior, which has led to robot control strategies with limited adaptability. To bridge this gap, this paper proposes a hierarchical human behavior modeling framework that integrates human motion prediction (HMP) and human action segmentation. The proposed model captures both the fine-grained motion dynamics and the higher-level task structure, enabling a more complete and context-aware understanding of human behavior. The model can enhance robot decision-making for both proactive safety mechanisms and dynamic task allocation. HMP compares three predictive models (Convolutional Neural Networks - Long Short-Term Memory (CNN-LSTM), Spatial-Temporal Graph Convolutional Networks (ST-GCN), Transformer). CNN-LSTM and ST-GCN outperformed Transformer, demonstrating better short-term predictive accuracy. Human action segmentation includes feature extraction, dimensionality reduction, clustering, and two-stage temporal segmentation. The HMP (CNN-LSTM) based features achieve the highest clustering performance. Two-stage segmentation demonstrates high accuracy, achieving normalized edit distances (NED) of 0.029 and 0.07 for task-level and sub-task-level segmentation, respectively. Evaluation results show that proactive collision avoidance using predicted motions increased safety distance (from 0.4633 m to 0.4717 m), while dynamic task allocation based on action segmentation improves robot efficiency (from 84.95 % to 98.56 %). These results validate the effectiveness of the proposed hierarchical human behavior modeling framework in simultaneously enhancing safety and efficiency in HRC assembly.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103202"},"PeriodicalIF":11.4,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145731795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Robotics and Computer-integrated Manufacturing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1