首页 > 最新文献

Robotics and Computer-integrated Manufacturing最新文献

英文 中文
A hierarchical human behavior modeling framework for safe and efficient human-robot collaborative assembly 安全高效人机协同装配的分层人行为建模框架
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-10 DOI: 10.1016/j.rcim.2025.103202
Guoyi Xia , Zied Ghrairi , Aaron Heuermann , Klaus-Dieter Thoben
Human-robot collaboration (HRC) offers promising solutions in industrial assembly by combining human dexterity and robot efficiency. However, the proximity of the human and robot raises safety concerns that can limit efficiency. Previous studies have typically addressed safety or efficiency separately, relying on incomplete models of human behavior, which has led to robot control strategies with limited adaptability. To bridge this gap, this paper proposes a hierarchical human behavior modeling framework that integrates human motion prediction (HMP) and human action segmentation. The proposed model captures both the fine-grained motion dynamics and the higher-level task structure, enabling a more complete and context-aware understanding of human behavior. The model can enhance robot decision-making for both proactive safety mechanisms and dynamic task allocation. HMP compares three predictive models (Convolutional Neural Networks - Long Short-Term Memory (CNN-LSTM), Spatial-Temporal Graph Convolutional Networks (ST-GCN), Transformer). CNN-LSTM and ST-GCN outperformed Transformer, demonstrating better short-term predictive accuracy. Human action segmentation includes feature extraction, dimensionality reduction, clustering, and two-stage temporal segmentation. The HMP (CNN-LSTM) based features achieve the highest clustering performance. Two-stage segmentation demonstrates high accuracy, achieving normalized edit distances (NED) of 0.029 and 0.07 for task-level and sub-task-level segmentation, respectively. Evaluation results show that proactive collision avoidance using predicted motions increased safety distance (from 0.4633 m to 0.4717 m), while dynamic task allocation based on action segmentation improves robot efficiency (from 84.95 % to 98.56 %). These results validate the effectiveness of the proposed hierarchical human behavior modeling framework in simultaneously enhancing safety and efficiency in HRC assembly.
人机协作(HRC)通过结合人的灵活性和机器人的效率,为工业装配提供了有前途的解决方案。然而,人类和机器人的接近引发了安全问题,这可能会限制效率。以前的研究通常是分别解决安全或效率问题,依赖于不完整的人类行为模型,这导致机器人控制策略具有有限的适应性。为了弥补这一缺陷,本文提出了一种将人体运动预测(HMP)和人体动作分割相结合的分层人体行为建模框架。所提出的模型既捕获了细粒度的运动动态,又捕获了更高级别的任务结构,从而能够对人类行为进行更完整和上下文感知的理解。该模型可以增强机器人的主动安全机制和动态任务分配的决策能力。HMP比较了三种预测模型(卷积神经网络-长短期记忆(CNN-LSTM),时空图卷积网络(ST-GCN), Transformer)。CNN-LSTM和ST-GCN表现优于Transformer,表现出更好的短期预测准确性。人体动作分割包括特征提取、降维、聚类和两阶段时间分割。基于HMP (CNN-LSTM)的特征实现了最高的聚类性能。两阶段分割具有较高的准确率,任务级和子任务级分割的归一化编辑距离(NED)分别为0.029和0.07。评估结果表明,基于预测运动的主动避撞提高了机器人的安全距离(从0.4633 m提高到0.4717 m),而基于动作分割的动态任务分配提高了机器人的效率(从84.95%提高到98.56%)。这些结果验证了所提出的分层人类行为建模框架在提高HRC装配安全性和效率方面的有效性。
{"title":"A hierarchical human behavior modeling framework for safe and efficient human-robot collaborative assembly","authors":"Guoyi Xia ,&nbsp;Zied Ghrairi ,&nbsp;Aaron Heuermann ,&nbsp;Klaus-Dieter Thoben","doi":"10.1016/j.rcim.2025.103202","DOIUrl":"10.1016/j.rcim.2025.103202","url":null,"abstract":"<div><div>Human-robot collaboration (HRC) offers promising solutions in industrial assembly by combining human dexterity and robot efficiency. However, the proximity of the human and robot raises safety concerns that can limit efficiency. Previous studies have typically addressed safety or efficiency separately, relying on incomplete models of human behavior, which has led to robot control strategies with limited adaptability. To bridge this gap, this paper proposes a hierarchical human behavior modeling framework that integrates human motion prediction (HMP) and human action segmentation. The proposed model captures both the fine-grained motion dynamics and the higher-level task structure, enabling a more complete and context-aware understanding of human behavior. The model can enhance robot decision-making for both proactive safety mechanisms and dynamic task allocation. HMP compares three predictive models (Convolutional Neural Networks - Long Short-Term Memory (CNN-LSTM), Spatial-Temporal Graph Convolutional Networks (ST-GCN), Transformer). CNN-LSTM and ST-GCN outperformed Transformer, demonstrating better short-term predictive accuracy. Human action segmentation includes feature extraction, dimensionality reduction, clustering, and two-stage temporal segmentation. The HMP (CNN-LSTM) based features achieve the highest clustering performance. Two-stage segmentation demonstrates high accuracy, achieving normalized edit distances (NED) of 0.029 and 0.07 for task-level and sub-task-level segmentation, respectively. Evaluation results show that proactive collision avoidance using predicted motions increased safety distance (from 0.4633 m to 0.4717 m), while dynamic task allocation based on action segmentation improves robot efficiency (from 84.95 % to 98.56 %). These results validate the effectiveness of the proposed hierarchical human behavior modeling framework in simultaneously enhancing safety and efficiency in HRC assembly.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103202"},"PeriodicalIF":11.4,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145731795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Target-oriented collision-free robot grasping using task-attendance teachers-student knowledge distillation for various dense-clutter scenarios 基于任务出勤师生知识蒸馏的面向目标无碰撞机器人抓取
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-10 DOI: 10.1016/j.rcim.2025.103201
Shaodong Li , Cheng Xiang , Wei Du , Xi Liu , Huajian Song , Feng Shuang
The target-oriented collision-free robot grasping in dense-clutter scenarios has made significant progress. However, the prior studies primarily focus on the complex problem brought about by an increase in object quantity. Actually, the various scenarios will also result in the challenge. That means skill models will inevitably start to obtain bloat, as more subtasks appear. Our previous study presents a multi-teacher distillation strategy, thereby effectively avoiding bloat. Unfortunately, it still has trouble in execution capability when the scenarios have more subtasks and highly similar subtasks. The root cause is attributed to the inadequate capability of distilled student in subtask identification. It is naive to develop the subtask classifier based on the traditional data-driven way when human experience fails to work. Therefore, we propose a dataset-free classifier by migrating the classification capability of teachers to the student. Then, we further propose a task-attendance teachers-student knowledge distillation strategy to afford the various scenarios involving more and highly similar subtasks, thus significantly enhancing the performance of grasping. And we also leverage the language instruction to ensure the mask map of target object through LLM, improving the intuitiveness of human-robot cooperation. Extensive comparative experiment verifies the advantage of our framework. We measure the capability of teachers-student distillation, and value of dataset-free classifier in our framework. Importantly, the performance of segmentation strategy is tested under ambiguous instruction, visual occlusion, and color conflict. Furthermore, the excellent generalization and robustness are exhibited according to the real-world experiments involving unseen object, unseen task modality, and disturbance existing.
面向目标的无碰撞机器人在密集杂波环境下抓取的研究取得了重大进展。然而,以往的研究主要集中在对象数量增加所带来的复杂问题上。实际上,各种各样的场景也会带来挑战。这意味着随着更多子任务的出现,技能模型将不可避免地开始膨胀。我们之前的研究提出了一个多教师蒸馏策略,从而有效地避免了膨胀。不幸的是,当场景具有更多的子任务和高度相似的子任务时,它在执行能力方面仍然存在问题。其根本原因在于学生在子任务识别方面的能力不足。当人的经验不起作用时,基于传统的数据驱动方法开发子任务分类器是幼稚的。因此,我们提出了一种无数据集的分类器,将教师的分类能力转移到学生身上。在此基础上,我们进一步提出了任务-考勤-师生知识蒸馏策略,以应对涉及更多且高度相似的子任务的各种场景,从而显著提高了抓取性能。并且利用语言指令通过LLM保证目标物体的掩模图,提高了人机合作的直观性。大量的对比实验验证了该框架的优越性。在我们的框架中,我们衡量了师生蒸馏的能力和无数据集分类器的价值。重要的是,在模糊指令、视觉遮挡和颜色冲突的情况下测试了分割策略的性能。此外,通过对不可见目标、不可见任务模态和存在干扰的实际实验,证明了该方法具有良好的泛化和鲁棒性。
{"title":"Target-oriented collision-free robot grasping using task-attendance teachers-student knowledge distillation for various dense-clutter scenarios","authors":"Shaodong Li ,&nbsp;Cheng Xiang ,&nbsp;Wei Du ,&nbsp;Xi Liu ,&nbsp;Huajian Song ,&nbsp;Feng Shuang","doi":"10.1016/j.rcim.2025.103201","DOIUrl":"10.1016/j.rcim.2025.103201","url":null,"abstract":"<div><div>The target-oriented collision-free robot grasping in dense-clutter scenarios has made significant progress. However, the prior studies primarily focus on the complex problem brought about by an increase in object quantity. Actually, the various scenarios will also result in the challenge. That means skill models will inevitably start to obtain bloat, as more subtasks appear. Our previous study presents a multi-teacher distillation strategy, thereby effectively avoiding bloat. Unfortunately, it still has trouble in execution capability when the scenarios have more subtasks and highly similar subtasks. The root cause is attributed to the inadequate capability of distilled student in subtask identification. It is naive to develop the subtask classifier based on the traditional data-driven way when human experience fails to work. Therefore, we propose a dataset-free classifier by migrating the classification capability of teachers to the student. Then, we further propose a task-attendance teachers-student knowledge distillation strategy to afford the various scenarios involving more and highly similar subtasks, thus significantly enhancing the performance of grasping. And we also leverage the language instruction to ensure the mask map of target object through LLM, improving the intuitiveness of human-robot cooperation. Extensive comparative experiment verifies the advantage of our framework. We measure the capability of teachers-student distillation, and value of dataset-free classifier in our framework. Importantly, the performance of segmentation strategy is tested under ambiguous instruction, visual occlusion, and color conflict. Furthermore, the excellent generalization and robustness are exhibited according to the real-world experiments involving unseen object, unseen task modality, and disturbance existing.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103201"},"PeriodicalIF":11.4,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145731770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of support head with joint-synchronized structure and tuned mass damper for robotic mirror milling 机器人铣镜关节同步结构和调谐质量阻尼器支撑头设计
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-08 DOI: 10.1016/j.rcim.2025.103200
Kun Chen , Chenghao Huang , Haonan Ma , Peng Xu , Bing Li
Machined thin-walled parts are critical to lightweight aerospace structures; however, their low stiffness poses significant challenges in controlling deformation and vibrations during their machining. To counteract these force-induced errors, robotic mirror milling has emerged as a promising technique, because a support robot can actively neutralize the cutting forces. However, the support robot requires an appropriate end-effector to protect the workpiece surface, while providing stable support. In this study, a support head with plastic rollers and a multi-branched tuned mass damper (TMD) is developed. The support head, synchronized with the Joint 6 of the support robot, contacts the thin-walled part via plastic rollers. A follow-up motion plan algorithm is proposed for the support head to synchronize its motion with the milling tool. The multi-branched TMD is designed to suppress the vibrations of the thin-walled parts and support head in different directions, thus ensuring an effective contact between them. A frequency-response function model for the TMD is proposed for parameter design. Simulations and experiments confirm that the designed motion-planning algorithm reduces the velocity of Joint 6 by 78.6 %, while the TMD decreases the machined surface roughness by 32.8 % and 21.4 % at axial depths of cut of 1.0 and 1.5 mm, respectively. The results demonstrate that the joint-synchronized support head, combined with the multi-branched TMD, effectively ensures stable support to the workpiece, while preventing damage to its surface.
机加工薄壁件是航空航天轻量化结构的关键;然而,它们的低刚度在加工过程中对控制变形和振动提出了重大挑战。为了抵消这些力引起的误差,机器人镜面铣削已经成为一种很有前途的技术,因为支撑机器人可以主动抵消切削力。然而,支撑机器人需要一个合适的末端执行器来保护工件表面,同时提供稳定的支撑。本文设计了一种带有塑料滚子和多分支调谐质量阻尼器的支撑头。支撑头与支撑机器人的关节6同步,通过塑料滚轮与薄壁件接触。提出了一种支撑头与铣刀同步运动的跟踪运动规划算法。多分支TMD的设计是为了抑制薄壁零件和支承头在不同方向上的振动,从而保证它们之间的有效接触。提出了TMD的频率响应函数模型,用于参数设计。仿真和实验结果表明,在轴向切削深度为1.0 mm和1.5 mm时,所设计的运动规划算法可使关节6的切削速度降低78.6%,而TMD可使加工表面粗糙度分别降低32.8%和21.4%。结果表明,联合同步支撑头与多分支TMD相结合,有效地保证了工件的稳定支撑,同时防止了工件表面的损伤。
{"title":"Design of support head with joint-synchronized structure and tuned mass damper for robotic mirror milling","authors":"Kun Chen ,&nbsp;Chenghao Huang ,&nbsp;Haonan Ma ,&nbsp;Peng Xu ,&nbsp;Bing Li","doi":"10.1016/j.rcim.2025.103200","DOIUrl":"10.1016/j.rcim.2025.103200","url":null,"abstract":"<div><div>Machined thin-walled parts are critical to lightweight aerospace structures; however, their low stiffness poses significant challenges in controlling deformation and vibrations during their machining. To counteract these force-induced errors, robotic mirror milling has emerged as a promising technique, because a support robot can actively neutralize the cutting forces. However, the support robot requires an appropriate end-effector to protect the workpiece surface, while providing stable support. In this study, a support head with plastic rollers and a multi-branched tuned mass damper (TMD) is developed. The support head, synchronized with the Joint 6 of the support robot, contacts the thin-walled part via plastic rollers. A follow-up motion plan algorithm is proposed for the support head to synchronize its motion with the milling tool. The multi-branched TMD is designed to suppress the vibrations of the thin-walled parts and support head in different directions, thus ensuring an effective contact between them. A frequency-response function model for the TMD is proposed for parameter design. Simulations and experiments confirm that the designed motion-planning algorithm reduces the velocity of Joint 6 by 78.6 %, while the TMD decreases the machined surface roughness by 32.8 % and 21.4 % at axial depths of cut of 1.0 and 1.5 mm, respectively. The results demonstrate that the joint-synchronized support head, combined with the multi-branched TMD, effectively ensures stable support to the workpiece, while preventing damage to its surface.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103200"},"PeriodicalIF":11.4,"publicationDate":"2025-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145697314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Time–torque coordinated optimization for trajectory planning of industrial robots 工业机器人轨迹规划的时间-力矩协调优化
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-01 DOI: 10.1016/j.rcim.2025.103199
Zeyun Xiao, Danfeng Sun, Donglai Zhu, Yong Wang, Yi Yan, Huifeng Wu
Trajectory optimization is vital for improving the operational efficiency and reliability of industrial robotic arms. However, increasing task execution speed frequently induces sharp fluctuations in joint torque, which elevates energy consumption, places excessive stress on actuators, and excites structural vibrations. To tackle these issues, this study introduces a unified time–torque optimization framework that combines predictive modeling with heuristic search. The framework adopts a neural network incorporating frequency-domain correlation and a delay-aware mechanism to model dynamic torque variations. Based on the predicted torque profiles, the robot’s motion trajectory is parameterized by quintic polynomials, and a multi-objective loss function is constructed to jointly minimize execution time and torque variation rate. Particle Swarm Optimization (PSO) is employed to perform a global search for intermediate joint velocities and accelerations, improving convergence toward near-optimal solutions. Experiments on a six-axis industrial robotic platform demonstrate that the proposed method effectively reduces execution time and smooths torque transitions, confirming its practicality for industrial applications.
轨迹优化是提高工业机械臂运行效率和可靠性的关键。然而,任务执行速度的提高往往会引起关节扭矩的急剧波动,从而增加能量消耗,对执行器施加过大的应力,并激发结构振动。为了解决这些问题,本研究引入了一个统一的时间-扭矩优化框架,该框架将预测建模与启发式搜索相结合。该框架采用结合频域相关和延迟感知机制的神经网络对动态转矩变化进行建模。基于预测的转矩曲线,采用五次多项式参数化机器人运动轨迹,构建多目标损失函数,共同最小化执行时间和转矩变化率。采用粒子群算法(PSO)对中间关节速度和加速度进行全局搜索,提高了接近最优解的收敛性。在六轴工业机器人平台上的实验表明,该方法有效地缩短了执行时间,平滑了转矩转换,验证了其在工业应用中的实用性。
{"title":"Time–torque coordinated optimization for trajectory planning of industrial robots","authors":"Zeyun Xiao,&nbsp;Danfeng Sun,&nbsp;Donglai Zhu,&nbsp;Yong Wang,&nbsp;Yi Yan,&nbsp;Huifeng Wu","doi":"10.1016/j.rcim.2025.103199","DOIUrl":"10.1016/j.rcim.2025.103199","url":null,"abstract":"<div><div>Trajectory optimization is vital for improving the operational efficiency and reliability of industrial robotic arms. However, increasing task execution speed frequently induces sharp fluctuations in joint torque, which elevates energy consumption, places excessive stress on actuators, and excites structural vibrations. To tackle these issues, this study introduces a unified time–torque optimization framework that combines predictive modeling with heuristic search. The framework adopts a neural network incorporating frequency-domain correlation and a delay-aware mechanism to model dynamic torque variations. Based on the predicted torque profiles, the robot’s motion trajectory is parameterized by quintic polynomials, and a multi-objective loss function is constructed to jointly minimize execution time and torque variation rate. Particle Swarm Optimization (PSO) is employed to perform a global search for intermediate joint velocities and accelerations, improving convergence toward near-optimal solutions. Experiments on a six-axis industrial robotic platform demonstrate that the proposed method effectively reduces execution time and smooths torque transitions, confirming its practicality for industrial applications.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103199"},"PeriodicalIF":11.4,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145651051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A lightweight detection network integrating multi-scale semantic refinement for steel strip defects 基于多尺度语义细化的钢带缺陷轻量化检测网络
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-11-29 DOI: 10.1016/j.rcim.2025.103188
Baicheng Bian , Long Chen , Zongwang Han , Shiqing Wu , WeiDong Li , Hongguang Chen
Real-time surface defect detection in steel strips requires high accuracy under computational constraints. Existing methods often struggle to detect low-contrast defects in complex backgrounds while maintaining computational efficiency. This paper proposes MSR-Det, a lightweight network built on a multi-scale semantic refinement framework to address this challenge. The core innovations include: a Hierarchical Texture Enhancement Module (HTEM) that strengthens discriminative textural patterns, a Specific Cross-Level Interaction Module (SCLIM) that facilitates granular feature fusion across network depths, and a Feature Enhancement Module (FEM) that refines defect representations through deep-to-shallow semantic guidance. The architecture further incorporates two key modules: a Lightweight Large-Kernel Bottleneck Module (LLKBM) for efficient receptive field expansion, and an Adaptive Gating Spatial Pyramid Pooling (AGSPP) for dynamic multi-scale context integration. Extensive experiments on NEU-DET, GC10-DET, and our industrial SS-DET dataset demonstrate that MSR-Det achieves state-of-the-art performance of 86.6% Mean Average Precision at 50% Intersection over Union (mAP50) on SS-DET with only 5.0M parameters, also attaining a real-time speed of 180 FPS. This work provides a robust and practical solution for automated visual inspection in industrial settings, effectively balancing high precision with operational efficiency.
钢带表面缺陷实时检测在计算约束下要求精度高。现有的方法往往难以在保持计算效率的同时检测复杂背景下的低对比度缺陷。本文提出了基于多尺度语义细化框架的轻量级网络MSR-Det来解决这一挑战。核心创新包括:增强判别纹理模式的分层纹理增强模块(HTEM),促进跨网络深度颗粒特征融合的特定跨层交互模块(SCLIM),以及通过从深到浅的语义引导来细化缺陷表示的特征增强模块(FEM)。该架构进一步集成了两个关键模块:用于有效接受场扩展的轻量级大核瓶颈模块(LLKBM)和用于动态多尺度上下文集成的自适应门控空间金字塔池(AGSPP)。在nue - det、GC10-DET和我们的工业SS-DET数据集上进行的大量实验表明,MSR-Det在SS-DET上仅使用5.0M参数就能达到86.6%的平均精度(mAP50),并且实时速度达到180 FPS。这项工作为工业环境中的自动视觉检测提供了一个强大而实用的解决方案,有效地平衡了高精度和操作效率。
{"title":"A lightweight detection network integrating multi-scale semantic refinement for steel strip defects","authors":"Baicheng Bian ,&nbsp;Long Chen ,&nbsp;Zongwang Han ,&nbsp;Shiqing Wu ,&nbsp;WeiDong Li ,&nbsp;Hongguang Chen","doi":"10.1016/j.rcim.2025.103188","DOIUrl":"10.1016/j.rcim.2025.103188","url":null,"abstract":"<div><div>Real-time surface defect detection in steel strips requires high accuracy under computational constraints. Existing methods often struggle to detect low-contrast defects in complex backgrounds while maintaining computational efficiency. This paper proposes MSR-Det, a lightweight network built on a multi-scale semantic refinement framework to address this challenge. The core innovations include: a Hierarchical Texture Enhancement Module (HTEM) that strengthens discriminative textural patterns, a Specific Cross-Level Interaction Module (SCLIM) that facilitates granular feature fusion across network depths, and a Feature Enhancement Module (FEM) that refines defect representations through deep-to-shallow semantic guidance. The architecture further incorporates two key modules: a Lightweight Large-Kernel Bottleneck Module (LLKBM) for efficient receptive field expansion, and an Adaptive Gating Spatial Pyramid Pooling (AGSPP) for dynamic multi-scale context integration. Extensive experiments on NEU-DET, GC10-DET, and our industrial SS-DET dataset demonstrate that MSR-Det achieves state-of-the-art performance of 86.6% Mean Average Precision at 50% Intersection over Union (<span><math><msub><mrow><mtext>mAP</mtext></mrow><mrow><mn>50</mn></mrow></msub></math></span>) on SS-DET with only 5.0M parameters, also attaining a real-time speed of 180 FPS. This work provides a robust and practical solution for automated visual inspection in industrial settings, effectively balancing high precision with operational efficiency.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103188"},"PeriodicalIF":11.4,"publicationDate":"2025-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145614038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dual-service combination optimization of manufacturing and logistics: models for self-managed and third-party logistics in cloud manufacturing 制造与物流双服务组合优化:云制造下的自营物流与第三方物流模式
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-11-29 DOI: 10.1016/j.rcim.2025.103178
Chunhua Tang , Shuangyao Zhao , Ting Huang , Mark Goh
Service combination (SC) is a critical technique in cloud manufacturing, enabling the integration of multiple services to deliver value-added solutions. Logistics plays a pivotal role in SC by ensuring seamless coordination across various manufacturing stages, thereby maximizing the efficiency of production flows. This implies that the SC process must integrate both manufacturing services (MSs) and logistics services (LSs) to determine the optimal combination strategy. Prior research has focused mainly on MS performance, often overlooking the critical impact of logistics on SC outcomes. Although some studies have incorporated logistics considerations, they have largely treated logistics attributes as secondary components of MS evaluations or adopted linear aggregation methods to jointly configure MSs and LSs. These approaches fail to capture the dynamic nature of logistics performance and the interdependencies between MSs and LSs. To address these gaps, this study develops two optimization models for SC that integrate both MSs and LSs, tailored for self-managed and third-party logistics modes. In particular, an innovative bi-level optimization model is introduced to capture the sequential dependencies and dynamic interactions between MSs and LSs in logistics outsourcing, ensuring seamless integration. The upper level focuses on optimizing the MS selection, while the lower level identifies the optimal LSs based on the determined MSs. Improved genetic algorithms incorporating adaptive and parallel mechanisms are developed to address the models, dynamically adjusting parameters to improve solution accuracy and efficiency. Case studies and numerical experiments validate the effectiveness of the proposed models and algorithms, offering actionable managerial insights grounded in the results.
服务组合(SC)是云制造中的一项关键技术,可以集成多个服务以提供增值解决方案。物流在供应链中发挥着关键作用,确保了各个制造阶段的无缝协调,从而最大限度地提高了生产流程的效率。这意味着供应链过程必须整合制造服务(MSs)和物流服务(LSs),以确定最佳的组合策略。先前的研究主要集中在供应链绩效上,往往忽略了物流对供应链结果的关键影响。尽管一些研究纳入了物流方面的考虑,但它们在很大程度上将物流属性作为MS评估的次要组成部分,或采用线性聚合方法共同配置MS和ls。这些方法未能捕捉到物流绩效的动态性质以及物流服务提供商和物流服务提供商之间的相互依赖关系。为了解决这些差距,本研究针对自主管理和第三方物流模式开发了两种集成了物流管理和物流服务的物流管理优化模型。特别是,引入了创新的双层优化模型,以捕获物流外包中物流服务提供商和物流服务提供商之间的顺序依赖关系和动态交互,确保无缝集成。上层侧重于优化质谱选择,下层则根据确定的质谱识别出最优的质谱。采用改进的遗传算法,结合自适应和并行机制来求解模型,动态调整参数以提高求解精度和效率。案例研究和数值实验验证了所提出的模型和算法的有效性,提供了基于结果的可操作的管理见解。
{"title":"Dual-service combination optimization of manufacturing and logistics: models for self-managed and third-party logistics in cloud manufacturing","authors":"Chunhua Tang ,&nbsp;Shuangyao Zhao ,&nbsp;Ting Huang ,&nbsp;Mark Goh","doi":"10.1016/j.rcim.2025.103178","DOIUrl":"10.1016/j.rcim.2025.103178","url":null,"abstract":"<div><div>Service combination (SC) is a critical technique in cloud manufacturing, enabling the integration of multiple services to deliver value-added solutions. Logistics plays a pivotal role in SC by ensuring seamless coordination across various manufacturing stages, thereby maximizing the efficiency of production flows. This implies that the SC process must integrate both manufacturing services (MSs) and logistics services (LSs) to determine the optimal combination strategy. Prior research has focused mainly on MS performance, often overlooking the critical impact of logistics on SC outcomes. Although some studies have incorporated logistics considerations, they have largely treated logistics attributes as secondary components of MS evaluations or adopted linear aggregation methods to jointly configure MSs and LSs. These approaches fail to capture the dynamic nature of logistics performance and the interdependencies between MSs and LSs. To address these gaps, this study develops two optimization models for SC that integrate both MSs and LSs, tailored for self-managed and third-party logistics modes. In particular, an innovative bi-level optimization model is introduced to capture the sequential dependencies and dynamic interactions between MSs and LSs in logistics outsourcing, ensuring seamless integration. The upper level focuses on optimizing the MS selection, while the lower level identifies the optimal LSs based on the determined MSs. Improved genetic algorithms incorporating adaptive and parallel mechanisms are developed to address the models, dynamically adjusting parameters to improve solution accuracy and efficiency. Case studies and numerical experiments validate the effectiveness of the proposed models and algorithms, offering actionable managerial insights grounded in the results.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103178"},"PeriodicalIF":11.4,"publicationDate":"2025-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145614037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From drawings to decisions: A hybrid vision-language framework for parsing 2D engineering drawings into structured manufacturing knowledge 从图纸到决策:用于将2D工程图纸解析为结构化制造知识的混合视觉语言框架
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-11-28 DOI: 10.1016/j.rcim.2025.103186
Muhammad Tayyab Khan , Lequn Chen , Zane Yong , Jun Ming Tan , Wenhe Feng , Seung Ki Moon
Efficient and accurate extraction of key information from 2D engineering drawings is essential for advancing digital manufacturing workflows. This information includes elements such as geometric dimensioning and tolerancing (GD&T), measures, material specifications, and textual annotations. Manual extraction remains slow and labor-intensive, while generic optical character recognition (OCR) models often fail to interpret 2D drawings accurately due to complex layouts, engineering symbols, and rotated annotations. These limitations result in incomplete and unreliable outputs. To address these challenges, this paper proposes a hybrid vision-language framework that integrates a rotation-aware object detection model (YOLOv11-obb) with a transformer-based vision-language parser. We introduce a structured parsing pipeline that first applies YOLOv11-obb to localize annotations and extract oriented bounding box (OBB) image patches, which are subsequently parsed into structured outputs using a fine-tuned, lightweight vision-language model (VLM). To develop and evaluate this pipeline, we curate a dataset of 1367 2D mechanical drawings manually annotated across nine key categories: GD&Ts, General Tolerances, Measures, Materials, Notes, Radii, Surface Roughness, Threads, and Title Blocks. YOLOv11-obb is trained on this dataset to detect OBBs and extract annotation patches. These image patches are then parsed using two fine-tuned open-source VLMs. The first is Donut, a transformer-based model that combines a Swin-B visual encoder with a BART text decoder, enabling end-to-end parsing directly from images without relying on OCR. The second is Florence-2, a prompt-driven encoder–decoder model that integrates a DaViT vision backbone and supports structured output generation through multimodal token alignment. Both models are lightweight and well-suited for specialized industrial tasks under limited computational overhead. Following fine-tuning of both models on the curated dataset of image patches paired with structured annotation labels, a comparative experiment is conducted to evaluate parsing performance across four key metrics. Donut outperforms Florence-2, achieving 89.2 % precision, 99.2 % recall, and a 94 % F1-score, with a hallucination rate of 10.8 %. Finally, a case study demonstrates how the extracted structured information supports downstream manufacturing tasks such as process and tool selection, showcasing the practical utility of the proposed framework in modernizing 2D drawing interpretation.
从2D工程图纸中高效准确地提取关键信息对于推进数字化制造工作流程至关重要。这些信息包括几何尺寸和公差(gdt)、测量、材料规格和文本注释等元素。人工提取仍然是缓慢和劳动密集型的,而一般的光学字符识别(OCR)模型往往不能准确地解释2D图纸,因为复杂的布局,工程符号,和旋转的注释。这些限制导致输出不完整和不可靠。为了解决这些挑战,本文提出了一种混合视觉语言框架,该框架将旋转感知对象检测模型(YOLOv11-obb)与基于转换器的视觉语言解析器集成在一起。我们引入了一个结构化解析管道,该管道首先应用YOLOv11-obb来定位注释并提取面向边界框(OBB)图像补丁,随后使用微调的轻量级视觉语言模型(VLM)将其解析为结构化输出。为了开发和评估这一管道,我们整理了一个1367张2D机械图纸的数据集,这些图纸手动标注了9个关键类别:gds、一般公差、测量、材料、注释、半径、表面粗糙度、螺纹和标题块。YOLOv11-obb在此数据集上进行训练,检测obb并提取标注补丁。然后使用两个经过微调的开源vlm解析这些图像补丁。第一个是Donut,一个基于转换器的模型,它结合了swing -b视觉编码器和BART文本解码器,可以直接从图像进行端到端解析,而不依赖于OCR。第二个是Florence-2,这是一个提示驱动的编码器-解码器模型,它集成了DaViT视觉主干,并通过多模态令牌对齐支持结构化输出生成。这两种模型都是轻量级的,非常适合计算开销有限的专门工业任务。在对两种模型在与结构化注释标签配对的图像补丁的策划数据集上进行微调之后,进行了一个比较实验,以评估四个关键指标的解析性能。Donut优于Florence-2,准确率为89.2%,召回率为99.2%,f1得分为94%,幻觉率为10.8%。最后,一个案例研究展示了提取的结构化信息如何支持下游制造任务,如工艺和工具选择,展示了所提出的框架在现代化2D绘图解释中的实际用途。
{"title":"From drawings to decisions: A hybrid vision-language framework for parsing 2D engineering drawings into structured manufacturing knowledge","authors":"Muhammad Tayyab Khan ,&nbsp;Lequn Chen ,&nbsp;Zane Yong ,&nbsp;Jun Ming Tan ,&nbsp;Wenhe Feng ,&nbsp;Seung Ki Moon","doi":"10.1016/j.rcim.2025.103186","DOIUrl":"10.1016/j.rcim.2025.103186","url":null,"abstract":"<div><div>Efficient and accurate extraction of key information from 2D engineering drawings is essential for advancing digital manufacturing workflows. This information includes elements such as geometric dimensioning and tolerancing (GD&amp;T), measures, material specifications, and textual annotations. Manual extraction remains slow and labor-intensive, while generic optical character recognition (OCR) models often fail to interpret 2D drawings accurately due to complex layouts, engineering symbols, and rotated annotations. These limitations result in incomplete and unreliable outputs. To address these challenges, this paper proposes a hybrid vision-language framework that integrates a rotation-aware object detection model (YOLOv11-obb) with a transformer-based vision-language parser. We introduce a structured parsing pipeline that first applies YOLOv11-obb to localize annotations and extract oriented bounding box (OBB) image patches, which are subsequently parsed into structured outputs using a fine-tuned, lightweight vision-language model (VLM). To develop and evaluate this pipeline, we curate a dataset of 1367 2D mechanical drawings manually annotated across nine key categories: GD&amp;Ts, General Tolerances, Measures, Materials, Notes, Radii, Surface Roughness, Threads, and Title Blocks. YOLOv11-obb is trained on this dataset to detect OBBs and extract annotation patches. These image patches are then parsed using two fine-tuned open-source VLMs. The first is Donut, a transformer-based model that combines a Swin-B visual encoder with a BART text decoder, enabling end-to-end parsing directly from images without relying on OCR. The second is Florence-2, a prompt-driven encoder–decoder model that integrates a DaViT vision backbone and supports structured output generation through multimodal token alignment. Both models are lightweight and well-suited for specialized industrial tasks under limited computational overhead. Following fine-tuning of both models on the curated dataset of image patches paired with structured annotation labels, a comparative experiment is conducted to evaluate parsing performance across four key metrics. Donut outperforms Florence-2, achieving 89.2 % precision, 99.2 % recall, and a 94 % F1-score, with a hallucination rate of 10.8 %. Finally, a case study demonstrates how the extracted structured information supports downstream manufacturing tasks such as process and tool selection, showcasing the practical utility of the proposed framework in modernizing 2D drawing interpretation.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103186"},"PeriodicalIF":11.4,"publicationDate":"2025-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145614041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Pivot-Move Strategy for Dual-Robot Manipulator Additive Manufacturing: Enabling Collision Avoidance without Halting Deposition 双机器人机械臂增材制造的一种新型支点移动策略:避免碰撞而不停止沉积
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-11-27 DOI: 10.1016/j.rcim.2025.103177
C.L. Li , Y.C. Jiao , K. Ren , N. Liu , Y.F. Zhang
Robot-assisted additive manufacturing (AM) has been gaining increasing popularity due to its great flexibility and reachability. Moreover, an AM system with dual deposition-heads held by robot manipulators would significantly shorten the building time, especially for large-scale parts. However, motion planning (MP) for the dual robot manipulators AM is highly challengeable due to various constraints imposed by the setup and the AM process aiming to improve the qualities of the component, e.g., maintaining travelling speed and posture of the deposition head and avoiding collision. In this paper, a novel pivot-move strategy is proposed for MP in AM with dual robot manipulators. Given the sequenced deposition toolpath segments to each deposition head, an initial MP solution including robot configuration at each time sample waypoint is firstly generated for each robot manipulator, respectively. This is followed by conducting a check-and-correct process at each waypoint, where the collision among the links of two robot manipulators is identified and corrected. Specially, the robot manipulator is designed to simultaneously pivot and move to avoid the collision while maintaining the traveling speed unchanged. Numerical simulation, physical implementation, and benchmarking were conducted to exhibit a 78.295% deposition time reduction and high-quality deposition with the developed strategy. To the best of the authors' knowledge, this study represents the pioneering effort in addressing the collision issue in dual robot manipulators depositing on the same heated bed, achieving collision avoidance without interrupting the ongoing deposition process. It can be a valuable supplement to the state of the art in this area.
机器人辅助增材制造(AM)由于其巨大的灵活性和可达性而越来越受欢迎。此外,一个具有双沉积头的增材制造系统,由机器人机械手握住,将大大缩短建造时间,特别是对于大型零件。然而,由于设置和增材制造过程施加的各种限制,双机器人机械手增材制造的运动规划(MP)具有很高的挑战性,这些限制旨在提高组件的质量,例如,保持沉积头的移动速度和姿态,并避免碰撞。本文提出了一种新的双机械手增材制造中MP的支点移动策略。给定每个沉积头的顺序沉积刀具路径段,首先为每个机器人机械手分别生成包含每个时间样本路径点的机器人配置的初始MP解。随后,在每个航路点进行检查和纠正过程,在此过程中,两个机器人操纵器之间的链接之间的碰撞被识别和纠正。特别地,机器人机械手被设计为在保持行进速度不变的情况下同时转动和移动以避免碰撞。数值模拟、物理实现和基准测试表明,采用所开发的策略,沉积时间缩短78.295%,沉积质量高。据作者所知,这项研究代表了解决在同一加热床上沉积的双机器人操纵器碰撞问题的开创性努力,在不中断正在进行的沉积过程的情况下实现了碰撞避免。它可以成为这一领域最新技术的有价值的补充。
{"title":"A Novel Pivot-Move Strategy for Dual-Robot Manipulator Additive Manufacturing: Enabling Collision Avoidance without Halting Deposition","authors":"C.L. Li ,&nbsp;Y.C. Jiao ,&nbsp;K. Ren ,&nbsp;N. Liu ,&nbsp;Y.F. Zhang","doi":"10.1016/j.rcim.2025.103177","DOIUrl":"10.1016/j.rcim.2025.103177","url":null,"abstract":"<div><div>Robot-assisted additive manufacturing (AM) has been gaining increasing popularity due to its great flexibility and reachability. Moreover, an AM system with dual deposition-heads held by robot manipulators would significantly shorten the building time, especially for large-scale parts. However, motion planning (MP) for the dual robot manipulators AM is highly challengeable due to various constraints imposed by the setup and the AM process aiming to improve the qualities of the component, e.g., maintaining travelling speed and posture of the deposition head and avoiding collision. In this paper, a novel pivot-move strategy is proposed for MP in AM with dual robot manipulators. Given the sequenced deposition toolpath segments to each deposition head, an initial MP solution including robot configuration at each time sample waypoint is firstly generated for each robot manipulator, respectively. This is followed by conducting a check-and-correct process at each waypoint, where the collision among the links of two robot manipulators is identified and corrected. Specially, the robot manipulator is designed to simultaneously pivot and move to avoid the collision while maintaining the traveling speed unchanged. Numerical simulation, physical implementation, and benchmarking were conducted to exhibit a 78.295% deposition time reduction and high-quality deposition with the developed strategy. To the best of the authors' knowledge, this study represents the pioneering effort in addressing the collision issue in dual robot manipulators depositing on the same heated bed, achieving collision avoidance without interrupting the ongoing deposition process. It can be a valuable supplement to the state of the art in this area.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103177"},"PeriodicalIF":11.4,"publicationDate":"2025-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145611798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A robotic framework for high-throughput and multi-view 3D digital image correlation (3D-DIC): Increasing measurement volume and versatility for deformation analysis 用于高通量和多视图3D数字图像相关(3D- dic)的机器人框架:增加变形分析的测量量和多功能性
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-11-26 DOI: 10.1016/j.rcim.2025.103187
Özgüç Bertuğ Çapunaman , Alale Mohseni , Dennis Dombrovskij , Kaiyang Yin , Benay Gürsoy , Max David Mylo
Three-dimensional digital image correlation (3D-DIC) is a widely applicable, non-contact optical imaging technique for accurately quantifying full-field surface displacements and strains in materials and structures. However, conventional 3D-DIC implementations relying on fixed stereo camera positions face trade-offs between the field-of-view and spatial resolution and lack high-throughput for long-duration measurements. Here we present an integrated robotic 3D-DIC framework that employs an industrial robotic arm to autonomously and repeatedly reposition stereo cameras. This enables automated calibration, monitoring of multiple samples over extended periods, and expansion of the effective spatial coverage and data throughput, all while maintaining calibration stability and measurement fidelity. We validate this approach on rigid and deforming reference samples and demonstrate its ability to quantify material deformation of bio-composite samples simultaneously during the drying process. Under robotic repositioning, rigid samples exhibit stable displacement and strain measurements while benefiting from significantly increased volumetric coverage and reduced manual oversight. Thus, the proposed system improves experimental efficiency and allows for the incorporation of advanced techniques, such as multi-view stitching, to characterize complex geometries with higher effective resolution. When applied to slowly deforming bio-composites, the system can capture time-lapse images from multiple viewpoints, providing a more comprehensive assessment of complex, evolving material behaviors. These enhancements in 3D-DIC further improve geometric accuracy, increase data density, and expand applicability to a broader range of materials and experimental conditions. Ultimately, the proposed robot-assisted 3D-DIC system creates a robust, high-throughput monitoring framework for bio-fabrication, additive manufacturing, and advanced composite processing, paving the way for targeted programming of shape changes, among other applications.
三维数字图像相关(3D-DIC)是一种应用广泛的非接触式光学成像技术,用于精确量化材料和结构的全场表面位移和应变。然而,传统的3D-DIC实现依赖于固定的立体摄像机位置,面临着视场和空间分辨率之间的权衡,并且缺乏长时间测量的高通量。在这里,我们提出了一个集成的机器人3D-DIC框架,它采用工业机械臂来自主地反复重新定位立体摄像机。这可以实现自动校准,长时间监测多个样品,扩大有效的空间覆盖和数据吞吐量,同时保持校准稳定性和测量保真度。我们在刚性和变形参考样品上验证了这种方法,并证明了它在干燥过程中同时量化生物复合材料样品的材料变形的能力。在机器人重新定位下,刚性样品表现出稳定的位移和应变测量,同时受益于显著增加的体积覆盖和减少人工监督。因此,所提出的系统提高了实验效率,并允许结合先进的技术,如多视图拼接,以更高的有效分辨率表征复杂的几何形状。当应用于缓慢变形的生物复合材料时,该系统可以从多个视点捕获延时图像,从而对复杂的、不断变化的材料行为提供更全面的评估。3D-DIC的这些增强功能进一步提高了几何精度,增加了数据密度,并扩展了对更广泛的材料和实验条件的适用性。最终,提出的机器人辅助3D-DIC系统为生物制造、增材制造和先进复合材料加工创造了一个强大的、高通量的监测框架,为有针对性的形状变化编程铺平了道路,以及其他应用。
{"title":"A robotic framework for high-throughput and multi-view 3D digital image correlation (3D-DIC): Increasing measurement volume and versatility for deformation analysis","authors":"Özgüç Bertuğ Çapunaman ,&nbsp;Alale Mohseni ,&nbsp;Dennis Dombrovskij ,&nbsp;Kaiyang Yin ,&nbsp;Benay Gürsoy ,&nbsp;Max David Mylo","doi":"10.1016/j.rcim.2025.103187","DOIUrl":"10.1016/j.rcim.2025.103187","url":null,"abstract":"<div><div>Three-dimensional digital image correlation (3D-DIC) is a widely applicable, non-contact optical imaging technique for accurately quantifying full-field surface displacements and strains in materials and structures. However, conventional 3D-DIC implementations relying on fixed stereo camera positions face trade-offs between the field-of-view and spatial resolution and lack high-throughput for long-duration measurements. Here we present an integrated robotic 3D-DIC framework that employs an industrial robotic arm to autonomously and repeatedly reposition stereo cameras. This enables automated calibration, monitoring of multiple samples over extended periods, and expansion of the effective spatial coverage and data throughput, all while maintaining calibration stability and measurement fidelity. We validate this approach on rigid and deforming reference samples and demonstrate its ability to quantify material deformation of bio-composite samples simultaneously during the drying process. Under robotic repositioning, rigid samples exhibit stable displacement and strain measurements while benefiting from significantly increased volumetric coverage and reduced manual oversight. Thus, the proposed system improves experimental efficiency and allows for the incorporation of advanced techniques, such as multi-view stitching, to characterize complex geometries with higher effective resolution. When applied to slowly deforming bio-composites, the system can capture time-lapse images from multiple viewpoints, providing a more comprehensive assessment of complex, evolving material behaviors. These enhancements in 3D-DIC further improve geometric accuracy, increase data density, and expand applicability to a broader range of materials and experimental conditions. Ultimately, the proposed robot-assisted 3D-DIC system creates a robust, high-throughput monitoring framework for bio-fabrication, additive manufacturing, and advanced composite processing, paving the way for targeted programming of shape changes, among other applications.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"99 ","pages":"Article 103187"},"PeriodicalIF":11.4,"publicationDate":"2025-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145595008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A lightweight object detection approach for precision gripping in multiple peg-in-hole assembly tasks 一种用于多孔钉装配任务中精确夹持的轻量目标检测方法
IF 11.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-11-21 DOI: 10.1016/j.rcim.2025.103185
Jianjun Jiao , Zonggang Li , Guangqing Xia , Guoping Wang , Yinjuan Chen , Ruibing Gao
Automating product assembly using manipulators in manufacturing remains challenging. This is mainly because detection and gripping prior to component assembly still depend heavily on manual operations and traditional teaching methods, resulting in a low overall level of automation. The primary difficulty in detection and gripping arises from the precise recognition of rotation angles and the complex demands for accuracy, real-time performance, and stability. This paper presents an improved lightweight model, IDPC-YOLOv8, for multiple peg-in-hole workpiece detection and gripping to address these challenges. The proposed approach integrates adaptive image preprocessing to enhance visual clarity under varying lighting conditions and employs an efficient network architecture that jointly exploits global and local features to improve detection precision and computational efficiency. In addition, a rotation-aware detection strategy is introduced to enable accurate prediction of object orientation. Moreover, a network optimization scheme further reduces model parameters, making the system suitable for real-time deployment. Experimental results reveal that the IDPC-YOLOv8 model achieves an accuracy of 97.8% and a detection speed of 126.59 FPS, representing improvements of 4% and 8.3%, respectively, over the original YOLOv8-OBB model. Compared to several state-of-the-art rotation detection models, IDPC-YOLOv8 demonstrates superior integration and generalization capabilities. The effectiveness of the proposed method is further validated through excellent gripping success rates achieved in real-world experiments using the AUBO-i5 manipulator.
在制造中使用机械手自动化产品装配仍然具有挑战性。这主要是因为组件组装前的检测和抓取仍然严重依赖人工操作和传统的教学方法,导致整体自动化水平较低。检测和抓握的主要困难来自旋转角度的精确识别以及对精度、实时性和稳定性的复杂要求。本文提出了一种改进的轻量级模型IDPC-YOLOv8,用于多个孔内钉工件检测和夹持,以解决这些挑战。该方法集成了自适应图像预处理以增强不同光照条件下的视觉清晰度,并采用高效的网络架构,共同利用全局和局部特征来提高检测精度和计算效率。此外,还引入了一种旋转感知检测策略,以实现对目标方向的准确预测。此外,网络优化方案进一步减少了模型参数,使系统适合实时部署。实验结果表明,IDPC-YOLOv8模型的准确率为97.8%,检测速度为126.59 FPS,比原YOLOv8-OBB模型分别提高了4%和8.3%。与几种最先进的旋转检测模型相比,IDPC-YOLOv8展示了卓越的集成和泛化能力。通过AUBO-i5机械手在实际实验中取得的优异抓取成功率,进一步验证了所提出方法的有效性。
{"title":"A lightweight object detection approach for precision gripping in multiple peg-in-hole assembly tasks","authors":"Jianjun Jiao ,&nbsp;Zonggang Li ,&nbsp;Guangqing Xia ,&nbsp;Guoping Wang ,&nbsp;Yinjuan Chen ,&nbsp;Ruibing Gao","doi":"10.1016/j.rcim.2025.103185","DOIUrl":"10.1016/j.rcim.2025.103185","url":null,"abstract":"<div><div>Automating product assembly using manipulators in manufacturing remains challenging. This is mainly because detection and gripping prior to component assembly still depend heavily on manual operations and traditional teaching methods, resulting in a low overall level of automation. The primary difficulty in detection and gripping arises from the precise recognition of rotation angles and the complex demands for accuracy, real-time performance, and stability. This paper presents an improved lightweight model, IDPC-YOLOv8, for multiple peg-in-hole workpiece detection and gripping to address these challenges. The proposed approach integrates adaptive image preprocessing to enhance visual clarity under varying lighting conditions and employs an efficient network architecture that jointly exploits global and local features to improve detection precision and computational efficiency. In addition, a rotation-aware detection strategy is introduced to enable accurate prediction of object orientation. Moreover, a network optimization scheme further reduces model parameters, making the system suitable for real-time deployment. Experimental results reveal that the IDPC-YOLOv8 model achieves an accuracy of 97.8% and a detection speed of 126.59 FPS, representing improvements of 4% and 8.3%, respectively, over the original YOLOv8-OBB model. Compared to several state-of-the-art rotation detection models, IDPC-YOLOv8 demonstrates superior integration and generalization capabilities. The effectiveness of the proposed method is further validated through excellent gripping success rates achieved in real-world experiments using the AUBO-i5 manipulator.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"98 ","pages":"Article 103185"},"PeriodicalIF":11.4,"publicationDate":"2025-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145567450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Robotics and Computer-integrated Manufacturing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1