Robotics and Computer-integrated Manufacturing最新文献_第6页

Semantic map construction approach for human-robot collaborative manufacturing 人机协作制造的语义图构建方法

IF 9.1 1区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Robotics and Computer-integrated Manufacturing

Pub Date : 2024-08-02 DOI: 10.1016/j.rcim.2024.102845

Chen Zheng , Yuyang Du , Jinhua Xiao , Tengfei Sun , Zhanxi Wang , Benoît Eynard , Yicha Zhang

Map construction is the initial step of mobile robots for their localization, navigation, and path planning in unknown environments. Considering the human-robot collaboration (HRC) scenarios in modern manufacturing, where the human workers’ capabilities are closely integrated with the efficiency and precision of robots in the same workspace, a map integrating geometric and semantic information is considered as the technical foundation for intelligent interactions between human workers and robots, such as motion planning, reasoning, and context-aware decision-making. Although different map construction methods have been proposed for mobile robots’ perception in the working environment, it is still a challenging task when applied to such human-robot collaborative manufacturing scenarios to achieve the afore-mentioned intelligent interactions between human workers and robots due to the poor integration of semantic information in the constructed map. On the one hand, due to the lack of ability for differentiating the dynamic objects, the mobile robot might sometimes wrongly use the dynamic objects as the spatial references to calculate the pose transformation between the two successive frames, which negatively affects the accuracy of the robot's localization and pose estimation. On the other hand, the map that integrates both the geometric and semantic information can hardly be constructed in real-time, which cannot provide an effective support for the real-time reasoning and decision making during the human-robot collaboration process.

This study proposes a novel map construction approach containing semantic information generation, geometric information generation, and semantic & geometric information fusion modules, which enables the integration of the semantic and geometric information in the constructed map. First, the semantic information generation module analyzes the captured images of the dynamic working environment, eliminates the features of dynamic objects, and generates the semantic information of the static objects. Meanwhile, the geometric information generation module is adopted to generate the accurate geometric information of the robot's motion plane by using the environment data. Finally, a map integrating semantic and geometric information in real-time can be constructed by the semantic & geometric fusion module. The experimental results demonstrate the effectiveness of the proposed semantic map construction approach.

地图构建是移动机器人在未知环境中进行定位、导航和路径规划的第一步。考虑到现代制造业中的人机协作（HRC）场景，即在同一工作空间中，人类工人的能力与机器人的效率和精度紧密结合，集成了几何和语义信息的地图被认为是人类工人与机器人进行智能交互（如运动规划、推理和情境感知决策）的技术基础。尽管针对移动机器人在工作环境中的感知提出了不同的地图构建方法，但由于构建的地图中语义信息集成度较低，因此应用于此类人机协同制造场景以实现上述人机之间的智能交互仍是一项具有挑战性的任务。一方面，由于缺乏对动态物体的区分能力，移动机器人有时可能会错误地将动态物体作为空间参照物来计算连续两帧之间的姿态变换，从而对机器人定位和姿态估计的准确性造成负面影响。另一方面，集成了几何信息和语义信息的地图难以实时构建，无法为人机协作过程中的实时推理和决策提供有效支持。

{"title":"Semantic map construction approach for human-robot collaborative manufacturing","authors":"Chen Zheng , Yuyang Du , Jinhua Xiao , Tengfei Sun , Zhanxi Wang , Benoît Eynard , Yicha Zhang","doi":"10.1016/j.rcim.2024.102845","DOIUrl":"10.1016/j.rcim.2024.102845","url":null,"abstract":"<div><p>Map construction is the initial step of mobile robots for their localization, navigation, and path planning in unknown environments. Considering the human-robot collaboration (HRC) scenarios in modern manufacturing, where the human workers’ capabilities are closely integrated with the efficiency and precision of robots in the same workspace, a map integrating geometric and semantic information is considered as the technical foundation for intelligent interactions between human workers and robots, such as motion planning, reasoning, and context-aware decision-making. Although different map construction methods have been proposed for mobile robots’ perception in the working environment, it is still a challenging task when applied to such human-robot collaborative manufacturing scenarios to achieve the afore-mentioned intelligent interactions between human workers and robots due to the poor integration of semantic information in the constructed map. On the one hand, due to the lack of ability for differentiating the dynamic objects, the mobile robot might sometimes wrongly use the dynamic objects as the spatial references to calculate the pose transformation between the two successive frames, which negatively affects the accuracy of the robot's localization and pose estimation. On the other hand, the map that integrates both the geometric and semantic information can hardly be constructed in real-time, which cannot provide an effective support for the real-time reasoning and decision making during the human-robot collaboration process.</p><p>This study proposes a novel map construction approach containing semantic information generation, geometric information generation, and semantic & geometric information fusion modules, which enables the integration of the semantic and geometric information in the constructed map. First, the semantic information generation module analyzes the captured images of the dynamic working environment, eliminates the features of dynamic objects, and generates the semantic information of the static objects. Meanwhile, the geometric information generation module is adopted to generate the accurate geometric information of the robot's motion plane by using the environment data. Finally, a map integrating semantic and geometric information in real-time can be constructed by the semantic & geometric fusion module. The experimental results demonstrate the effectiveness of the proposed semantic map construction approach.</p></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"91 ","pages":"Article 102845"},"PeriodicalIF":9.1,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0736584524001327/pdfft?md5=501e9b621a0fd7fba981460b0f82d5b7&pid=1-s2.0-S0736584524001327-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141891967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

IF 9.1 1区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Robotics and Computer-integrated Manufacturing

Pub Date : 2024-07-31 DOI: 10.1016/j.rcim.2024.102842

Jun-Wan Yun, Minwoo Na, Yuhyeon Hwang, Jae-Bok Song

In practice, the process of robot assembly in an unstructured environment faces difficulties due to the presence of unpredictable environmental errors related to vision and pose. Therefore, to minimize the uncertain environmental errors during the robotic assembly process in an unstructured environment, several studies have considered a reinforcement learning (RL)-based approach. However, if assembly parts are changed, it becomes difficult to apply RL-based methods to assemble various parts because additional learning may be required. Especially in the case of connector assembly, fine-tuning is essential because the shape changes depending on the type of connector. In this study, we propose a similar assembly state discriminator that transforms the state information (force, velocity, and RGB image) of reinforcement learning into generalized features to respond various types of connector assembly tasks. This method processes the data to include essential features for assembly regardless of connector type. By learning the RL model with the processed data using this method, the RL model trained for a specific connector can be efficiently applied to other types of connectors without fine-tuning. The assembly success rate for the 7 types of connectors (Harting, HDMI, USB, power, air jack, banana plug and PCIE) using the proposed method was demonstrated to be over 96 %.

在实践中，由于存在与视觉和姿态相关的不可预测的环境误差，在非结构化环境中进行机器人装配过程会遇到很多困难。因此，为了尽量减少非结构化环境中机器人装配过程中的不确定环境误差，一些研究考虑了基于强化学习（RL）的方法。但是，如果装配部件发生变化，就很难应用基于 RL 的方法来装配各种部件，因为可能需要额外的学习。特别是在连接器装配的情况下，由于形状会根据连接器的类型发生变化，因此微调是必不可少的。在本研究中，我们提出了一种类似的装配状态判别器，它能将强化学习的状态信息（力、速度和 RGB 图像）转化为通用特征，以应对各种类型的连接器装配任务。这种方法处理数据时，无论连接器类型如何，都会包含装配的基本特征。通过使用这种方法利用处理过的数据学习 RL 模型，为特定连接器训练的 RL 模型可以有效地应用于其他类型的连接器，而无需进行微调。使用所提出的方法，7 种连接器（Harting、HDMI、USB、电源、空气插孔、香蕉插头和 PCIE）的装配成功率超过 96%。

{"title":"Similar assembly state discriminator for reinforcement learning-based robotic connector assembly","authors":"Jun-Wan Yun, Minwoo Na, Yuhyeon Hwang, Jae-Bok Song","doi":"10.1016/j.rcim.2024.102842","DOIUrl":"10.1016/j.rcim.2024.102842","url":null,"abstract":"<div><p>In practice, the process of robot assembly in an unstructured environment faces difficulties due to the presence of unpredictable environmental errors related to vision and pose. Therefore, to minimize the uncertain environmental errors during the robotic assembly process in an unstructured environment, several studies have considered a reinforcement learning (RL)-based approach. However, if assembly parts are changed, it becomes difficult to apply RL-based methods to assemble various parts because additional learning may be required. Especially in the case of connector assembly, fine-tuning is essential because the shape changes depending on the type of connector. In this study, we propose a similar assembly state discriminator that transforms the state information (force, velocity, and RGB image) of reinforcement learning into generalized features to respond various types of connector assembly tasks. This method processes the data to include essential features for assembly regardless of connector type. By learning the RL model with the processed data using this method, the RL model trained for a specific connector can be efficiently applied to other types of connectors without fine-tuning. The assembly success rate for the 7 types of connectors (Harting, HDMI, USB, power, air jack, banana plug and PCIE) using the proposed method was demonstrated to be over 96 %.</p></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"91 ","pages":"Article 102842"},"PeriodicalIF":9.1,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141892021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Out-of-order execution enabled deep reinforcement learning for dynamic additive manufacturing scheduling 针对动态增材制造排程的超序执行深度强化学习

IF 9.1 1区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Robotics and Computer-integrated Manufacturing

Pub Date : 2024-07-31 DOI: 10.1016/j.rcim.2024.102841

Mingyue Sun , Jiyuchen Ding , Zhiheng Zhao , Jian Chen , George Q. Huang , Lihui Wang

Additive Manufacturing (AM) has revolutionized the production landscape by enabling on-demand customized manufacturing. However, the efficient management of dynamic AM orders poses significant challenges for production planning and scheduling. This paper addresses the dynamic scheduling problem considering batch processing, random order arrival and machine eligibility constraints, aiming to minimize total tardiness in a parallel non-identical AM machine environment. To tackle this problem, we propose the out-of-order enabled dueling deep Q network (O3-DDQN) approach. In the proposed approach, the problem is formulated as a Markov decision process (MDP). Three-dimensional features, encompassing dynamic orders, AM machines, and delays, are extracted using a ‘look around’ method to represent the production status at a rescheduling point. Additionally, five novel composite scheduling rules based on the out-of-order principle are introduced for selection when an AM machine completes processing or a new order arrives. Moreover, we design a reward function that is strongly correlated with the objective to evaluate the agent’s chosen action. Experimental results demonstrate the superiority of the O3-DDQN approach over single scheduling rules, randomly selected rules, and the classic DQN method. The average improvement rate of performance reaches 13.09% compared to composite scheduling rules and random rules. Additionally, the O3-DDQN outperforms the classic DQN agent with a 6.54% improvement rate. The O3-DDQN algorithm improves scheduling in dynamic AM environments, enhancing productivity and on-time delivery. This research contributes to advancing AM production and offers insights into efficient resource allocation.

快速成型制造（AM）实现了按需定制生产，从而彻底改变了生产格局。然而，如何有效管理动态 AM 订单给生产计划和调度带来了巨大挑战。本文探讨了动态调度问题，考虑了批量处理、随机订单到达和机器资格约束，旨在最大限度地减少并行非相同 AM 机器环境中的总迟到时间。为解决这一问题，我们提出了失序启用决斗深 Q 网络（O3-DDQN）方法。在所提出的方法中，问题被表述为马尔可夫决策过程（MDP）。使用 "环顾 "方法提取了包括动态订单、AM 机器和延迟在内的三维特征，以表示重新安排点的生产状态。此外，我们还引入了五种基于失序原则的新型复合调度规则，用于在 AM 机器完成加工或新订单到来时进行选择。此外，我们还设计了一个与目标密切相关的奖励函数，用于评估代理选择的行动。实验结果表明，O3-DDQN 方法优于单一调度规则、随机选择规则和经典 DQN 方法。与复合调度规则和随机规则相比，性能平均提高了 13.09%。此外，O3-DDQN 的改进率为 6.54%，优于经典 DQN 代理。O3-DDQN 算法改善了动态 AM 环境中的调度，提高了生产率和准时交货率。这项研究有助于推动 AM 生产，并为高效资源分配提供了见解。

{"title":"Out-of-order execution enabled deep reinforcement learning for dynamic additive manufacturing scheduling","authors":"Mingyue Sun , Jiyuchen Ding , Zhiheng Zhao , Jian Chen , George Q. Huang , Lihui Wang","doi":"10.1016/j.rcim.2024.102841","DOIUrl":"10.1016/j.rcim.2024.102841","url":null,"abstract":"<div><p>Additive Manufacturing (AM) has revolutionized the production landscape by enabling on-demand customized manufacturing. However, the efficient management of dynamic AM orders poses significant challenges for production planning and scheduling. This paper addresses the dynamic scheduling problem considering batch processing, random order arrival and machine eligibility constraints, aiming to minimize total tardiness in a parallel non-identical AM machine environment. To tackle this problem, we propose the out-of-order enabled dueling deep Q network (O3-DDQN) approach. In the proposed approach, the problem is formulated as a Markov decision process (MDP). Three-dimensional features, encompassing dynamic orders, AM machines, and delays, are extracted using a ‘look around’ method to represent the production status at a rescheduling point. Additionally, five novel composite scheduling rules based on the out-of-order principle are introduced for selection when an AM machine completes processing or a new order arrives. Moreover, we design a reward function that is strongly correlated with the objective to evaluate the agent’s chosen action. Experimental results demonstrate the superiority of the O3-DDQN approach over single scheduling rules, randomly selected rules, and the classic DQN method. The average improvement rate of performance reaches 13.09% compared to composite scheduling rules and random rules. Additionally, the O3-DDQN outperforms the classic DQN agent with a 6.54% improvement rate. The O3-DDQN algorithm improves scheduling in dynamic AM environments, enhancing productivity and on-time delivery. This research contributes to advancing AM production and offers insights into efficient resource allocation.</p></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"91 ","pages":"Article 102841"},"PeriodicalIF":9.1,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141891968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Time-series classification in smart manufacturing systems: An experimental evaluation of state-of-the-art machine learning algorithms 智能制造系统中的时间序列分类：最先进机器学习算法的实验评估

IF 9.1 1区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Robotics and Computer-integrated Manufacturing

Pub Date : 2024-07-30 DOI: 10.1016/j.rcim.2024.102839

Mojtaba A. Farahani , M.R. McCormick , Ramy Harik , Thorsten Wuest

Manufacturing is transformed towards smart manufacturing, entering a new data-driven era fueled by digital technologies. The resulting Smart Manufacturing Systems (SMS) gather extensive amounts of diverse data, thanks to the growing number of sensors and rapid advances in sensing technologies. Among the various data types available in SMS settings, time-series data plays a pivotal role. Hence, Time-Series Classification (TSC) emerges as a crucial task in this domain. Over the past decade, researchers have introduced numerous methods for TSC, necessitating not only algorithmic development and analysis but also validation and empirical comparison. This dual approach holds substantial value for practitioners by streamlining choices and revealing insights into models’ strengths and weaknesses. The objective of this study is to fill this gap by providing a rigorous experimental evaluation of the state-of-the-art Machine Learning (ML) and Deep Learning (DL) algorithms for TSC tasks in manufacturing and industrial settings. We first explored and compiled a comprehensive list of more than 92 state-of-the-art algorithms from both TSC and manufacturing literature. Following this, we methodologically selected the 36 most representative algorithms from this list. To evaluate their performance across various manufacturing classification tasks, we curated a set of 22 manufacturing datasets, representative of different characteristics that cover diverse manufacturing problems. Subsequently, we implemented and evaluated the algorithms on the manufacturing benchmark datasets, and analyzed the results for each dataset. Based on the results, ResNet, DrCIF, InceptionTime, and ARSENAL emerged as the top-performing algorithms, boasting an average accuracy of over 96.6 % across all 22 manufacturing TSC datasets. These findings underscore the robustness, efficiency, scalability, and effectiveness of convolutional kernels in capturing temporal features in time-series data collected from manufacturing systems for TSC tasks, as three out of the top four performing algorithms leverage these kernels for feature extraction. Additionally, LSTM, BiLSTM, and TS-LSTM algorithms deserve recognition for their effectiveness in capturing features within manufacturing time-series data using RNN-based structures.

制造业正在向智能制造转型，进入一个由数字技术推动的数据驱动的新时代。由于传感器数量的不断增加和传感技术的飞速发展，智能制造系统（SMS）收集了大量不同的数据。在 SMS 环境中的各种数据类型中，时间序列数据起着举足轻重的作用。因此，时间序列分类（TSC）成为该领域的一项重要任务。在过去的十年中，研究人员推出了许多 TSC 方法，不仅需要算法开发和分析，还需要验证和经验比较。这种双重方法可简化选择并揭示模型的优缺点，对从业人员具有重要价值。本研究旨在填补这一空白，针对制造业和工业环境中的 TSC 任务，对最先进的机器学习（ML）和深度学习（DL）算法进行严格的实验评估。我们首先从 TSC 和制造业文献中探索并汇编了一份超过 92 种最先进算法的综合列表。随后，我们从该列表中筛选出 36 种最具代表性的算法。为了评估这些算法在各种制造分类任务中的性能，我们策划了一组 22 个制造数据集，这些数据集代表了涵盖各种制造问题的不同特征。随后，我们在制造业基准数据集上实施和评估了这些算法，并分析了每个数据集的结果。根据结果，ResNet、DrCIF、InceptionTime 和 ARSENAL 成为表现最佳的算法，在所有 22 个制造业 TSC 数据集上的平均准确率超过 96.6%。这些发现凸显了卷积内核在捕捉从制造系统收集的时间序列数据中的时间特征以执行 TSC 任务方面的稳健性、高效性、可扩展性和有效性，因为在表现最好的四种算法中，有三种都利用了这些内核进行特征提取。此外，LSTM、BiLSTM 和 TS-LSTM 算法在使用基于 RNN 的结构捕捉制造时间序列数据中的特征方面的有效性也值得肯定。

{"title":"Time-series classification in smart manufacturing systems: An experimental evaluation of state-of-the-art machine learning algorithms","authors":"Mojtaba A. Farahani , M.R. McCormick , Ramy Harik , Thorsten Wuest","doi":"10.1016/j.rcim.2024.102839","DOIUrl":"10.1016/j.rcim.2024.102839","url":null,"abstract":"<div><p>Manufacturing is transformed towards smart manufacturing, entering a new data-driven era fueled by digital technologies. The resulting Smart Manufacturing Systems (SMS) gather extensive amounts of diverse data, thanks to the growing number of sensors and rapid advances in sensing technologies. Among the various data types available in SMS settings, time-series data plays a pivotal role. Hence, Time-Series Classification (TSC) emerges as a crucial task in this domain. Over the past decade, researchers have introduced numerous methods for TSC, necessitating not only algorithmic development and analysis but also validation and empirical comparison. This dual approach holds substantial value for practitioners by streamlining choices and revealing insights into models’ strengths and weaknesses. The objective of this study is to fill this gap by providing a rigorous experimental evaluation of the state-of-the-art Machine Learning (ML) and Deep Learning (DL) algorithms for TSC tasks in manufacturing and industrial settings. We first explored and compiled a comprehensive list of more than 92 state-of-the-art algorithms from both TSC and manufacturing literature. Following this, we methodologically selected the 36 most representative algorithms from this list. To evaluate their performance across various manufacturing classification tasks, we curated a set of 22 manufacturing datasets, representative of different characteristics that cover diverse manufacturing problems. Subsequently, we implemented and evaluated the algorithms on the manufacturing benchmark datasets, and analyzed the results for each dataset. Based on the results, ResNet, DrCIF, InceptionTime, and ARSENAL emerged as the top-performing algorithms, boasting an average accuracy of over 96.6 % across all 22 manufacturing TSC datasets. These findings underscore the robustness, efficiency, scalability, and effectiveness of convolutional kernels in capturing temporal features in time-series data collected from manufacturing systems for TSC tasks, as three out of the top four performing algorithms leverage these kernels for feature extraction. Additionally, LSTM, BiLSTM, and TS-LSTM algorithms deserve recognition for their effectiveness in capturing features within manufacturing time-series data using RNN-based structures.</p></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"91 ","pages":"Article 102839"},"PeriodicalIF":9.1,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141891969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Intelligent seam tracking in foils joining based on spatial–temporal deep learning from molten pool serial images 基于熔池序列图像的时空深度学习的箔接缝智能跟踪技术

IF 9.1 1区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Robotics and Computer-integrated Manufacturing

Pub Date : 2024-07-29 DOI: 10.1016/j.rcim.2024.102840

Yuxiang Hong , Yuxuan Jiang , Mingxuan Yang , Baohua Chang , Dong DU

Vision-based weld seam tracking has become one of the key technologies to realize intelligent robotic welding, and weld deviation detection is an essential step. However, accurate and robust detection of weld deviations during the microwelding of ultrathin metal foils remains a significant challenge. This challenge can be attributed to the fusion zone at the mesoscopic scale and the complex time-varying interference (pulsed arcs and reflected light from the workpiece surface). In this paper, an intelligent seam tracking approach for foils joining based on spatial–temporal deep learning from molten pool serial images is proposed. More specifically, a microscopic passive vision sensor is designed to capture molten pool and seam trajectory images under pulsed arc lights. A 3D convolutional neural network (3DCNN) and long short-term memory (LSTM)-based welding torch offset prediction network (WTOP-net) is established to implement highly accurate deviation prediction by capturing long-term dependence of spatial–temporal features. Then, expert knowledge is further incorporated into the spatio-temporal features to improve the robustness of the model. In addition, the slime mould algorithm (SMA) is used to prevent local optima and improve accuracy, efficiency of WTOP-net. The experimental results indicate that the maximum error detected by our method fluctuates within $\pm$ 0.08 mm and the average error is within $\pm$ 0.011 mm when joining two 0.12 mm thickness stainless steel diaphragms. The proposed approach provides a basis for automated robotic seam tracking and intelligent precision manufacturing of ultrathin sheets welded components in aerospace and other fields.

基于视觉的焊缝跟踪已成为实现智能机器人焊接的关键技术之一，而焊缝偏差检测则是其中必不可少的一步。然而，在超薄金属箔的微焊接过程中准确、稳健地检测焊缝偏差仍然是一项重大挑战。这一挑战可归因于介观尺度的熔合区和复杂的时变干扰（脉冲电弧和来自工件表面的反射光）。本文提出了一种基于熔池序列图像时空深度学习的箔片接合智能接缝跟踪方法。具体而言，设计了一种微型被动视觉传感器，用于捕捉脉冲弧光灯下的熔池和接缝轨迹图像。建立了基于三维卷积神经网络（3DCNN）和长短期记忆（LSTM）的焊枪偏移预测网络（WTOP-net），通过捕捉空间-时间特征的长期依赖性实现高精度偏差预测。然后，将专家知识进一步纳入时空特征，以提高模型的鲁棒性。此外，还使用了粘液模算法（SMA）来防止局部最优，提高 WTOP 网络的精度和效率。实验结果表明，在连接两个 0.12 毫米厚的不锈钢隔膜时，我们的方法检测到的最大误差在 0.08 毫米以内，平均误差在 0.011 毫米以内。所提出的方法为航空航天和其他领域的自动机器人焊缝跟踪和超薄板焊接部件的智能精密制造奠定了基础。

{"title":"Intelligent seam tracking in foils joining based on spatial–temporal deep learning from molten pool serial images","authors":"Yuxiang Hong , Yuxuan Jiang , Mingxuan Yang , Baohua Chang , Dong DU","doi":"10.1016/j.rcim.2024.102840","DOIUrl":"10.1016/j.rcim.2024.102840","url":null,"abstract":"<div><p>Vision-based weld seam tracking has become one of the key technologies to realize intelligent robotic welding, and weld deviation detection is an essential step. However, accurate and robust detection of weld deviations during the microwelding of ultrathin metal foils remains a significant challenge. This challenge can be attributed to the fusion zone at the mesoscopic scale and the complex time-varying interference (pulsed arcs and reflected light from the workpiece surface). In this paper, an intelligent seam tracking approach for foils joining based on spatial–temporal deep learning from molten pool serial images is proposed. More specifically, a microscopic passive vision sensor is designed to capture molten pool and seam trajectory images under pulsed arc lights. A 3D convolutional neural network (3DCNN) and long short-term memory (LSTM)-based welding torch offset prediction network (WTOP-net) is established to implement highly accurate deviation prediction by capturing long-term dependence of spatial–temporal features. Then, expert knowledge is further incorporated into the spatio-temporal features to improve the robustness of the model. In addition, the slime mould algorithm (SMA) is used to prevent local optima and improve accuracy, efficiency of WTOP-net. The experimental results indicate that the maximum error detected by our method fluctuates within <span><math><mo>±</mo></math></span> 0.08 mm and the average error is within <span><math><mo>±</mo></math></span> 0.011 mm when joining two 0.12 mm thickness stainless steel diaphragms. The proposed approach provides a basis for automated robotic seam tracking and intelligent precision manufacturing of ultrathin sheets welded components in aerospace and other fields.</p></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"91 ","pages":"Article 102840"},"PeriodicalIF":9.1,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141891971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Online motion accuracy compensation of industrial servomechanisms using machine learning approaches 利用机器学习方法对工业伺服电机进行在线运动精度补偿

IF 9.1 1区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Robotics and Computer-integrated Manufacturing

Pub Date : 2024-07-29 DOI: 10.1016/j.rcim.2024.102838

Pietro Bilancia , Alberto Locatelli , Alessio Tutarini , Mirko Mucciarini , Manuel Iori , Marcello Pellicciari

This paper addresses the crucial aspect of position error modeling and compensation in industrial servomechanisms with the aim to achieve accurate control and high-performance operation in industrial robots and automated production systems. The inherent complexity and nonlinear behavior of these modules, usually consisting of a servomotor and a speed reducer, often challenge traditional analytical modeling approaches. In response, the study extensively explores the design and implementation of Machine Learning (ML) algorithms to obtain a comprehensive model of the Transmission Error (TE) in rotating vector reducers, which is a main source of robot motion accuracy errors. The ML models are trained with experimental data obtained from a special purpose test rig, where the reducer is tested under different combinations of input speed, applied load and oil temperature. In the second part of the work, the resulting predictive model, tailored to capture the intricate dynamics of the analyzed reducer, is imported into a programmable logic controller to enable online compensation strategies during the execution of custom motion profiles. Experimental tests are conducted using two distinct motion profiles: one generated with a cycloidal law, typical of industrial machinery, and the other extrapolated from the joints of an industrial robot during a pick-and-place task. The results demonstrate the effectiveness of the proposed approach, enabling accurate prediction and substantial reductions (over 90%) in the overall reducer TE through the implemented predictive model.

本文探讨了工业伺服机构中位置误差建模和补偿的关键问题，旨在实现工业机器人和自动化生产系统的精确控制和高性能运行。这些模块通常由伺服电机和减速器组成，其固有的复杂性和非线性行为往往对传统的分析建模方法构成挑战。为此，本研究广泛探讨了机器学习（ML）算法的设计和实施，以获得旋转矢量减速器传输误差（TE）的综合模型，这是机器人运动精度误差的主要来源。ML 模型是利用从特殊用途测试平台上获得的实验数据进行训练的，在该测试平台上，减速器在不同的输入速度、应用负载和油温组合下进行测试。在工作的第二部分，为捕捉所分析减速器的复杂动态而定制的预测模型被导入到可编程逻辑控制器中，以便在执行自定义运动曲线时启用在线补偿策略。实验测试使用了两种不同的运动曲线：一种是由典型的工业机械摆线定律产生的，另一种是在拾放任务中从工业机器人的关节中推断出来的。实验结果证明了所提方法的有效性，通过实施预测模型，实现了精确预测并大幅降低（超过 90%）整体减速器 TE。

{"title":"Online motion accuracy compensation of industrial servomechanisms using machine learning approaches","authors":"Pietro Bilancia , Alberto Locatelli , Alessio Tutarini , Mirko Mucciarini , Manuel Iori , Marcello Pellicciari","doi":"10.1016/j.rcim.2024.102838","DOIUrl":"10.1016/j.rcim.2024.102838","url":null,"abstract":"<div><p>This paper addresses the crucial aspect of position error modeling and compensation in industrial servomechanisms with the aim to achieve accurate control and high-performance operation in industrial robots and automated production systems. The inherent complexity and nonlinear behavior of these modules, usually consisting of a servomotor and a speed reducer, often challenge traditional analytical modeling approaches. In response, the study extensively explores the design and implementation of Machine Learning (ML) algorithms to obtain a comprehensive model of the Transmission Error (TE) in rotating vector reducers, which is a main source of robot motion accuracy errors. The ML models are trained with experimental data obtained from a special purpose test rig, where the reducer is tested under different combinations of input speed, applied load and oil temperature. In the second part of the work, the resulting predictive model, tailored to capture the intricate dynamics of the analyzed reducer, is imported into a programmable logic controller to enable online compensation strategies during the execution of custom motion profiles. Experimental tests are conducted using two distinct motion profiles: one generated with a cycloidal law, typical of industrial machinery, and the other extrapolated from the joints of an industrial robot during a pick-and-place task. The results demonstrate the effectiveness of the proposed approach, enabling accurate prediction and substantial reductions (over 90%) in the overall reducer TE through the implemented predictive model.</p></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"91 ","pages":"Article 102838"},"PeriodicalIF":9.1,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S073658452400125X/pdfft?md5=07da8c50389e390e8f49cb701257c0b4&pid=1-s2.0-S073658452400125X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141891970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Smart and user-centric manufacturing information recommendation using multimodal learning to support human-robot collaboration in mixed reality environments 利用多模态学习支持混合现实环境中的人机协作，实现以用户为中心的智能制造信息推荐

IF 9.1 1区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Robotics and Computer-integrated Manufacturing

Pub Date : 2024-07-26 DOI: 10.1016/j.rcim.2024.102836

Sung Ho Choi , Minseok Kim , Jae Yeol Lee

The future manufacturing system must be capable of supporting customized mass production while reducing cost and must be flexible enough to accommodate market demands. Additionally, workers must possess the knowledge and skills to adapt to the evolving manufacturing environment. Previous studies have been conducted to provide customized manufacturing information to the worker. However, most have not considered the worker's situation or region of interest (ROI), so they had difficulty providing information tailored to the worker. Thus, a manufacturing information recommendation system should utilize not only manufacturing data but also the worker's situational information and intent to assist the worker in adjusting to the evolving working environment. This study presents a smart and user-centric manufacturing information recommendation system that harnesses the vision and text dual encoder-based multimodal deep learning model to offer the most relevant information based on the worker's vision and query, which can support human-robot collaboration (HRC) in a mixed reality (MR) environment. The proposed recommendation model can assist the worker by analyzing the manufacturing environment image acquired from smart glasses, the worker's specific question, and the related manufacturing document. By establishing correlations between the MR-based visual information and the worker's query using the multimodal deep learning model, the proposed approach identifies the most suitable information to be recommended. Furthermore, the recommended information can be visualized through MR smart glasses to support HRC. For quantitative and qualitative evaluation, we compared the proposed model with existing vision-text dual models, and the results demonstrated that the proposed approach outperformed previous studies. Thus, the proposed approach has the potential to assist workers more effectively in MR-based manufacturing environments, enhancing their overall productivity and adaptability.

未来的制造系统必须能够支持定制化大规模生产，同时降低成本，并且必须足够灵活，以适应市场需求。此外，工人必须具备适应不断变化的制造环境的知识和技能。以往的研究都是为了向工人提供定制化生产信息。但是，大多数研究都没有考虑工人的实际情况或关注区域（ROI），因此难以提供适合工人的信息。因此，制造信息推荐系统不仅要利用制造数据，还要利用工人的情景信息和意图，以帮助工人适应不断变化的工作环境。本研究提出了一种以用户为中心的智能制造信息推荐系统，该系统利用基于视觉和文本双编码器的多模态深度学习模型，根据工人的视觉和查询提供最相关的信息，从而支持混合现实（MR）环境中的人机协作（HRC）。所提出的推荐模型可以通过分析智能眼镜获取的制造环境图像、工人的具体问题以及相关的制造文档来帮助工人。通过使用多模态深度学习模型在基于 MR 的视觉信息和工人的查询之间建立关联，所提出的方法可以识别出最适合推荐的信息。此外，推荐的信息可以通过磁共振智能眼镜可视化，以支持 HRC。为了进行定量和定性评估，我们将所提出的模型与现有的视觉-文本双重模型进行了比较，结果表明所提出的方法优于之前的研究。因此，所提出的方法有望在基于磁共振的制造环境中更有效地帮助工人，提高他们的整体生产率和适应能力。

{"title":"Smart and user-centric manufacturing information recommendation using multimodal learning to support human-robot collaboration in mixed reality environments","authors":"Sung Ho Choi , Minseok Kim , Jae Yeol Lee","doi":"10.1016/j.rcim.2024.102836","DOIUrl":"10.1016/j.rcim.2024.102836","url":null,"abstract":"<div><p>The future manufacturing system must be capable of supporting customized mass production while reducing cost and must be flexible enough to accommodate market demands. Additionally, workers must possess the knowledge and skills to adapt to the evolving manufacturing environment. Previous studies have been conducted to provide customized manufacturing information to the worker. However, most have not considered the worker's situation or region of interest (ROI), so they had difficulty providing information tailored to the worker. Thus, a manufacturing information recommendation system should utilize not only manufacturing data but also the worker's situational information and intent to assist the worker in adjusting to the evolving working environment. This study presents a smart and user-centric manufacturing information recommendation system that harnesses the vision and text dual encoder-based multimodal deep learning model to offer the most relevant information based on the worker's vision and query, which can support human-robot collaboration (HRC) in a mixed reality (MR) environment. The proposed recommendation model can assist the worker by analyzing the manufacturing environment image acquired from smart glasses, the worker's specific question, and the related manufacturing document. By establishing correlations between the MR-based visual information and the worker's query using the multimodal deep learning model, the proposed approach identifies the most suitable information to be recommended. Furthermore, the recommended information can be visualized through MR smart glasses to support HRC. For quantitative and qualitative evaluation, we compared the proposed model with existing vision-text dual models, and the results demonstrated that the proposed approach outperformed previous studies. Thus, the proposed approach has the potential to assist workers more effectively in MR-based manufacturing environments, enhancing their overall productivity and adaptability.</p></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"91 ","pages":"Article 102836"},"PeriodicalIF":9.1,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141954344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A resilient scheduling framework for multi-robot multi-station welding flow shop scheduling against robot failures 针对机器人故障的多机器人多工位焊接流动车间弹性调度框架

IF 9.1 1区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Robotics and Computer-integrated Manufacturing

Pub Date : 2024-07-26 DOI: 10.1016/j.rcim.2024.102835

Ming Wang , Peng Zhang , Guoqing Zhang , Kexin Sun , Jie Zhang , Mengyu Jin

With the development of intelligent manufacturing, robots are being increasingly applied in manufacturing systems due to their high flexibility. To avoid production disruptions caused by robot failures, higher requirements are imposed on the resilience of systems, specifically in terms of resistance, response, and recovery capabilities. In response to this, this paper investigates the resilient scheduling framework for multi-robot multi-station welding flow shop, thereby endowing and enhancing the resilience of the system. Within the resilient scheduling framework, a proactive scheduling method maximizing resistance capability is firstly proposed based on an improved NSGA-III with variable neighborhood search. Secondly, to improve the response and recovery capabilities of the system, a recovery scheduling method is presented. Therein, an adaptive trigger policy based on deep reinforcement learning is introduced to enhance the rapid response capability for disturbances, while the recovery optimization grants the system the ability to recover its performance that has been degraded due to the impact of disturbances. Finally, through simulation experiments and case study, it is verified that the proposed algorithms and framework possess superior performance of multi-objective optimization, which can endow the multi-robot multi-station welding flow shop with resilience to against uncertain robot failures.

随着智能制造的发展，机器人因其高度灵活性而越来越多地应用于制造系统。为了避免机器人故障导致生产中断，对系统的弹性提出了更高的要求，特别是在抗干扰能力、响应能力和恢复能力方面。为此，本文研究了多机器人多工位焊接流动车间的弹性调度框架，从而赋予并增强系统的弹性。在弹性调度框架内，首先提出了一种基于可变邻域搜索的改进型 NSGA-III 的主动调度方法，最大限度地提高了抵抗能力。其次，为了提高系统的响应和恢复能力，提出了一种恢复调度方法。其中，引入了基于深度强化学习的自适应触发策略，以增强对干扰的快速响应能力，而恢复优化则赋予系统恢复因干扰影响而降低的性能的能力。最后，通过仿真实验和案例研究，验证了所提出的算法和框架具有优越的多目标优化性能，能够赋予多机器人多工位焊接流水车间以应对不确定机器人故障的弹性。

{"title":"A resilient scheduling framework for multi-robot multi-station welding flow shop scheduling against robot failures","authors":"Ming Wang , Peng Zhang , Guoqing Zhang , Kexin Sun , Jie Zhang , Mengyu Jin","doi":"10.1016/j.rcim.2024.102835","DOIUrl":"10.1016/j.rcim.2024.102835","url":null,"abstract":"<div><p>With the development of intelligent manufacturing, robots are being increasingly applied in manufacturing systems due to their high flexibility. To avoid production disruptions caused by robot failures, higher requirements are imposed on the resilience of systems, specifically in terms of resistance, response, and recovery capabilities. In response to this, this paper investigates the resilient scheduling framework for multi-robot multi-station welding flow shop, thereby endowing and enhancing the resilience of the system. Within the resilient scheduling framework, a proactive scheduling method maximizing resistance capability is firstly proposed based on an improved NSGA-III with variable neighborhood search. Secondly, to improve the response and recovery capabilities of the system, a recovery scheduling method is presented. Therein, an adaptive trigger policy based on deep reinforcement learning is introduced to enhance the rapid response capability for disturbances, while the recovery optimization grants the system the ability to recover its performance that has been degraded due to the impact of disturbances. Finally, through simulation experiments and case study, it is verified that the proposed algorithms and framework possess superior performance of multi-objective optimization, which can endow the multi-robot multi-station welding flow shop with resilience to against uncertain robot failures.</p></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"91 ","pages":"Article 102835"},"PeriodicalIF":9.1,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141954345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Hand-eye calibration method for a line structured light robot vision system based on a single planar constraint 基于单一平面约束的线结构光机器人视觉系统手眼校准方法

IF 9.1 1区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Robotics and Computer-integrated Manufacturing

Pub Date : 2024-07-26 DOI: 10.1016/j.rcim.2024.102825

Kaifan Zhong , Jingxin Lin , Tao Gong , Xianmin Zhang , Nianfeng Wang

Hand-eye calibration is a prerequisite for robot vision system applications. However, due to the lack of image features, implementing hand-eye calibration with a line-structured light sensor is limited by complex procedures and special reference objects. The aim of this paper is to develop a stable and undemanding eye-in-hand calibration method with a 2D laser sensor that can be adapted to random measurement strategies in most common scenes. The proposed method can use an easily accessible plane from machined workpieces, except for specialized calibration objects, which facilitates the automation of industrial robots. Moreover, in this method, a two-step iterative method is combined with fast simulated annealing based on a single planar constraint to overcome large initial deviations and sensor errors. Simulations and calibration experiments are conducted to assess the method performance and demonstrate the feasibility and accuracy of the proposed eye-in-hand calibration method.

手眼校准是机器人视觉系统应用的先决条件。然而，由于缺乏图像特征，使用线结构光传感器进行手眼校准受到复杂程序和特殊参考对象的限制。本文旨在利用二维激光传感器开发一种稳定且要求不高的手眼校准方法，该方法可适用于大多数常见场景中的随机测量策略。除了专门的校准对象外，所提出的方法可以使用加工工件上容易获得的平面，这有利于工业机器人的自动化。此外，在该方法中，两步迭代法与基于单一平面约束的快速模拟退火相结合，克服了较大的初始偏差和传感器误差。为了评估该方法的性能，我们进行了模拟和校准实验，证明了所提出的手眼校准方法的可行性和准确性。

引用次数: 0

Dual data mapping with fine-tuned large language models and asset administration shells toward interoperable knowledge representation 利用微调大型语言模型和资产管理外壳进行双重数据映射，实现可互操作的知识表示法

IF 9.1 1区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Robotics and Computer-integrated Manufacturing

Pub Date : 2024-07-26 DOI: 10.1016/j.rcim.2024.102837

Dachuan Shi , Olga Meyer , Michael Oberle , Thomas Bauernhansl

In the context of Industry 4.0, ensuring the compatibility of digital twins (DTs) with existing software systems in the manufacturing sector presents a significant challenge. The Asset Administration Shell (AAS), conceptualized as the standardized DT for an asset, offers a powerful framework that connects the DT with the established software infrastructure through interoperable knowledge representation. Although the IEC 63278 series specifies the AAS metamodel, it lacks a matching strategy for automating the mapping between proprietary data from existing software and AAS information models. Addressing this gap, we introduce a novel dual data mapping system (DDMS) that utilizes a fine-tuned open-source large language model (LLM) for entity matching. This system facilitates not only the mapping between existing software and AAS models but also between AAS models and standardized vocabulary dictionaries, thereby enhancing the model's semantic interoperability. A case study within the injection molding domain illustrates the practical application of DDMS for the automated creation of AAS instances, seamlessly integrating the manufacturer's existing data. Furthermore, we extensively investigate the potential of fine-tuning decode-only LLMs as generative classifiers and encoding-based classifiers for the entity matching task. To this end, we establish two AAS-specific datasets by collecting and compiling AAS-related resources. In addition, supplementary experiments are performed on general entity-matching benchmark datasets to ensure that our empirical conclusions and insights are generally applicable. The experiment results indicate that the fine-tuned generative LLM classifier achieves slightly better results, while the encoding-based classifier enables much faster inference. Furthermore, the fine-tuned LLM surpasses all state-of-the-art approaches for entity matching, including GPT-4 enhanced with in-context learning and chain of thoughts. This evidence highlights the effectiveness of the proposed DDMS in bridging the interoperability gap within DT applications, offering a scalable solution for the manufacturing industry.

在工业 4.0 的背景下，确保数字孪生（DT）与制造业现有软件系统的兼容性是一项重大挑战。资产管理外壳（AAS）的概念是资产的标准化数字孪生，它提供了一个强大的框架，通过可互操作的知识表示将数字孪生与现有的软件基础设施连接起来。虽然 IEC 63278 系列规定了 AAS 元模型，但它缺乏自动映射现有软件专有数据和 AAS 信息模型的匹配策略。为了弥补这一不足，我们引入了一种新颖的双数据映射系统（DDMS），该系统利用经过微调的开源大型语言模型（LLM）进行实体匹配。该系统不仅能促进现有软件与 AAS 模型之间的映射，还能促进 AAS 模型与标准化词汇词典之间的映射，从而增强模型的语义互操作性。注塑成型领域的一个案例研究说明了 DDMS 在自动创建 AAS 实例方面的实际应用，无缝集成了制造商的现有数据。此外，我们还广泛研究了微调解码 LLM 作为生成分类器和基于编码的分类器在实体匹配任务中的潜力。为此，我们通过收集和编译 AAS 相关资源，建立了两个 AAS 专用数据集。此外，我们还在一般实体匹配基准数据集上进行了补充实验，以确保我们的经验结论和见解具有普遍适用性。实验结果表明，经过微调的生成式 LLM 分类器取得了稍好的结果，而基于编码的分类器的推理速度要快得多。此外，经过微调的 LLM 超越了所有最先进的实体匹配方法，包括通过上下文学习和思维链增强的 GPT-4。这些证据凸显了所提出的 DDMS 在弥合 DT 应用程序中的互操作性差距方面的有效性，为制造业提供了一个可扩展的解决方案。

{"title":"Dual data mapping with fine-tuned large language models and asset administration shells toward interoperable knowledge representation","authors":"Dachuan Shi , Olga Meyer , Michael Oberle , Thomas Bauernhansl","doi":"10.1016/j.rcim.2024.102837","DOIUrl":"10.1016/j.rcim.2024.102837","url":null,"abstract":"<div><p>In the context of Industry 4.0, ensuring the compatibility of digital twins (DTs) with existing software systems in the manufacturing sector presents a significant challenge. The Asset Administration Shell (AAS), conceptualized as the standardized DT for an asset, offers a powerful framework that connects the DT with the established software infrastructure through interoperable knowledge representation. Although the IEC 63278 series specifies the AAS metamodel, it lacks a matching strategy for automating the mapping between proprietary data from existing software and AAS information models. Addressing this gap, we introduce a novel dual data mapping system (DDMS) that utilizes a fine-tuned open-source large language model (LLM) for entity matching. This system facilitates not only the mapping between existing software and AAS models but also between AAS models and standardized vocabulary dictionaries, thereby enhancing the model's semantic interoperability. A case study within the injection molding domain illustrates the practical application of DDMS for the automated creation of AAS instances, seamlessly integrating the manufacturer's existing data. Furthermore, we extensively investigate the potential of fine-tuning decode-only LLMs as generative classifiers and encoding-based classifiers for the entity matching task. To this end, we establish two AAS-specific datasets by collecting and compiling AAS-related resources. In addition, supplementary experiments are performed on general entity-matching benchmark datasets to ensure that our empirical conclusions and insights are generally applicable. The experiment results indicate that the fine-tuned generative LLM classifier achieves slightly better results, while the encoding-based classifier enables much faster inference. Furthermore, the fine-tuned LLM surpasses all state-of-the-art approaches for entity matching, including GPT-4 enhanced with in-context learning and chain of thoughts. This evidence highlights the effectiveness of the proposed DDMS in bridging the interoperability gap within DT applications, offering a scalable solution for the manufacturing industry.</p></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"91 ","pages":"Article 102837"},"PeriodicalIF":9.1,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0736584524001248/pdfft?md5=ce13f8902e3f0294b6ee2bb0f02f505e&pid=1-s2.0-S0736584524001248-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141954346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0