首页 > 最新文献

Frontiers in Robotics and AI最新文献

英文 中文
FORMIGA: a fleet management framework for sustainable human-robot collaboration in field robotics. FORMIGA:用于现场机器人中可持续人机协作的车队管理框架。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-08 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1706910
Beril Yalcinkaya, Micael S Couceiro, Salviano Soares, António Valente

Robotic fleet management systems are increasingly vital for sustainable operations in agriculture, forestry, and other field domains where labor shortages, efficiency, and environmental concerns intersect. We present FORMIGA, a fleet management framework that integrates human operators and autonomous robots into a collaborative ecosystem. FORMIGA combines standardised communication through the Robot Operating System with a user-centered interface for monitoring and intervention, while also leveraging large language models to generate executable task code from natural language prompts. The framework was deployed and validated within the FEROX project, a European initiative addressing sustainable berry harvesting in remote environments. In simulation-based trials, FORMIGA demonstrated adaptive task allocation, reduced operator workload, and faster task completion compared to semi-autonomous control, enabling dynamic labor division between humans and robots. By enhancing productivity, supporting worker safety, and promoting resource-efficient operations, FORMIGA contributes to the economic, and environmental dimensions of sustainability, offering a transferable tool for advancing human-robot collaboration in field robotics.

机器人车队管理系统对于农业、林业和其他领域的可持续运营越来越重要,这些领域的劳动力短缺、效率和环境问题相互交织。我们提出了FORMIGA,这是一个将人类操作员和自主机器人集成到协作生态系统中的车队管理框架。FORMIGA通过机器人操作系统将标准化通信与以用户为中心的监控和干预界面相结合,同时还利用大型语言模型从自然语言提示生成可执行的任务代码。该框架在FEROX项目中进行了部署和验证,该项目是一项旨在解决偏远环境中可持续浆果收获的欧洲倡议。在基于仿真的试验中,FORMIGA展示了自适应任务分配,减少了操作人员的工作量,与半自主控制相比,任务完成速度更快,实现了人与机器人之间的动态分工。通过提高生产率,支持工人安全,促进资源高效运营,FORMIGA为可持续发展的经济和环境方面做出了贡献,为推进现场机器人领域的人机协作提供了一种可转移的工具。
{"title":"FORMIGA: a fleet management framework for sustainable human-robot collaboration in field robotics.","authors":"Beril Yalcinkaya, Micael S Couceiro, Salviano Soares, António Valente","doi":"10.3389/frobt.2025.1706910","DOIUrl":"10.3389/frobt.2025.1706910","url":null,"abstract":"<p><p>Robotic fleet management systems are increasingly vital for sustainable operations in agriculture, forestry, and other field domains where labor shortages, efficiency, and environmental concerns intersect. We present FORMIGA, a fleet management framework that integrates human operators and autonomous robots into a collaborative ecosystem. FORMIGA combines standardised communication through the Robot Operating System with a user-centered interface for monitoring and intervention, while also leveraging large language models to generate executable task code from natural language prompts. The framework was deployed and validated within the FEROX project, a European initiative addressing sustainable berry harvesting in remote environments. In simulation-based trials, FORMIGA demonstrated adaptive task allocation, reduced operator workload, and faster task completion compared to semi-autonomous control, enabling dynamic labor division between humans and robots. By enhancing productivity, supporting worker safety, and promoting resource-efficient operations, FORMIGA contributes to the economic, and environmental dimensions of sustainability, offering a transferable tool for advancing human-robot collaboration in field robotics.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1706910"},"PeriodicalIF":3.0,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12823971/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146054659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-driven acceleration of mixed-integer bilinear programs: a comparative study for robot motion planning. 混合整数双线性规划的数据驱动加速:机器人运动规划的比较研究。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-08 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1656564
Xuan Lin

This paper presents a comparative study of data-driven acceleration techniques for mixed-integer bilinear programs (MIBLPs) applied to robot motion planning. MIBLPs combine discrete decision variables and nonlinear constraints, making them computationally challenging for real-time robotics applications. We investigate two reformulation strategies: (1) converting binary variables into continuous variables with complementarity constraints (MPCC), and (2) converting bilinear constraints into mixed-integer linear constraints using McCormick envelopes (MICP). Using offline computed solutions as datasets, we apply K-nearest neighbor methods to warm-start both reformulations. We experimented with the proposed data-driven MIBLP formulation for motion planning on a linear inverted pendulum with contacts, and planning motion using a single rigid body model with mode transitions and contacts. Our results demonstrate that when sufficient data is available, MICP achieves consistently fast solving speeds that are suitable for real-time computation, while MPCC achieves higher success rates with limited amount of data. Our approach is capable of planning motions for the SCALER robot platform to transition between bipedal and quadrupedal configurations to navigate around obstacles without pre-specified gaits. Code for reproducing our results is available at https://github.com/XuanLin/MIBLP_benchmark.

本文对混合整数双线性规划(MIBLPs)中数据驱动加速技术在机器人运动规划中的应用进行了比较研究。miblp结合了离散决策变量和非线性约束,使其在实时机器人应用中具有计算挑战性。本文研究了两种重构策略:(1)利用互补约束将二元变量转化为连续变量(MPCC),(2)利用麦考密克包络(MICP)将双线性约束转化为混合整数线性约束。使用离线计算解作为数据集,我们应用k近邻方法来热启动两个重新公式。我们将提出的数据驱动的MIBLP公式用于具有接触的线性倒立摆的运动规划,并使用具有模式转换和接触的单一刚体模型进行运动规划。我们的研究结果表明,当有足够的数据可用时,MICP可以实现适合实时计算的快速求解速度,而MPCC在有限的数据量下可以实现更高的成功率。我们的方法能够为SCALER机器人平台规划运动,在两足和四足构型之间转换,在没有预先指定步态的情况下绕过障碍物。用于复制我们的结果的代码可在https://github.com/XuanLin/MIBLP_benchmark上获得。
{"title":"Data-driven acceleration of mixed-integer bilinear programs: a comparative study for robot motion planning.","authors":"Xuan Lin","doi":"10.3389/frobt.2025.1656564","DOIUrl":"https://doi.org/10.3389/frobt.2025.1656564","url":null,"abstract":"<p><p>This paper presents a comparative study of data-driven acceleration techniques for mixed-integer bilinear programs (MIBLPs) applied to robot motion planning. MIBLPs combine discrete decision variables and nonlinear constraints, making them computationally challenging for real-time robotics applications. We investigate two reformulation strategies: (1) converting binary variables into continuous variables with complementarity constraints (MPCC), and (2) converting bilinear constraints into mixed-integer linear constraints using McCormick envelopes (MICP). Using offline computed solutions as datasets, we apply K-nearest neighbor methods to warm-start both reformulations. We experimented with the proposed data-driven MIBLP formulation for motion planning on a linear inverted pendulum with contacts, and planning motion using a single rigid body model with mode transitions and contacts. Our results demonstrate that when sufficient data is available, MICP achieves consistently fast solving speeds that are suitable for real-time computation, while MPCC achieves higher success rates with limited amount of data. Our approach is capable of planning motions for the SCALER robot platform to transition between bipedal and quadrupedal configurations to navigate around obstacles without pre-specified gaits. Code for reproducing our results is available at https://github.com/XuanLin/MIBLP_benchmark.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1656564"},"PeriodicalIF":3.0,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12824877/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146047146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LG-H-PPO: offline hierarchical PPO for robot path planning on a latent graph. LG-H-PPO:基于隐图的机器人路径规划的离线分层PPO。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-07 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1737238
Xiang Han

The path planning capability of autonomous robots in complex environments is crucial for their widespread application in the real world. However, long-term decision-making and sparse reward signals pose significant challenges to traditional reinforcement learning (RL) algorithms. Offline hierarchical reinforcement learning offers an effective approach by decomposing tasks into two stages: high-level subgoal generation and low-level subgoal attainment. Advanced Offline HRL methods, such as Guider and HIQL, typically introduce latent spaces in high-level policies to represent subgoals, thereby handling high-dimensional states and enhancing generalization. However, these approaches require the high-level policy to search and generate sub-objectives within a continuous latent space. This remains a complex and sample-inefficient challenge for policy optimization algorithms-particularly policy gradient-based PPO-often leading to unstable training and slow convergence. To address this core limitation, this paper proposes a novel offline hierarchical PPO framework-LG-H-PPO (Latent Graph-based Hierarchical PPO). The core innovation of LG-H-PPO lies in discretizing the continuous latent space into a structured "latent graph." By transforming high-level planning from challenging "continuous creation" to simple "discrete selection," LG-H-PPO substantially reduces the learning difficulty for the high-level policy. Preliminary experiments on standard D4RL offline navigation benchmarks demonstrate that LG-H-PPO achieves significant advantages over advanced baselines like Guider and HIQL in both convergence speed and final task success rates. The main contribution of this paper is introducing graph structures into latent variable HRL planning. This effectively simplifies the action space for high-level policies, enhancing the training efficiency and stability of offline HRL algorithms for long-sequence navigation tasks. It lays the foundation for future offline HRL research combining latent variable representations with explicit graph planning.

自主机器人在复杂环境中的路径规划能力对其在现实世界中的广泛应用至关重要。然而,长期决策和稀疏奖励信号对传统的强化学习(RL)算法提出了重大挑战。离线分层强化学习提供了一种有效的方法,它将任务分解为两个阶段:高级子目标生成和低级子目标实现。高级离线HRL方法,如Guider和HIQL,通常在高级策略中引入潜在空间来表示子目标,从而处理高维状态并增强泛化。然而,这些方法需要高层策略在连续的潜在空间中搜索和生成子目标。对于策略优化算法(尤其是基于策略梯度的ppo)来说,这仍然是一个复杂且样本效率低下的挑战,通常会导致训练不稳定和收敛缓慢。为了解决这一核心限制,本文提出了一种新的离线分层PPO框架- lg - h- PPO (Latent Graph-based hierarchical PPO)。LG-H-PPO的核心创新在于将连续潜空间离散成结构化的“潜图”。通过将高层次规划从具有挑战性的“连续创造”转变为简单的“离散选择”,LG-H-PPO大大降低了高层次政策的学习难度。在标准D4RL离线导航基准上的初步实验表明,与Guider和HIQL等先进基线相比,LG-H-PPO在收敛速度和最终任务成功率方面都具有显著优势。本文的主要贡献是将图结构引入到潜在变量HRL规划中。这有效地简化了高层策略的动作空间,提高了离线HRL算法对长序列导航任务的训练效率和稳定性。将潜在变量表示与显式图规划相结合,为未来的离线HRL研究奠定了基础。
{"title":"LG-H-PPO: offline hierarchical PPO for robot path planning on a latent graph.","authors":"Xiang Han","doi":"10.3389/frobt.2025.1737238","DOIUrl":"https://doi.org/10.3389/frobt.2025.1737238","url":null,"abstract":"<p><p>The path planning capability of autonomous robots in complex environments is crucial for their widespread application in the real world. However, long-term decision-making and sparse reward signals pose significant challenges to traditional reinforcement learning (RL) algorithms. Offline hierarchical reinforcement learning offers an effective approach by decomposing tasks into two stages: high-level subgoal generation and low-level subgoal attainment. Advanced Offline HRL methods, such as Guider and HIQL, typically introduce latent spaces in high-level policies to represent subgoals, thereby handling high-dimensional states and enhancing generalization. However, these approaches require the high-level policy to search and generate sub-objectives within a continuous latent space. This remains a complex and sample-inefficient challenge for policy optimization algorithms-particularly policy gradient-based PPO-often leading to unstable training and slow convergence. To address this core limitation, this paper proposes a novel offline hierarchical PPO framework-LG-H-PPO (Latent Graph-based Hierarchical PPO). The core innovation of LG-H-PPO lies in discretizing the continuous latent space into a structured \"latent graph.\" By transforming high-level planning from challenging \"continuous creation\" to simple \"discrete selection,\" LG-H-PPO substantially reduces the learning difficulty for the high-level policy. Preliminary experiments on standard D4RL offline navigation benchmarks demonstrate that LG-H-PPO achieves significant advantages over advanced baselines like Guider and HIQL in both convergence speed and final task success rates. The main contribution of this paper is introducing graph structures into latent variable HRL planning. This effectively simplifies the action space for high-level policies, enhancing the training efficiency and stability of offline HRL algorithms for long-sequence navigation tasks. It lays the foundation for future offline HRL research combining latent variable representations with explicit graph planning.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1737238"},"PeriodicalIF":3.0,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12819167/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146031202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accuracy of robot-assisted anterior transpedicular screws in the subaxial cervical spine: an experimental study on human specimens. 机器人辅助下颈椎前路经椎弓根螺钉的准确性:人类标本的实验研究。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-07 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1686350
Lin Cong, Xiaowei Sun, Xiaolu Xi, Ke Yuan, Yajing Cao, Qiang Xie, Yue Zhu

Study design: Prospective study.

Objectives: This study aimed to evaluate the accuracy and safety of robot-assisted anterior transpedicular screw (ATPS) fixation in human cervical spine specimens.

Methods: A spine robotic system was used to implant thirty-six 1.2 mm Kirschner wires (K-wires) into the cervical pedicles (C4-C7) of five human specimens. Accuracy was assessed by comparing the planned trajectories with the actual K-wire positions. The Gertzbein-Robbins classification system (GRS), adapted for cervical pedicles, was used to evaluate accuracy; Grades A and B (<2 mm pedicle breach) were considered clinically acceptable. Secondary metrics included entry point and angle offsets.

Results: Of the 36 K-wires implanted, nine were placed in C4 and C6, 10 in C5, and eight in C7. According to the adapted GRS, 25 placements (69.4%) were Grade A, 10 (27.8%) were Grade B, and one was Grade C, resulting in a 97.2% clinically acceptable placement rate. The mean target offset was 2.29 ± 1.72 mm, the entry offset was 2.47 ± 1.57 mm, and the angle offset was 5.67° ± 3.72°. No significant differences were observed between the left and right sides (p > 0.05).

Conclusion: Robot-assisted ATPS fixation in cervical specimens achieved high accuracy with 97.2% of placements rated clinically acceptable, indicating its technical feasibility and potential utility in anterior cervical procedures.

研究设计:前瞻性研究。目的:本研究旨在评估机器人辅助椎弓根前路螺钉(ATPS)固定在人颈椎标本中的准确性和安全性。方法:采用脊柱机器人系统将36根1.2 mm克氏针(k -丝)植入5例人颈椎椎弓根(C4-C7)。通过比较计划轨迹与实际k线位置来评估准确性。采用适用于颈椎椎弓根的Gertzbein-Robbins分类系统(GRS)评估准确性;A、B级(结果:36根k针中,9根放置在C4、C6, 10根放置在C5, 8根放置在C7。根据调整后的GRS, A级安置25个(69.4%),B级安置10个(27.8%),C级安置1个,临床可接受安置率为97.2%。平均靶偏移量为2.29±1.72 mm,入射偏移量为2.47±1.57 mm,角度偏移量为5.67°±3.72°。左右两侧无显著差异(p < 0.05)。结论:机器人辅助ATPS固定颈椎标本的准确率较高,97.2%的放置位置被临床认可,表明其技术可行性和在颈椎前路手术中的潜在应用。
{"title":"Accuracy of robot-assisted anterior transpedicular screws in the subaxial cervical spine: an experimental study on human specimens.","authors":"Lin Cong, Xiaowei Sun, Xiaolu Xi, Ke Yuan, Yajing Cao, Qiang Xie, Yue Zhu","doi":"10.3389/frobt.2025.1686350","DOIUrl":"10.3389/frobt.2025.1686350","url":null,"abstract":"<p><strong>Study design: </strong>Prospective study.</p><p><strong>Objectives: </strong>This study aimed to evaluate the accuracy and safety of robot-assisted anterior transpedicular screw (ATPS) fixation in human cervical spine specimens.</p><p><strong>Methods: </strong>A spine robotic system was used to implant thirty-six 1.2 mm Kirschner wires (K-wires) into the cervical pedicles (C4-C7) of five human specimens. Accuracy was assessed by comparing the planned trajectories with the actual K-wire positions. The Gertzbein-Robbins classification system (GRS), adapted for cervical pedicles, was used to evaluate accuracy; Grades A and B (<2 mm pedicle breach) were considered clinically acceptable. Secondary metrics included entry point and angle offsets.</p><p><strong>Results: </strong>Of the 36 K-wires implanted, nine were placed in C4 and C6, 10 in C5, and eight in C7. According to the adapted GRS, 25 placements (69.4%) were Grade A, 10 (27.8%) were Grade B, and one was Grade C, resulting in a 97.2% clinically acceptable placement rate. The mean target offset was 2.29 ± 1.72 mm, the entry offset was 2.47 ± 1.57 mm, and the angle offset was 5.67° ± 3.72°. No significant differences were observed between the left and right sides (p > 0.05).</p><p><strong>Conclusion: </strong>Robot-assisted ATPS fixation in cervical specimens achieved high accuracy with 97.2% of placements rated clinically acceptable, indicating its technical feasibility and potential utility in anterior cervical procedures.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1686350"},"PeriodicalIF":3.0,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12819262/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146031243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modeling the biomechanical features affecting the metabolic rate of walking with a powered ankle-foot prosthesis. 模拟影响动力踝足假体行走代谢率的生物力学特征。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-07 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1708564
Mikayla Schneider, Zane A Colvin, Alena M Grabowski, Cara Gonzalez Welker

For individuals with unilateral transtibial amputation, powered ankle-foot prostheses have the potential to reduce the metabolic rate of walking, which could contribute to improvements in mobility and quality of life; however, physiological improvements have not been consistently demonstrated in experimental studies. To improve our understanding of the biomechanical mechanisms that drive metabolic rate outcomes, we used a machine learning approach to model the relationship between multimodal biomechanical factors and the metabolic rate of walking with a powered ankle-foot prosthesis. Our model included 50 features describing spatiotemporal parameters, step-to-step transition work, joint kinematics, muscle activity, ground reaction forces, prosthesis settings, and subject characteristics, and resulted in a pseudo-R2 of 0.986. Accumulated local effects plots were used to visualize the direction and magnitude of the relationship between each feature and the metabolic rate of walking. The features with the largest effect on metabolic rate were peak unaffected side ankle inversion angle, leading affected leg positive work during the step-to-step transition, and peak affected knee extension angle. This work furthers our knowledge about the biomechanical and physiological response to powered ankle-foot prosthesis use and could assist in developing new strategies to drive reductions in metabolic rate.

对于单侧经胫骨截肢的患者,动力踝足假体有可能降低步行的代谢率,这可能有助于改善活动能力和生活质量;然而,生理上的改善并没有在实验研究中得到一致的证明。为了提高我们对驱动代谢率结果的生物力学机制的理解,我们使用机器学习方法来模拟多模态生物力学因素与使用动力踝足假体行走的代谢率之间的关系。我们的模型包括50个特征,描述了时空参数、台阶到台阶的过渡工作、关节运动学、肌肉活动、地面反作用力、假体设置和受试者特征,结果的伪r2为0.986。累积的局部效应图用于可视化每个特征与步行代谢率之间关系的方向和大小。对代谢率影响最大的特征是未受影响的侧踝关节内翻角峰值,导致步向过渡时腿部正向功的影响,以及受影响的膝关节伸角峰值。这项工作进一步加深了我们对动力踝足假体使用的生物力学和生理反应的了解,并有助于制定新的策略来降低代谢率。
{"title":"Modeling the biomechanical features affecting the metabolic rate of walking with a powered ankle-foot prosthesis.","authors":"Mikayla Schneider, Zane A Colvin, Alena M Grabowski, Cara Gonzalez Welker","doi":"10.3389/frobt.2025.1708564","DOIUrl":"10.3389/frobt.2025.1708564","url":null,"abstract":"<p><p>For individuals with unilateral transtibial amputation, powered ankle-foot prostheses have the potential to reduce the metabolic rate of walking, which could contribute to improvements in mobility and quality of life; however, physiological improvements have not been consistently demonstrated in experimental studies. To improve our understanding of the biomechanical mechanisms that drive metabolic rate outcomes, we used a machine learning approach to model the relationship between multimodal biomechanical factors and the metabolic rate of walking with a powered ankle-foot prosthesis. Our model included 50 features describing spatiotemporal parameters, step-to-step transition work, joint kinematics, muscle activity, ground reaction forces, prosthesis settings, and subject characteristics, and resulted in a pseudo-R<sup>2</sup> of 0.986. Accumulated local effects plots were used to visualize the direction and magnitude of the relationship between each feature and the metabolic rate of walking. The features with the largest effect on metabolic rate were peak unaffected side ankle inversion angle, leading affected leg positive work during the step-to-step transition, and peak affected knee extension angle. This work furthers our knowledge about the biomechanical and physiological response to powered ankle-foot prosthesis use and could assist in developing new strategies to drive reductions in metabolic rate.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1708564"},"PeriodicalIF":3.0,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12819243/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146031265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A feasibility study: a non-inferiority study comparing head-mounted and console-based virtual reality for robotic surgery training. 可行性研究:一项比较头戴式和基于控制台的虚拟现实机器人手术训练的非劣效性研究。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-06 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1616462
Kazuho Kawashima, Shadi Ghali, Justin W Collins, Ali Esmaeili

Background: Head-mounted virtual reality (VR) simulations are increasingly explored in healthcare, particularly in patient education, stroke rehabilitation, and surgical training. While VR-based simulation plays a growing role in robotic-assisted surgery (RAS) training, the implications of head-mounted VR in this context remain underexamined.

Method: This prospective, randomised, controlled trial with a single-arm crossover compared two training modalities: a head-mounted VR simulation and a conventional console-based simulation. Participants in the experimental group used head-mounted VR as their primary training method, while the control group trained on a conventional console. Both groups completed a running suture task at baseline, midterm, and final assessments on the surgical console. The primary outcome was the composite score from the final assessment.

Results: Fourteen participants were equally distributed between the two arms. Baseline scores showed no significant differences. A two-way repeated measures ANOVA demonstrated significant overall improvement across assessments (F (1.688, 20.26) = 48.34, p < 0.001, partial η2 = 0.80). No statistical difference was found in final composite scores (mean difference: 8.4 ± 9.45, p = 0.391, Cohen's d = -0.48), midterm scores, or granular kinematic data. However, non-inferiority could not be established as the confidence interval fell outside our pre-set margin. The crossover group required less time (mean difference: 39 ± 9.01 min, p = 0.004) and fewer attempts (mean difference: 8 ± 2.2, p = 0.009) to reach benchmark performance compared to the control group.

Conclusion: Both head-mounted VR and console-based training significantly improved fundamental RAS skills in novices. While our study showed that the VR training shortened the time and attempts required to reach proficiency benchmarks, the small scale of this trial and the breadth of the confidence intervals mean the results should be viewed as preliminary observations. These results provide an initial signal of feasibility that warrants confirmation in larger studies.

背景:头戴式虚拟现实(VR)模拟越来越多地应用于医疗保健领域,特别是在患者教育、中风康复和外科培训方面。虽然基于VR的模拟在机器人辅助手术(RAS)训练中发挥着越来越大的作用,但在这种情况下,头戴式VR的影响仍未得到充分研究。方法:这项前瞻性、随机、对照试验采用单臂交叉试验,比较了两种训练方式:头戴式VR模拟和传统的基于控制台的模拟。实验组的参与者使用头戴式VR作为他们的主要训练方法,而对照组则在传统的控制台进行训练。两组均在手术控制台上完成了基线、中期和最终评估的连续缝合任务。主要结果是最终评估的综合得分。结果:14名参与者平均分布在两组之间。基线评分无显著差异。双向重复测量方差分析显示各评估的总体改善显著(F (1.688, 20.26) = 48.34, p < 0.001,部分η2 = 0.80)。最终综合评分(平均差值:8.4±9.45,p = 0.391, Cohen’s d = -0.48)、中期评分或颗粒运动数据均无统计学差异。然而,由于置信区间超出了我们预先设定的范围,因此无法建立非劣效性。与对照组相比,交叉组达到基准性能所需的时间更短(平均差值:39±9.01 min, p = 0.004),尝试次数更少(平均差值:8±2.2,p = 0.009)。结论:头戴式VR和基于控制台的训练均能显著提高新手的RAS基础技能。虽然我们的研究表明虚拟现实训练缩短了达到熟练程度基准所需的时间和尝试,但该试验的小规模和置信区间的广度意味着结果应被视为初步观察结果。这些结果提供了可行性的初步信号,值得在更大规模的研究中得到证实。
{"title":"A feasibility study: a non-inferiority study comparing head-mounted and console-based virtual reality for robotic surgery training.","authors":"Kazuho Kawashima, Shadi Ghali, Justin W Collins, Ali Esmaeili","doi":"10.3389/frobt.2025.1616462","DOIUrl":"10.3389/frobt.2025.1616462","url":null,"abstract":"<p><strong>Background: </strong>Head-mounted virtual reality (VR) simulations are increasingly explored in healthcare, particularly in patient education, stroke rehabilitation, and surgical training. While VR-based simulation plays a growing role in robotic-assisted surgery (RAS) training, the implications of head-mounted VR in this context remain underexamined.</p><p><strong>Method: </strong>This prospective, randomised, controlled trial with a single-arm crossover compared two training modalities: a head-mounted VR simulation and a conventional console-based simulation. Participants in the experimental group used head-mounted VR as their primary training method, while the control group trained on a conventional console. Both groups completed a running suture task at baseline, midterm, and final assessments on the surgical console. The primary outcome was the composite score from the final assessment.</p><p><strong>Results: </strong>Fourteen participants were equally distributed between the two arms. Baseline scores showed no significant differences. A two-way repeated measures ANOVA demonstrated significant overall improvement across assessments (F (1.688, 20.26) = 48.34, p < 0.001, partial η<sup>2</sup> = 0.80). No statistical difference was found in final composite scores (mean difference: 8.4 ± 9.45, p = 0.391, Cohen's d = -0.48), midterm scores, or granular kinematic data. However, non-inferiority could not be established as the confidence interval fell outside our pre-set margin. The crossover group required less time (mean difference: 39 ± 9.01 min, p = 0.004) and fewer attempts (mean difference: 8 ± 2.2, p = 0.009) to reach benchmark performance compared to the control group.</p><p><strong>Conclusion: </strong>Both head-mounted VR and console-based training significantly improved fundamental RAS skills in novices. While our study showed that the VR training shortened the time and attempts required to reach proficiency benchmarks, the small scale of this trial and the breadth of the confidence intervals mean the results should be viewed as preliminary observations. These results provide an initial signal of feasibility that warrants confirmation in larger studies.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1616462"},"PeriodicalIF":3.0,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12816980/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146020069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the ethical, legal, and social implications of cybernetic avatars. 探索控制论化身的伦理、法律和社会含义。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-05 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1724149
Ryuma Shineha

A cybernetic avatar (CA) is a concept that encompasses not only avatars representing virtual bodies in cyberspace but also information and communication technology (ICT) and robotic technologies that enhance the physical, cognitive, and perceptual capabilities of humans. CAs can enable multiple people to remotely operate numerous avatars and robots together to perform complex tasks on a large scale and create the necessary infrastructure for their operation and other related activities. However, due to the novelty of this concept, the ethical, legal, and social implications (ELSI) of CAs have not been discussed sufficiently. Therefore, the objective of this paper is to provide an overview of ELSI in the context of a CA, taking into account the implications from fields similar to that of CAs, such as robotic avatars, virtual avatars, metaverses, virtual reality, extended reality, social robots, human-robot interaction, telepresence, telexistence, embodied technology, and exoskeletons. In our review of ELSI in related fields, we found common themes: safety and security, data privacy, identity theft and identity loss, manipulation, intellectual property management, user addiction and overdependence, cyber abuse, risk management in a specific domain (e.g., medical applications), regulatory gaps, balance between free expression and harmful content, accountability, transparency, distributive justice, prevention of inequalities, dual use, and conceptual changes of familiarity. These issues should not be ignored when considering the social implementation of CAs.

控制论化身(cybernetic avatar, CA)是一个概念,它不仅包括在网络空间中代表虚拟身体的化身,还包括增强人类身体、认知和感知能力的信息和通信技术(ICT)和机器人技术。ca可以使多人远程操作多个化身和机器人,一起执行大规模的复杂任务,并为其操作和其他相关活动创建必要的基础设施。然而,由于这一概念的新颖性,ca的伦理、法律和社会影响(ELSI)尚未得到充分的讨论。因此,本文的目的是在CA的背景下概述ELSI,并考虑到与CA类似的领域的影响,如机器人化身、虚拟化身、元现实、虚拟现实、扩展现实、社交机器人、人机交互、远程呈现、远程存在、具体化技术和外骨骼。在我们对相关领域ELSI的回顾中,我们发现了共同的主题:安全和保障、数据隐私、身份盗窃和身份丢失、操纵、知识产权管理、用户成瘾和过度依赖、网络滥用、特定领域(例如医疗应用)的风险管理、监管差距、自由表达与有害内容之间的平衡、问责制、透明度、分配正义、预防不平等、双重用途以及熟悉的概念变化。在考虑ca的社会实施时,这些问题不应被忽视。
{"title":"Exploring the ethical, legal, and social implications of cybernetic avatars.","authors":"Ryuma Shineha","doi":"10.3389/frobt.2025.1724149","DOIUrl":"10.3389/frobt.2025.1724149","url":null,"abstract":"<p><p>A cybernetic avatar (CA) is a concept that encompasses not only avatars representing virtual bodies in cyberspace but also information and communication technology (ICT) and robotic technologies that enhance the physical, cognitive, and perceptual capabilities of humans. CAs can enable multiple people to remotely operate numerous avatars and robots together to perform complex tasks on a large scale and create the necessary infrastructure for their operation and other related activities. However, due to the novelty of this concept, the ethical, legal, and social implications (ELSI) of CAs have not been discussed sufficiently. Therefore, the objective of this paper is to provide an overview of ELSI in the context of a CA, taking into account the implications from fields similar to that of CAs, such as robotic avatars, virtual avatars, metaverses, virtual reality, extended reality, social robots, human-robot interaction, telepresence, telexistence, embodied technology, and exoskeletons. In our review of ELSI in related fields, we found common themes: safety and security, data privacy, identity theft and identity loss, manipulation, intellectual property management, user addiction and overdependence, cyber abuse, risk management in a specific domain (e.g., medical applications), regulatory gaps, balance between free expression and harmful content, accountability, transparency, distributive justice, prevention of inequalities, dual use, and conceptual changes of familiarity. These issues should not be ignored when considering the social implementation of CAs.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1724149"},"PeriodicalIF":3.0,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12812702/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146012845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Wetware network-based AI: a chemical approach to embodied cognition for robotics and artificial intelligence. 基于湿软件网络的人工智能:机器人和人工智能具身认知的化学方法。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-05 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1694338
Luisa Damiano, Antonio Fleres, Andrea Roli, Pasquale Stano

Wetware Network-Based Artificial Intelligence (WNAI) introduces a new approach to robotic cognition and artificial intelligence: autonomous cognitive agents built from synthetic chemical networks. Rooted in Wetware Neuromorphic Engineering, WNAI shifts the focus of this emerging field from disembodied computation and biological mimicry to reticular chemical self-organization as a substrate for cognition. At the theoretical level, WNAI integrates insights from network cybernetics, autopoietic theory and enaction to frame cognition as a materially grounded, emergent phenomenon. At the heuristic level, WNAI defines its role as complementary to existing leading approaches. On the one hand, it complements embodied AI and xenobotics by expanding the design space of artificial embodied cognition into fully synthetic domains. On the other hand, it engages in mutual exchange with neural network architectures, advancing cross-substrate principles of network-based cognition. At the technological level, WNAI offers a roadmap for implementing chemical neural networks and protocellular agents, with potential applications in robotic systems requiring minimal, adaptive, and substrate-sensitive intelligence. By situating wetware neuromorphic engineering within the broader landscape of robotics and AI, this article outlines a programmatic framework that highlights its potential to expand artificial cognition beyond silicon and biohybrid systems.

基于湿件网络的人工智能(WNAI)为机器人认知和人工智能引入了一种新的方法:基于合成化学网络构建的自主认知代理。基于Wetware神经形态工程,WNAI将这一新兴领域的重点从无实体计算和生物模拟转移到作为认知基础的网状化学自组织。在理论层面,WNAI整合了来自网络控制论、自创生理论和行为的见解,将认知构建为一种基于物质的、涌现的现象。在启发式层面,WNAI将其角色定义为对现有领先方法的补充。一方面,它通过将人工具身认知的设计空间扩展到全合成领域,对具身AI和异种机器人进行了补充。另一方面,它与神经网络架构相互交流,推进基于网络的认知的跨基原则。在技术层面上,WNAI为实现化学神经网络和原细胞因子提供了一个路线图,在机器人系统中有潜在的应用,需要最小的、自适应的和对基质敏感的智能。通过将湿软件神经形态工程置于更广阔的机器人和人工智能领域,本文概述了一个程序化框架,强调了其将人工认知扩展到硅和生物混合系统之外的潜力。
{"title":"Wetware network-based AI: a chemical approach to embodied cognition for robotics and artificial intelligence.","authors":"Luisa Damiano, Antonio Fleres, Andrea Roli, Pasquale Stano","doi":"10.3389/frobt.2025.1694338","DOIUrl":"10.3389/frobt.2025.1694338","url":null,"abstract":"<p><p><i>Wetware Network-Based Artificial Intelligence</i> (WNAI) introduces a new approach to robotic cognition and artificial intelligence: autonomous cognitive agents built from synthetic chemical networks. Rooted in <i>Wetware Neuromorphic Engineering</i>, WNAI shifts the focus of this emerging field from disembodied computation and biological mimicry to reticular chemical self-organization as a substrate for cognition. At the <i>theoretical level</i>, WNAI integrates insights from network cybernetics, autopoietic theory and enaction to frame cognition as a materially grounded, emergent phenomenon. At the <i>heuristic level</i>, WNAI defines its role as complementary to existing leading approaches. On the one hand, it complements embodied AI and xenobotics by expanding the design space of artificial embodied cognition into fully synthetic domains. On the other hand, it engages in mutual exchange with neural network architectures, advancing cross-substrate principles of network-based cognition. At the <i>technological level</i>, WNAI offers a roadmap for implementing chemical neural networks and protocellular agents, with potential applications in robotic systems requiring minimal, adaptive, and substrate-sensitive intelligence. By situating wetware neuromorphic engineering within the broader landscape of robotics and AI, this article outlines a programmatic framework that highlights its potential to expand artificial cognition beyond silicon and biohybrid systems.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1694338"},"PeriodicalIF":3.0,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12812610/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146012858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Embarrassment in HRI: remediation and the role of robot responses in emotion control. HRI中的尴尬:修复和机器人反应在情绪控制中的作用。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-02 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1569040
Ahmed Salem, Kaoru Sumi

As robots became increasingly integrated into daily life, their ability to influence human emotions through verbal and nonverbal expressions is gaining attention. While robots have been explored for their role in emotional expression, their potential in emotion regulation particularly in mitigating or amplifying embarrassment remains under-explored in human-robot interaction. To address this gap, this study investigates whether and how robots can regulate the embarrassment emotion through their responses. A between-subjects experiment was conducted with 96 participants (48 males and 48 females) using the social robot Furhat. Participants experienced an embarrassing situation induced by a failure of meshing scenario, followed by the robot adopting one of three response attitudes: neutral, empathic, or ridiculing. Additionally, the robot's social agency was manipulated by varying its facial appearance between a human-like and an anime-like appearances. The findings indicate that embarrassment was effectively induced, as evidenced by physiological data, body movements, facial expressions, and participants' verbal responses. The anime-faced robot elicited lower embarrassment and arousal due to its lower perceived social agency and anthropomorphism. The robot's attitude was the dominant factor shaping participants' emotional responses and perceptions. The neutral and empathic attitudes with an anime face were found to be the most effective in eliciting mild emotions and mitigating embarrassment. Interestingly, an empathic attitude is suspected to be favored over a neutral one as it elicited the lowest embarrassment. However, an empathic attitude risks shaming the participant due to its indirect confrontation that inherently acknowledges the embarrassing incident which is undesirable in Japanese culture. Nevertheless, in terms of the robot's perceived evaluation by participants, a neutral attitude was the most favored. This study highlights the role of robot responses in emotion regulation, particularly in embarrassment control, and provides insights into designing socially intelligent robots that can modulate human emotions effectively.

随着机器人越来越多地融入日常生活,它们通过语言和非语言表达影响人类情绪的能力越来越受到关注。虽然机器人在情感表达方面的作用已经被探索,但它们在情感调节方面的潜力,特别是在减轻或放大尴尬方面,在人机交互方面仍未得到充分的探索。为了解决这一差距,本研究调查了机器人是否以及如何通过他们的反应来调节尴尬情绪。研究人员利用社交机器人Furhat对96名参与者(48名男性和48名女性)进行了一项受试者间实验。参与者经历了一个由网格失败引起的尴尬情景,随后机器人采取了三种反应态度之一:中立、移情或嘲笑。此外,机器人的社会代理被操纵,通过改变其面部外观之间的人类和类似动画的外观。研究结果表明,从生理数据、身体动作、面部表情和参与者的语言反应来看,尴尬是有效诱导的。由于其较低的感知社会能动性和拟人化,具有动漫面孔的机器人引起的尴尬和兴奋程度较低。机器人的态度是影响参与者情绪反应和感知的主要因素。研究发现,动画脸的中性和移情态度在引发温和情绪和减轻尴尬方面最有效。有趣的是,移情态度被怀疑比中立态度更受青睐,因为它引起的尴尬最少。然而,移情态度可能会让参与者感到羞耻,因为它的间接对抗本质上承认了尴尬的事件,这在日本文化中是不受欢迎的。然而,就参与者对机器人的感知评价而言,中立的态度是最受欢迎的。本研究强调了机器人反应在情绪调节中的作用,特别是在尴尬控制中的作用,并为设计能够有效调节人类情绪的社交智能机器人提供了见解。
{"title":"Embarrassment in HRI: remediation and the role of robot responses in emotion control.","authors":"Ahmed Salem, Kaoru Sumi","doi":"10.3389/frobt.2025.1569040","DOIUrl":"10.3389/frobt.2025.1569040","url":null,"abstract":"<p><p>As robots became increasingly integrated into daily life, their ability to influence human emotions through verbal and nonverbal expressions is gaining attention. While robots have been explored for their role in emotional expression, their potential in emotion regulation particularly in mitigating or amplifying embarrassment remains under-explored in human-robot interaction. To address this gap, this study investigates whether and how robots can regulate the embarrassment emotion through their responses. A between-subjects experiment was conducted with 96 participants (48 males and 48 females) using the social robot Furhat. Participants experienced an embarrassing situation induced by a failure of meshing scenario, followed by the robot adopting one of three response attitudes: neutral, empathic, or ridiculing. Additionally, the robot's social agency was manipulated by varying its facial appearance between a human-like and an anime-like appearances. The findings indicate that embarrassment was effectively induced, as evidenced by physiological data, body movements, facial expressions, and participants' verbal responses. The anime-faced robot elicited lower embarrassment and arousal due to its lower perceived social agency and anthropomorphism. The robot's attitude was the dominant factor shaping participants' emotional responses and perceptions. The neutral and empathic attitudes with an anime face were found to be the most effective in eliciting mild emotions and mitigating embarrassment. Interestingly, an empathic attitude is suspected to be favored over a neutral one as it elicited the lowest embarrassment. However, an empathic attitude risks shaming the participant due to its indirect confrontation that inherently acknowledges the embarrassing incident which is undesirable in Japanese culture. Nevertheless, in terms of the robot's perceived evaluation by participants, a neutral attitude was the most favored. This study highlights the role of robot responses in emotion regulation, particularly in embarrassment control, and provides insights into designing socially intelligent robots that can modulate human emotions effectively.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1569040"},"PeriodicalIF":3.0,"publicationDate":"2026-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12807912/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145999440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Human intention recognition by deep LSTM and transformer networks for real-time human-robot collaboration. 基于深度LSTM和变压器网络的人机实时协作人类意图识别。
IF 3 Q2 ROBOTICS Pub Date : 2025-12-19 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1708987
Matija Mavsar, Mihael Simonič, Aleš Ude

Collaboration between humans and robots is essential for optimizing the performance of complex tasks in industrial environments, reducing worker strain, and improving safety. This paper presents an integrated human-robot collaboration (HRC) system that leverages advanced intention recognition for real-time task sharing and interaction. By utilizing state-of-the-art human pose estimation combined with deep learning models, we developed a robust framework for detecting and predicting worker intentions. Specifically, we employed LSTM-based and transformer-based neural networks with convolutional and pooling layers to classify human hand trajectories, achieving higher accuracy compared to previous approaches. Additionally, our system integrates dynamic movement primitives (DMPs) for smooth robot motion transitions, collision prevention, and automatic motion onset/cessation detection. We validated the system in a real-world industrial assembly task, demonstrating its effectiveness in enhancing the fluency, safety, and efficiency of human-robot collaboration. The proposed method shows promise in improving real-time decision-making in collaborative environments, offering a safer and more intuitive interaction between humans and robots.

人类和机器人之间的协作对于优化工业环境中复杂任务的性能、减少工人压力和提高安全性至关重要。本文提出了一种集成的人机协作(HRC)系统,该系统利用先进的意图识别进行实时任务共享和交互。通过利用最先进的人体姿势估计与深度学习模型相结合,我们开发了一个强大的框架来检测和预测工人的意图。具体来说,我们采用基于lstm和基于变压器的神经网络,结合卷积和池化层对人手轨迹进行分类,与之前的方法相比,获得了更高的精度。此外,我们的系统集成了动态运动原语(dmp),用于平滑机器人运动转换,碰撞预防和自动运动开始/停止检测。我们在一个真实的工业装配任务中验证了该系统,证明了它在提高人机协作的流畅性、安全性和效率方面的有效性。该方法有望改善协作环境中的实时决策,提供人与机器人之间更安全、更直观的交互。
{"title":"Human intention recognition by deep LSTM and transformer networks for real-time human-robot collaboration.","authors":"Matija Mavsar, Mihael Simonič, Aleš Ude","doi":"10.3389/frobt.2025.1708987","DOIUrl":"10.3389/frobt.2025.1708987","url":null,"abstract":"<p><p>Collaboration between humans and robots is essential for optimizing the performance of complex tasks in industrial environments, reducing worker strain, and improving safety. This paper presents an integrated human-robot collaboration (HRC) system that leverages advanced intention recognition for real-time task sharing and interaction. By utilizing state-of-the-art human pose estimation combined with deep learning models, we developed a robust framework for detecting and predicting worker intentions. Specifically, we employed LSTM-based and transformer-based neural networks with convolutional and pooling layers to classify human hand trajectories, achieving higher accuracy compared to previous approaches. Additionally, our system integrates dynamic movement primitives (DMPs) for smooth robot motion transitions, collision prevention, and automatic motion onset/cessation detection. We validated the system in a real-world industrial assembly task, demonstrating its effectiveness in enhancing the fluency, safety, and efficiency of human-robot collaboration. The proposed method shows promise in improving real-time decision-making in collaborative environments, offering a safer and more intuitive interaction between humans and robots.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1708987"},"PeriodicalIF":3.0,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12757248/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145900997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Frontiers in Robotics and AI
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1