首页 > 最新文献

Frontiers in Robotics and AI最新文献

英文 中文
A substation robot path planning algorithm based on deep reinforcement learning enhanced by ant colony optimization. 基于蚁群优化的深度强化学习的变电站机器人路径规划算法。
IF 3 Q2 ROBOTICS Pub Date : 2026-02-04 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1759501
Hongwei Zhang, Lijun Sun, Weihong Tan, Siyu Bao, Xing He, Jinguo Chen

Substation robots face significant challenges in path planning due to the complex electromagnetic environment, dense equipment layout, and safety-critical operational requirements. This paper proposes a path planning algorithm based on deep reinforcement learning enhanced by ant colony optimization, establishing a synergistic optimization framework that combines bio-inspired algorithms with deep learning. The proposed method addresses critical path planning issues in substation inspection and maintenance operations. The approach includes: 1) designing a pheromone-guided exploration strategy that transforms environmental prior knowledge into spatial bias to reduce ineffective exploration; 2) establishing a high-quality sample screening mechanism that enhances Q-network training through ant colony path experience to improve sample efficiency; 3) implementing dynamic decision weight adjustment that enables gradual transition from heuristic guidance to autonomous learning decisions. Experimental results in complex environments demonstrate the method's superiority. Compared to state-of-the-art baselines including PPO, DDQN, and A*, the proposed method achieves 24% higher sample efficiency, 18% reduction in average path length, and superior dynamic obstacle avoidance. Field validation in a 2,500-square-meter substation confirms a 14.8% improvement in task completion rate compared to standard DRL approaches.

由于复杂的电磁环境、密集的设备布局和安全关键的操作要求,变电站机器人在路径规划方面面临着重大挑战。本文提出了一种基于蚁群优化增强的深度强化学习的路径规划算法,建立了仿生算法与深度学习相结合的协同优化框架。提出的方法解决了变电站检查和维护操作中的关键路径规划问题。该方法包括:1)设计信息素引导的勘探策略,将环境先验知识转化为空间偏差,以减少无效勘探;2)建立高质量样本筛选机制,通过蚁群路径经验增强q网络训练,提高样本效率;3)实施动态决策权重调整,使启发式指导逐步过渡到自主学习决策。在复杂环境下的实验结果证明了该方法的优越性。与PPO、DDQN和A*等最先进的基线相比,该方法的样本效率提高了24%,平均路径长度减少了18%,并且具有更好的动态避障能力。在2500平方米变电站的现场验证证实,与标准DRL方法相比,任务完成率提高了14.8%。
{"title":"A substation robot path planning algorithm based on deep reinforcement learning enhanced by ant colony optimization.","authors":"Hongwei Zhang, Lijun Sun, Weihong Tan, Siyu Bao, Xing He, Jinguo Chen","doi":"10.3389/frobt.2025.1759501","DOIUrl":"https://doi.org/10.3389/frobt.2025.1759501","url":null,"abstract":"<p><p>Substation robots face significant challenges in path planning due to the complex electromagnetic environment, dense equipment layout, and safety-critical operational requirements. This paper proposes a path planning algorithm based on deep reinforcement learning enhanced by ant colony optimization, establishing a synergistic optimization framework that combines bio-inspired algorithms with deep learning. The proposed method addresses critical path planning issues in substation inspection and maintenance operations. The approach includes: 1) designing a pheromone-guided exploration strategy that transforms environmental prior knowledge into spatial bias to reduce ineffective exploration; 2) establishing a high-quality sample screening mechanism that enhances Q-network training through ant colony path experience to improve sample efficiency; 3) implementing dynamic decision weight adjustment that enables gradual transition from heuristic guidance to autonomous learning decisions. Experimental results in complex environments demonstrate the method's superiority. Compared to state-of-the-art baselines including PPO, DDQN, and A*, the proposed method achieves 24% higher sample efficiency, 18% reduction in average path length, and superior dynamic obstacle avoidance. Field validation in a 2,500-square-meter substation confirms a 14.8% improvement in task completion rate compared to standard DRL approaches.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1759501"},"PeriodicalIF":3.0,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12914723/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146229228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Coulomb force-guided deep reinforcement learning for effective and explainable robotic motion planning. 库仑力引导的深度强化学习,用于有效和可解释的机器人运动规划。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-30 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1697155
Sirui Song, Trevor Bihl, Jundong Liu

Training mobile robots through digital twins with deep reinforcement learning (DRL) has gained increasing attention to ensure efficient and safe navigation in complex environments. In this paper, we propose a novel physics-inspired DRL framework that achieves both effective and explainable motion planning. We represent the robot, destination, and obstacles as electrical charges and model their interactions using Coulomb forces. These forces are incorporated into the reward function, providing both attractive and repulsive signals to guide robot behavior. In addition, obstacle boundaries extracted from LiDAR segmentation are integrated as anticipatory rewards, allowing the robot to avoid collisions from a distance. The proposed model is first trained in Gazebo simulation environments and subsequently deployed on a real TurtleBot v3 robot. Extensive experiments in both simulation and real-world scenarios demonstrate the effectiveness of the proposed framework. Results show that our method significantly reduces collisions, maintains safe distances from obstacles, and generates safer trajectories toward the destinations.

利用数字孪生和深度强化学习(DRL)来训练移动机器人,以确保在复杂环境下的高效和安全导航,已受到越来越多的关注。在本文中,我们提出了一个新的物理启发的DRL框架,实现了有效和可解释的运动规划。我们将机器人、目的地和障碍物表示为电荷,并使用库仑力对它们的相互作用进行建模。这些力量被整合到奖励函数中,提供吸引和排斥信号来指导机器人的行为。此外,从LiDAR分割中提取的障碍物边界被集成为预期奖励,使机器人能够从远处避免碰撞。提出的模型首先在Gazebo模拟环境中进行训练,随后部署在真正的TurtleBot v3机器人上。在模拟和现实场景中的大量实验证明了所提出框架的有效性。结果表明,我们的方法可以显著减少碰撞,保持与障碍物的安全距离,并生成更安全的目的地轨迹。
{"title":"Coulomb force-guided deep reinforcement learning for effective and explainable robotic motion planning.","authors":"Sirui Song, Trevor Bihl, Jundong Liu","doi":"10.3389/frobt.2025.1697155","DOIUrl":"10.3389/frobt.2025.1697155","url":null,"abstract":"<p><p>Training mobile robots through digital twins with deep reinforcement learning (DRL) has gained increasing attention to ensure efficient and safe navigation in complex environments. In this paper, we propose a novel physics-inspired DRL framework that achieves both effective and explainable motion planning. We represent the robot, destination, and obstacles as electrical charges and model their interactions using Coulomb forces. These forces are incorporated into the reward function, providing both attractive and repulsive signals to guide robot behavior. In addition, obstacle boundaries extracted from LiDAR segmentation are integrated as anticipatory rewards, allowing the robot to avoid collisions from a distance. The proposed model is first trained in Gazebo simulation environments and subsequently deployed on a real TurtleBot v3 robot. Extensive experiments in both simulation and real-world scenarios demonstrate the effectiveness of the proposed framework. Results show that our method significantly reduces collisions, maintains safe distances from obstacles, and generates safer trajectories toward the destinations.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1697155"},"PeriodicalIF":3.0,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12900773/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146203288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Understanding accounting professionals' intention to adopt robotic process automation: a TOE-based empirical assessment from an emerging country. 了解会计专业人员采用机器人流程自动化的意图:来自新兴国家的基于toe的实证评估。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-29 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1747539
Nusirat Ojuolape Gold, Husain Coovadia, Katlego Thipe

The proliferation of the Fourth Industrial Revolution (4IR) is transforming the accounting landscape, with technologies such as Robotic Process Automation (RPA) changing the face of traditional accounting processes. This study investigates the level of RPA adoption among accountants in South Africa and examines how technological-organizational-environmental (TOE) factors influence the behavioral intention of RPA adoption. The study employed an exploratory cross-sectional survey comprising responses from 100 professional accountants in practice to analyze its data, combining descriptive statistics with a multiple linear regression model supported by correlation tests to determine significant predictors of RPA adoption intention. The robustness of the model, which was verified by multiple pre- and post-analysis checks, indicated that institutional support, particularly normative pressure, has the strongest influence on adoption intention, with an adjusted R2 value of 0.27 highly significant. This highlights the crucial role that organizational readiness, managerial support, and technology readiness play in enabling RPA adoption. On the other hand, mimetic pressure showed a negative influence, indicating that the industry-wide adoption of RPA technology may raise concerns and anxiety about job displacement. Overall, the findings reinforce the importance of organizational capacity-building in fostering RPA adoption while also revealing the complexity of environmental and technological factors that influence the adoption decisions of professional accountants in a developing-economy context. The findings support SDG 9 by emphasizing capacity building and inclusive digital transformation.

第四次工业革命(4IR)的扩散正在改变会计领域,机器人流程自动化(RPA)等技术改变了传统会计流程的面貌。本研究调查了南非会计师采用RPA的水平,并考察了技术-组织-环境(TOE)因素如何影响RPA采用的行为意愿。本研究采用了一种探索性的横断面调查方法,收集了100名专业会计师在实践中的反馈,并对其数据进行了分析,将描述性统计与相关检验支持的多元线性回归模型相结合,以确定RPA采用意愿的显著预测因素。通过多次分析前和分析后的检验,模型的稳健性表明,制度支持,特别是规范压力,对收养意愿的影响最大,调整后的R2值为0.27,非常显著。这突出了组织准备、管理支持和技术准备在启用RPA采用中所起的关键作用。另一方面,模仿压力表现出负面影响,表明全行业采用RPA技术可能会引起对工作取代的担忧和焦虑。总体而言,研究结果强调了组织能力建设在促进采用RPA方面的重要性,同时也揭示了在发展中经济体背景下影响专业会计师采用RPA决策的环境和技术因素的复杂性。调查结果通过强调能力建设和包容性数字化转型来支持可持续发展目标9。
{"title":"Understanding accounting professionals' intention to adopt robotic process automation: a TOE-based empirical assessment from an emerging country.","authors":"Nusirat Ojuolape Gold, Husain Coovadia, Katlego Thipe","doi":"10.3389/frobt.2025.1747539","DOIUrl":"10.3389/frobt.2025.1747539","url":null,"abstract":"<p><p>The proliferation of the Fourth Industrial Revolution (4IR) is transforming the accounting landscape, with technologies such as Robotic Process Automation (RPA) changing the face of traditional accounting processes. This study investigates the level of RPA adoption among accountants in South Africa and examines how technological-organizational-environmental (TOE) factors influence the behavioral intention of RPA adoption. The study employed an exploratory cross-sectional survey comprising responses from 100 professional accountants in practice to analyze its data, combining descriptive statistics with a multiple linear regression model supported by correlation tests to determine significant predictors of RPA adoption intention. The robustness of the model, which was verified by multiple pre- and post-analysis checks, indicated that institutional support, particularly normative pressure, has the strongest influence on adoption intention, with an adjusted R<sup>2</sup> value of 0.27 highly significant. This highlights the crucial role that organizational readiness, managerial support, and technology readiness play in enabling RPA adoption. On the other hand, mimetic pressure showed a negative influence, indicating that the industry-wide adoption of RPA technology may raise concerns and anxiety about job displacement. Overall, the findings reinforce the importance of organizational capacity-building in fostering RPA adoption while also revealing the complexity of environmental and technological factors that influence the adoption decisions of professional accountants in a developing-economy context. The findings support SDG 9 by emphasizing capacity building and inclusive digital transformation.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1747539"},"PeriodicalIF":3.0,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12895344/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146203248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transforming dementia caregiver support with AI-powered social robotics. 用人工智能社交机器人改变痴呆症护理人员的支持。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-27 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1704313
Tyler Morris, Conor Brown, Xiaopeng Zhao, Linda Nichols, Jennifer Martindale-Adams, Sharon Bowland, Wenjun Zhou

Introduction: Informal dementia caregivers face significant emotional and physical burdens, yet evidence-based interventions like REACH are often limited by high labor costs and scalability constraints.

Methods: We design a Robot-based Information and Support to Enhance Alzheimer's Caregiver Health (RISE) system, which uses novel social robotics and generative AI to deliver automated and personalized caregiver training and stress management. RISE uses retrieval-augmented generative AI (RAG-AI) grounded in the verified REACH Caregiver Notebook to ensure content safety and minimize hallucinations. It employs the social robot Pepper to deliver interactive presentations, Q&A sessions, review quizzes, and stress reduction activities. A technical evaluation and a two-phase user evaluation was conducted.

Results: We found that the RISE's RAG-AI backend achieved 87% correctness and 92% relevancy when compared to ground truth. User feedback indicated strong acceptance, with Likert-scale usability scores ranging from 3.6 to 4.6 out of 5 across all components.

Discussion: These results suggest that combining verifiable AI architectures with embodied social robotics offers a feasible, scalable solution for enhancing caregiver support and wellbeing. Future work could include a larger scale user study involving real informal dementia caregivers.

非正式的痴呆症护理人员面临着巨大的情感和身体负担,然而,像REACH这样的循证干预措施往往受到高昂的劳动力成本和可扩展性限制。方法:我们设计了一个基于机器人的信息和支持来增强阿尔茨海默氏症护理人员健康(RISE)系统,该系统使用新型社交机器人和生成式人工智能来提供自动化和个性化的护理人员培训和压力管理。RISE使用检索增强生成人工智能(RAG-AI),以经过验证的REACH护理人员笔记本为基础,确保内容安全并最大限度地减少幻觉。它使用社交机器人Pepper来提供互动演示、问答环节、复习小测验和减压活动。进行了技术评价和两阶段的用户评价。结果:我们发现RISE的RAG-AI后端与基础事实相比达到了87%的正确性和92%的相关性。用户的反馈表明了强烈的接受度,所有组件的李克特可用性评分从3.6到4.6(满分5分)不等。讨论:这些结果表明,将可验证的人工智能架构与嵌入的社交机器人相结合,为增强护理人员的支持和福祉提供了一种可行的、可扩展的解决方案。未来的工作可能包括一个更大规模的用户研究,涉及真正的非正式痴呆症护理人员。
{"title":"Transforming dementia caregiver support with AI-powered social robotics.","authors":"Tyler Morris, Conor Brown, Xiaopeng Zhao, Linda Nichols, Jennifer Martindale-Adams, Sharon Bowland, Wenjun Zhou","doi":"10.3389/frobt.2025.1704313","DOIUrl":"10.3389/frobt.2025.1704313","url":null,"abstract":"<p><strong>Introduction: </strong>Informal dementia caregivers face significant emotional and physical burdens, yet evidence-based interventions like REACH are often limited by high labor costs and scalability constraints.</p><p><strong>Methods: </strong>We design a Robot-based Information and Support to Enhance Alzheimer's Caregiver Health (RISE) system, which uses novel social robotics and generative AI to deliver automated and personalized caregiver training and stress management. RISE uses retrieval-augmented generative AI (RAG-AI) grounded in the verified REACH Caregiver Notebook to ensure content safety and minimize hallucinations. It employs the social robot Pepper to deliver interactive presentations, Q&A sessions, review quizzes, and stress reduction activities. A technical evaluation and a two-phase user evaluation was conducted.</p><p><strong>Results: </strong>We found that the RISE's RAG-AI backend achieved 87% correctness and 92% relevancy when compared to ground truth. User feedback indicated strong acceptance, with Likert-scale usability scores ranging from 3.6 to 4.6 out of 5 across all components.</p><p><strong>Discussion: </strong>These results suggest that combining verifiable AI architectures with embodied social robotics offers a feasible, scalable solution for enhancing caregiver support and wellbeing. Future work could include a larger scale user study involving real informal dementia caregivers.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1704313"},"PeriodicalIF":3.0,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12886016/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146167354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robotic transesophageal echocardiography: system design and deep learning-based kinematic modeling. 机器人经食管超声心动图:系统设计和基于深度学习的运动学建模。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-27 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1705142
Seyed MohammadReza Sajadi, Abbas Tariverdi, Henrik Brun, Ole Jakob Elle, Kim Mathiassen

Introduction: This paper presents a robotic transesophageal echocardiography (TEE) system that replicates all essential degrees of freedom available in manual TEE procedures. The developed robotic system advances dual-subsystem architectures through enhanced mechanical design and deep learning-based kinematic modeling.

Methods: Building upon previous designs that manipulate the TEE probe from both handle and gastroscope tube, our system integrates with a teleoperated UR5 manipulator to accommodate both supine and left lateral decubitus patient positions, addressing the full spectrum of clinical TEE procedures. The system features 6 DOF at the probe handle and 2 DOF at the gastroscope tube. Together, these create optimal gastroscope tube geometry, minimizing cable tension asymmetry and friction-induced nonlinearities inherent in cable-driven mechanisms. The primary contribution is a data-driven kinematic model using recurrent neural networks with LSTM units that overcomes fundamental limitations of analytical approaches for continuum manipulators. Trained on 42,000 synchronized pose-command pairs collected across three gastroscope tube configurations (0°, 45°, 90° bends), the model effectively captures dead zones, hysteresis, and coupling effects between steering mechanisms.

Results: Experimental validation demonstrates strong position tracking across all three gastroscope tube configurations. The model achieves RMSE of 1.267 mm for the 0° configuration, 1.209 mm for the 45° configuration, and 1.194 mm for the 90° configuration. Mean orientation errors are 7.064° at 0°, 8.503° at 45°, and 4.947° at the clinically critical 90° configuration. The model exhibits coordinate frame independence with only 0.06 mm RMSE difference between original and rotated datasets. This confirms true kinematic learning rather than coordinate-specific patterns. With 1.8 ms inference time, the system achieves real-time performance essential for clinical deployment.

Discussion: This integration of robotic system design with deep learning establishes a foundation for semi-autonomous TEE systems. The developed system can support both diagnostic TEE examinations and TEE-guided structural heart interventions.

简介:本文介绍了一种机器人经食管超声心动图(TEE)系统,该系统复制了人工TEE程序中所有必要的自由度。开发的机器人系统通过增强的机械设计和基于深度学习的运动学建模,推进了双子系统架构。方法:基于先前的设计,通过手柄和胃镜管操作TEE探针,我们的系统集成了远程操作UR5机械手,以适应仰卧位和左侧侧卧位的患者,解决临床TEE手术的全部问题。该系统的特点是探头手柄的6自由度和胃镜管的2自由度。它们共同创造了最佳的胃镜管几何形状,最大限度地减少了电缆张力的不对称性和电缆驱动机制中固有的摩擦引起的非线性。主要贡献是一个数据驱动的运动学模型,该模型使用带有LSTM单元的递归神经网络,克服了连续统机械臂分析方法的基本限制。该模型对三种胃镜管配置(0°、45°、90°弯曲)收集的42,000对同步姿势命令进行了训练,有效捕获了死区、迟滞和转向机构之间的耦合效应。结果:实验验证表明在所有三种胃镜管配置中都有很强的位置跟踪。该模型在0°配置下的RMSE为1.267 mm, 45°配置下的RMSE为1.209 mm, 90°配置下的RMSE为1.194 mm。平均定向误差在0°时为7.064°,在45°时为8.503°,在临床关键的90°配置时为4.947°。该模型具有坐标帧无关性,原始数据集与旋转数据集的RMSE差仅为0.06 mm。这证实了真正的运动学学习,而不是特定于坐标的模式。在1.8 ms的推理时间内,系统实现了临床部署所必需的实时性能。讨论:机器人系统设计与深度学习的结合为半自主TEE系统奠定了基础。开发的系统可以支持诊断TEE检查和TEE引导的结构性心脏干预。
{"title":"Robotic transesophageal echocardiography: system design and deep learning-based kinematic modeling.","authors":"Seyed MohammadReza Sajadi, Abbas Tariverdi, Henrik Brun, Ole Jakob Elle, Kim Mathiassen","doi":"10.3389/frobt.2025.1705142","DOIUrl":"10.3389/frobt.2025.1705142","url":null,"abstract":"<p><strong>Introduction: </strong>This paper presents a robotic transesophageal echocardiography (TEE) system that replicates all essential degrees of freedom available in manual TEE procedures. The developed robotic system advances dual-subsystem architectures through enhanced mechanical design and deep learning-based kinematic modeling.</p><p><strong>Methods: </strong>Building upon previous designs that manipulate the TEE probe from both handle and gastroscope tube, our system integrates with a teleoperated UR5 manipulator to accommodate both supine and left lateral decubitus patient positions, addressing the full spectrum of clinical TEE procedures. The system features 6 DOF at the probe handle and 2 DOF at the gastroscope tube. Together, these create optimal gastroscope tube geometry, minimizing cable tension asymmetry and friction-induced nonlinearities inherent in cable-driven mechanisms. The primary contribution is a data-driven kinematic model using recurrent neural networks with LSTM units that overcomes fundamental limitations of analytical approaches for continuum manipulators. Trained on 42,000 synchronized pose-command pairs collected across three gastroscope tube configurations (0°, 45°, 90° bends), the model effectively captures dead zones, hysteresis, and coupling effects between steering mechanisms.</p><p><strong>Results: </strong>Experimental validation demonstrates strong position tracking across all three gastroscope tube configurations. The model achieves RMSE of 1.267 mm for the 0° configuration, 1.209 mm for the 45° configuration, and 1.194 mm for the 90° configuration. Mean orientation errors are 7.064° at 0°, 8.503° at 45°, and 4.947° at the clinically critical 90° configuration. The model exhibits coordinate frame independence with only 0.06 mm RMSE difference between original and rotated datasets. This confirms true kinematic learning rather than coordinate-specific patterns. With 1.8 ms inference time, the system achieves real-time performance essential for clinical deployment.</p><p><strong>Discussion: </strong>This integration of robotic system design with deep learning establishes a foundation for semi-autonomous TEE systems. The developed system can support both diagnostic TEE examinations and TEE-guided structural heart interventions.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1705142"},"PeriodicalIF":3.0,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12887703/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146167361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ATRON: Autonomous trash retrieval for oceanic neatness. ATRON:自动回收垃圾,保持海洋整洁。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-22 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1718177
John Abanes, Hyunjin Jang, Behruz Erkinov, Jana Awadalla, Anthony Tzes

The subject of this article is the development of an unmanned surface vehicle (USV) for the removal of floating debris. A twin-hulled boat with four thrusters placed at the corners of the vessel is used for this purpose. The trash is collected in a storage space through a timing belt driven by an electric motor. The debris is accumulated in a funnel positioned at the front of the boat and subsequently raised through this belt into the garbage bin. The boat is equipped with a spherical camera, a long-range 2D LiDAR, and an inertial measurement unit (IMU) for simultaneous localization and mapping (SLAM). The floating debris is identified from rectified camera frames using YOLO, while the LiDAR and IMU concurrently provide the USV's odometry. Visual methods are utilized to determine the location of debris and obstacles in the 3D environment. The optimal order in which the debris is collected is determined by solving the orienteering problem, and the planar convex hull of the boat is combined with map and obstacle data via the Open Motion Planning Library (OMPL) to perform path planning. Pure pursuit is used to generate the trajectory from the obtained path. Limits on the linear and angular velocities are experimentally estimated, and a PID controller is tuned to improve path following. The USV is evaluated in an indoor swimming pool containing static obstacles and floating debris.

本文的主题是开发一种用于清除漂浮碎片的无人水面飞行器(USV)。一个双壳船与四个推进器放置在船的角落被用于这个目的。垃圾通过由电动机驱动的同步带收集在存储空间中。垃圾堆积在船前部的漏斗中,随后通过这条皮带上升到垃圾箱中。该艇装备有一个球形照相机、一个远程2D激光雷达和一个用于同时定位和绘图(SLAM)的惯性测量单元(IMU)。利用YOLO从整流相机帧中识别漂浮碎片,而激光雷达和IMU同时提供USV的里程计。利用视觉方法确定三维环境中碎片和障碍物的位置。通过求解定向问题确定碎片的最优收集顺序,并通过开放运动规划库(Open Motion Planning Library, OMPL)将船的平面凸壳与地图和障碍物数据相结合进行路径规划。采用纯追踪的方法,从得到的路径生成轨迹。实验估计了直线速度和角速度的极限,并调整了PID控制器以改善路径跟踪。USV在包含静态障碍物和漂浮碎片的室内游泳池中进行评估。
{"title":"ATRON: Autonomous trash retrieval for oceanic neatness.","authors":"John Abanes, Hyunjin Jang, Behruz Erkinov, Jana Awadalla, Anthony Tzes","doi":"10.3389/frobt.2025.1718177","DOIUrl":"10.3389/frobt.2025.1718177","url":null,"abstract":"<p><p>The subject of this article is the development of an unmanned surface vehicle (USV) for the removal of floating debris. A twin-hulled boat with four thrusters placed at the corners of the vessel is used for this purpose. The trash is collected in a storage space through a timing belt driven by an electric motor. The debris is accumulated in a funnel positioned at the front of the boat and subsequently raised through this belt into the garbage bin. The boat is equipped with a spherical camera, a long-range 2D LiDAR, and an inertial measurement unit (IMU) for simultaneous localization and mapping (SLAM). The floating debris is identified from rectified camera frames using YOLO, while the LiDAR and IMU concurrently provide the USV's odometry. Visual methods are utilized to determine the location of debris and obstacles in the 3D environment. The optimal order in which the debris is collected is determined by solving the orienteering problem, and the planar convex hull of the boat is combined with map and obstacle data via the Open Motion Planning Library (OMPL) to perform path planning. Pure pursuit is used to generate the trajectory from the obtained path. Limits on the linear and angular velocities are experimentally estimated, and a PID controller is tuned to improve path following. The USV is evaluated in an indoor swimming pool containing static obstacles and floating debris.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1718177"},"PeriodicalIF":3.0,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12873476/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146144161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On vibration suppression of a tendon-driven soft robotic neck for the social robot HARU. 社交机器人HARU肌腱驱动软机器人颈部振动抑制研究。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-22 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1698343
Seshagopalan Thorapalli Muralidharan, Randy Gomez, Georgios Andrikopoulos

Tendon-driven continuum actuators (TDCAs) provide compliant and lifelike motion that is well suited for human-robot interaction, but their structural compliance and underactuation make them susceptible to undesired vibrations, particularly along unactuated axes under load. This work addresses vibration suppression in such systems by proposing a real-time control strategy for a two-degree-of-freedom TDCA-based soft robotic neck used in the HARU social robot, where yaw motion is unactuated and prone to oscillations due to eccentric loading. The proposed approach combines a current-based tendon pretensioning routine, baseline PID control of the actuated pitch and roll axes, and a novel Coupled Axis Indirect Vibration Suppression (CIVS) mechanism. CIVS exploits mechanical cross-axis coupling by using high-pass filtered yaw acceleration from an inertial sensor to generate transient tension modulations in the actuated tendons, thereby increasing effective damping of the unactuated yaw mode without introducing additional hardware or compromising compliance. A classical sliding mode control is also implemented as a nonlinear benchmark under identical hardware constraints. Experimental validation on the HARU neck under representative loading conditions demonstrates that the proposed method achieves substantial vibration attenuation. Compared to the baseline controller, CIVS reduces yaw angular range by approximately 53% and yaw acceleration area by over 60%, while preserving smooth, expressive motion. The results further show that CIVS outperforms the sliding mode controller in suppressing vibrations on the unactuated axis. These findings indicate that indirect, feedback-driven tendon modulation provides an effective and low-complexity solution for mitigating load-induced vibrations in underactuated soft robotic systems, making the approach particularly suitable for interactive applications where safety, compliance, and motion expressivity are critical.

肌腱驱动连续执行器(tdca)提供柔顺和逼真的运动,非常适合人机交互,但其结构的顺应性和欠驱动使其容易受到不期望的振动,特别是在负载下沿非驱动轴。这项工作通过提出一种用于HARU社交机器人的基于tdca的两自由度软机器人颈部的实时控制策略,解决了此类系统中的振动抑制问题,其中偏摆运动是非驱动的,容易因偏心加载而产生振荡。提出的方法结合了基于电流的肌腱预紧程序,驱动的俯仰轴和滚轴的基线PID控制,以及一种新的耦合轴间接振动抑制(CIVS)机制。CIVS利用来自惯性传感器的高通滤波偏航加速度,在驱动肌腱中产生瞬态张力调制,从而增加非驱动偏航模式的有效阻尼,而无需引入额外的硬件或降低顺应性。在相同的硬件约束下,还实现了一种经典的滑模控制作为非线性基准。在典型载荷条件下的HARU颈部试验验证表明,该方法具有较好的减振效果。与基线控制器相比,CIVS将偏航角范围减少了约53%,偏航加速度面积减少了60%以上,同时保持了平滑、富有表现力的运动。结果进一步表明,CIVS在抑制非驱动轴上的振动方面优于滑模控制器。这些发现表明,间接的、反馈驱动的肌腱调制为减轻欠驱动软机器人系统中负载引起的振动提供了一种有效且低复杂性的解决方案,使该方法特别适用于安全性、顺应性和运动表达性至关重要的交互式应用。
{"title":"On vibration suppression of a tendon-driven soft robotic neck for the social robot HARU.","authors":"Seshagopalan Thorapalli Muralidharan, Randy Gomez, Georgios Andrikopoulos","doi":"10.3389/frobt.2025.1698343","DOIUrl":"10.3389/frobt.2025.1698343","url":null,"abstract":"<p><p>Tendon-driven continuum actuators (TDCAs) provide compliant and lifelike motion that is well suited for human-robot interaction, but their structural compliance and underactuation make them susceptible to undesired vibrations, particularly along unactuated axes under load. This work addresses vibration suppression in such systems by proposing a real-time control strategy for a two-degree-of-freedom TDCA-based soft robotic neck used in the HARU social robot, where yaw motion is unactuated and prone to oscillations due to eccentric loading. The proposed approach combines a current-based tendon pretensioning routine, baseline PID control of the actuated pitch and roll axes, and a novel Coupled Axis Indirect Vibration Suppression (CIVS) mechanism. CIVS exploits mechanical cross-axis coupling by using high-pass filtered yaw acceleration from an inertial sensor to generate transient tension modulations in the actuated tendons, thereby increasing effective damping of the unactuated yaw mode without introducing additional hardware or compromising compliance. A classical sliding mode control is also implemented as a nonlinear benchmark under identical hardware constraints. Experimental validation on the HARU neck under representative loading conditions demonstrates that the proposed method achieves substantial vibration attenuation. Compared to the baseline controller, CIVS reduces yaw angular range by approximately 53% and yaw acceleration area by over 60%, while preserving smooth, expressive motion. The results further show that CIVS outperforms the sliding mode controller in suppressing vibrations on the unactuated axis. These findings indicate that indirect, feedback-driven tendon modulation provides an effective and low-complexity solution for mitigating load-induced vibrations in underactuated soft robotic systems, making the approach particularly suitable for interactive applications where safety, compliance, and motion expressivity are critical.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1698343"},"PeriodicalIF":3.0,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12872555/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146144169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ecg2o: a seamless extension of g2o for equality-constrained factor graph optimization. ecg20: g20的无缝扩展,用于等式约束因子图优化。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-20 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1698333
Anas Abdelkarim, Daniel Görges, Holger Voos

Factor graph optimization serves as a fundamental framework for robotic perception, enabling applications such as pose estimation, simultaneous localization and mapping (SLAM), structure-from-motion (SfM), and situational modeling. Traditionally, these methods solve unconstrained least squares problems using algorithms such as Gauss-Newton and Levenberg-Marquardt. However, extending factor graphs with native support for hard equality constraints can yield more accurate state estimates and broaden their applicability, particularly in planning and control. Prior work has addressed equality handling either by soft penalties (large weights) or by nested-loop Augmented Lagrangian (AL) schemes. In this paper, we propose a novel extension of factor graphs that seamlessly incorporates hard equality constraints without requiring additional optimization techniques. Our approach maintains the efficiency and flexibility of existing second-order optimization techniques while ensuring constraint satisfaction. To validate the proposed method, an autonomous-vehicle velocity-tracking optimal control problem is solved and benchmarked against an AL baseline, both implemented in g2o. Additional comparisons are conducted in GTSAM, where the penalty method and AL are evaluated against our g2o implementations. Moreover, we introduce ecg2o, a header-only C++ library that extends the widely used g2o library with full support for hard equality-constrained optimization. This library, along with demonstrative examples and the optimal control problem, is available as open source at https://github.com/snt-arg/ecg2o.

因子图优化是机器人感知的基本框架,可以实现姿态估计、同步定位和绘图(SLAM)、运动结构(SfM)和情境建模等应用。传统上,这些方法使用高斯-牛顿和利文伯格-马夸特等算法来解决无约束最小二乘问题。然而,使用对硬相等约束的本地支持来扩展因子图可以产生更准确的状态估计并扩展其适用性,特别是在计划和控制方面。先前的工作已经通过软惩罚(大权重)或嵌套循环增广拉格朗日(AL)方案解决了相等性处理。在本文中,我们提出了一种新的因子图扩展,它无缝地结合了硬等式约束,而不需要额外的优化技术。我们的方法在保证约束满足的同时保持了现有二阶优化技术的效率和灵活性。为了验证所提出的方法,解决了一个自动驾驶汽车速度跟踪最优控制问题,并针对人工智能基线进行了基准测试,这两个问题都在2009年实现。在GTSAM中进行了额外的比较,其中根据我们的g20实现对惩罚方法和ai进行了评估。此外,我们还介绍了一个仅限头文件的c++库ecg2o,它扩展了广泛使用的g2o库,完全支持硬等式约束优化。这个库,连同演示示例和最优控制问题,可以在https://github.com/snt-arg/ecg2o上作为开放源代码获得。
{"title":"ecg2o: a seamless extension of g2o for equality-constrained factor graph optimization.","authors":"Anas Abdelkarim, Daniel Görges, Holger Voos","doi":"10.3389/frobt.2025.1698333","DOIUrl":"10.3389/frobt.2025.1698333","url":null,"abstract":"<p><p>Factor graph optimization serves as a fundamental framework for robotic perception, enabling applications such as pose estimation, simultaneous localization and mapping (SLAM), structure-from-motion (SfM), and situational modeling. Traditionally, these methods solve unconstrained least squares problems using algorithms such as Gauss-Newton and Levenberg-Marquardt. However, extending factor graphs with native support for hard equality constraints can yield more accurate state estimates and broaden their applicability, particularly in planning and control. Prior work has addressed equality handling either by soft penalties (large weights) or by nested-loop Augmented Lagrangian (AL) schemes. In this paper, we propose a novel extension of factor graphs that seamlessly incorporates hard equality constraints without requiring additional optimization techniques. Our approach maintains the efficiency and flexibility of existing second-order optimization techniques while ensuring constraint satisfaction. To validate the proposed method, an autonomous-vehicle velocity-tracking optimal control problem is solved and benchmarked against an AL baseline, both implemented in g2o. Additional comparisons are conducted in GTSAM, where the penalty method and AL are evaluated against our g2o implementations. Moreover, we introduce ecg2o, a header-only C++ library that extends the widely used g2o library with full support for hard equality-constrained optimization. This library, along with demonstrative examples and the optimal control problem, is available as open source at https://github.com/snt-arg/ecg2o.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1698333"},"PeriodicalIF":3.0,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12864083/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146120624","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Eyes ahead: a scoping review of technologies enabling humanoid robots to follow human gaze. 前方的眼睛:使人形机器人跟随人类目光的技术的范围审查。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-16 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1723527
Leana Neuber, Wolf Culemann, Ruth Maria Ingendoh, Angela Heine

Gaze is a fundamental aspect of non-verbal communication in human interaction, playing an important role in conveying attention, intentions, and emotions. A key concept in gaze-based human interaction is joint attention, the focus of two individuals on an object in a shared environment. In the context of human-robot interaction (HRI), gaze-following has become a growing research area, as it enables robots to appear more socially intelligent, engaging, and likable. While various technical approaches have been developed to achieve this capability, a comprehensive overview of existing implementations has been lacking. This scoping review addresses this gap by systematically categorizing existing solutions, offering a structured perspective on how gaze-following behavior is technically realized in the field of HRI. A systematic search was conducted across four databases, leading to the identification of 28 studies. To structure the findings, a taxonomy was developed that categorizes technological approaches along three key functional dimensions: (1) environment tracking, which involves recognizing the objects in the robot's surroundings; (2) gaze tracking, which refers to detecting and interpreting human gaze direction; and (3) gaze-environment mapping, which connects gaze information with objects in the shared environment to enable appropriate robotic responses. Across studies, a distinction emerges between constrained and unconstrained solutions. While constrained approaches, such as predefined object positions, provide high accuracy, they are often limited to controlled settings. In contrast, unconstrained methods offer greater flexibility but pose significant technical challenges. The complexity of the implementations also varies significantly, from simple rule-based approaches to advanced, adaptive systems that integrate multiple data sources. These findings highlight ongoing challenges in achieving robust and real-time gaze-following in robots, particularly in dynamic, real-world environments. Future research should focus on refining unconstrained tracking methods and leveraging advances in machine learning and computer vision to make human-robot interactions more natural and socially intuitive.

凝视是人类非语言交际的一个基本方面,在传递注意力、意图和情感方面起着重要作用。基于注视的人类互动的一个关键概念是共同注意,即两个人在共享环境中对一个物体的关注。在人机交互(HRI)的背景下,目光跟随已经成为一个日益发展的研究领域,因为它使机器人看起来更有社交智能、更有吸引力、更讨人喜欢。虽然已经开发了各种技术方法来实现此功能,但缺乏对现有实现的全面概述。本文通过系统地对现有解决方案进行分类来解决这一问题,并提供了一个结构化的视角,说明注视跟随行为在HRI领域是如何在技术上实现的。在四个数据库中进行了系统搜索,最终确定了28项研究。为了构建研究结果,研究人员根据三个关键功能维度对技术方法进行了分类:(1)环境跟踪,涉及识别机器人周围环境中的物体;(2)注视跟踪,即检测和解释人的注视方向;(3)注视-环境映射,将注视信息与共享环境中的物体联系起来,使机器人能够做出适当的反应。在各种研究中,有约束和无约束的解决方案之间出现了区别。虽然约束方法(如预定义的对象位置)提供了高精度,但它们通常限于受控设置。相比之下,不受约束的方法提供了更大的灵活性,但也带来了重大的技术挑战。实现的复杂性也有很大的不同,从简单的基于规则的方法到集成多个数据源的高级自适应系统。这些发现突出了在机器人中实现鲁棒和实时注视跟踪的持续挑战,特别是在动态的现实世界环境中。未来的研究应该集中在改进无约束跟踪方法,利用机器学习和计算机视觉的进步,使人机交互更加自然和社会直观。
{"title":"Eyes ahead: a scoping review of technologies enabling humanoid robots to follow human gaze.","authors":"Leana Neuber, Wolf Culemann, Ruth Maria Ingendoh, Angela Heine","doi":"10.3389/frobt.2025.1723527","DOIUrl":"10.3389/frobt.2025.1723527","url":null,"abstract":"<p><p>Gaze is a fundamental aspect of non-verbal communication in human interaction, playing an important role in conveying attention, intentions, and emotions. A key concept in gaze-based human interaction is joint attention, the focus of two individuals on an object in a shared environment. In the context of human-robot interaction (HRI), gaze-following has become a growing research area, as it enables robots to appear more socially intelligent, engaging, and likable. While various technical approaches have been developed to achieve this capability, a comprehensive overview of existing implementations has been lacking. This scoping review addresses this gap by systematically categorizing existing solutions, offering a structured perspective on how gaze-following behavior is technically realized in the field of HRI. A systematic search was conducted across four databases, leading to the identification of 28 studies. To structure the findings, a taxonomy was developed that categorizes technological approaches along three key functional dimensions: (1) environment tracking, which involves recognizing the objects in the robot's surroundings; (2) gaze tracking, which refers to detecting and interpreting human gaze direction; and (3) gaze-environment mapping, which connects gaze information with objects in the shared environment to enable appropriate robotic responses. Across studies, a distinction emerges between constrained and unconstrained solutions. While constrained approaches, such as predefined object positions, provide high accuracy, they are often limited to controlled settings. In contrast, unconstrained methods offer greater flexibility but pose significant technical challenges. The complexity of the implementations also varies significantly, from simple rule-based approaches to advanced, adaptive systems that integrate multiple data sources. These findings highlight ongoing challenges in achieving robust and real-time gaze-following in robots, particularly in dynamic, real-world environments. Future research should focus on refining unconstrained tracking methods and leveraging advances in machine learning and computer vision to make human-robot interactions more natural and socially intuitive.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1723527"},"PeriodicalIF":3.0,"publicationDate":"2026-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12856928/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146107600","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-view object pose distribution tracking for pre-grasp planning on mobile robots. 移动机器人抓取前规划的多视点目标位姿分布跟踪。
IF 3 Q2 ROBOTICS Pub Date : 2026-01-14 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1683931
Lakshadeep Naik, Thorbjørn Mosekjær Iversen, Jakob Wilm, Norbert Krüger

The ability to track the 6D pose distribution of an object while a mobile manipulator is still approaching it can enable the robot to pre-plan grasps, thereby improving both the time efficiency and robustness of mobile manipulation. However, tracking a 6D object pose distribution on approach can be challenging due to the limited view of the robot camera. In this study, we present a particle filter-based multi-view 6D pose distribution tracking framework that compensates for the limited view of the moving robot camera while it approaches the object by fusing observations from external stationary cameras in the environment. We extend the single-view pose distribution tracking framework (PoseRBPF) to fuse observations from external cameras. We model the object pose posterior as a multi-modal distribution and introduce techniques for fusion, re-sampling, and pose estimation from the tracked distribution to effectively handle noisy and conflicting observations from different cameras. To evaluate our framework, we also contribute a real-world benchmark dataset. Our experiments demonstrate that the proposed framework yields a more accurate quantification of object pose and associated uncertainty than previous research. Finally, we apply our framework for pre-grasp planning on mobile robots, demonstrating its practical utility.

当移动机械手还在接近目标时,能够跟踪目标的6D位姿分布,使机器人能够预先计划抓取,从而提高移动操作的时间效率和鲁棒性。然而,由于机器人相机的视野有限,跟踪接近的6D物体姿态分布可能具有挑战性。在本研究中,我们提出了一种基于粒子滤波的多视角6D姿态分布跟踪框架,该框架通过融合环境中外部固定摄像机的观测值来补偿移动机器人摄像机在接近目标时的有限视角。我们扩展了单视图姿态分布跟踪框架(PoseRBPF),以融合来自外部摄像机的观测。我们将目标姿态后验建模为多模态分布,并引入融合、重采样和姿态估计技术,以有效处理来自不同相机的噪声和冲突观测。为了评估我们的框架,我们还提供了一个真实世界的基准数据集。我们的实验表明,所提出的框架比以前的研究更准确地量化了物体的姿态和相关的不确定性。最后,我们将我们的框架应用于移动机器人的预抓取规划,证明了它的实用性。
{"title":"Multi-view object pose distribution tracking for pre-grasp planning on mobile robots.","authors":"Lakshadeep Naik, Thorbjørn Mosekjær Iversen, Jakob Wilm, Norbert Krüger","doi":"10.3389/frobt.2025.1683931","DOIUrl":"https://doi.org/10.3389/frobt.2025.1683931","url":null,"abstract":"<p><p>The ability to track the 6D pose distribution of an object while a mobile manipulator is still approaching it can enable the robot to pre-plan grasps, thereby improving both the time efficiency and robustness of mobile manipulation. However, tracking a 6D object pose distribution on approach can be challenging due to the limited view of the robot camera. In this study, we present a particle filter-based multi-view 6D pose distribution tracking framework that compensates for the limited view of the moving robot camera while it approaches the object by fusing observations from external stationary cameras in the environment. We extend the single-view pose distribution tracking framework (PoseRBPF) to fuse observations from external cameras. We model the object pose posterior as a multi-modal distribution and introduce techniques for fusion, re-sampling, and pose estimation from the tracked distribution to effectively handle noisy and conflicting observations from different cameras. To evaluate our framework, we also contribute a real-world benchmark dataset. Our experiments demonstrate that the proposed framework yields a more accurate quantification of object pose and associated uncertainty than previous research. Finally, we apply our framework for pre-grasp planning on mobile robots, demonstrating its practical utility.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1683931"},"PeriodicalIF":3.0,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12848315/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146087630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Frontiers in Robotics and AI
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1