首页 > 最新文献

Frontiers in Robotics and AI最新文献

英文 中文
Adaptive querying for reward learning from human feedback. 基于人类反馈的奖励学习自适应查询。
IF 3 Q2 ROBOTICS Pub Date : 2026-02-12 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1734564
Yashwanthi Anand, Nnamdi Nwagwu, Kevin Sabbe, Naomi T Fitter, Sandhya Saisubramanian

Learning from human feedback is a popular approach to train robots to adapt to user preferences and improve safety. Existing approaches typically consider a single querying (interaction) format when seeking human feedback and do not leverage multiple modes of user interaction with a robot. We examine how to learn a penalty function associated with unsafe behaviors using multiple forms of human feedback, by optimizing both the query state and feedback format. Our proposed adaptive feedback selection is an iterative, two-phase approach which first selects critical states for querying, and then uses information gain to select a feedback format for querying across the sampled critical states. The feedback format selection also accounts for the cost and probability of receiving feedback in a certain format. Our experiments in simulation demonstrate the sample efficiency of our approach in learning to avoid undesirable behaviors. The results of our user study with a physical robot highlight the practicality and effectiveness of adaptive feedback selection in seeking informative, user-aligned feedback that accelerate learning. Experiment videos, code and supplementary materials are found on our website: https://tinyurl.com/AFS-learning.

从人类的反馈中学习是训练机器人适应用户偏好和提高安全性的一种流行方法。在寻求人类反馈时,现有的方法通常考虑单一的查询(交互)格式,而不利用用户与机器人交互的多种模式。我们研究了如何通过优化查询状态和反馈格式,使用多种形式的人类反馈来学习与不安全行为相关的惩罚函数。我们提出的自适应反馈选择是一种迭代的两阶段方法,首先选择用于查询的关键状态,然后使用信息增益选择用于跨采样关键状态查询的反馈格式。反馈格式的选择还考虑了以某种格式接收反馈的成本和概率。我们的模拟实验证明了我们的方法在学习避免不良行为方面的样本效率。我们对物理机器人的用户研究结果强调了自适应反馈选择在寻求信息丰富、与用户一致的反馈以加速学习方面的实用性和有效性。实验视频、代码和补充材料可以在我们的网站上找到:https://tinyurl.com/AFS-learning。
{"title":"Adaptive querying for reward learning from human feedback.","authors":"Yashwanthi Anand, Nnamdi Nwagwu, Kevin Sabbe, Naomi T Fitter, Sandhya Saisubramanian","doi":"10.3389/frobt.2025.1734564","DOIUrl":"https://doi.org/10.3389/frobt.2025.1734564","url":null,"abstract":"<p><p>Learning from human feedback is a popular approach to train robots to adapt to user preferences and improve safety. Existing approaches typically consider a single querying (interaction) format when seeking human feedback and do not leverage multiple modes of user interaction with a robot. We examine how to learn a penalty function associated with unsafe behaviors using <i>multiple</i> forms of human feedback, by optimizing both the <i>query state</i> and <i>feedback format</i>. Our proposed <i>adaptive feedback selection</i> is an iterative, two-phase approach which first selects critical states for querying, and then uses information gain to select a feedback format for querying across the sampled critical states. The feedback format selection also accounts for the cost and probability of receiving feedback in a certain format. Our experiments in simulation demonstrate the sample efficiency of our approach in learning to avoid undesirable behaviors. The results of our user study with a physical robot highlight the practicality and effectiveness of adaptive feedback selection in seeking informative, user-aligned feedback that accelerate learning. Experiment videos, code and supplementary materials are found on our website: https://tinyurl.com/AFS-learning.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1734564"},"PeriodicalIF":3.0,"publicationDate":"2026-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12935605/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147327734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Underground mine rescue robotic systems: insights into human-robot information exchange. 井下矿井救援机器人系统:洞察人机信息交换。
IF 3 Q2 ROBOTICS Pub Date : 2026-02-11 eCollection Date: 2026-01-01 DOI: 10.3389/frobt.2026.1698570
Roya Bakzadeh, Rana Alhaj-Bedar, Sarah Wilson, Vasileios Androulakis, Hassan Khaniani, Sihua Shao, Mostafa Hassanalian, Pedram Roghanchi

Mine emergencies demand rapid and informed decision-making under extreme conditions, often placing personnel in life-threatening situations. Robotic assistance offers the potential to reduce unnecessary human exposure during such operations. This study examines the specific informational needs and communication preferences of mine rescue personnel for designing robotic systems for underground emergency response. A semi-structured interview was developed and conducted with ten mine rescue personnel and subject matter experts (SMEs). Responses were analyzed using thematic analysis and compared with established cognitive models to derive key design recommendations. Drawing on both field experience and hypothetical rescue scenarios, participants provided insights into key functional aspects of robotic systems, including mapping and navigation, gas detection and environmental monitoring, communication capabilities, system reliability, control, and the robot's specific roles during operations. The qualitative data was transcribed and analyzed to identify recurring themes and critical user guidelines. The findings revealed insights into the informational and interface recommendations of rescue teams, particularly the need for real-time situational data and customizable human-robot interfaces tailored to emergency scenarios. These results expose key deficiencies in the current human-robot interaction systems and offer actionable guidance for designing robotic technologies that better align with the operational needs of experienced responders. The outcomes of this study can serve as practical guidelines for developing effective interfaces to support underground mine rescue missions.

矿山紧急情况要求在极端条件下做出迅速和明智的决策,往往使人员处于危及生命的境地。机器人辅助提供了在此类操作中减少不必要的人类暴露的潜力。本研究探讨矿井救援人员在设计井下应急机器人系统时的特定资讯需求与通讯偏好。对10名矿井救援人员和主题专家进行了半结构化访谈。使用主题分析对反馈进行分析,并与已建立的认知模型进行比较,以得出关键的设计建议。根据现场经验和假设的救援场景,与会者提供了机器人系统的关键功能方面的见解,包括测绘和导航、气体检测和环境监测、通信能力、系统可靠性、控制以及机器人在操作中的具体角色。对定性数据进行转录和分析,以确定反复出现的主题和关键的用户准则。调查结果揭示了对救援团队的信息和界面建议的见解,特别是对实时态势数据和针对紧急情况量身定制的人机界面的需求。这些结果揭示了当前人机交互系统的主要缺陷,并为设计机器人技术提供了可操作的指导,以更好地满足经验丰富的响应者的操作需求。本研究结果可作为开发有效界面以支持地下矿山救援任务的实用指南。
{"title":"Underground mine rescue robotic systems: insights into human-robot information exchange.","authors":"Roya Bakzadeh, Rana Alhaj-Bedar, Sarah Wilson, Vasileios Androulakis, Hassan Khaniani, Sihua Shao, Mostafa Hassanalian, Pedram Roghanchi","doi":"10.3389/frobt.2026.1698570","DOIUrl":"https://doi.org/10.3389/frobt.2026.1698570","url":null,"abstract":"<p><p>Mine emergencies demand rapid and informed decision-making under extreme conditions, often placing personnel in life-threatening situations. Robotic assistance offers the potential to reduce unnecessary human exposure during such operations. This study examines the specific informational needs and communication preferences of mine rescue personnel for designing robotic systems for underground emergency response. A semi-structured interview was developed and conducted with ten mine rescue personnel and subject matter experts (SMEs). Responses were analyzed using thematic analysis and compared with established cognitive models to derive key design recommendations. Drawing on both field experience and hypothetical rescue scenarios, participants provided insights into key functional aspects of robotic systems, including mapping and navigation, gas detection and environmental monitoring, communication capabilities, system reliability, control, and the robot's specific roles during operations. The qualitative data was transcribed and analyzed to identify recurring themes and critical user guidelines. The findings revealed insights into the informational and interface recommendations of rescue teams, particularly the need for real-time situational data and customizable human-robot interfaces tailored to emergency scenarios. These results expose key deficiencies in the current human-robot interaction systems and offer actionable guidance for designing robotic technologies that better align with the operational needs of experienced responders. The outcomes of this study can serve as practical guidelines for developing effective interfaces to support underground mine rescue missions.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"13 ","pages":"1698570"},"PeriodicalIF":3.0,"publicationDate":"2026-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12932144/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147311229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Operationalising reproducibility in soft robotics. 软机器人的操作再现性。
IF 3 Q2 ROBOTICS Pub Date : 2026-02-10 eCollection Date: 2026-01-01 DOI: 10.3389/frobt.2026.1751222
David Howard

Reproducibility is a particular challenge for soft robotics, yet remains a core part of its development and maturation as a field. This perspective dives into reproducibility: what it is, what it means, and how it can be applied to soft robotics. We first discuss reproducibility and delineate why it is a critical consideration for the field. Following this, our core contributions are in defining three moonshot goals that collectively chart a path towards a reproducible future for soft robotics. First, methods for testing and sharing data are discussed. Second, we show how testing procedures from other scientific disciplines can provide broad coverage over different types of soft robotics tests that we might want to complete. Finally, we highlight the need for methods to quantitatively compare the embodied intelligence that lies at the heart of soft robotics research. If successful, these steps would put the field in an excellent position to develop into the future.

对于软机器人来说,可重复性是一个特别的挑战,但仍然是其发展和成熟的核心部分。这一观点深入探讨了可重复性:它是什么,它意味着什么,以及它如何应用于软机器人。我们首先讨论可重复性,并描述为什么它是该领域的关键考虑因素。在此之后,我们的核心贡献是确定三个登月目标,这些目标共同为软机器人的可复制未来指明了道路。首先,讨论了测试和共享数据的方法。其次,我们展示了来自其他科学学科的测试程序如何为我们可能想要完成的不同类型的软机器人测试提供广泛的覆盖。最后,我们强调需要对软机器人研究核心的具身智能进行定量比较的方法。如果成功,这些步骤将使该领域处于一个极好的位置,以发展到未来。
{"title":"Operationalising reproducibility in soft robotics.","authors":"David Howard","doi":"10.3389/frobt.2026.1751222","DOIUrl":"https://doi.org/10.3389/frobt.2026.1751222","url":null,"abstract":"<p><p>Reproducibility is a particular challenge for soft robotics, yet remains a core part of its development and maturation as a field. This perspective dives into reproducibility: what it is, what it means, and how it can be applied to soft robotics. We first discuss reproducibility and delineate why it is a critical consideration for the field. Following this, our core contributions are in defining three moonshot goals that collectively chart a path towards a reproducible future for soft robotics. First, methods for testing and sharing data are discussed. Second, we show how testing procedures from other scientific disciplines can provide broad coverage over different types of soft robotics tests that we might want to complete. Finally, we highlight the need for methods to quantitatively compare the embodied intelligence that lies at the heart of soft robotics research. If successful, these steps would put the field in an excellent position to develop into the future.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"13 ","pages":"1751222"},"PeriodicalIF":3.0,"publicationDate":"2026-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12929158/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147291499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Entropy-dependent human motor modulation consistent with morphological computation in a single subject. 熵依赖的人类运动调制与形态学计算一致。
IF 3 Q2 ROBOTICS Pub Date : 2026-02-10 eCollection Date: 2026-01-01 DOI: 10.3389/frobt.2026.1734848
Tsubasa Wakatsuki, Norimasa Yamada

Morphological computation (MC)-the idea that body mechanics contribute to computation-has been widely explored in robotics and examined in humans from a physiological perspective. In this study, we report a behavioral pattern consistent with MC under temporal uncertainty. This proof-of-concept single-subject study examined whether human motor control shows behavioral signatures consistent with MC within a temporal-preparation paradigm. One participant completed 160 trials across four entropy levels (0, 1.0, 1.5, 2.0 bits) in two tasks: a low-embodiment button-pressing movement and a high-embodiment reaching movement. The reaching movement tended to show decreasing response variability (coefficient of variation, CV) with increasing temporal uncertainty, whereas the button-pressing movement tended to remain flat or slightly increase. Reaction time (RT) patterns also diverged: RTs tended to lengthen with longer foreperiods in the reaching condition but shortened in the button-pressing movement. Moreover, spatial accuracy in the reaching movement tended to improve across foreperiods. These adaptations emerged without explicit strategy instructions, may reflect sensitivity to temporal context. Taken together, these patterns appear consistent with MC-inspired accounts in which limb mechanics and modest co-contraction may filter temporal uncertainty rather than amplify it. Although constrained by a single-subject, four-level design, the findings offer preliminary evidence that is suggestive of embodied-intelligence principles that may generalize to human motor control, highlighting commonalities between biological and robotic systems in brain-body-environment dynamics.

形态计算(MC)——身体力学有助于计算的想法——已经在机器人领域得到了广泛的探索,并从生理学的角度对人类进行了检验。在本研究中,我们报告了在时间不确定性下与MC一致的行为模式。这项概念验证的单受试者研究检查了人类运动控制是否在时间准备范式中显示出与MC一致的行为特征。一名参与者在两个任务中完成了四个熵水平(0,1.0,1.5,2.0位)的160次试验:低体现的按键运动和高体现的伸手运动。随着时间不确定性的增加,伸手动作的响应变异性(变异系数,CV)呈下降趋势,而按下按钮的动作则趋于平缓或略有增加。反应时间(RT)模式也存在差异:在伸手条件下,反应时间倾向于延长,而在按下按钮的运动中,反应时间倾向于缩短。此外,伸展动作的空间精度在前期有提高的趋势。这些适应在没有明确策略指导的情况下出现,可能反映了对时间背景的敏感性。综上所述,这些模式似乎与mc启发的说法一致,其中肢体力学和适度的共同收缩可能会过滤时间的不确定性,而不是放大它。虽然受限于单受试者、四级设计,但研究结果提供了初步证据,表明体现智能原则可能推广到人类运动控制,突出了脑-体-环境动力学中生物系统和机器人系统之间的共性。
{"title":"Entropy-dependent human motor modulation consistent with morphological computation in a single subject.","authors":"Tsubasa Wakatsuki, Norimasa Yamada","doi":"10.3389/frobt.2026.1734848","DOIUrl":"https://doi.org/10.3389/frobt.2026.1734848","url":null,"abstract":"<p><p>Morphological computation (MC)-the idea that body mechanics contribute to computation-has been widely explored in robotics and examined in humans from a physiological perspective. In this study, we report a behavioral pattern consistent with MC under temporal uncertainty. This proof-of-concept single-subject study examined whether human motor control shows behavioral signatures consistent with MC within a temporal-preparation paradigm. One participant completed 160 trials across four entropy levels (0, 1.0, 1.5, 2.0 bits) in two tasks: a low-embodiment button-pressing movement and a high-embodiment reaching movement. The reaching movement tended to show decreasing response variability (coefficient of variation, CV) with increasing temporal uncertainty, whereas the button-pressing movement tended to remain flat or slightly increase. Reaction time (RT) patterns also diverged: RTs tended to lengthen with longer foreperiods in the reaching condition but shortened in the button-pressing movement. Moreover, spatial accuracy in the reaching movement tended to improve across foreperiods. These adaptations emerged without explicit strategy instructions, may reflect sensitivity to temporal context. Taken together, these patterns appear consistent with MC-inspired accounts in which limb mechanics and modest co-contraction may filter temporal uncertainty rather than amplify it. Although constrained by a single-subject, four-level design, the findings offer preliminary evidence that is suggestive of embodied-intelligence principles that may generalize to human motor control, highlighting commonalities between biological and robotic systems in brain-body-environment dynamics.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"13 ","pages":"1734848"},"PeriodicalIF":3.0,"publicationDate":"2026-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12929133/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147311247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Calibration-free per-finger force-feedback slip control for grasping by anthropomorphic hand with tri-axial tactile sensors. 具有三轴触觉传感器的拟人化手的无校准单指力反馈滑移控制。
IF 3 Q2 ROBOTICS Pub Date : 2026-02-09 eCollection Date: 2026-01-01 DOI: 10.3389/frobt.2026.1735467
Dickson Chiu Yu Wong, Zheng H Zhu

This paper addresses the challenge of detecting and recovering from slip during robotic grasping of unknown objects, with the objective of establishing a robust no on-site or per-object calibration slip-recovery controller for an anthropomorphic hand. This hand is equipped with tri-axial piezoresistive tactile force sensors on each finger, and the proposed approach is validated through experimental analysis. The proposed methodology eliminates the need for object- or pose-specific calibration, explicit friction modelling, dense tactile arrays, line-of-sight vision, and a data-hungry learning process, enabling real-time implementation with minimal computation and integration effort. Using a commonly acquired online baseline from initial readings, slip is detected from relative changes between consecutive samples of the baseline-subtracted resultant tangential force, and object engagement is determined when the normal force reading deviates from a no-slip baseline beyond a preset threshold. Upon detecting slip, each finger increases its gripping force in closed-loop control until the slip stops, while enforcing motor-current protection in finger control to prevent actuator overload and object damage. Experiments were conducted on objects with different rigidity, weight, and surface textures, including an aluminium tube, a plastic water bottle, and a sponge. Additionally, the response time and variations in gripping force were evaluated. The results demonstrate rapid slip response via localized per-finger correction, good object conformability, and effective re-stabilization under different lifting speeds and sudden external disturbances. The per-finger design utilizes the minimum necessary correction at the offending finger, reducing unnecessary force increases on other fingers and improving grasp efficiency. This approach represents a practical solution for warehouse picking, human-robot collaboration, and in situ manipulation where task-specific calibrations, visual access, or training datasets are impractical.

本文解决了机器人在抓取未知物体时检测和从滑动中恢复的挑战,目的是为拟人化的手建立一个鲁棒的无现场或每物体校准滑动恢复控制器。这只手的每个手指上都安装了三轴压阻式触觉力传感器,并通过实验分析验证了所提出的方法。所提出的方法消除了对特定对象或姿势的校准、明确的摩擦建模、密集的触觉阵列、视线视觉和数据饥渴的学习过程的需要,从而以最小的计算和集成工作实现实时实现。使用通常从初始读数获得的在线基线,从减去基线的切向合力的连续样本之间的相对变化中检测滑移,当法向力读数偏离无滑移基线超过预设阈值时,确定物体啮合。当检测到滑移时,在闭环控制中,每个手指增加其夹持力,直到滑移停止,同时在手指控制中执行电机电流保护,以防止执行器过载和物体损坏。实验对象是不同硬度、重量和表面纹理的物体,包括铝管、塑料水瓶和海绵。此外,还评估了响应时间和夹持力的变化。结果表明,在不同的提升速度和突然的外部干扰下,通过局部的每指校正,滑动响应快速,物体一致性好,并且有效地重新稳定。每个手指的设计利用了最小的必要的纠正在冒犯的手指,减少不必要的力增加在其他手指和提高抓握效率。这种方法代表了仓库拣选、人机协作和现场操作的实用解决方案,其中任务特定的校准、视觉访问或训练数据集是不切实际的。
{"title":"Calibration-free per-finger force-feedback slip control for grasping by anthropomorphic hand with tri-axial tactile sensors.","authors":"Dickson Chiu Yu Wong, Zheng H Zhu","doi":"10.3389/frobt.2026.1735467","DOIUrl":"https://doi.org/10.3389/frobt.2026.1735467","url":null,"abstract":"<p><p>This paper addresses the challenge of detecting and recovering from slip during robotic grasping of unknown objects, with the objective of establishing a robust no on-site or per-object calibration slip-recovery controller for an anthropomorphic hand. This hand is equipped with tri-axial piezoresistive tactile force sensors on each finger, and the proposed approach is validated through experimental analysis. The proposed methodology eliminates the need for object- or pose-specific calibration, explicit friction modelling, dense tactile arrays, line-of-sight vision, and a data-hungry learning process, enabling real-time implementation with minimal computation and integration effort. Using a commonly acquired online baseline from initial readings, slip is detected from relative changes between consecutive samples of the baseline-subtracted resultant tangential force, and object engagement is determined when the normal force reading deviates from a no-slip baseline beyond a preset threshold. Upon detecting slip, each finger increases its gripping force in closed-loop control until the slip stops, while enforcing motor-current protection in finger control to prevent actuator overload and object damage. Experiments were conducted on objects with different rigidity, weight, and surface textures, including an aluminium tube, a plastic water bottle, and a sponge. Additionally, the response time and variations in gripping force were evaluated. The results demonstrate rapid slip response via localized per-finger correction, good object conformability, and effective re-stabilization under different lifting speeds and sudden external disturbances. The per-finger design utilizes the minimum necessary correction at the offending finger, reducing unnecessary force increases on other fingers and improving grasp efficiency. This approach represents a practical solution for warehouse picking, human-robot collaboration, and <i>in situ</i> manipulation where task-specific calibrations, visual access, or training datasets are impractical.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"13 ","pages":"1735467"},"PeriodicalIF":3.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12926654/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147285918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial: Integrative approaches with BCI and robotics for improved human interaction. 社论:脑机接口和机器人技术的综合方法改善人类互动。
IF 3 Q2 ROBOTICS Pub Date : 2026-02-06 eCollection Date: 2026-01-01 DOI: 10.3389/frobt.2026.1785247
Hammad Nazeer, Farzan M Noori, Rayyan Azam Khan
{"title":"Editorial: Integrative approaches with BCI and robotics for improved human interaction.","authors":"Hammad Nazeer, Farzan M Noori, Rayyan Azam Khan","doi":"10.3389/frobt.2026.1785247","DOIUrl":"https://doi.org/10.3389/frobt.2026.1785247","url":null,"abstract":"","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"13 ","pages":"1785247"},"PeriodicalIF":3.0,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12921480/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147272509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning-based robotic cloth manipulation applications: systematic review, challenges and opportunities for physical AI. 基于深度学习的机器人布料操作应用:物理人工智能的系统回顾、挑战和机遇。
IF 3 Q2 ROBOTICS Pub Date : 2026-02-06 eCollection Date: 2026-01-01 DOI: 10.3389/frobt.2026.1752914
Ningquan Gu, Mitsuhiro Hayashibe, Kyo Kutsuzawa, Hui Yu

Cloth unfolding and folding are fundamental tasks in autonomous robotic cloth manipulation as Physical AI. Driven by recent advances in deep learning, this area has developed rapidly in recent years. This review aims to systematically identify and summarize current progress in deep learning-based cloth unfolding and folding. Following the Systematic Reviews and Meta-Analyses (PRISMA) guidelines, 41 relevant papers from 2019 to 2024 were selected for analysis. We examines various factors influencing cloth manipulation and find that, while current methods show impressive performance, several challenges remain unaddressed. These challenges include irregular cloth sizes and diverse initial garment states. Concerning datasets, there is a need for improved real-world data collection systems and more realistic cloth simulators, and the Sim2Real gap must be carefully considered. Additionally, the review highlights the importance of incorporating multi-modal sensors into current platforms and the emergence of novel primitive actions that enhance performance. The need for more consistent comparison metrics is emphasized, and strategies for addressing failure modes are discussed to further advance the field. From an algorithmic perspective, we reorganize existing learning methods into six learning and control paradigms: perception-guided heuristics, goal-conditioned manipulation policies, predictive and model-based state representation methods, reward-driven reinforcement learning over primitive actions, demonstration-driven skill transfer methods, and emerging large language model-based planning methods. We discuss how each paradigm contributes to unfolding and folding, their respective strengths and limitations, and the open problems that arise. Finally, we summarize the remaining challenges and provide future perspectives for physical AI.

布料展开和折叠是自主机器人布料操作的基本任务。在深度学习最新进展的推动下,这一领域近年来发展迅速。本文旨在系统地识别和总结基于深度学习的布料展开和折叠的最新进展。根据系统评价和荟萃分析(PRISMA)指南,选择2019 - 2024年的41篇相关论文进行分析。我们研究了影响布料操作的各种因素,发现虽然目前的方法表现出令人印象深刻的性能,但仍有几个挑战尚未解决。这些挑战包括不规则的布料尺寸和不同的初始服装状态。关于数据集,需要改进真实世界的数据收集系统和更逼真的布料模拟器,并且必须仔细考虑Sim2Real的差距。此外,该综述强调了将多模态传感器整合到当前平台的重要性,以及提高性能的新颖原始动作的出现。强调需要更一致的比较指标,并讨论了解决失效模式的策略,以进一步推进该领域。从算法的角度来看,我们将现有的学习方法重组为六种学习和控制范式:感知引导的启发式,目标条件操作策略,基于预测和模型的状态表示方法,奖励驱动的原始动作强化学习,演示驱动的技能转移方法,以及新兴的基于大型语言模型的规划方法。我们将讨论每种范式如何有助于展开和折叠,它们各自的优势和局限性,以及出现的开放问题。最后,我们总结了仍然存在的挑战,并提供了物理人工智能的未来前景。
{"title":"Deep learning-based robotic cloth manipulation applications: systematic review, challenges and opportunities for physical AI.","authors":"Ningquan Gu, Mitsuhiro Hayashibe, Kyo Kutsuzawa, Hui Yu","doi":"10.3389/frobt.2026.1752914","DOIUrl":"https://doi.org/10.3389/frobt.2026.1752914","url":null,"abstract":"<p><p>Cloth unfolding and folding are fundamental tasks in autonomous robotic cloth manipulation as Physical AI. Driven by recent advances in deep learning, this area has developed rapidly in recent years. This review aims to systematically identify and summarize current progress in deep learning-based cloth unfolding and folding. Following the Systematic Reviews and Meta-Analyses (PRISMA) guidelines, 41 relevant papers from 2019 to 2024 were selected for analysis. We examines various factors influencing cloth manipulation and find that, while current methods show impressive performance, several challenges remain unaddressed. These challenges include irregular cloth sizes and diverse initial garment states. Concerning datasets, there is a need for improved real-world data collection systems and more realistic cloth simulators, and the Sim2Real gap must be carefully considered. Additionally, the review highlights the importance of incorporating multi-modal sensors into current platforms and the emergence of novel primitive actions that enhance performance. The need for more consistent comparison metrics is emphasized, and strategies for addressing failure modes are discussed to further advance the field. From an algorithmic perspective, we reorganize existing learning methods into six learning and control paradigms: perception-guided heuristics, goal-conditioned manipulation policies, predictive and model-based state representation methods, reward-driven reinforcement learning over primitive actions, demonstration-driven skill transfer methods, and emerging large language model-based planning methods. We discuss how each paradigm contributes to unfolding and folding, their respective strengths and limitations, and the open problems that arise. Finally, we summarize the remaining challenges and provide future perspectives for physical AI.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"13 ","pages":"1752914"},"PeriodicalIF":3.0,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12921407/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147272523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Helping or watching it happen: how participants respond to robot failures in a turn-taking game. 帮助还是看着它发生:参与者在轮流游戏中对机器人失败的反应。
IF 3 Q2 ROBOTICS Pub Date : 2026-02-06 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1664334
Samantha Stedtler, Katherine Harrison, Valentina Fantasia

Robot failures in Human-Robot Interaction (HRI), though often stemming from technical limitations, can have severe effects on the interactional dynamics between humans and robots. Prior empirical research has led to conflicting findings on how such failures influence user perceptions and the overall success of the interaction. In this study, we investigate how human participants respond to robot failures on a moment-to-moment basis, with a particular focus on how social roles, responsibilities, and agency are negotiated as these episodes unfold. We examine how responses and helping behaviors are instantiated, and which factors facilitate or hinder recovery strategies. We focus on kinematic failures, such as interruptions in motion, unsuccessful grasping, or dropping objects, that occurred during Tic-Tac-Toe games between human participants (n = 17) and the humanoid robot Epi. Our analysis combines multimodal conversation analysis (MCA) and thick description, drawing on our interdisciplinary backgrounds in cognitive science and feminist Science and Technology Studies (STS). We present selected interactional sequences that illustrate a range of participant responses, including physical repair and scaffolding, interpretive support, emotional care, sustained monitoring, and dynamic negotiation of agency. These observations demonstrate how humans co-construct interactional continuity and robot competence through distributed, multimodal, and affective forms of help. They also reveal how agency is dynamically reconfigured, and how roles and responsibilities are distributed across human and robotic actors. We show how the burden of repair often falls to the human participant and conclude by reflecting on the setting and methods used, specifically in regards to the role of the robot as a research tool.

人机交互(HRI)中的机器人故障,虽然通常源于技术限制,但可能对人与机器人之间的交互动力学产生严重影响。先前的实证研究在这些失败如何影响用户感知和交互的整体成功方面得出了相互矛盾的发现。在这项研究中,我们调查了人类参与者如何在机器人故障的时刻做出反应,特别关注社会角色、责任和代理是如何随着这些事件的展开而协商的。我们研究了反应和帮助行为是如何实例化的,哪些因素促进或阻碍了恢复策略。我们专注于运动学故障,例如运动中断,不成功的抓取或掉落物体,发生在人类参与者(n = 17)和人形机器人Epi之间的井字棋游戏中。我们的分析结合了多模态会话分析(MCA)和厚描述,借鉴了我们在认知科学和女权主义科学与技术研究(STS)方面的跨学科背景。我们选择了互动序列来说明一系列参与者的反应,包括物理修复和脚手架,解释支持,情感关怀,持续监测和代理的动态协商。这些观察结果表明,人类如何通过分布式、多模态和情感形式的帮助,共同构建交互连续性和机器人能力。它们还揭示了代理是如何动态重新配置的,以及角色和责任是如何在人类和机器人参与者之间分配的。我们展示了修理的负担往往落在人类参与者身上,并通过反思所使用的设置和方法,特别是关于机器人作为研究工具的作用,得出结论。
{"title":"Helping or watching it happen: how participants respond to robot failures in a turn-taking game.","authors":"Samantha Stedtler, Katherine Harrison, Valentina Fantasia","doi":"10.3389/frobt.2025.1664334","DOIUrl":"https://doi.org/10.3389/frobt.2025.1664334","url":null,"abstract":"<p><p>Robot failures in Human-Robot Interaction (HRI), though often stemming from technical limitations, can have severe effects on the interactional dynamics between humans and robots. Prior empirical research has led to conflicting findings on how such failures influence user perceptions and the overall success of the interaction. In this study, we investigate how human participants respond to robot failures on a moment-to-moment basis, with a particular focus on how social roles, responsibilities, and agency are negotiated as these episodes unfold. We examine how responses and helping behaviors are instantiated, and which factors facilitate or hinder recovery strategies. We focus on kinematic failures, such as interruptions in motion, unsuccessful grasping, or dropping objects, that occurred during Tic-Tac-Toe games between human participants (n = 17) and the humanoid robot Epi. Our analysis combines multimodal conversation analysis (MCA) and thick description, drawing on our interdisciplinary backgrounds in cognitive science and feminist Science and Technology Studies (STS). We present selected interactional sequences that illustrate a range of participant responses, including physical repair and scaffolding, interpretive support, emotional care, sustained monitoring, and dynamic negotiation of agency. These observations demonstrate how humans co-construct interactional continuity and robot competence through distributed, multimodal, and affective forms of help. They also reveal how agency is dynamically reconfigured, and how roles and responsibilities are distributed across human and robotic actors. We show how the burden of repair often falls to the human participant and conclude by reflecting on the setting and methods used, specifically in regards to the role of the robot as a research tool.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1664334"},"PeriodicalIF":3.0,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12921409/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147272478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Visio-verbal teleimpedance interface: enabling semi-autonomous control of physical interaction via eye tracking and speech. 视觉语言远程阻抗接口:通过眼动追踪和语音实现对物理交互的半自主控制。
IF 3 Q2 ROBOTICS Pub Date : 2026-02-05 eCollection Date: 2026-01-01 DOI: 10.3389/frobt.2026.1749105
Henk H A Jekel, Alejandro Díaz Rosales, Luka Peternel

The paper presents a visio-verbal teleimpedance interface for commanding 3D stiffness ellipsoids to the remote robot with a combination of the operator's gaze and verbal interaction. The gaze is detected by an eye-tracker, allowing the system to understand the context in terms of what the operator is currently looking at in the scene. Along with verbal interaction, a Vision-Language Model (VLM) processes this information, enabling the operator to communicate their intended action or provide corrections. Based on these inputs, the interface can then generate appropriate stiffness matrices for different physical interaction actions. To validate the proposed visio-verbal teleimpedance interface, we conducted a series of experiments on a setup including a Force Dimension Sigma.7 haptic device to control the motion of the remote Kuka LBR iiwa robotic arm. The human operator's gaze is tracked by Tobii Pro Glasses 2, while human verbal commands are processed by a VLM using GPT-4o. The first experiment explored the optimal prompt configuration for the interface. The second and third experiments demonstrated different functionalities of the interface on a slide-in-the-groove task.

提出了一种将操作者的注视与语言交互相结合的视觉-语言遥阻抗界面,用于向远程机器人指挥三维刚性椭球体。眼球追踪器检测到这种凝视,使系统能够根据操作员当前在场景中看到的内容来理解上下文。除了口头交互,视觉语言模型(VLM)还处理这些信息,使操作员能够传达他们的预期动作或提供纠正。基于这些输入,界面可以为不同的物理交互动作生成适当的刚度矩阵。为了验证所提出的视觉-语言远程阻抗接口,我们在一个包含Force Dimension Sigma.7触觉装置的装置上进行了一系列实验,以控制远程库卡LBR iiwa机械臂的运动。人类操作员的目光由Tobii Pro眼镜2跟踪,而人类的口头命令由使用gpt - 40的VLM处理。第一个实验探索了界面的最佳提示配置。第二和第三个实验展示了滑动槽任务中界面的不同功能。
{"title":"Visio-verbal teleimpedance interface: enabling semi-autonomous control of physical interaction via eye tracking and speech.","authors":"Henk H A Jekel, Alejandro Díaz Rosales, Luka Peternel","doi":"10.3389/frobt.2026.1749105","DOIUrl":"https://doi.org/10.3389/frobt.2026.1749105","url":null,"abstract":"<p><p>The paper presents a visio-verbal teleimpedance interface for commanding 3D stiffness ellipsoids to the remote robot with a combination of the operator's gaze and verbal interaction. The gaze is detected by an eye-tracker, allowing the system to understand the context in terms of what the operator is currently looking at in the scene. Along with verbal interaction, a Vision-Language Model (VLM) processes this information, enabling the operator to communicate their intended action or provide corrections. Based on these inputs, the interface can then generate appropriate stiffness matrices for different physical interaction actions. To validate the proposed visio-verbal teleimpedance interface, we conducted a series of experiments on a setup including a Force Dimension Sigma.7 haptic device to control the motion of the remote Kuka LBR iiwa robotic arm. The human operator's gaze is tracked by Tobii Pro Glasses 2, while human verbal commands are processed by a VLM using GPT-4o. The first experiment explored the optimal prompt configuration for the interface. The second and third experiments demonstrated different functionalities of the interface on a slide-in-the-groove task.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"13 ","pages":"1749105"},"PeriodicalIF":3.0,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12926544/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147285945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modelling C-arm fluoroscopy and operating table kinematics via machine learning. 通过机器学习建模c臂透视和手术台运动学。
IF 3 Q2 ROBOTICS Pub Date : 2026-02-05 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1691576
Faria Jaheen, Vinod Gutta, Pascal Fallavollita

This work presents a machine learning driven framework for data-efficient kinematic modeling and workspace optimization in modular C-arm fluoroscopy systems integrated with operating tables. A comprehensive dataset of joint configurations and end-effector poses annotated with voxelized collision status enables the training of predictive models across multiple system configurations ranging from 5 to 9 degrees of freedom. Leveraging expansive simulation-derived datasets, as well as clinical assessment through simulated X-ray generation, the models are trained and validated, achieving sub-millimetric positional accuracy and sub-degree angular precision while delivering real-time inference that surpasses conventional methods in scalability, robustness, and computational latency. The proposed framework demonstrates the viability of data-driven trajectory planning in multi-degree of freedom C-arm systems, providing a clinically relevant solution for improving imaging access and reducing intraoperative collision risks.

这项工作提出了一个机器学习驱动的框架,用于与手术台集成的模块化c臂透视系统的数据高效运动学建模和工作空间优化。关节构型和末端执行器姿态的综合数据集配有体素化碰撞状态注释,可以跨5到9个自由度的多个系统构型训练预测模型。利用广泛的模拟衍生数据集,以及通过模拟x射线生成的临床评估,对模型进行了训练和验证,实现了亚毫米级的位置精度和亚度角精度,同时提供了超越传统方法的可扩展性、鲁棒性和计算延迟的实时推断。所提出的框架证明了数据驱动轨迹规划在多自由度c臂系统中的可行性,为改善成像获取和降低术中碰撞风险提供了临床相关的解决方案。
{"title":"Modelling C-arm fluoroscopy and operating table kinematics via machine learning.","authors":"Faria Jaheen, Vinod Gutta, Pascal Fallavollita","doi":"10.3389/frobt.2025.1691576","DOIUrl":"https://doi.org/10.3389/frobt.2025.1691576","url":null,"abstract":"<p><p>This work presents a machine learning driven framework for data-efficient kinematic modeling and workspace optimization in modular C-arm fluoroscopy systems integrated with operating tables. A comprehensive dataset of joint configurations and end-effector poses annotated with voxelized collision status enables the training of predictive models across multiple system configurations ranging from 5 to 9 degrees of freedom. Leveraging expansive simulation-derived datasets, as well as clinical assessment through simulated X-ray generation, the models are trained and validated, achieving sub-millimetric positional accuracy and sub-degree angular precision while delivering real-time inference that surpasses conventional methods in scalability, robustness, and computational latency. The proposed framework demonstrates the viability of data-driven trajectory planning in multi-degree of freedom C-arm systems, providing a clinically relevant solution for improving imaging access and reducing intraoperative collision risks.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1691576"},"PeriodicalIF":3.0,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12917506/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147272542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Frontiers in Robotics and AI
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1