首页 > 最新文献

Frontiers in Robotics and AI最新文献

英文 中文
Conventions and research challenges in considering trust with socially assistive robots for older adults. 考虑对老年人社会辅助机器人的信任的惯例和研究挑战。
IF 3 Q2 ROBOTICS Pub Date : 2025-11-26 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1631206
Aisha Gul, Liam Turner, Carolina Fuentes

Introduction: The global ageing population rise creates a growing need for assistance and Socially Assistive robots (SARs) have the potential to support independence for older adults. However, to allow older adults to benefit from robots that will assist in daily life, it is important to better understand the role of trust in SARs.

Method: We present a Systematic Literature Review (SLR) aiming to identify the models, methods, and research settings used for measuring trust in SARs with older adults as population and analyse current factors in trust assessment.

Result: Our results reveal that previous studies were mostly conducted in lab settings and used subjective self-report measures like questionnaires, interviews, and surveys to measure the trust of older adults in SARs. Moreover, many of these studies focus on healthy older adults without age-related disabilities. We also examine different human-robot trust models that influence trust, and we discuss the lack of standardisation in the measurement of trust among older people in SARs.

Discussion: To address the standardisation gap, we developed a conceptual framework, Subjective Objective Trust Assessment HRI (SOTA-HRI), that incorporates subjective and objective measures to comprehensively evaluate trust in human-robot inter-actions. By combining these dimensions, our proposed framework provides a foundation for future research to design tailored interventions, enhance interaction quality, and ensure reliable trust assessment methods in this domain. Finally, we highlight key areas for future research, such as considering demographic sensitivity in trust-building strategies and further exploring contextual factors such as predictability and dependability that have not been thoroughly explored.

导言:全球老龄化人口的增长创造了对帮助和社会辅助机器人(sar)的需求不断增长,有可能支持老年人的独立。然而,为了让老年人受益于帮助他们日常生活的机器人,更好地理解信任在SARs中的作用是很重要的。方法:我们提出了一项系统文献综述(SLR),旨在确定用于测量老年人对SARs的信任的模型、方法和研究背景,并分析信任评估中的当前因素。结果:我们的研究结果表明,以前的研究大多是在实验室环境中进行的,并使用主观自我报告方法,如问卷调查、访谈和调查来衡量老年人对SARs的信任。此外,这些研究中的许多都集中在没有年龄相关残疾的健康老年人身上。我们还研究了影响信任的不同人机信任模型,并讨论了SARs中老年人信任测量缺乏标准化的问题。讨论:为了解决标准化差距,我们开发了一个概念框架,主观客观信任评估HRI (SOTA-HRI),它结合了主观和客观的措施来全面评估人机交互中的信任。通过结合这些维度,我们提出的框架为未来的研究提供了基础,以设计量身定制的干预措施,提高交互质量,并确保该领域可靠的信任评估方法。最后,我们强调了未来研究的关键领域,例如在建立信任战略中考虑人口敏感性,并进一步探索尚未彻底探索的背景因素,如可预测性和可靠性。
{"title":"Conventions and research challenges in considering trust with socially assistive robots for older adults.","authors":"Aisha Gul, Liam Turner, Carolina Fuentes","doi":"10.3389/frobt.2025.1631206","DOIUrl":"10.3389/frobt.2025.1631206","url":null,"abstract":"<p><strong>Introduction: </strong>The global ageing population rise creates a growing need for assistance and Socially Assistive robots (SARs) have the potential to support independence for older adults. However, to allow older adults to benefit from robots that will assist in daily life, it is important to better understand the role of trust in SARs.</p><p><strong>Method: </strong>We present a Systematic Literature Review (SLR) aiming to identify the models, methods, and research settings used for measuring trust in SARs with older adults as population and analyse current factors in trust assessment.</p><p><strong>Result: </strong>Our results reveal that previous studies were mostly conducted in lab settings and used subjective self-report measures like questionnaires, interviews, and surveys to measure the trust of older adults in SARs. Moreover, many of these studies focus on healthy older adults without age-related disabilities. We also examine different human-robot trust models that influence trust, and we discuss the lack of standardisation in the measurement of trust among older people in SARs.</p><p><strong>Discussion: </strong>To address the standardisation gap, we developed a conceptual framework, Subjective Objective Trust Assessment HRI (SOTA-HRI), that incorporates subjective and objective measures to comprehensively evaluate trust in human-robot inter-actions. By combining these dimensions, our proposed framework provides a foundation for future research to design tailored interventions, enhance interaction quality, and ensure reliable trust assessment methods in this domain. Finally, we highlight key areas for future research, such as considering demographic sensitivity in trust-building strategies and further exploring contextual factors such as predictability and dependability that have not been thoroughly explored.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1631206"},"PeriodicalIF":3.0,"publicationDate":"2025-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12690213/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145745162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial: Autonomous robotic systems in aquaculture: research challenges and industry needs. 社论:水产养殖中的自主机器人系统:研究挑战和行业需求。
IF 3 Q2 ROBOTICS Pub Date : 2025-11-21 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1740881
Eleni Kelasidi, Michael Triantafyllou, Sveinung Johan Ohrem
{"title":"Editorial: Autonomous robotic systems in aquaculture: research challenges and industry needs.","authors":"Eleni Kelasidi, Michael Triantafyllou, Sveinung Johan Ohrem","doi":"10.3389/frobt.2025.1740881","DOIUrl":"https://doi.org/10.3389/frobt.2025.1740881","url":null,"abstract":"","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1740881"},"PeriodicalIF":3.0,"publicationDate":"2025-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12678286/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145702492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial: Advancements in neural learning control for enhanced multi-robot coordination. 编辑:增强多机器人协调的神经学习控制的进展。
IF 3 Q2 ROBOTICS Pub Date : 2025-11-21 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1731356
Shude He, Shi-Lu Dai, Chengzhi Yuan, Haotian Shi
{"title":"Editorial: Advancements in neural learning control for enhanced multi-robot coordination.","authors":"Shude He, Shi-Lu Dai, Chengzhi Yuan, Haotian Shi","doi":"10.3389/frobt.2025.1731356","DOIUrl":"https://doi.org/10.3389/frobt.2025.1731356","url":null,"abstract":"","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1731356"},"PeriodicalIF":3.0,"publicationDate":"2025-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12678322/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145702448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A framework for semantics-based situational awareness during mobile robot deployments. 移动机器人部署过程中基于语义的态势感知框架。
IF 3 Q2 ROBOTICS Pub Date : 2025-11-19 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1694123
Tianshu Ruan, Aniketh Ramesh, Hao Wang, Alix Johnstone-Morfoisse, Gokcenur Altindal, Paul Norman, Grigoris Nikolaou, Rustam Stolkin, Manolis Chiou

Deployment of robots into hazardous environments typically involves a "human-robot teaming" (HRT) paradigm, in which a human supervisor interacts with a remotely operating robot inside the hazardous zone. Situational awareness (SA) is vital for enabling HRT, to support navigation, planning, and decision-making. In this paper, we explore issues of higher-level "semantic" information and understanding in SA. In semi-autonomous or variable-autonomy paradigms, different types of semantic information may be important, in different ways, for both the human operator and an autonomous agent controlling the robot. We propose a generalizable framework for acquiring and combining multiple modalities of semantic-level SA during remote deployments of mobile robots. We demonstrate the framework with an example application of search and rescue (SAR) in disaster-response robotics. We propose a set of "environment semantic indicators" that can reflect a variety of different types of semantic information, such as indicators of risk or signs of human activity (SHA), as the robot encounters different scenes. Based on these indicators, we propose a metric to describe the overall situation of the environment, called "Situational Semantic Richness" (SSR). This metric combines multiple semantic indicators to summarize the overall situation. The SSR indicates whether an information-rich, complex situation has been encountered, which may require advanced reasoning by robots and humans and, hence, the attention of the expert human operator. The framework is tested on a Jackal robot in a mock-up disaster-response environment. Experimental results demonstrate that the proposed semantic indicators are sensitive to changes in different modalities of semantic information in different scenes, and the SSR metric reflects the overall semantic changes in the situations encountered.

将机器人部署到危险环境中通常涉及“人-机器人团队”(HRT)范例,其中人类主管与危险区域内的远程操作机器人进行交互。态势感知(SA)对于实现HRT至关重要,可以支持导航、规划和决策。在本文中,我们探讨了在SA中更高层次的“语义”信息和理解问题。在半自主或可变自主范例中,不同类型的语义信息可能以不同的方式对人类操作员和控制机器人的自主代理都很重要。我们提出了一个可推广的框架,用于在移动机器人远程部署期间获取和组合语义级SA的多种模式。我们通过在灾难响应机器人中搜索和救援(SAR)的应用示例来演示该框架。我们提出了一套“环境语义指标”,可以反映机器人遇到不同场景时各种不同类型的语义信息,如风险指标或人类活动迹象(SHA)。基于这些指标,我们提出了一个描述环境总体情况的度量,称为“情景语义丰富度”(Situational Semantic abundance, SSR)。这个指标结合了多个语义指标来总结整体情况。SSR表明是否遇到了信息丰富、复杂的情况,这可能需要机器人和人类进行高级推理,因此需要专家操作员的注意。该框架在模拟灾难响应环境中的Jackal机器人上进行了测试。实验结果表明,所提出的语义指标对不同场景下语义信息的不同形态变化非常敏感,SSR度量反映了所遇到情境下的整体语义变化。
{"title":"A framework for semantics-based situational awareness during mobile robot deployments.","authors":"Tianshu Ruan, Aniketh Ramesh, Hao Wang, Alix Johnstone-Morfoisse, Gokcenur Altindal, Paul Norman, Grigoris Nikolaou, Rustam Stolkin, Manolis Chiou","doi":"10.3389/frobt.2025.1694123","DOIUrl":"10.3389/frobt.2025.1694123","url":null,"abstract":"<p><p>Deployment of robots into hazardous environments typically involves a \"human-robot teaming\" (HRT) paradigm, in which a human supervisor interacts with a remotely operating robot inside the hazardous zone. Situational awareness (SA) is vital for enabling HRT, to support navigation, planning, and decision-making. In this paper, we explore issues of higher-level \"semantic\" information and understanding in SA. In semi-autonomous or variable-autonomy paradigms, different types of semantic information may be important, in different ways, for both the human operator and an autonomous agent controlling the robot. We propose a generalizable framework for acquiring and combining multiple modalities of semantic-level SA during remote deployments of mobile robots. We demonstrate the framework with an example application of search and rescue (SAR) in disaster-response robotics. We propose a set of \"environment semantic indicators\" that can reflect a variety of different types of semantic information, such as indicators of risk or signs of human activity (SHA), as the robot encounters different scenes. Based on these indicators, we propose a metric to describe the overall situation of the environment, called \"Situational Semantic Richness\" (SSR). This metric combines multiple semantic indicators to summarize the overall situation. The SSR indicates whether an information-rich, complex situation has been encountered, which may require advanced reasoning by robots and humans and, hence, the attention of the expert human operator. The framework is tested on a Jackal robot in a mock-up disaster-response environment. Experimental results demonstrate that the proposed semantic indicators are sensitive to changes in different modalities of semantic information in different scenes, and the SSR metric reflects the overall semantic changes in the situations encountered.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1694123"},"PeriodicalIF":3.0,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12672245/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145679066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial: Human-in-the-loop paradigm for assistive robotics. 编辑:辅助机器人的人在环范式。
IF 3 Q2 ROBOTICS Pub Date : 2025-11-19 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1718326
Francesca Cordella, Dario Farina, Loredana Zollo
{"title":"Editorial: Human-in-the-loop paradigm for assistive robotics.","authors":"Francesca Cordella, Dario Farina, Loredana Zollo","doi":"10.3389/frobt.2025.1718326","DOIUrl":"https://doi.org/10.3389/frobt.2025.1718326","url":null,"abstract":"","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1718326"},"PeriodicalIF":3.0,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12672270/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145679076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Symbolic feedback for transparent fault anticipation in neuroergonomic brain-machine interfaces. 神经工效学脑机接口中透明故障预测的符号反馈。
IF 3 Q2 ROBOTICS Pub Date : 2025-11-18 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1656642
Abdelaali Mahrouk

Background: Brain-Machine Interfaces (BMIs) increasingly mediate human interaction with assistive systems, yet remain sensitive to internal cognitive divergence. Subtle shifts in user intention-due to fatigue, overload, or schema conflict-may affect system reliability. While decoding accuracy has improved, most systems still lack mechanisms to communicate internal uncertainty or reasoning dynamics in real time.

Objective: We present NECAP-Interaction, a neuro-symbolic architecture that explores the potential of symbolic feedback to support real-time human-AI alignment. The framework aims to improve neuroergonomic transparency by integrating symbolic trace generation into the BMI control pipeline.

Methods: All evaluations were conducted using high-fidelity synthetic agents across three simulation tasks (motor control, visual attention, cognitive inhibition). NECAP-Interaction generates symbolic descriptors of epistemic shifts, supporting co-adaptive human-system communication. We report trace clarity, response latency, and symbolic coverage using structured replay analysis and interpretability metrics.

Results: NECAP-Interaction anticipated behavioral divergence up to 2.3 ± 0.4 s before error onset and maintained over 90% symbolic trace interpretability across uncertainty tiers. In simulated overlays, symbolic feedback improved user comprehension of system states and reduced latency to trust collapse compared to baseline architectures (CNN, RNN).

Conclusion: Cognitive interpretability is not merely a technical concern-it is a design priority. By embedding symbolic introspection into BMI workflows, NECAP-Interaction supports user transparency and co-regulated interaction in cognitively demanding contexts. These findings contribute to the development of human-centered neurotechnologies where explainability is experienced in real time.

背景:脑机接口(bmi)越来越多地调解人类与辅助系统的互动,但仍然对内部认知分歧敏感。用户意图的细微变化(由于疲劳、过载或模式冲突)可能会影响系统可靠性。虽然解码精度有所提高,但大多数系统仍然缺乏实时沟通内部不确定性或推理动态的机制。目的:我们提出NECAP-Interaction,这是一个神经符号架构,探索符号反馈支持实时人机对齐的潜力。该框架旨在通过将符号轨迹生成集成到BMI控制管道中来提高神经工效学的透明度。方法:采用高保真合成试剂对三个模拟任务(运动控制、视觉注意、认知抑制)进行评估。NECAP-Interaction生成认知转移的符号描述符,支持共同适应的人类系统通信。我们使用结构化重播分析和可解释性指标报告跟踪清晰度、响应延迟和符号覆盖率。结果:NECAP-Interaction预测错误发生前2.3±0.4 s的行为差异,并在不确定性层保持超过90%的符号痕迹可解释性。在模拟叠加中,与基线架构相比,符号反馈提高了用户对系统状态的理解,减少了信任崩溃的延迟(CNN, RNN)。结论:认知可解释性不仅仅是一个技术问题——它是一个设计优先级。通过将符号自省嵌入到BMI工作流中,NECAP-Interaction在认知要求高的环境中支持用户透明度和协同调节交互。这些发现有助于以人为中心的神经技术的发展,在这种技术中,可解释性是实时体验的。
{"title":"Symbolic feedback for transparent fault anticipation in neuroergonomic brain-machine interfaces.","authors":"Abdelaali Mahrouk","doi":"10.3389/frobt.2025.1656642","DOIUrl":"10.3389/frobt.2025.1656642","url":null,"abstract":"<p><strong>Background: </strong>Brain-Machine Interfaces (BMIs) increasingly mediate human interaction with assistive systems, yet remain sensitive to internal cognitive divergence. Subtle shifts in user intention-due to fatigue, overload, or schema conflict-may affect system reliability. While decoding accuracy has improved, most systems still lack mechanisms to communicate internal uncertainty or reasoning dynamics in real time.</p><p><strong>Objective: </strong>We present NECAP-Interaction, a neuro-symbolic architecture that explores the potential of symbolic feedback to support real-time human-AI alignment. The framework aims to improve neuroergonomic transparency by integrating symbolic trace generation into the BMI control pipeline.</p><p><strong>Methods: </strong>All evaluations were conducted using high-fidelity synthetic agents across three simulation tasks (motor control, visual attention, cognitive inhibition). NECAP-Interaction generates symbolic descriptors of epistemic shifts, supporting co-adaptive human-system communication. We report trace clarity, response latency, and symbolic coverage using structured replay analysis and interpretability metrics.</p><p><strong>Results: </strong>NECAP-Interaction anticipated behavioral divergence up to 2.3 ± 0.4 s before error onset and maintained over 90% symbolic trace interpretability across uncertainty tiers. In simulated overlays, symbolic feedback improved user comprehension of system states and reduced latency to trust collapse compared to baseline architectures (CNN, RNN).</p><p><strong>Conclusion: </strong>Cognitive interpretability is not merely a technical concern-it is a design priority. By embedding symbolic introspection into BMI workflows, NECAP-Interaction supports user transparency and co-regulated interaction in cognitively demanding contexts. These findings contribute to the development of human-centered neurotechnologies where explainability is experienced in real time.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1656642"},"PeriodicalIF":3.0,"publicationDate":"2025-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12668942/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145670244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Benchmarking complete-to-partial point cloud registration techniques for laparoscopic surgery. 腹腔镜手术中完全到部分点云配准技术的标杆分析。
IF 3 Q2 ROBOTICS Pub Date : 2025-11-17 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1702360
Alberto Neri, Veronica Penza, Nazim Haouchine, Leonardo S Mattos

Objective: Registering a preoperative 3D model of an organ with its actual anatomy viewed from an intraoperative video is a fundamental challenge in computer-assisted surgery, especially for surgical augmented reality. To address this, we present a benchmark of state-of-the-art deep learning point-cloud registration methods, offering a transparent evaluation of their generalizability to surgical scenarios and establishing a robust guideline for developing advanced non-rigid algorithms.

Methods: We systematically evaluate traditional and deep learning GMM-based, correspondence-based, correspondence-free, matching-based, and liver-specific point cloud registration approaches on two surgical datasets: a deformed IRCAD liver set and DePoll dataset. We also propose our complete-to-partial point cloud registration framework that leverages keypoint extraction, overlap estimation, and a Transformer-based architecture, culminating in competitive registration results.

Results: Experimental evaluations on deformed IRCAD tests reveal that most deep learning methods achieve good registration performances with TRE<10 mm, MAE(R) < 4 and MAE(t)<5 mm. On DePoll, however, performance drops dramatically due to the large deformations.

Conclusion: In conclusion, deep-learning rigid registration methods remain reliable under small deformations and varying partiality but lose accuracy when faced with severe non-rigid changes. To overcome this, future work should focus on building non-rigid registration architectures that preserve the strengths of self-, cross-attention and overlap modules while enhancing correspondence estimation to handle large deformations in laparoscopic surgery.

目的:在计算机辅助手术中,尤其是增强现实手术中,通过术中视频记录器官的术前3D模型和实际解剖结构是一个基本的挑战。为了解决这个问题,我们提出了一个最先进的深度学习点云配准方法的基准,提供了一个透明的评估其在外科手术场景中的通用性,并为开发先进的非刚性算法建立了一个强大的指南。方法:我们在两个手术数据集上系统地评估了传统的和深度学习的基于gmm的、基于对应的、无对应的、基于匹配的和肝脏特异性的点云配准方法:一个变形的IRCAD肝脏集和DePoll数据集。我们还提出了我们的完整到部分点云配准框架,该框架利用关键点提取、重叠估计和基于transformer的架构,最终获得竞争性的配准结果。结果:对变形IRCAD测试的实验评估表明,大多数深度学习方法在trem下都能获得良好的配准性能。结论:综上所述,深度学习刚性配准方法在小变形和不同偏度下仍然是可靠的,但在面对严重的非刚性变化时失去准确性。为了克服这一问题,未来的工作应侧重于构建非刚性配准架构,以保留自关注、交叉关注和重叠模块的优势,同时增强对应估计以处理腹腔镜手术中的大变形。
{"title":"Benchmarking complete-to-partial point cloud registration techniques for laparoscopic surgery.","authors":"Alberto Neri, Veronica Penza, Nazim Haouchine, Leonardo S Mattos","doi":"10.3389/frobt.2025.1702360","DOIUrl":"10.3389/frobt.2025.1702360","url":null,"abstract":"<p><strong>Objective: </strong>Registering a preoperative 3D model of an organ with its actual anatomy viewed from an intraoperative video is a fundamental challenge in computer-assisted surgery, especially for surgical augmented reality. To address this, we present a benchmark of state-of-the-art deep learning point-cloud registration methods, offering a transparent evaluation of their generalizability to surgical scenarios and establishing a robust guideline for developing advanced non-rigid algorithms.</p><p><strong>Methods: </strong>We systematically evaluate traditional and deep learning GMM-based, correspondence-based, correspondence-free, matching-based, and liver-specific point cloud registration approaches on two surgical datasets: a deformed IRCAD liver set and DePoll dataset. We also propose our complete-to-partial point cloud registration framework that leverages keypoint extraction, overlap estimation, and a Transformer-based architecture, culminating in competitive registration results.</p><p><strong>Results: </strong>Experimental evaluations on deformed IRCAD tests reveal that most deep learning methods achieve good registration performances with TRE<10 mm, MAE(R) < 4 and MAE(t)<5 mm. On DePoll, however, performance drops dramatically due to the large deformations.</p><p><strong>Conclusion: </strong>In conclusion, deep-learning rigid registration methods remain reliable under small deformations and varying partiality but lose accuracy when faced with severe non-rigid changes. To overcome this, future work should focus on building non-rigid registration architectures that preserve the strengths of self-, cross-attention and overlap modules while enhancing correspondence estimation to handle large deformations in laparoscopic surgery.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1702360"},"PeriodicalIF":3.0,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12666070/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145662284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative analysis of deep Q-learning algorithms for object throwing using a robot manipulator. 基于机械臂抛物的深度q -学习算法对比分析。
IF 3 Q2 ROBOTICS Pub Date : 2025-11-14 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1567211
Mohammad Al Homsi, Maja Trumić, Adriano Fagiolini, Giansalvo Cirrincione

Recent advances in artificial intelligence (AI) have attracted significant attention due to AI's ability to solve complex problems and the rapid development of learning algorithms and computational power. Among the many AI techniques, transformers stand out for their flexible architectures and high computational capacity. Unlike traditional neural networks, transformers use mechanisms such as self-attention with positional encoding, which enable them to effectively capture long-range dependencies in sequential and spatial data. This paper presents a comparison of various deep Q-learning algorithms and proposes two original techniques that use self-attention into deep Q-learning. The first technique is structured self-attention with deep Q-learning, and the second uses multi-head attention with deep Q-learning. These methods are compared with different types of deep Q-learning and other temporal techniques in uncertain tasks, such as throwing objects to unknown targets. The performance of these algorithms is evaluated in a simplified environment, where the task involves throwing a ball using a robotic arm manipulator. This setup provides a controlled scenario to analyze the algorithms' efficiency and effectiveness in solving dynamic control problems. Additional constraints are introduced to evaluate performance under more complex conditions, such as a joint lock or the presence of obstacles like a wall near the robot or the target. The output of the algorithm includes the correct joint configurations and trajectories for throwing to unknown target positions. The use of multi-head attention has enhanced the robot's ability to prioritize and interact with critical environmental features. The paper also includes a comparison of temporal difference algorithms to address constraints on the robot's joints. These algorithms are capable of finding solutions within the limitations of existing hardware, enabling robots to interact intelligently and autonomously with their environment.

由于人工智能解决复杂问题的能力以及学习算法和计算能力的快速发展,人工智能(AI)的最新进展引起了人们的极大关注。在众多人工智能技术中,变压器以其灵活的架构和高计算能力脱颖而出。与传统的神经网络不同,变压器使用自我关注和位置编码等机制,使它们能够有效地捕获序列和空间数据中的远程依赖关系。本文对各种深度q学习算法进行了比较,并提出了两种将自注意应用于深度q学习的原始技术。第一种技术是深度q学习的结构化自我注意,第二种技术是深度q学习的多头注意。这些方法与不同类型的深度q学习和其他不确定任务中的时间技术进行了比较,例如向未知目标投掷物体。这些算法的性能在一个简化的环境中进行了评估,其中的任务涉及使用机械臂操纵器投掷球。该设置提供了一个受控场景来分析算法在解决动态控制问题方面的效率和有效性。在更复杂的条件下,例如关节锁或障碍物(如机器人或目标附近的墙壁),引入了额外的约束来评估性能。该算法的输出包括正确的关节构型和抛向未知目标位置的轨迹。多头注意力的使用增强了机器人对关键环境特征进行优先排序和交互的能力。本文还比较了用于解决机器人关节约束的时间差分算法。这些算法能够在现有硬件的限制下找到解决方案,使机器人能够智能自主地与环境进行交互。
{"title":"Comparative analysis of deep Q-learning algorithms for object throwing using a robot manipulator.","authors":"Mohammad Al Homsi, Maja Trumić, Adriano Fagiolini, Giansalvo Cirrincione","doi":"10.3389/frobt.2025.1567211","DOIUrl":"10.3389/frobt.2025.1567211","url":null,"abstract":"<p><p>Recent advances in artificial intelligence (AI) have attracted significant attention due to AI's ability to solve complex problems and the rapid development of learning algorithms and computational power. Among the many AI techniques, transformers stand out for their flexible architectures and high computational capacity. Unlike traditional neural networks, transformers use mechanisms such as self-attention with positional encoding, which enable them to effectively capture long-range dependencies in sequential and spatial data. This paper presents a comparison of various deep Q-learning algorithms and proposes two original techniques that use self-attention into deep Q-learning. The first technique is structured self-attention with deep Q-learning, and the second uses multi-head attention with deep Q-learning. These methods are compared with different types of deep Q-learning and other temporal techniques in uncertain tasks, such as throwing objects to unknown targets. The performance of these algorithms is evaluated in a simplified environment, where the task involves throwing a ball using a robotic arm manipulator. This setup provides a controlled scenario to analyze the algorithms' efficiency and effectiveness in solving dynamic control problems. Additional constraints are introduced to evaluate performance under more complex conditions, such as a joint lock or the presence of obstacles like a wall near the robot or the target. The output of the algorithm includes the correct joint configurations and trajectories for throwing to unknown target positions. The use of multi-head attention has enhanced the robot's ability to prioritize and interact with critical environmental features. The paper also includes a comparison of temporal difference algorithms to address constraints on the robot's joints. These algorithms are capable of finding solutions within the limitations of existing hardware, enabling robots to interact intelligently and autonomously with their environment.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1567211"},"PeriodicalIF":3.0,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12660075/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145649677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial: Dialogue with robots: constructive approaches for understanding communication. 社论:与机器人对话:理解沟通的建设性方法。
IF 3 Q2 ROBOTICS Pub Date : 2025-11-14 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1725162
Casey Kennington, Koji Inoue, Randy Gomez, Peter Ford-Dominey
{"title":"Editorial: Dialogue with robots: constructive approaches for understanding communication.","authors":"Casey Kennington, Koji Inoue, Randy Gomez, Peter Ford-Dominey","doi":"10.3389/frobt.2025.1725162","DOIUrl":"https://doi.org/10.3389/frobt.2025.1725162","url":null,"abstract":"","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1725162"},"PeriodicalIF":3.0,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12660074/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145649649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hypothesis on the functional advantages of the selection-broadcast cycle structure: global workspace theory and dealing with a real-time world. 选择-广播周期结构功能优势的假设:全局工作空间理论与实时世界的处理。
IF 3 Q2 ROBOTICS Pub Date : 2025-11-13 eCollection Date: 2025-01-01 DOI: 10.3389/frobt.2025.1607190
Junya Nakanishi, Jun Baba, Yuichiro Yoshikawa, Hiroko Kamide, Hiroshi Ishiguro

This paper discusses the functional advantages of the Selection-Broadcast Cycle structure proposed by Global Workspace Theory (GWT), inspired by human consciousness, particularly focusing on its applicability to artificial intelligence and robotics in dynamic, real-time scenarios. While previous studies often examined the Selection and Broadcast processes independently, this research emphasizes their combined cyclic structure and the resulting benefits for real-time cognitive systems. Specifically, the paper identifies three primary benefits: Dynamic Thinking Adaptation, Experience-Based Adaptation, and Immediate Real-Time Adaptation. This work highlights GWT's potential as a cognitive architecture suitable for sophisticated decision-making and adaptive performance in unsupervised, dynamic environments. It suggests new directions for the development and implementation of robust, general-purpose AI and robotics systems capable of managing complex, real-world tasks.

本文讨论了全球工作空间理论(GWT)提出的选择-广播周期结构的功能优势,该结构受到人类意识的启发,特别关注其在动态、实时场景下对人工智能和机器人的适用性。虽然以前的研究通常是单独考察选择和广播过程,但本研究强调了它们的组合循环结构及其对实时认知系统的好处。具体而言,本文确定了三个主要优势:动态思维适应、基于经验的适应和即时实时适应。这项工作突出了GWT作为一种认知架构的潜力,适合于在无监督的动态环境中进行复杂的决策和自适应性能。它为开发和实现强大的、通用的人工智能和机器人系统提供了新的方向,这些系统能够管理复杂的、现实世界的任务。
{"title":"Hypothesis on the functional advantages of the selection-broadcast cycle structure: global workspace theory and dealing with a real-time world.","authors":"Junya Nakanishi, Jun Baba, Yuichiro Yoshikawa, Hiroko Kamide, Hiroshi Ishiguro","doi":"10.3389/frobt.2025.1607190","DOIUrl":"10.3389/frobt.2025.1607190","url":null,"abstract":"<p><p>This paper discusses the functional advantages of the Selection-Broadcast Cycle structure proposed by Global Workspace Theory (GWT), inspired by human consciousness, particularly focusing on its applicability to artificial intelligence and robotics in dynamic, real-time scenarios. While previous studies often examined the Selection and Broadcast processes independently, this research emphasizes their combined cyclic structure and the resulting benefits for real-time cognitive systems. Specifically, the paper identifies three primary benefits: Dynamic Thinking Adaptation, Experience-Based Adaptation, and Immediate Real-Time Adaptation. This work highlights GWT's potential as a cognitive architecture suitable for sophisticated decision-making and adaptive performance in unsupervised, dynamic environments. It suggests new directions for the development and implementation of robust, general-purpose AI and robotics systems capable of managing complex, real-world tasks.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1607190"},"PeriodicalIF":3.0,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12657164/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145649691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Frontiers in Robotics and AI
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1