首页 > 最新文献

IEEE Robotics and Automation Letters最新文献

英文 中文
Closed-loop Control of Steerable Balloon Endoscopes for Robot-assisted Transcatheter Intracardiac Procedures. 机器人辅助心内导管手术中可操纵球囊内窥镜的闭环控制。
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-04-01 Epub Date: 2026-02-13 DOI: 10.1109/lra.2026.3664592
Max McCandless, Jonathan Hamid, Sammy Elmariah, Nathaniel Langer, Pierre E Dupont

To move away from open-heart surgery towards safer transcatheter procedures, there is a growing need for improved imaging techniques and robotic solutions to enable simple, accurate tool navigation. Common imaging modalities, such as fluoroscopy and ultrasound, have limitations that can be overcome using cardioscopy, i.e., direct optical visualization inside the beating heart. We present a cardioscope designed as a steerable balloon. As a balloon, it can be collapsed to pass through the vasculature and subsequently inflated inside the heart for visualization and tool delivery through an integrated working channel. Through careful design of balloon wall thickness, a single input, balloon inflation pressure, is used to sequentially independently control two outputs, balloon diameter (corresponding to field of view diameter) and balloon bending angle (enabling precise working channel positioning). This balloon technology can be tuned to produce cardioscopes designed for a range of intracardiac tasks. To illustrate this approach, a balloon design is presented for the specific task of aortic leaflet laceration. Image-based closed-loop control of bending angle is also demonstrated as a means of enabling stable orientation control during tool insertion and removal.

为了从心脏直视手术转向更安全的经导管手术,越来越需要改进成像技术和机器人解决方案,以实现简单、准确的工具导航。常见的成像方式,如透视和超声,有局限性,可以克服使用心镜检查,即直接光学可视化内部跳动的心脏。我们提出了一种设计成可操纵气球的心镜。作为一个气球,它可以塌陷穿过血管系统,随后在心脏内部膨胀,以便通过一个集成的工作通道进行可视化和工具输送。通过对球囊壁厚的精心设计,采用单输入球囊充气压力,依次独立控制球囊直径(对应视场直径)和球囊弯曲角度(实现精确的工作通道定位)两个输出。这种球囊技术可以调整为生产用于一系列心脏内任务的心镜。为了说明这种方法,提出了一种球囊设计用于主动脉小叶撕裂伤的特定任务。基于图像的弯曲角闭环控制也被证明是在刀具插入和取出过程中实现稳定方向控制的一种手段。
{"title":"Closed-loop Control of Steerable Balloon Endoscopes for Robot-assisted Transcatheter Intracardiac Procedures.","authors":"Max McCandless, Jonathan Hamid, Sammy Elmariah, Nathaniel Langer, Pierre E Dupont","doi":"10.1109/lra.2026.3664592","DOIUrl":"10.1109/lra.2026.3664592","url":null,"abstract":"<p><p>To move away from open-heart surgery towards safer transcatheter procedures, there is a growing need for improved imaging techniques and robotic solutions to enable simple, accurate tool navigation. Common imaging modalities, such as fluoroscopy and ultrasound, have limitations that can be overcome using cardioscopy, i.e., direct optical visualization inside the beating heart. We present a cardioscope designed as a steerable balloon. As a balloon, it can be collapsed to pass through the vasculature and subsequently inflated inside the heart for visualization and tool delivery through an integrated working channel. Through careful design of balloon wall thickness, a single input, balloon inflation pressure, is used to sequentially independently control two outputs, balloon diameter (corresponding to field of view diameter) and balloon bending angle (enabling precise working channel positioning). This balloon technology can be tuned to produce cardioscopes designed for a range of intracardiac tasks. To illustrate this approach, a balloon design is presented for the specific task of aortic leaflet laceration. Image-based closed-loop control of bending angle is also demonstrated as a means of enabling stable orientation control during tool insertion and removal.</p>","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 4","pages":"4211-4218"},"PeriodicalIF":5.3,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12994373/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147480592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic-ICP: Doppler-Aware Iterative Closest Point Registration for Dynamic Scenes 动态icp:动态场景的多普勒感知迭代最近点配准
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-03-03 DOI: 10.1109/LRA.2026.3669808
Dong Wang;Daniel Casado Herraez;Stefan May;Andreas Nüchter
Reliable odometry in highly dynamic environments remains challenging when it relies on ICP-based registration: ICP assumes near-static scenes and degrades in repetitive or low-texture geometry. We introduce Dynamic-ICP, a Doppler-aware registration framework. The method (i) estimates ego translational velocity from per-point Doppler velocity via robust regression and builds a velocity filter, (ii) clusters dynamic objects and reconstructs object-wise translational velocities from ego-compensated radial measurements, (iii) predicts dynamic points with a constant-velocity model, and (iv) aligns scans using a compact objective that combines point-to-plane geometry residual with a translation-invariant, rotation-only Doppler residual. The approach requires no external sensors or sensor–vehicle calibration and operates directly on FMCW LiDAR range and Doppler velocities. We evaluate Dynamic-ICP on three real-world datasets-HeRCULES, HeLiPR, AevaScenes-focusing on highly dynamic scenes. Dynamic-ICP consistently improves rotational stability and translation accuracy over the state-of-the-art methods.
在高度动态环境中,可靠的里程测量仍然具有挑战性,因为它依赖于基于ICP的配准:ICP假设接近静态的场景,并且在重复或低纹理几何中会退化。我们介绍动态icp,一个多普勒感知注册框架。该方法(i)通过鲁棒回归从每个点的多普勒速度估计自我平移速度,并建立速度滤波器,(ii)聚类动态物体,并从自我补偿的径向测量中重建物体方向的平移速度,(iii)用恒定速度模型预测动态点,(iv)使用紧凑的物镜将点到平面的几何残差与平移不变的、仅旋转的多普勒残差相结合,对扫描进行对齐。该方法不需要外部传感器或传感器车辆校准,直接根据FMCW激光雷达的距离和多普勒速度进行操作。我们在三个真实世界的数据集(hercules, HeLiPR, aevascense)上评估了dynamic - icp,重点关注高度动态的场景。动态icp不断提高旋转稳定性和翻译精度的国家的最先进的方法。
{"title":"Dynamic-ICP: Doppler-Aware Iterative Closest Point Registration for Dynamic Scenes","authors":"Dong Wang;Daniel Casado Herraez;Stefan May;Andreas Nüchter","doi":"10.1109/LRA.2026.3669808","DOIUrl":"https://doi.org/10.1109/LRA.2026.3669808","url":null,"abstract":"Reliable odometry in highly dynamic environments remains challenging when it relies on ICP-based registration: ICP assumes near-static scenes and degrades in repetitive or low-texture geometry. We introduce Dynamic-ICP, a Doppler-aware registration framework. The method (i) estimates ego translational velocity from per-point Doppler velocity via robust regression and builds a velocity filter, (ii) clusters dynamic objects and reconstructs object-wise translational velocities from ego-compensated radial measurements, (iii) predicts dynamic points with a constant-velocity model, and (iv) aligns scans using a compact objective that combines point-to-plane geometry residual with a translation-invariant, rotation-only Doppler residual. The approach requires no external sensors or sensor–vehicle calibration and operates directly on FMCW LiDAR range and Doppler velocities. We evaluate Dynamic-ICP on three real-world datasets-HeRCULES, HeLiPR, AevaScenes-focusing on highly dynamic scenes. Dynamic-ICP consistently improves rotational stability and translation accuracy over the state-of-the-art methods.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 4","pages":"5174-5181"},"PeriodicalIF":5.3,"publicationDate":"2026-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11419773","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147440656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Valve-Less Electro-Hydrostatic Powered Prosthetic Foot to Improve the Power Efficiency During Walking 一种无阀电-静压动力假肢足提高行走时的动力效率
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-03-02 DOI: 10.1109/LRA.2026.3668985
Bowen Li;Xin Li;Hongguang Xu;Qitao Huang
Hydraulic systems have been widely applied in lower-limb prostheses, primarily for their compact actuation and inherent damping capability. However, when applied to powered prosthetic feet, valves and other damping elements cause unavoidable energy dissipation, thereby constraining their power density. To address this limitation, we propose a valve-less electro-hydrostatic powered prosthetic foot aimed at enhancing power efficiency. In addition, a gas accumulator’s nonlinear elasticity is considered, forming a hydraulic series elastic actuator with passive stiffness comparable to the human ankle during walking. This configuration lowers the motor’s required speed, consequently reducing its actual electrical power. The proposed design was evaluated through a treadmill walking experiment with a non-impaired subject walking at 1.1 m/s. The results showed that the prosthetic foot provided sufficient positive power relative to human reference values. Moreover, the prosthesis achieved a peak output power of 206.3 $pm$ 15.0 W, while the corresponding motor electrical power was only 147.5 $pm$ 29.5 W. Our study demonstrates that the electro-hydrostatic system holds significant potential for enhancing the power density of powered prosthetic feet.
液压系统在下肢假肢中得到了广泛的应用,主要是由于其紧凑的驱动和固有的阻尼能力。然而,当应用于动力假肢脚时,阀门和其他阻尼元件不可避免地会产生能量耗散,从而限制了它们的功率密度。为了解决这一限制,我们提出了一种无阀电静压动力假肢足,旨在提高功率效率。此外,还考虑了蓄能器的非线性弹性,形成了一种液压串联弹性驱动器,其被动刚度可与人的踝关节相媲美。这种配置降低了电机所需的速度,从而降低了其实际电力。我们通过跑步机行走实验对所提出的设计进行了评估,实验对象为非损伤受试者,行走速度为1.1 m/s。结果表明,假肢足相对于人体参考值提供了足够的正功率。此外,假体的峰值输出功率为206.3 $pm$ 15.0 W,而相应的电机功率仅为147.5 $pm$ 29.5 W。我们的研究表明,电静液系统在提高动力假肢脚的功率密度方面具有重要的潜力。
{"title":"A Valve-Less Electro-Hydrostatic Powered Prosthetic Foot to Improve the Power Efficiency During Walking","authors":"Bowen Li;Xin Li;Hongguang Xu;Qitao Huang","doi":"10.1109/LRA.2026.3668985","DOIUrl":"https://doi.org/10.1109/LRA.2026.3668985","url":null,"abstract":"Hydraulic systems have been widely applied in lower-limb prostheses, primarily for their compact actuation and inherent damping capability. However, when applied to powered prosthetic feet, valves and other damping elements cause unavoidable energy dissipation, thereby constraining their power density. To address this limitation, we propose a valve-less electro-hydrostatic powered prosthetic foot aimed at enhancing power efficiency. In addition, a gas accumulator’s nonlinear elasticity is considered, forming a hydraulic series elastic actuator with passive stiffness comparable to the human ankle during walking. This configuration lowers the motor’s required speed, consequently reducing its actual electrical power. The proposed design was evaluated through a treadmill walking experiment with a non-impaired subject walking at 1.1 m/s. The results showed that the prosthetic foot provided sufficient positive power relative to human reference values. Moreover, the prosthesis achieved a peak output power of 206.3 <inline-formula><tex-math>$pm$</tex-math></inline-formula> 15.0 W, while the corresponding motor electrical power was only 147.5 <inline-formula><tex-math>$pm$</tex-math></inline-formula> 29.5 W. Our study demonstrates that the electro-hydrostatic system holds significant potential for enhancing the power density of powered prosthetic feet.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 4","pages":"4833-4840"},"PeriodicalIF":5.3,"publicationDate":"2026-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sim2Real Domain Shifting: Hyper-Realistic Data Generation for Object Segmentation Sim2Real领域转移:对象分割的超现实数据生成
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-02-27 DOI: 10.1109/LRA.2026.3668590
Han Zheng;Rong Xiong;Yue Wang;Jun Wu
Object segmentation is a critical prerequisite for robotic tasks such as grasping and assembly. While high accuracy and reliability typically require extensive real-world data, its collection and annotation are costly. Although synthetic data generated through physically-based rendering mitigates this need, a persistent domain gap hinders model performance. This letter introduces a novel hyper-realistic synthetic data generation method to mitigate this gap with minimal real-world data. By extracting domain information from limited real scenes, we shift synthetic data toward the target domain. Realistic backgrounds are synthesized using generative models, while a two-stage style transfer, guided by anchor image styles, adapts foregrounds. Our method achieves performance comparable to models trained on thousands of real images using as few as one real image, significantly reducing the reliance on large-scale data collection.
物体分割是机器人完成抓取和装配等任务的关键先决条件。虽然高准确性和可靠性通常需要大量的真实世界数据,但其收集和注释的成本很高。尽管通过基于物理的呈现生成的合成数据减轻了这种需求,但是持久的领域差距阻碍了模型的性能。这封信介绍了一种新的超现实合成数据生成方法,以最小的现实世界数据来缓解这种差距。通过从有限的真实场景中提取领域信息,将合成数据向目标领域转移。使用生成模型合成现实背景,而由锚图像样式引导的两阶段风格转移则适应前景。我们的方法达到了与使用少量真实图像在数千个真实图像上训练的模型相当的性能,显着减少了对大规模数据收集的依赖。
{"title":"Sim2Real Domain Shifting: Hyper-Realistic Data Generation for Object Segmentation","authors":"Han Zheng;Rong Xiong;Yue Wang;Jun Wu","doi":"10.1109/LRA.2026.3668590","DOIUrl":"https://doi.org/10.1109/LRA.2026.3668590","url":null,"abstract":"Object segmentation is a critical prerequisite for robotic tasks such as grasping and assembly. While high accuracy and reliability typically require extensive real-world data, its collection and annotation are costly. Although synthetic data generated through physically-based rendering mitigates this need, a persistent domain gap hinders model performance. This letter introduces a novel hyper-realistic synthetic data generation method to mitigate this gap with minimal real-world data. By extracting domain information from limited real scenes, we shift synthetic data toward the target domain. Realistic backgrounds are synthesized using generative models, while a two-stage style transfer, guided by anchor image styles, adapts foregrounds. Our method achieves performance comparable to models trained on thousands of real images using as few as one real image, significantly reducing the reliance on large-scale data collection.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 4","pages":"4761-4768"},"PeriodicalIF":5.3,"publicationDate":"2026-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
UNITE-NBV: Uncertainty-Driven and Information-Enhanced Gain Estimation for Next Best View UNITE-NBV:次优视图的不确定性驱动和信息增强增益估计
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-02-27 DOI: 10.1109/LRA.2026.3668579
Kaice Jiang;Qingxiao Wu;Sicong Li;Feng Zhu;Yingjian Fang;Jianxin Cai
Next Best View (NBV) algorithms are a critical area of research in 3D reconstruction. They aim to efficiently reconstruct 3D scenes by maximizing information gain from the next optimal viewpoint. However, current NBV methods often neglect the importance of high-quality candidate view sampling, leading to inconsistent quality of the candidate viewpoint set. Moreover, these methods frequently encounter difficulties in extracting effective information for accurate information gain estimation, especially when reconstructing complex objects or large-scale scenes. To address these challenges, we propose UNITE-NBV. Our method achieves accurate next best view selection across various scenes by effectively uniting the proposed high-quality candidate view sampling strategy and information gain estimation network. Specifically, we introduce Spherical Uncertainty Sampling (SUS), a novel candidate viewpoint sampling method. This method calculates the scene’s uncertainty field and maps it into a spherical sampling space. Within this space, the uncertainty distribution guides the sampling of high-quality candidate viewpoints. Additionally, we propose a Multi-Expert Information Gain Network (MEIGN) that performs disentangled encoding and expert encoder sequence processing on features extracted from the reconstructed scene. These processed features are then dynamically fused using sparse gating and Multi-Head Self-Attention, enabling accurate information gain estimation for candidate views. Extensive experimental results on both the small-object dataset ShapeNet and various large-scale 3D scene datasets demonstrate the effectiveness and superior performance of our proposed method. The code will be released.
次优视图(NBV)算法是三维重建中的一个重要研究领域。他们的目标是通过最大化下一个最佳视点的信息增益来有效地重建3D场景。然而,目前的NBV方法往往忽视了高质量候选视图采样的重要性,导致候选视点集的质量不一致。此外,这些方法在提取有效信息以获得准确的信息增益估计方面经常遇到困难,特别是在重建复杂物体或大规模场景时。为了应对这些挑战,我们提出了UNITE-NBV。该方法将提出的高质量候选视图采样策略和信息增益估计网络有效地结合起来,实现了在不同场景下的次优视图选择。具体来说,我们介绍了一种新的候选视点采样方法——球面不确定采样(SUS)。该方法计算场景的不确定性场,并将其映射到球面采样空间。在这个空间内,不确定性分布指导高质量候选视点的采样。此外,我们提出了一种多专家信息增益网络(MEIGN),该网络对从重构场景中提取的特征进行解纠缠编码和专家编码器序列处理。然后使用稀疏门控和多头自关注对这些处理后的特征进行动态融合,从而实现对候选视图的准确信息增益估计。在小目标数据集ShapeNet和各种大规模3D场景数据集上的大量实验结果证明了我们提出的方法的有效性和优越的性能。代码将被发布。
{"title":"UNITE-NBV: Uncertainty-Driven and Information-Enhanced Gain Estimation for Next Best View","authors":"Kaice Jiang;Qingxiao Wu;Sicong Li;Feng Zhu;Yingjian Fang;Jianxin Cai","doi":"10.1109/LRA.2026.3668579","DOIUrl":"https://doi.org/10.1109/LRA.2026.3668579","url":null,"abstract":"Next Best View (NBV) algorithms are a critical area of research in 3D reconstruction. They aim to efficiently reconstruct 3D scenes by maximizing information gain from the next optimal viewpoint. However, current NBV methods often neglect the importance of high-quality candidate view sampling, leading to inconsistent quality of the candidate viewpoint set. Moreover, these methods frequently encounter difficulties in extracting effective information for accurate information gain estimation, especially when reconstructing complex objects or large-scale scenes. To address these challenges, we propose UNITE-NBV. Our method achieves accurate next best view selection across various scenes by effectively uniting the proposed high-quality candidate view sampling strategy and information gain estimation network. Specifically, we introduce Spherical Uncertainty Sampling (SUS), a novel candidate viewpoint sampling method. This method calculates the scene’s uncertainty field and maps it into a spherical sampling space. Within this space, the uncertainty distribution guides the sampling of high-quality candidate viewpoints. Additionally, we propose a Multi-Expert Information Gain Network (MEIGN) that performs disentangled encoding and expert encoder sequence processing on features extracted from the reconstructed scene. These processed features are then dynamically fused using sparse gating and Multi-Head Self-Attention, enabling accurate information gain estimation for candidate views. Extensive experimental results on both the small-object dataset ShapeNet and various large-scale 3D scene datasets demonstrate the effectiveness and superior performance of our proposed method. The code will be released.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 4","pages":"5166-5173"},"PeriodicalIF":5.3,"publicationDate":"2026-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147440655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Quadrupedal Jumping and Walking for Dynamic Locomotion Using Reinforcement Learning 用强化学习研究动态运动的四足跳跃和行走
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-02-27 DOI: 10.1109/LRA.2026.3668467
Jørgen Anker Olsen;Lars Rønhaug Pettersen;Kostas Alexis
This paper presents a curriculum-based reinforcement learning framework for training precise and high-performance jumping policies for the robot Olympus. Separate policies are developed for vertical and horizontal jumps, leveraging a simple yet effective strategy. First, we densify the inherently sparse jumping reward using the laws of projectile motion. Next, a reference state initialization scheme is employed to accelerate the exploration of dynamic jumping behaviors. We also present a walking policy that, when combined with the jumping policies, unlocks versatile and dynamic locomotion capabilities. Comprehensive testing validates walking on varied terrain surfaces and jumping performance that exceeds previous works, effectively crossing the Sim2Real gap. Experimental validation demonstrates horizontal jumps up to 1.25 m with centimeter accuracy and vertical jumps up to 1.0 m. Additionally, we show that with only minor modifications, the proposed method can be used to learn omnidirectional jumping.
本文提出了一种基于课程的强化学习框架,用于训练机器人Olympus精确、高性能的跳跃策略。为垂直和水平跳跃开发了单独的策略,利用简单而有效的策略。首先,利用抛射运动规律对固有稀疏的跳跃奖励进行密集化。其次,采用参考状态初始化方案来加速动态跳跃行为的探索。我们还提出了一个步行策略,当与跳跃策略相结合时,解锁多功能和动态运动能力。综合测试验证了在不同地形表面的行走和跳跃性能超过以往的作品,有效地跨越了Sim2Real的差距。实验验证表明,水平跳跃可达1.25米,精度为厘米,垂直跳跃可达1.0米。此外,我们还表明,只要稍加修改,所提出的方法就可以用于学习全向跳跃。
{"title":"Towards Quadrupedal Jumping and Walking for Dynamic Locomotion Using Reinforcement Learning","authors":"Jørgen Anker Olsen;Lars Rønhaug Pettersen;Kostas Alexis","doi":"10.1109/LRA.2026.3668467","DOIUrl":"https://doi.org/10.1109/LRA.2026.3668467","url":null,"abstract":"This paper presents a curriculum-based reinforcement learning framework for training precise and high-performance jumping policies for the robot Olympus. Separate policies are developed for vertical and horizontal jumps, leveraging a simple yet effective strategy. First, we densify the inherently sparse jumping reward using the laws of projectile motion. Next, a reference state initialization scheme is employed to accelerate the exploration of dynamic jumping behaviors. We also present a walking policy that, when combined with the jumping policies, unlocks versatile and dynamic locomotion capabilities. Comprehensive testing validates walking on varied terrain surfaces and jumping performance that exceeds previous works, effectively crossing the Sim2Real gap. Experimental validation demonstrates horizontal jumps up to 1.25 m with centimeter accuracy and vertical jumps up to 1.0 m. Additionally, we show that with only minor modifications, the proposed method can be used to learn omnidirectional jumping.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 4","pages":"4809-4816"},"PeriodicalIF":5.3,"publicationDate":"2026-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GLAFE: A Global-Local Feature Learning Self-Attention Encoder for UAV Relocalization in Weak-Texture Environments 面向弱纹理环境下无人机再定位的全局-局部特征学习自关注编码器
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-02-27 DOI: 10.1109/LRA.2026.3668451
Yuan Chen;Jie Jiang
When uncrewed aerial vehicles (UAVs) conduct exploration tasks in weakly textured environments, such as planetary surfaces or outdoor scenes with sparse features, the absence of GPS typically necessitates the use of visual SLAM for localization. However, feature sparsity, motion blur caused by rapid camera movements, and viewpoint variations often lead to failures in feature-based pose tracking and relocalization. To address this issue, we propose a Global-Local feature learning Self-Attention Encoder (GLAFE), which simultaneously generates enhanced local and global feature descriptors by exploiting the correlations between local features, thereby improving robustness and efficiency in weakly textured scenes with viewpoint changes. A multi-objective optimization strategy based on shared samples is proposed to facilitate the joint learning of global and local features for GLAFE. Experiments on simulated Mars surface images and real-world flight data demonstrate that the proposed approach achieves better comprehensive performance in terms of robustness, accuracy, and efficiency compared with classical retrieval-based and other deep learning methods.$^{1}$
当无人驾驶飞行器(uav)在弱纹理环境中进行探测任务时,例如行星表面或具有稀疏特征的室外场景,缺乏GPS通常需要使用视觉SLAM进行定位。然而,特征稀疏性、相机快速运动引起的运动模糊以及视点变化往往导致基于特征的姿态跟踪和重新定位失败。为了解决这个问题,我们提出了一种全局-局部特征学习自注意编码器(GLAFE),该编码器通过利用局部特征之间的相关性同时生成增强的局部和全局特征描述符,从而提高了在视点变化的弱纹理场景中的鲁棒性和效率。为了实现全局特征和局部特征的联合学习,提出了一种基于共享样本的多目标优化策略。模拟火星表面图像和真实飞行数据的实验表明,与经典的基于检索的深度学习方法和其他深度学习方法相比,该方法在鲁棒性、准确性和效率方面具有更好的综合性能
{"title":"GLAFE: A Global-Local Feature Learning Self-Attention Encoder for UAV Relocalization in Weak-Texture Environments","authors":"Yuan Chen;Jie Jiang","doi":"10.1109/LRA.2026.3668451","DOIUrl":"https://doi.org/10.1109/LRA.2026.3668451","url":null,"abstract":"When uncrewed aerial vehicles (UAVs) conduct exploration tasks in weakly textured environments, such as planetary surfaces or outdoor scenes with sparse features, the absence of GPS typically necessitates the use of visual SLAM for localization. However, feature sparsity, motion blur caused by rapid camera movements, and viewpoint variations often lead to failures in feature-based pose tracking and relocalization. To address this issue, we propose a Global-Local feature learning Self-Attention Encoder (GLAFE), which simultaneously generates enhanced local and global feature descriptors by exploiting the correlations between local features, thereby improving robustness and efficiency in weakly textured scenes with viewpoint changes. A multi-objective optimization strategy based on shared samples is proposed to facilitate the joint learning of global and local features for GLAFE. Experiments on simulated Mars surface images and real-world flight data demonstrate that the proposed approach achieves better comprehensive performance in terms of robustness, accuracy, and efficiency compared with classical retrieval-based and other deep learning methods.<inline-formula><tex-math>$^{1}$</tex-math></inline-formula>","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 4","pages":"5151-5157"},"PeriodicalIF":5.3,"publicationDate":"2026-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147440657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning-Based Fourier Registration for Forward-Looking Sonar Odometry in Texture-Sparse Underwater Environments 纹理稀疏水下环境下基于深度学习的前视声纳测程傅立叶配准
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-02-27 DOI: 10.1109/LRA.2026.3668623
Peng Yao;Qiming Liu;Yingming Sun;Yalu Wang;Jiatao Yu
Robust forward-looking sonar (FLS) odometry is critical for underwater autonomous navigation but is hindered by severe noise and sparse textures in acoustic imaging. Traditional Fourier-based methods are susceptible to such degradations, while end-to-end deep learning approaches often struggle to learn intrinsic geometric relationships. We propose a novel deep learning framework that synergizes classical signal processing with learnable architectures. Our method decomposes pose estimation into rotation and translation stages, utilizing an improved Trans-UNet to enhance image feature interaction. Specifically, the rotation network leverages the Radon transform for noise filtering, combined with a multi-angle correlation layer to determine angular relationships. Following rotation correction, an improved learnable phase correlation module estimates translation within an end-to-end trainable system. Experiments on public datasets demonstrate that our method achieves outstanding odometry performance even without loop closure detection, and zero-shot evaluations on wetland datasets further validate its strong generalization capability.
强大的前视声呐(FLS)测程技术对于水下自主导航至关重要,但由于声成像中的严重噪声和稀疏纹理而受到阻碍。传统的基于傅里叶的方法容易受到这种退化的影响,而端到端深度学习方法往往难以学习内在的几何关系。我们提出了一种新的深度学习框架,将经典信号处理与可学习架构相结合。该方法将姿态估计分解为旋转和平移两个阶段,利用改进的Trans-UNet增强图像特征交互。具体来说,旋转网络利用Radon变换进行噪声滤波,并结合多角度相关层来确定角度关系。在旋转校正后,改进的可学习相位相关模块估计端到端可训练系统中的平移。在公共数据集上的实验表明,即使没有闭环检测,我们的方法也能取得出色的里程计性能,在湿地数据集上的零射击评估进一步验证了其强大的泛化能力。
{"title":"Deep Learning-Based Fourier Registration for Forward-Looking Sonar Odometry in Texture-Sparse Underwater Environments","authors":"Peng Yao;Qiming Liu;Yingming Sun;Yalu Wang;Jiatao Yu","doi":"10.1109/LRA.2026.3668623","DOIUrl":"https://doi.org/10.1109/LRA.2026.3668623","url":null,"abstract":"Robust forward-looking sonar (FLS) odometry is critical for underwater autonomous navigation but is hindered by severe noise and sparse textures in acoustic imaging. Traditional Fourier-based methods are susceptible to such degradations, while end-to-end deep learning approaches often struggle to learn intrinsic geometric relationships. We propose a novel deep learning framework that synergizes classical signal processing with learnable architectures. Our method decomposes pose estimation into rotation and translation stages, utilizing an improved Trans-UNet to enhance image feature interaction. Specifically, the rotation network leverages the Radon transform for noise filtering, combined with a multi-angle correlation layer to determine angular relationships. Following rotation correction, an improved learnable phase correlation module estimates translation within an end-to-end trainable system. Experiments on public datasets demonstrate that our method achieves outstanding odometry performance even without loop closure detection, and zero-shot evaluations on wetland datasets further validate its strong generalization capability.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 4","pages":"4817-4824"},"PeriodicalIF":5.3,"publicationDate":"2026-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362343","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Dual-Mode Hydraulic Actuator for a Quasi-Passive Load-Carrying Exoskeleton in Multiple Conditions 多工况下准被动负载外骨骼的双模液压致动器
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-02-26 DOI: 10.1109/LRA.2026.3668701
Bao Yingwei;Yang Bo;Wang Zezheng;Li Huilai;Jiang Haoyi;Sun Maowen;Ouyang Xiaoping
Lower limb exoskeleton robots have been widely researched for load-carrying assistance. Recently, quasi-passive exoskeletons using low-power elements to modulate mechanical characteristics have emerged. However, achieving effective damping and stiffness across varying tasks and loads remains challenging. This letter proposes a dual-mode actuator (DMA) inspired by knee joint impedances, achieving controllable damping force and high spring stiffness for assistance, with good backdrivability for human-robot transparency. Based on the DMA, a quasi-passive lower limb exoskeleton (QLLE) is proposed and evaluated during loaded walking and squatting. Experimental results demonstrated that the DMA achieved a sinusoidal damping force tracking error of 5.9% at 1 Hz, a spring stiffness of 21.7 N/mm, and an unassisted backdrive force of 14.3 N for compression. In addition, QLLE assistance reduced the maximum net metabolic cost by 8.5% during walking and 15.6% during squatting, with the load effectively transferred to the ground. These findings highlight the potential of QLLEs in real-world applications, such as manual material transportation.
下肢外骨骼机器人在负重辅助方面得到了广泛的研究。最近出现了使用低功率元件来调节机械特性的准被动外骨骼。然而,在不同的任务和负载下实现有效的阻尼和刚度仍然具有挑战性。这封信提出了一种受膝关节阻抗启发的双模驱动器(DMA),实现了可控的阻尼力和高弹簧刚度的辅助,具有良好的人机透明度的反向驾驶性。在此基础上,提出了一种准被动下肢外骨骼(QLLE),并对其负重行走和下蹲进行了评估。实验结果表明,在1 Hz下,DMA的正弦阻尼力跟踪误差为5.9%,弹簧刚度为21.7 N/mm,无辅助压缩反驱动力为14.3 N。此外,QLLE辅助在步行时减少了8.5%的最大净代谢成本,在深蹲时减少了15.6%,负荷有效地转移到地面。这些发现突出了QLLEs在实际应用中的潜力,例如人工材料运输。
{"title":"A Dual-Mode Hydraulic Actuator for a Quasi-Passive Load-Carrying Exoskeleton in Multiple Conditions","authors":"Bao Yingwei;Yang Bo;Wang Zezheng;Li Huilai;Jiang Haoyi;Sun Maowen;Ouyang Xiaoping","doi":"10.1109/LRA.2026.3668701","DOIUrl":"https://doi.org/10.1109/LRA.2026.3668701","url":null,"abstract":"Lower limb exoskeleton robots have been widely researched for load-carrying assistance. Recently, quasi-passive exoskeletons using low-power elements to modulate mechanical characteristics have emerged. However, achieving effective damping and stiffness across varying tasks and loads remains challenging. This letter proposes a dual-mode actuator (DMA) inspired by knee joint impedances, achieving controllable damping force and high spring stiffness for assistance, with good backdrivability for human-robot transparency. Based on the DMA, a quasi-passive lower limb exoskeleton (QLLE) is proposed and evaluated during loaded walking and squatting. Experimental results demonstrated that the DMA achieved a sinusoidal damping force tracking error of 5.9% at 1 Hz, a spring stiffness of 21.7 N/mm, and an unassisted backdrive force of 14.3 N for compression. In addition, QLLE assistance reduced the maximum net metabolic cost by 8.5% during walking and 15.6% during squatting, with the load effectively transferred to the ground. These findings highlight the potential of QLLEs in real-world applications, such as manual material transportation.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 4","pages":"5135-5142"},"PeriodicalIF":5.3,"publicationDate":"2026-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147440616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Strategic Shaping of Human Prosociality: A Latent-State POMDP Framework 人类亲社会性的战略塑造:一个潜在状态的POMDP框架
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-02-25 DOI: 10.1109/LRA.2026.3668141
Zahra Zahedi;Xinyue Hu;Shashank Mehrotra;Mark Steyvers;Kumar Akash
We propose a decision-theoretic framework in which a robot strategically can shape inferred human’s prosocial state during repeated interactions. Modeling the human’s prosociality as a latent state that evolves over time, the robot learns to infer and influence this state through its own actions, including helping and signaling. We formalize this as a latent-state POMDP with limited observations and learn the transition and observation dynamics using expectation maximization. The resulting belief-based policy balances task and social objectives, selecting actions that maximize long-term cooperative outcomes. We evaluate the model using data from user studies and show that the learned policy outperforms baseline strategies in both team performance and increasing observed human cooperative behavior.
我们提出了一个决策理论框架,在这个框架中,机器人可以在重复的互动中策略性地塑造推断出来的人类的亲社会状态。机器人将人类的亲社会性建模为一种随时间演变的潜在状态,并学会通过自己的行为(包括帮助和发信号)推断和影响这种状态。我们将其形式化为具有有限观测值的潜在状态POMDP,并使用期望最大化来学习过渡和观测动态。由此产生的基于信念的政策平衡了任务和社会目标,选择了使长期合作成果最大化的行动。我们使用来自用户研究的数据来评估模型,并表明学习策略在团队绩效和增加观察到的人类合作行为方面都优于基线策略。
{"title":"Strategic Shaping of Human Prosociality: A Latent-State POMDP Framework","authors":"Zahra Zahedi;Xinyue Hu;Shashank Mehrotra;Mark Steyvers;Kumar Akash","doi":"10.1109/LRA.2026.3668141","DOIUrl":"https://doi.org/10.1109/LRA.2026.3668141","url":null,"abstract":"We propose a decision-theoretic framework in which a robot strategically can shape inferred human’s prosocial state during repeated interactions. Modeling the human’s prosociality as a latent state that evolves over time, the robot learns to infer and influence this state through its own actions, including helping and signaling. We formalize this as a latent-state POMDP with limited observations and learn the transition and observation dynamics using expectation maximization. The resulting belief-based policy balances task and social objectives, selecting actions that maximize long-term cooperative outcomes. We evaluate the model using data from user studies and show that the learned policy outperforms baseline strategies in both team performance and increasing observed human cooperative behavior.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 4","pages":"4729-4736"},"PeriodicalIF":5.3,"publicationDate":"2026-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Robotics and Automation Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1