首页 > 最新文献

IEEE Robotics and Automation Letters最新文献

英文 中文
Multi-modal sensing in colonoscopy: a data-driven approach. 结肠镜检查中的多模态传感:数据驱动的方法。
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-02-01 Epub Date: 2025-12-18 DOI: 10.1109/lra.2025.3645700
Viola Del Bono, Emma Capaldi, Anushka Kelshiker, Ayhan Aktas, Hiroyuki Aihara, Sheila Russo

Soft optical sensors hold potential for enhancing minimally invasive procedures like colonoscopy, yet their complex, multi-modal responses pose significant challenges. This work introduces a machine learning (ML) framework for real-time estimation of 3D shape and contact force in a soft robotic sleeve for colonoscopy. To overcome limitations of manual calibration and collect large datasets for ML, we developed an automated platform for collecting data across a range of orientations, curvatures, and contact forces. A cascaded ML architecture was implemented for sequential estimation of contact force and 3D shape, enabling an accuracy with errors of 4.7% for curvature, 2.37% for orientation, and 5.5% for force tracking. We also explored the potential of ML for contact localization by training a model to estimate contact intensity and location across 16 indenters distributed along the sleeve. The force intensity was estimated with an error ranging from 0.06 N to 0.31 N throughout the indenters. Despite the proximity of the contact points, the system achieved high localization performances, with 8 indenters reaching over 80% accuracy, demonstrating promising spatial resolution.

软光学传感器具有增强结肠镜检查等微创手术的潜力,但其复杂的多模态响应构成了重大挑战。这项工作介绍了一种机器学习(ML)框架,用于实时估计用于结肠镜检查的软机器人套筒中的3D形状和接触力。为了克服人工校准和收集大型ML数据集的局限性,我们开发了一个自动化平台,用于收集一系列方向,曲率和接触力的数据。采用级联ML架构对接触力和3D形状进行顺序估计,曲率误差为4.7%,方向误差为2.37%,力跟踪误差为5.5%。我们还通过训练一个模型来估计沿套筒分布的16个压头的接触强度和位置,探索了机器学习在接触定位方面的潜力。整个压头的力强度估计误差范围为0.06 N至0.31 N。尽管接触点很接近,但该系统实现了很高的定位性能,8个压痕的精度超过80%,展示了有希望的空间分辨率。
{"title":"Multi-modal sensing in colonoscopy: a data-driven approach.","authors":"Viola Del Bono, Emma Capaldi, Anushka Kelshiker, Ayhan Aktas, Hiroyuki Aihara, Sheila Russo","doi":"10.1109/lra.2025.3645700","DOIUrl":"10.1109/lra.2025.3645700","url":null,"abstract":"<p><p>Soft optical sensors hold potential for enhancing minimally invasive procedures like colonoscopy, yet their complex, multi-modal responses pose significant challenges. This work introduces a machine learning (ML) framework for real-time estimation of 3D shape and contact force in a soft robotic sleeve for colonoscopy. To overcome limitations of manual calibration and collect large datasets for ML, we developed an automated platform for collecting data across a range of orientations, curvatures, and contact forces. A cascaded ML architecture was implemented for sequential estimation of contact force and 3D shape, enabling an accuracy with errors of 4.7% for curvature, 2.37% for orientation, and 5.5% for force tracking. We also explored the potential of ML for contact localization by training a model to estimate contact intensity and location across 16 indenters distributed along the sleeve. The force intensity was estimated with an error ranging from 0.06 N to 0.31 N throughout the indenters. Despite the proximity of the contact points, the system achieved high localization performances, with 8 indenters reaching over 80% accuracy, demonstrating promising spatial resolution.</p>","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 2","pages":"2018-2025"},"PeriodicalIF":5.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12811025/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145998055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Robotics and Automation Society Information IEEE机器人与自动化学会信息
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-01-28 DOI: 10.1109/LRA.2026.3656038
{"title":"IEEE Robotics and Automation Society Information","authors":"","doi":"10.1109/LRA.2026.3656038","DOIUrl":"https://doi.org/10.1109/LRA.2026.3656038","url":null,"abstract":"","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 2","pages":"C3-C3"},"PeriodicalIF":5.3,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11367139","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Robotics and Automation Letters Information for Authors IEEE机器人与自动化作者信函信息
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-01-28 DOI: 10.1109/LRA.2026.3656040
{"title":"IEEE Robotics and Automation Letters Information for Authors","authors":"","doi":"10.1109/LRA.2026.3656040","DOIUrl":"https://doi.org/10.1109/LRA.2026.3656040","url":null,"abstract":"","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 2","pages":"C4-C4"},"PeriodicalIF":5.3,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11367250","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Taxonomy for Evaluating Generalist Robot Manipulation Policies 评估通才机器人操作策略的分类学
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-01-22 DOI: 10.1109/LRA.2026.3656785
Jensen Gao;Suneel Belkhale;Sudeep Dasari;Ashwin Balakrishna;Dhruv Shah;Dorsa Sadigh
Machine learning for robot manipulation promises to unlock generalization to novel tasks and environments. But how should we measure the progress of these policies towards generalization? Evaluating and quantifying generalization is the Wild West of modern robotics, with each work proposing and measuring different types of generalization in their own, often difficult to reproduce settings. In this work, our goal is (1) to outline the forms of generalization we believe are important for robot manipulation in a comprehensive and fine-grained manner, and (2) to provide reproducible guidelines for measuring these notions of generalization. We first propose $bigstar$-Gen, a taxonomy of generalization for robot manipulation structured around visual, semantic, and behavioral generalization. Next, we instantiate $bigstar$-Gen with two case studies on real-world benchmarking: one based on open-source models and the Bridge V2 dataset, and another based on the bimanual ALOHA 2 platform that covers more dexterous and longer horizon tasks. Our case studies reveal many interesting insights: for example, we observe that open-source vision-language-action models often struggle with semantic generalization, despite pre-training on internet-scale language datasets.
用于机器人操作的机器学习有望解锁对新任务和环境的泛化。但是,我们应该如何衡量这些政策在泛化方面的进展呢?评估和量化泛化是现代机器人的狂野西部,每项工作都在自己的环境中提出和测量不同类型的泛化,通常很难重现。在这项工作中,我们的目标是(1)以全面和细粒度的方式概述我们认为对机器人操作很重要的泛化形式,以及(2)为测量这些泛化概念提供可重复的指导方针。我们首先提出$bigstar$-Gen,这是一个围绕视觉、语义和行为泛化构建的机器人操作泛化分类法。接下来,我们实例化$bigstar$-Gen与两个案例研究在现实世界的基准:一个基于开源模型和Bridge V2数据集,另一个基于双手动ALOHA 2平台,涵盖更灵活和更长远的任务。我们的案例研究揭示了许多有趣的见解:例如,我们观察到,尽管在互联网规模的语言数据集上进行了预训练,开源的视觉语言动作模型经常在语义泛化方面遇到困难。
{"title":"A Taxonomy for Evaluating Generalist Robot Manipulation Policies","authors":"Jensen Gao;Suneel Belkhale;Sudeep Dasari;Ashwin Balakrishna;Dhruv Shah;Dorsa Sadigh","doi":"10.1109/LRA.2026.3656785","DOIUrl":"https://doi.org/10.1109/LRA.2026.3656785","url":null,"abstract":"Machine learning for robot manipulation promises to unlock generalization to novel tasks and environments. But how should we measure the progress of these policies towards generalization? Evaluating and quantifying generalization is the Wild West of modern robotics, with each work proposing and measuring different types of generalization in their own, often difficult to reproduce settings. In this work, our goal is (1) to outline the forms of generalization we believe are important for robot manipulation in a comprehensive and fine-grained manner, and (2) to provide reproducible guidelines for measuring these notions of generalization. We first propose <inline-formula><tex-math>$bigstar$</tex-math></inline-formula>-Gen, a taxonomy of generalization for robot manipulation structured around visual, semantic, and behavioral generalization. Next, we instantiate <inline-formula><tex-math>$bigstar$</tex-math></inline-formula>-Gen with two case studies on real-world benchmarking: one based on open-source models and the Bridge V2 dataset, and another based on the bimanual ALOHA 2 platform that covers more dexterous and longer horizon tasks. Our case studies reveal many interesting insights: for example, we observe that open-source vision-language-action models often struggle with semantic generalization, despite pre-training on internet-scale language datasets.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 3","pages":"3182-3189"},"PeriodicalIF":5.3,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Energy-Efficient and Powerful Oscillator for Micro-Air Vehicles With Electronics-Free Flapping 无电子扑动微型飞行器的高能效强振子
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-01-22 DOI: 10.1109/LRA.2026.3656791
Yongjian Zhao;Yuyan Qi;Jiaqi Shao;Bin Sun;Min Wang;Songyi Zhong;Yang Yang
High power density and energy efficiency are critical for achieving agile locomotion and sustained operation in miniature flapping-wing robots. Here, a pneumatic linear reciprocating oscillator is developed as an actuation solution. The oscillator leverages the Bernoulli principle to establish a positive feedback mechanism through coordinated interactions among a soft membrane, a piston, and the airflow. Experimental validation demonstrates that the oscillator-based flapping-wing robot can generate a lift of 0.43 N to enable take-off and sustained flight in unstructured environments. The minimal oscillation unit exhibits maximum input and output specific power of 710.5 W/kg and 220.7 W/kg, respectively, with peak energy conversion efficiency reaching 41.9% . This design represents a paradigm shift from conventional electromechanical systems, offering two fundamental advancements: (i) simplified robotic drive architectures through an oscillator-based mechanism, and (ii) a foundation for hybrid energy systems that reduce reliance on electricity.
高功率密度和高能量效率是微型扑翼机器人实现敏捷运动和持续运行的关键。本文提出了一种气动直线往复振荡器作为驱动方案。振荡器利用伯努利原理,通过软膜、活塞和气流之间的协调相互作用,建立正反馈机制。实验验证表明,基于振子的扑翼机器人可以产生0.43 N的升力,能够在非结构化环境中起飞和持续飞行。最小振荡单元的最大输入比功率为710.5 W/kg,最大输出比功率为220.7 W/kg,最大能量转换效率达到41.9%。这种设计代表了传统机电系统的范式转变,提供了两个基本的进步:(i)通过基于振荡器的机制简化了机器人驱动架构;(ii)为减少对电力依赖的混合能源系统奠定了基础。
{"title":"An Energy-Efficient and Powerful Oscillator for Micro-Air Vehicles With Electronics-Free Flapping","authors":"Yongjian Zhao;Yuyan Qi;Jiaqi Shao;Bin Sun;Min Wang;Songyi Zhong;Yang Yang","doi":"10.1109/LRA.2026.3656791","DOIUrl":"https://doi.org/10.1109/LRA.2026.3656791","url":null,"abstract":"High power density and energy efficiency are critical for achieving agile locomotion and sustained operation in miniature flapping-wing robots. Here, a pneumatic linear reciprocating oscillator is developed as an actuation solution. The oscillator leverages the Bernoulli principle to establish a positive feedback mechanism through coordinated interactions among a soft membrane, a piston, and the airflow. Experimental validation demonstrates that the oscillator-based flapping-wing robot can generate a lift of 0.43 N to enable take-off and sustained flight in unstructured environments. The minimal oscillation unit exhibits maximum input and output specific power of 710.5 W/kg and 220.7 W/kg, respectively, with peak energy conversion efficiency reaching 41.9% . This design represents a paradigm shift from conventional electromechanical systems, offering two fundamental advancements: (i) simplified robotic drive architectures through an oscillator-based mechanism, and (ii) a foundation for hybrid energy systems that reduce reliance on electricity.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 3","pages":"3174-3181"},"PeriodicalIF":5.3,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Precise Mobile Manipulation of Small Everyday Objects 精确移动操作的小日常物品
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-01-22 DOI: 10.1109/LRA.2026.3656784
Arjun Gupta;Rishik Sathua;Saurabh Gupta
Many everyday mobile manipulation tasks require precise interaction with small objects, such as grasping a knob to open a cabinet or pressing a light switch. In this letter, we develop Visual Servoing with Vision Models (VSVM), a closed-loop framework that enables a mobile manipulator to tackle such precise tasks involving the manipulation of small objects. VSVM uses state-of-the-art vision foundation models to generate 3D targets for visual servoing to enable diverse tasks in novel environments. Naively doing so fails because of occlusion by the end-effector. VSVM mitigates this using vision models that out-paint the end-effector thereby significantly enhancing target localization. We demonstrate that aided by out-painting methods, open-vocabulary object detectors can serve as a drop-in module for VSVM to seek semantic targets (e.g. knobs) and point tracking methods can help VSVM reliably pursue interaction sites indicated by user clicks. We conduct a large-scale evaluation spanning experiments in 10 novel environments across 6 buildings including 72 different object instances. VSVM obtains a 71% zero-shot success rate on manipulating unseen objects in novel environments in the real world, outperforming an open-loop control method by an absolute 42% and an imitation learning baseline trained on 1000+ demonstrations also by an absolute success rate of 50% .
许多日常的移动操作任务需要与小物体进行精确的交互,例如抓住旋钮打开橱柜或按下电灯开关。在这封信中,我们开发了带有视觉模型的视觉伺服(VSVM),这是一个闭环框架,使移动机械手能够处理涉及操纵小物体的精确任务。VSVM使用最先进的视觉基础模型来生成用于视觉伺服的3D目标,以实现在新环境中的各种任务。由于末端执行器的遮挡,天真地这样做会失败。VSVM使用的视觉模型比末端执行器更清晰,从而显著增强了目标定位。我们证明了在外画方法的辅助下,开放词汇对象检测器可以作为VSVM的插入模块来寻找语义目标(如按钮),点跟踪方法可以帮助VSVM可靠地追踪用户点击所指示的交互站点。我们在6座建筑的10个新环境中进行了大规模的评估,包括72个不同的对象实例。在现实世界的新环境中,VSVM在操纵看不见的物体时获得了71%的零射击成功率,绝对优于开环控制方法42%,并且在1000多个演示中训练的模仿学习基线也获得了50%的绝对成功率。
{"title":"Precise Mobile Manipulation of Small Everyday Objects","authors":"Arjun Gupta;Rishik Sathua;Saurabh Gupta","doi":"10.1109/LRA.2026.3656784","DOIUrl":"https://doi.org/10.1109/LRA.2026.3656784","url":null,"abstract":"Many everyday mobile manipulation tasks require precise interaction with small objects, such as grasping a knob to open a cabinet or pressing a light switch. In this letter, we develop Visual Servoing with Vision Models (VSVM), a closed-loop framework that enables a mobile manipulator to tackle such precise tasks involving the manipulation of small objects. VSVM uses state-of-the-art vision foundation models to generate 3D targets for visual servoing to enable diverse tasks in novel environments. Naively doing so fails because of occlusion by the end-effector. VSVM mitigates this using vision models that out-paint the end-effector thereby significantly enhancing target localization. We demonstrate that aided by out-painting methods, open-vocabulary object detectors can serve as a drop-in module for VSVM to seek semantic targets (e.g. knobs) and point tracking methods can help VSVM reliably pursue interaction sites indicated by user clicks. We conduct a large-scale evaluation spanning experiments in 10 novel environments across 6 buildings including 72 different object instances. VSVM obtains a 71% zero-shot success rate on manipulating unseen objects in novel environments in the real world, outperforming an open-loop control method by an absolute 42% and an imitation learning baseline trained on 1000+ demonstrations also by an absolute success rate of 50% .","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 3","pages":"3214-3221"},"PeriodicalIF":5.3,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and Validation of Docking-Based Cooperative Strategies for Snake Robots in Complex Environments 复杂环境下基于对接的蛇形机器人协同策略设计与验证
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-01-21 DOI: 10.1109/LRA.2026.3656725
Xuan Xiao;Kefeng Zhang;Jiaqi Zhu;Jianming Wang;Runtian Zhu
Snake robots exhibit remarkable locomotion capabilities in complex environments as the degrees of freedom (DOFs) increase, but at the cost of energy consumption. To address this issue, this article proposes a cooperation strategy for snake robots based on a head-tail docking mechanism, which allows multiple short snake robots to combine into a longer one, enabling the execution of complex tasks. The mechanical design and the implementation of the dockable snake robots are introduced, featuring passive docking mechanisms at both the head and tail, an embedded controller and a vision camera mounted on the head, and a distributed power supply system. Furthermore, the control strategies for the combined robots have been developed to perform the crawler gait and the motion of spanning between parallel pipes. As a result, experiments are conducted to demonstrate the feasibility and performance of the proposed docking mechanism and cooperative control methods. Specifically, two snake robots can autonomously dock under visual guidance. After docking, the combined robots can rapidly traverse flat surfaces by performing the crawler gait at an average speed of 0.168 m/s. Additionally, the robots can perform spanning between parallel pipes and pipe inspection tasks concurrently by separating.
随着自由度的增加,蛇形机器人在复杂环境中表现出卓越的运动能力,但这是以能量消耗为代价的。针对这一问题,本文提出了一种基于头尾对接机制的蛇形机器人协作策略,该策略允许多个较短的蛇形机器人组合成一个较长的蛇形机器人,从而能够执行复杂的任务。介绍了可停靠蛇机器人的机械设计与实现,其头部和尾部均采用被动对接机构,头部安装嵌入式控制器和视觉摄像头,并采用分布式供电系统。在此基础上,研究了组合机器人的爬行步态和跨管运动控制策略。最后,通过实验验证了所提出的对接机制和协同控制方法的可行性和性能。具体来说,两个蛇形机器人可以在视觉引导下自主对接。对接后,组合机器人能够以平均速度0.168 m/s的履带式步态快速穿越平坦表面。此外,机器人可以通过分离的方式同时完成平行管道之间的跨越和管道检测任务。
{"title":"Design and Validation of Docking-Based Cooperative Strategies for Snake Robots in Complex Environments","authors":"Xuan Xiao;Kefeng Zhang;Jiaqi Zhu;Jianming Wang;Runtian Zhu","doi":"10.1109/LRA.2026.3656725","DOIUrl":"https://doi.org/10.1109/LRA.2026.3656725","url":null,"abstract":"Snake robots exhibit remarkable locomotion capabilities in complex environments as the degrees of freedom (DOFs) increase, but at the cost of energy consumption. To address this issue, this article proposes a cooperation strategy for snake robots based on a head-tail docking mechanism, which allows multiple short snake robots to combine into a longer one, enabling the execution of complex tasks. The mechanical design and the implementation of the dockable snake robots are introduced, featuring passive docking mechanisms at both the head and tail, an embedded controller and a vision camera mounted on the head, and a distributed power supply system. Furthermore, the control strategies for the combined robots have been developed to perform the crawler gait and the motion of spanning between parallel pipes. As a result, experiments are conducted to demonstrate the feasibility and performance of the proposed docking mechanism and cooperative control methods. Specifically, two snake robots can autonomously dock under visual guidance. After docking, the combined robots can rapidly traverse flat surfaces by performing the crawler gait at an average speed of 0.168 m/s. Additionally, the robots can perform spanning between parallel pipes and pipe inspection tasks concurrently by separating.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 3","pages":"3190-3197"},"PeriodicalIF":5.3,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and NMPC-Based Control of a Hybrid Sprawl-Tuned Vehicle With Flying and Driving Modes 具有飞行和驾驶模式的混合动力汽车的设计与nmpc控制
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-01-19 DOI: 10.1109/LRA.2026.3655281
Haoyu Wang;Zhiqiang Miao;Weiwei Zhan;Xiangke Wang;Wei He;Yaonan Wang
To address the limited maneuverability and low energy efficiency of autonomous aerial vehicles (AAVs) in confined spaces, we design and implement the Hybrid Sprawl-Tuned Vehicle (HSTV) - a deformable multi-modal robotic platform specifically engineered for operation in complex and spatially constrained environments. Based on the “FSTAR” platform, HSTV is equipped with passive front wheels and actively driven rear wheels. The gear transmission mechanism enables the actively driven wheels to be driven without the need for dedicated motors, simplifying the architecture of the system. For both flying and driving modes, detailed kinematics and dynamics models integrated with a mode switching strategy are constructed by using the Newton-Euler method. Based on the developed models, the constrained nonlinear model predictive controller is designed to achieve the accurate motion performance in flying and driving mode. Comprehensive experimental results and comparative analysis demonstrate that HSTV achieves significant trajectory tracking accuracy across both flying and driving modes, while saving energy by up to 70.9% with no significantly increasing structural complexity (maintained at 98.6% simplicity).
为了解决自主飞行器(aav)在密闭空间中有限的机动性和低能效问题,我们设计并实现了混合动力扩展调谐车辆(HSTV),这是一种可变形的多模态机器人平台,专门用于在复杂和空间受限的环境中运行。HSTV基于“FSTAR”平台,采用被动前轮和主动后轮。齿轮传动机构使主动从动轮无需专用电机即可驱动,简化了系统的结构。针对飞行和驾驶两种模式,采用牛顿-欧拉方法建立了包含模式切换策略的详细运动学和动力学模型。基于所建立的模型,设计了约束非线性模型预测控制器,以实现飞行和驱动模式下的精确运动性能。综合实验结果和对比分析表明,HSTV在飞行和驾驶两种模式下都实现了显著的轨迹跟踪精度,节能高达70.9%,且结构复杂度保持在98.6%不变。
{"title":"Design and NMPC-Based Control of a Hybrid Sprawl-Tuned Vehicle With Flying and Driving Modes","authors":"Haoyu Wang;Zhiqiang Miao;Weiwei Zhan;Xiangke Wang;Wei He;Yaonan Wang","doi":"10.1109/LRA.2026.3655281","DOIUrl":"https://doi.org/10.1109/LRA.2026.3655281","url":null,"abstract":"To address the limited maneuverability and low energy efficiency of autonomous aerial vehicles (AAVs) in confined spaces, we design and implement the Hybrid Sprawl-Tuned Vehicle (HSTV) - a deformable multi-modal robotic platform specifically engineered for operation in complex and spatially constrained environments. Based on the “FSTAR” platform, HSTV is equipped with passive front wheels and actively driven rear wheels. The gear transmission mechanism enables the actively driven wheels to be driven without the need for dedicated motors, simplifying the architecture of the system. For both flying and driving modes, detailed kinematics and dynamics models integrated with a mode switching strategy are constructed by using the Newton-Euler method. Based on the developed models, the constrained nonlinear model predictive controller is designed to achieve the accurate motion performance in flying and driving mode. Comprehensive experimental results and comparative analysis demonstrate that HSTV achieves significant trajectory tracking accuracy across both flying and driving modes, while saving energy by up to 70.9% with no significantly increasing structural complexity (maintained at 98.6% simplicity).","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 3","pages":"3222-3229"},"PeriodicalIF":5.3,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Safety-Critical Steering Control for Rubber-Tired Container Gantry Cranes: A State-Interlocked CBF Approach 橡胶轮胎集装箱门式起重机安全关键转向控制:一种状态互锁CBF方法
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-01-19 DOI: 10.1109/LRA.2026.3655311
Cong Li;Qin Rao;Zheng Tian;Jun Yang
The rubber-tired container gantry crane (RTG) is a type of heavy-duty lifting equipment commonly used in container yards, which is driven by two-side rubber tires and steered via differential drive. While moving along the desired path, the RTG must remain centered of the lane with restricted heading angle, as deviations may compromise the safety of subsequent yard operations. Due to its underactuated nature and the presence of external disturbances, achieving accurate lane-keeping poses a significant control challenge. To address this issue, a robust safety-critical steering control strategy integrating disturbance rejection vector field (VF) with a new state-interlocked control barrier function (SICBF) is proposed. The strategy initially employs a VF path-following method as the nominal controller. By strategically shrinking the safe set, the SICBF overcomes the limitations of traditional CBFs, such as state coupling in the inequality verification and infeasibility when the control coefficient tends to zero. Furthermore, by incorporating a disturbance observer (DOB) into the quadratic programming (QP) framework, the robustness and safety of the control system are significantly enhanced. Comprehensive simulation and experiment are conducted on a practical RTG with a 40-ton load capacity. To our best knowledge, the proposed method is one of the very few methods that have demonstrated successful application to the practical RTG systems.
橡胶轮胎集装箱门式起重机(RTG)是集装箱堆场常用的一种重型起重设备,它采用双面橡胶轮胎驱动,通过差动传动进行转向。在沿着预期路径移动时,RTG必须保持在车道的中心,并限制航向角,因为偏离可能会危及后续堆场操作的安全。由于其欠驱动特性和外部干扰的存在,实现准确的车道保持提出了重大的控制挑战。为了解决这一问题,提出了一种将干扰抑制向量场(VF)与一种新的状态互锁控制屏障函数(SICBF)相结合的鲁棒安全关键转向控制策略。该策略最初采用VF路径跟踪方法作为标称控制器。通过有策略地缩小安全集,SICBF克服了传统CBFs在不等式验证中存在状态耦合和控制系数趋于零时不可行的局限性。此外,通过将扰动观测器(DOB)引入二次规划(QP)框架,控制系统的鲁棒性和安全性得到了显著提高。在40吨载重的实际RTG上进行了综合仿真和试验。据我们所知,所提出的方法是极少数已经证明成功应用于实际RTG系统的方法之一。
{"title":"Safety-Critical Steering Control for Rubber-Tired Container Gantry Cranes: A State-Interlocked CBF Approach","authors":"Cong Li;Qin Rao;Zheng Tian;Jun Yang","doi":"10.1109/LRA.2026.3655311","DOIUrl":"https://doi.org/10.1109/LRA.2026.3655311","url":null,"abstract":"The rubber-tired container gantry crane (RTG) is a type of heavy-duty lifting equipment commonly used in container yards, which is driven by two-side rubber tires and steered via differential drive. While moving along the desired path, the RTG must remain centered of the lane with restricted heading angle, as deviations may compromise the safety of subsequent yard operations. Due to its underactuated nature and the presence of external disturbances, achieving accurate lane-keeping poses a significant control challenge. To address this issue, a robust safety-critical steering control strategy integrating disturbance rejection vector field (VF) with a new state-interlocked control barrier function (SICBF) is proposed. The strategy initially employs a VF path-following method as the nominal controller. By strategically shrinking the safe set, the SICBF overcomes the limitations of traditional CBFs, such as state coupling in the inequality verification and infeasibility when the control coefficient tends to zero. Furthermore, by incorporating a disturbance observer (DOB) into the quadratic programming (QP) framework, the robustness and safety of the control system are significantly enhanced. Comprehensive simulation and experiment are conducted on a practical RTG with a 40-ton load capacity. To our best knowledge, the proposed method is one of the very few methods that have demonstrated successful application to the practical RTG systems.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 3","pages":"3238-3245"},"PeriodicalIF":5.3,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Azimuth-LIO: Robust LiDAR-Inertial Odometry via Azimuth-Aware Voxelization and Probabilistic Fusion 方位角- lio:基于方位角感知体素化和概率融合的鲁棒激光雷达-惯性里程计
IF 5.3 2区 计算机科学 Q2 ROBOTICS Pub Date : 2026-01-19 DOI: 10.1109/LRA.2026.3655291
Zhongguan Liu;Wei Li;Honglei Che;Lu Pan;Shuaidong Yuan
Voxel-based LiDAR–inertial odometry (LIO) is accurate and efficient but can suffer from geometric inconsistencies when single-Gaussian voxel models indiscriminately merge observations from conflicting viewpoints. To address this limitation, we propose Azimuth-LIO, a robust voxel-based LIO framework that leverages azimuth-aware voxelization and probabilistic fusion. Instead of using a single distribution per voxel, we discretize each voxel into azimuth-sectorized substructures, each modeled by an anisotropic 3D Gaussian to preserve viewpoint-specific spatial features and uncertainties. We further introduce a direction-weighted distribution-to-distribution registration metric to adaptively quantify the contributions of different azimuth sectors, followed by a Bayesian fusion framework that exploits these confidence weights to ensure azimuth-consistent map updates. The performance and efficiency of the proposed method are evaluated on public benchmarks including the M2DGR, MCD, and SubT-MRS datasets, demonstrating superior accuracy and robustness compared to existing voxel-based algorithms.
基于体素的LiDAR-inertial odometry (LIO)准确高效,但当单高斯体素模型不加选择地合并来自冲突视点的观测结果时,可能会出现几何不一致性。为了解决这一限制,我们提出了Azimuth-LIO,这是一种鲁棒的基于体素的LIO框架,利用了方位感知体素化和概率融合。我们不再使用每个体素的单一分布,而是将每个体素离散为方位角分割的子结构,每个子结构都由各向异性3D高斯建模,以保留特定于视点的空间特征和不确定性。我们进一步引入了一个方向加权分布到分布的配准度量,以自适应地量化不同方位角扇区的贡献,然后使用贝叶斯融合框架利用这些置信度权重来确保方位角一致的地图更新。在M2DGR、MCD和SubT-MRS等公共基准数据集上对所提出方法的性能和效率进行了评估,与现有的基于体素的算法相比,显示出更高的准确性和鲁棒性。
{"title":"Azimuth-LIO: Robust LiDAR-Inertial Odometry via Azimuth-Aware Voxelization and Probabilistic Fusion","authors":"Zhongguan Liu;Wei Li;Honglei Che;Lu Pan;Shuaidong Yuan","doi":"10.1109/LRA.2026.3655291","DOIUrl":"https://doi.org/10.1109/LRA.2026.3655291","url":null,"abstract":"Voxel-based LiDAR–inertial odometry (LIO) is accurate and efficient but can suffer from geometric inconsistencies when single-Gaussian voxel models indiscriminately merge observations from conflicting viewpoints. To address this limitation, we propose Azimuth-LIO, a robust voxel-based LIO framework that leverages azimuth-aware voxelization and probabilistic fusion. Instead of using a single distribution per voxel, we discretize each voxel into azimuth-sectorized substructures, each modeled by an anisotropic 3D Gaussian to preserve viewpoint-specific spatial features and uncertainties. We further introduce a direction-weighted distribution-to-distribution registration metric to adaptively quantify the contributions of different azimuth sectors, followed by a Bayesian fusion framework that exploits these confidence weights to ensure azimuth-consistent map updates. The performance and efficiency of the proposed method are evaluated on public benchmarks including the M2DGR, MCD, and SubT-MRS datasets, demonstrating superior accuracy and robustness compared to existing voxel-based algorithms.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"11 3","pages":"3158-3165"},"PeriodicalIF":5.3,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Robotics and Automation Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1