IEEE Robotics and Automation Letters最新文献

英文中文

Synergistic Terrain-Adaptive Morphing and Trajectory Tracking in a Transformable-Wheeled Robot 变形轮式机器人的协同地形自适应变形与轨迹跟踪

IF 4.6 2区计算机科学 Q2 ROBOTICS

IEEE Robotics and Automation Letters

Pub Date : 2025-01-01 DOI: 10.1109/LRA.2024.3524876

Ke Shi;Zainan Jiang;Borui Liu;Guocai Yang;Minghe Jin

Transformable-wheeled robots exhibit efficient locomotion and obstacle negotiation through mode transformation, which underpins the development of the multimodal robot MTABot—a previously validated platform. However, existing literature primarily focuses on structural design, leaving autonomous mode transitions across varying terrains as a significant challenge. This paper presents a unified terrain-adaptive morphing and trajectory tracking approach for MTABot, utilizing the Nonlinear Model Predictive Control (NMPC) framework. This method eliminates the need for environmental recognition or prior training. Specifically, a segmented kinematic model for the transformable wheel has been developed, ensuring the feasibility of motion in both rolling and climbing modes. Additionally, a virtual ground attachment constraint is proposed to guide adaptive morphing for overcoming single or small obstacles. An online weight adjustment method for NMPC is introduced to synchronize wheel motion and overcome continuous large obstacles. Comprehensive experiments in multi-terrain composite scenarios and various obstacle-crossing tests validated the effectiveness of the proposed approach.

可变形轮式机器人通过模式转换实现了高效的运动和障碍物协商，这也是多模式机器人 MTABot 的开发基础--MTABot 是一个先前已通过验证的平台。然而，现有文献主要集中在结构设计方面，在不同地形上的自主模式转换仍是一个重大挑战。本文利用非线性模型预测控制（NMPC）框架，为 MTABot 提出了一种统一的地形适应性变形和轨迹跟踪方法。该方法无需环境识别或事先训练。具体来说，为可变换车轮开发了一个分段运动学模型，确保了滚动和攀爬模式下运动的可行性。此外，还提出了一种虚拟地面附着约束，以指导克服单个或小型障碍物的自适应变形。还引入了一种 NMPC 在线重量调整方法，以同步车轮运动并克服连续的大型障碍。在多地形复合场景和各种障碍穿越测试中进行的综合实验验证了所提方法的有效性。

{"title":"Synergistic Terrain-Adaptive Morphing and Trajectory Tracking in a Transformable-Wheeled Robot","authors":"Ke Shi;Zainan Jiang;Borui Liu;Guocai Yang;Minghe Jin","doi":"10.1109/LRA.2024.3524876","DOIUrl":"https://doi.org/10.1109/LRA.2024.3524876","url":null,"abstract":"Transformable-wheeled robots exhibit efficient locomotion and obstacle negotiation through mode transformation, which underpins the development of the multimodal robot MTABot—a previously validated platform. However, existing literature primarily focuses on structural design, leaving autonomous mode transitions across varying terrains as a significant challenge. This paper presents a unified terrain-adaptive morphing and trajectory tracking approach for MTABot, utilizing the Nonlinear Model Predictive Control (NMPC) framework. This method eliminates the need for environmental recognition or prior training. Specifically, a segmented kinematic model for the transformable wheel has been developed, ensuring the feasibility of motion in both rolling and climbing modes. Additionally, a virtual ground attachment constraint is proposed to guide adaptive morphing for overcoming single or small obstacles. An online weight adjustment method for NMPC is introduced to synchronize wheel motion and overcome continuous large obstacles. Comprehensive experiments in multi-terrain composite scenarios and various obstacle-crossing tests validated the effectiveness of the proposed approach.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"10 2","pages":"1656-1663"},"PeriodicalIF":4.6,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142975985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Persistent Robot Charging Problem for Long-Duration Autonomy

IF 4.6 2区计算机科学 Q2 ROBOTICS

IEEE Robotics and Automation Letters

Pub Date : 2025-01-01 DOI: 10.1109/LRA.2024.3524897

Nitesh Kumar;Jaekyung Jackie Lee;Sivakumar Rathinam;Swaroop Darbha;P. B. Sujit;Rajiv Raman

This paper introduces a novel formulation for finding the recharging schedule for a fleet of

$n$

heterogeneous robots that minimizes utilization of recharging resources. This study provides a foundational framework applicable to Multi-Robot Mission Planning, particularly in scenarios demanding Long-Duration Autonomy (LDA) or other contexts that necessitate periodic recharging of multiple robots. A novel Integer Linear Programming (ILP) model is proposed to calculate the optimal initial conditions (partial charge) for individual robots, leading to minimal utilization of charging stations. This formulation was further generalized to maximize the servicing time for robots when charging stations are limited. The efficacy of the proposed formulation is evaluated through a comparative analysis, measuring its performance against the thrift price scheduling algorithm documented in the existing literature. The findings not only corroborate the effectiveness of the proposed approach but also underscore its potential as a valuable tool in optimizing resource allocation for a range of robotic and engineering applications.

{"title":"The Persistent Robot Charging Problem for Long-Duration Autonomy","authors":"Nitesh Kumar;Jaekyung Jackie Lee;Sivakumar Rathinam;Swaroop Darbha;P. B. Sujit;Rajiv Raman","doi":"10.1109/LRA.2024.3524897","DOIUrl":"https://doi.org/10.1109/LRA.2024.3524897","url":null,"abstract":"This paper introduces a novel formulation for finding the recharging schedule for a fleet of <inline-formula><tex-math>$n$</tex-math></inline-formula> heterogeneous robots that minimizes utilization of recharging resources. This study provides a foundational framework applicable to Multi-Robot Mission Planning, particularly in scenarios demanding Long-Duration Autonomy (LDA) or other contexts that necessitate periodic recharging of multiple robots. A novel Integer Linear Programming (ILP) model is proposed to calculate the optimal initial conditions (partial charge) for individual robots, leading to minimal utilization of charging stations. This formulation was further generalized to maximize the servicing time for robots when charging stations are limited. The efficacy of the proposed formulation is evaluated through a comparative analysis, measuring its performance against the thrift price scheduling algorithm documented in the existing literature. The findings not only corroborate the effectiveness of the proposed approach but also underscore its potential as a valuable tool in optimizing resource allocation for a range of robotic and engineering applications.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"10 3","pages":"2191-2198"},"PeriodicalIF":4.6,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143106703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

2024 Index IEEE Robotics and Automation Letters Vol. 9 2024索引IEEE机器人与自动化快报第9卷

IF 4.6 2区计算机科学 Q2 ROBOTICS

IEEE Robotics and Automation Letters

Pub Date : 2024-12-30 DOI: 10.1109/LRA.2024.3522690

引用次数: 0

Computational Design of Customized Vacuum-Driven Soft Grippers 定制真空驱动软夹持器的计算设计

IF 4.6 2区计算机科学 Q2 ROBOTICS

IEEE Robotics and Automation Letters

Pub Date : 2024-12-26 DOI: 10.1109/LRA.2024.3523203

Jiayi Jin;Siyuan Feng;Shuguang Li

Soft grippers are increasingly favored due to their passive compliance, lack of need for precise force control, and high adaptability to various object shapes. Unlike previous soft grippers that are mostly universal, we propose a framework for the computational design and rapid fabrication of customized soft grippers using a specific class of vacuum-driven pneumatic actuators. The algorithm can automatically generate a 3D-printable model of the optimized gripper design, and then the gripper can be rapidly fabricated at a low cost. Grasping experiments demonstrate that this framework can customize grippers for various daily objects with different geometries. The results also show the extensional abilities of customizing a gripper for multiple or heavy objects. This framework enables the rapid design and fabrication of grippers optimized for specific tasks while maintaining versatility for handling various objects.

软爪由于其被动顺应性、不需要精确的力控制以及对各种物体形状的高适应性而越来越受到青睐。与以前的软夹持器不同，我们提出了一个使用特定类别的真空驱动气动执行器进行计算设计和快速制造定制软夹持器的框架。该算法可以自动生成优化后的夹具设计的3d打印模型，从而实现夹具的低成本快速制造。抓取实验表明，该框架可以为各种不同几何形状的日常物体定制抓取器。结果还显示了定制夹具的扩展能力，可用于多个或较重的物体。该框架能够快速设计和制造针对特定任务优化的抓手，同时保持处理各种物体的通用性。

引用次数: 0

CAD2SLAM: Adaptive Projection Between CAD Blueprints and SLAM Maps CAD2SLAM: CAD蓝图和SLAM地图之间的自适应投影

IF 4.6 2区计算机科学 Q2 ROBOTICS

IEEE Robotics and Automation Letters

Pub Date : 2024-12-26 DOI: 10.1109/LRA.2024.3522838

Martín Bayón-Gutiérrez;Natalia Prieto-Fernández;María Teresa García-Ordás;José Alberto Benítez-Andrades;Héctor Alaiz-Moretón;Giorgio Grisetti

Robotic mobile platforms are key building blocks for numerous applications and cooperation between robots and humans is a key aspect to enhance productivity and reduce labor cost. To operate safely, robots typically rely on a custom map of the environment that depends on the sensor configuration of the platform. In contrast, blueprints stand as an abstract representation of the environment. The use of both CAD and SLAM maps allows robots to collaborate using the blueprint as a common language, while also easing the control for non-robotics experts. In this work we present an adaptive system to project a 2D pose in the blueprint to the SLAM map and vice-versa. Previous work in the literature aims at morphing a SLAM map to a previously available map. In contrast, CAD2SLAM does not alter the internal map representation used by the SLAM and localization algorithms running on the robot, preserving its original properties. We believe that our system is beneficial for the control and supervision of multiple heterogeneous robotic platforms that are monitored and controlled through the CAD map. Finally, we present a set of experiments that support our claims as well as open-source implementation.

机器人移动平台是众多应用的关键组成部分，机器人与人之间的合作是提高生产力和降低劳动力成本的关键方面。为了安全操作，机器人通常依赖于平台传感器配置的自定义环境地图。相比之下，蓝图是对环境的抽象表现。CAD和SLAM地图的使用允许机器人使用蓝图作为通用语言进行协作，同时也简化了非机器人专家的控制。在这项工作中，我们提出了一个自适应系统，将蓝图中的2D姿势投影到SLAM地图，反之亦然。先前文献中的工作旨在将SLAM地图变形为先前可用的地图。相比之下，CAD2SLAM不会改变SLAM和机器人上运行的定位算法所使用的内部地图表示，保留了其原始属性。我们相信，我们的系统有利于控制和监督多个异构机器人平台，通过CAD地图进行监测和控制。最后，我们提出了一组实验来支持我们的主张以及开源实现。

{"title":"CAD2SLAM: Adaptive Projection Between CAD Blueprints and SLAM Maps","authors":"Martín Bayón-Gutiérrez;Natalia Prieto-Fernández;María Teresa García-Ordás;José Alberto Benítez-Andrades;Héctor Alaiz-Moretón;Giorgio Grisetti","doi":"10.1109/LRA.2024.3522838","DOIUrl":"https://doi.org/10.1109/LRA.2024.3522838","url":null,"abstract":"Robotic mobile platforms are key building blocks for numerous applications and cooperation between robots and humans is a key aspect to enhance productivity and reduce labor cost. To operate safely, robots typically rely on a custom map of the environment that depends on the sensor configuration of the platform. In contrast, blueprints stand as an abstract representation of the environment. The use of both CAD and SLAM maps allows robots to collaborate using the blueprint as a common language, while also easing the control for non-robotics experts. In this work we present an adaptive system to project a 2D pose in the blueprint to the SLAM map and vice-versa. Previous work in the literature aims at morphing a SLAM map to a previously available map. In contrast, \u0000<italic>CAD2SLAM</i>\u0000 does not alter the internal map representation used by the SLAM and localization algorithms running on the robot, preserving its original properties. We believe that our system is beneficial for the control and supervision of multiple heterogeneous robotic platforms that are monitored and controlled through the CAD map. Finally, we present a set of experiments that support our claims as well as open-source implementation.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"10 2","pages":"1529-1536"},"PeriodicalIF":4.6,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10816387","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142938311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Adaptive Model Prediction Control Framework With Game Theory for Brain-Controlled Air-Ground Collaborative Autonomous System 基于博弈论的脑控地空协同自治系统自适应模型预测控制框架

IF 4.6 2区计算机科学 Q2 ROBOTICS

IEEE Robotics and Automation Letters

Pub Date : 2024-12-26 DOI: 10.1109/LRA.2024.3522780

Haonan Shi;Luzheng Bi;Zhenge Yang;Haorui Ge;Weijie Fei;Ling Wang

Brain-machine interfaces (BMIs) can enable humans to bypass the peripheral nervous system and directly control devices through the central nervous system. In this way, operators' hands are freed up, allowing them to interact with other devices, thus enabling multitasking operations. In this letter, to improve the performance of air-ground collaborative systems, we propose an adaptive model prediction control framework of brain-controlled air-ground collaboration systems, which consists of a BMI with a probabilistic output model, an interface model based on fuzzy logic, and an adaptive model-predictive-control shared controller based on game theory. We establish a human-in-the-loop experimental platform to validate the proposed method by trajectory tracking and obstacle avoidance scenarios. The experimental results show the effectiveness of the proposed method in improving performance and decreasing operators' workload. This work can contribute to the research and development of air-ground collaboration and provide new insights into the study of human-machine integration.

脑机接口（BMIs）可以使人类绕过周围神经系统，直接通过中枢神经系统控制设备。通过这种方式，操作员的双手被解放出来，允许他们与其他设备进行交互，从而实现多任务操作。为了提高地空协同系统的性能，本文提出了一种脑控地空协同系统的自适应模型预测控制框架，该框架由具有概率输出模型的BMI、基于模糊逻辑的接口模型和基于博弈论的自适应模型-预测-控制共享控制器组成。建立了人在环实验平台，通过轨迹跟踪和避障场景验证了该方法。实验结果表明，该方法在提高性能和减少操作员工作量方面是有效的。这项工作可以为地空协作的研究和发展做出贡献，并为人机集成的研究提供新的见解。

{"title":"Adaptive Model Prediction Control Framework With Game Theory for Brain-Controlled Air-Ground Collaborative Autonomous System","authors":"Haonan Shi;Luzheng Bi;Zhenge Yang;Haorui Ge;Weijie Fei;Ling Wang","doi":"10.1109/LRA.2024.3522780","DOIUrl":"https://doi.org/10.1109/LRA.2024.3522780","url":null,"abstract":"Brain-machine interfaces (BMIs) can enable humans to bypass the peripheral nervous system and directly control devices through the central nervous system. In this way, operators' hands are freed up, allowing them to interact with other devices, thus enabling multitasking operations. In this letter, to improve the performance of air-ground collaborative systems, we propose an adaptive model prediction control framework of brain-controlled air-ground collaboration systems, which consists of a BMI with a probabilistic output model, an interface model based on fuzzy logic, and an adaptive model-predictive-control shared controller based on game theory. We establish a human-in-the-loop experimental platform to validate the proposed method by trajectory tracking and obstacle avoidance scenarios. The experimental results show the effectiveness of the proposed method in improving performance and decreasing operators' workload. This work can contribute to the research and development of air-ground collaboration and provide new insights into the study of human-machine integration.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"10 2","pages":"1577-1584"},"PeriodicalIF":4.6,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142937837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Control Pneumatic Soft Bending Actuator With Feedforward Hysteresis Compensation by Pneumatic Physical Reservoir Computing 利用气动物理储层计算控制气动软弯曲执行器的前馈滞后补偿

IF 4.6 2区计算机科学 Q2 ROBOTICS

IEEE Robotics and Automation Letters

Pub Date : 2024-12-26 DOI: 10.1109/LRA.2024.3523229

Junyi Shen;Tetsuro Miyazaki;Kenji Kawashima

The nonlinearities of soft robots bring control challenges like hysteresis but also provide them with computational capacities. This letter introduces a fuzzy pneumatic physical reservoir computing (FPRC) model for feedforward hysteresis compensation in motion tracking control of soft actuators. Our method utilizes a pneumatic bending actuator as a physical reservoir with nonlinear computing capacities to control another pneumatic bending actuator. The FPRC model employs a Takagi-Sugeno (T-S) fuzzy logic to process outputs from the physical reservoir. The proposed FPRC model shows equivalent training performance to an Echo State Network (ESN) model, whereas it exhibits better test accuracies with significantly reduced execution time. Experiments validate the FPRC model's effectiveness in controlling the bending motion of a pneumatic soft actuator with open-loop and closed-loop control system setups. The proposed FPRC model's robustness against environmental disturbances has also been experimentally verified. To the authors' knowledge, this is the first implementation of a physical system in the feedforward hysteresis compensation model for controlling soft actuators. This study is expected to advance physical reservoir computing in nonlinear control applications and extend the feedforward hysteresis compensation methods for controlling soft actuators.

软机器人的非线性给控制带来了滞后等挑战，但也为其提供了计算能力。本文介绍了一种用于软执行器运动跟踪控制中前馈滞后补偿的模糊气动物理储层计算（FPRC）模型。我们的方法利用一个气动弯曲驱动器作为一个具有非线性计算能力的物理储层来控制另一个气动弯曲驱动器。FPRC模型采用Takagi-Sugeno （T-S）模糊逻辑来处理物理储层的输出。所提出的FPRC模型具有与回声状态网络（ESN）模型相当的训练性能，同时在显着减少执行时间的情况下具有更好的测试准确性。通过开环和闭环控制系统的设置，验证了FPRC模型对气动软执行器弯曲运动控制的有效性。实验还验证了所提出的FPRC模型对环境干扰的鲁棒性。据作者所知，这是第一次在控制软执行器的前馈滞后补偿模型中实现物理系统。该研究有望推动物理储层计算在非线性控制中的应用，并扩展控制软执行器的前馈滞后补偿方法。

{"title":"Control Pneumatic Soft Bending Actuator With Feedforward Hysteresis Compensation by Pneumatic Physical Reservoir Computing","authors":"Junyi Shen;Tetsuro Miyazaki;Kenji Kawashima","doi":"10.1109/LRA.2024.3523229","DOIUrl":"https://doi.org/10.1109/LRA.2024.3523229","url":null,"abstract":"The nonlinearities of soft robots bring control challenges like hysteresis but also provide them with computational capacities. This letter introduces a fuzzy pneumatic physical reservoir computing (FPRC) model for feedforward hysteresis compensation in motion tracking control of soft actuators. Our method utilizes a pneumatic bending actuator as a physical reservoir with nonlinear computing capacities to control another pneumatic bending actuator. The FPRC model employs a Takagi-Sugeno (T-S) fuzzy logic to process outputs from the physical reservoir. The proposed FPRC model shows equivalent training performance to an Echo State Network (ESN) model, whereas it exhibits better test accuracies with significantly reduced execution time. Experiments validate the FPRC model's effectiveness in controlling the bending motion of a pneumatic soft actuator with open-loop and closed-loop control system setups. The proposed FPRC model's robustness against environmental disturbances has also been experimentally verified. To the authors' knowledge, this is the first implementation of a physical system in the feedforward hysteresis compensation model for controlling soft actuators. This study is expected to advance physical reservoir computing in nonlinear control applications and extend the feedforward hysteresis compensation methods for controlling soft actuators.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"10 2","pages":"1664-1671"},"PeriodicalIF":4.6,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10816480","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142976126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning 基于深度强化学习的有效相机曝光控制视觉里程计

IF 4.6 2区计算机科学 Q2 ROBOTICS

IEEE Robotics and Automation Letters

Pub Date : 2024-12-26 DOI: 10.1109/LRA.2024.3523224

Shuyang Zhang;Jinhao He;Yilong Zhu;Jin Wu;Jie Yuan

The stability of visual odometry (VO) systems is undermined by degraded image quality, especially in environments with significant illumination changes. This study employs a deep reinforcement learning (DRL) framework to train agents for exposure control, aiming to enhance imaging performance in challenging conditions. A lightweight image simulator is developed to facilitate the training process, enabling the diversification of image exposure and sequence trajectory. This setup enables completely offline training, eliminating the need for direct interaction with camera hardware and the real environments. Different levels of reward functions are crafted to enhance the VO systems, equipping the DRL agents with varying intelligence. Extensive experiments have shown that our exposure control agents achieve superior efficiency—with an average inference duration of 1.58 ms per frame on a CPU—and respond more quickly than traditional feedback control schemes. By choosing an appropriate reward function, agents acquire an intelligent understanding of motion trends and can anticipate future changes in illumination. This predictive capability allows VO systems to deliver more stable and precise odometry results.

视觉里程计（VO）系统的稳定性受到图像质量下降的影响，特别是在光照变化较大的环境中。本研究采用深度强化学习（DRL）框架来训练智能体进行曝光控制，旨在提高在具有挑战性条件下的成像性能。为了方便训练过程，开发了一种轻量级图像模拟器，实现了图像曝光和序列轨迹的多样化。这种设置可以完全离线训练，消除了与相机硬件和真实环境直接交互的需要。不同级别的奖励功能被精心设计以增强VO系统，使DRL代理具有不同的智能。大量的实验表明，我们的暴露控制代理实现了更高的效率——cpu上每帧平均推理持续时间为1.58 ms——并且比传统的反馈控制方案响应更快。通过选择适当的奖励函数，智能体获得对运动趋势的智能理解，并可以预测未来照明的变化。这种预测能力使VO系统能够提供更稳定和精确的里程计结果。

{"title":"Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning","authors":"Shuyang Zhang;Jinhao He;Yilong Zhu;Jin Wu;Jie Yuan","doi":"10.1109/LRA.2024.3523224","DOIUrl":"https://doi.org/10.1109/LRA.2024.3523224","url":null,"abstract":"The stability of visual odometry (VO) systems is undermined by degraded image quality, especially in environments with significant illumination changes. This study employs a deep reinforcement learning (DRL) framework to train agents for exposure control, aiming to enhance imaging performance in challenging conditions. A lightweight image simulator is developed to facilitate the training process, enabling the diversification of image exposure and sequence trajectory. This setup enables completely offline training, eliminating the need for direct interaction with camera hardware and the real environments. Different levels of reward functions are crafted to enhance the VO systems, equipping the DRL agents with varying intelligence. Extensive experiments have shown that our exposure control agents achieve superior efficiency—with an average inference duration of 1.58 ms per frame on a CPU—and respond more quickly than traditional feedback control schemes. By choosing an appropriate reward function, agents acquire an intelligent understanding of motion trends and can anticipate future changes in illumination. This predictive capability allows VO systems to deliver more stable and precise odometry results.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"10 2","pages":"1609-1616"},"PeriodicalIF":4.6,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142940755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Directional Correspondence Based Cross-Source Point Cloud Registration for USV-AAV Cooperation in Lentic Environments 基于方向对应的虚拟环境下USV-AAV协同跨源点云配准

IF 4.6 2区计算机科学 Q2 ROBOTICS

IEEE Robotics and Automation Letters

Pub Date : 2024-12-26 DOI: 10.1109/LRA.2024.3523232

Byoungkwon Yoon;Seokhyun Hong;Dongjun Lee

We propose a novel cross-source point cloud registration (CSPR) method for USV-AAV cooperation in lentic environments. In the wild outdoors, which is the typical working domain of the USV-AAV team, CSPR faces significant challenges due to platform-domain problems (complex unstructured surroundings and viewing angle difference) in addition to sensor-domain problems (varying density, noise pattern, and scale). These characteristics make large discrepancies in local geometry, causing existing CSPR methods that rely on point-to-point correspondence based on local geometry around key points (e.g. surface normal, shape function, angle) to struggle. To address this challenge, we propose the novel concept of a directional correspondence-based iterative cross-source point cloud registration algorithm. Instead of using point-to-point correspondence under large discrepancies in local geometry, we build correspondence about directions to enable robust registration in the wild outdoors. Also, since the proposed directional correspondence uses bearing angle and normalized coordinate, we can separate scale estimation with transformation, effectively resolving the problem of different scales between two point clouds. Our algorithm outperforms the state-of-the-art methods, achieving an average error of

$1.60^circ$

for rotation and 1.83% for translation. Additionally, we demonstrated a USV-AAV team operation with enhanced visual information achieved with the proposed method.

提出了一种新颖的跨源点云配准（CSPR）方法，用于虚拟环境下的USV-AAV协作。在野外，这是USV-AAV团队的典型工作领域，由于平台域问题（复杂的非结构化环境和视角差异）以及传感器域问题（不同的密度、噪声模式和规模），CSPR面临着重大挑战。这些特征使得局部几何存在很大的差异，导致现有的CSPR方法依赖于基于关键点周围局部几何（如表面法线、形状函数、角度）的点对点对应。为了解决这一挑战，我们提出了一种基于方向对应的迭代交叉源点云配准算法的新概念。而不是使用点对点对应下的大差异的局部几何，我们建立对应的方向，以实现在野外户外的鲁棒配准。此外，由于所提出的方向对应使用了方位角和归一化坐标，因此可以将尺度估计与变换分离，有效地解决了两个点云之间不同尺度的问题。我们的算法优于最先进的方法，旋转的平均误差为1.60^circ$，平移的平均误差为1.83%。此外，我们还演示了USV-AAV团队操作，该方法增强了视觉信息。

{"title":"Directional Correspondence Based Cross-Source Point Cloud Registration for USV-AAV Cooperation in Lentic Environments","authors":"Byoungkwon Yoon;Seokhyun Hong;Dongjun Lee","doi":"10.1109/LRA.2024.3523232","DOIUrl":"https://doi.org/10.1109/LRA.2024.3523232","url":null,"abstract":"We propose a novel cross-source point cloud registration (CSPR) method for USV-AAV cooperation in lentic environments. In the wild outdoors, which is the typical working domain of the USV-AAV team, CSPR faces significant challenges due to platform-domain problems (complex unstructured surroundings and viewing angle difference) in addition to sensor-domain problems (varying density, noise pattern, and scale). These characteristics make large discrepancies in local geometry, causing existing CSPR methods that rely on point-to-point correspondence based on local geometry around key points (e.g. surface normal, shape function, angle) to struggle. To address this challenge, we propose the novel concept of a directional correspondence-based iterative cross-source point cloud registration algorithm. Instead of using point-to-point correspondence under large discrepancies in local geometry, we build correspondence about directions to enable robust registration in the wild outdoors. Also, since the proposed directional correspondence uses bearing angle and normalized coordinate, we can separate scale estimation with transformation, effectively resolving the problem of different scales between two point clouds. Our algorithm outperforms the state-of-the-art methods, achieving an average error of \u0000<inline-formula><tex-math>$1.60^circ$</tex-math></inline-formula>\u0000 for rotation and 1.83% for translation. Additionally, we demonstrated a USV-AAV team operation with enhanced visual information achieved with the proposed method.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"10 2","pages":"1601-1608"},"PeriodicalIF":4.6,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10816390","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142940756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning With Dense Labeling 基于密集标注双松弛对比学习的开放词汇移动操作

IF 4.6 2区计算机科学 Q2 ROBOTICS

IEEE Robotics and Automation Letters

Pub Date : 2024-12-26 DOI: 10.1109/LRA.2024.3522841

Daichi Yashima;Ryosuke Korekata;Komei Sugiura

Growing labor shortages are increasing the demand for domestic service robots (DSRs) to assist in various settings. In this study, we develop a DSR that transports everyday objects to specified pieces of furniture based on open-vocabulary instructions. Our approach focuses on retrieving images of target objects and receptacles from pre-collected images of indoor environments. For example, given an instruction “Please get the right red towel hanging on the metal towel rack and put it in the white washing machine on the left,” the DSR is expected to carry the red towel to the washing machine based on the retrieved images. This is challenging because the correct images should be retrieved from thousands of collected images, which may include many images of similar towels and appliances. To address this, we propose RelaX-Former, which learns diverse and robust representations from among positive, unlabeled positive, and negative samples. We evaluated RelaX-Former on a dataset containing real-world indoor images and human annotated instructions including complex referring expressions. The experimental results demonstrate that RelaX-Former outperformed existing baseline models across standard image retrieval metrics. Moreover, we performed physical experiments using a DSR to evaluate the performance of our approach in a zero-shot transfer setting. The experiments involved the DSR to carry objects to specific receptacles based on open-vocabulary instructions, achieving an overall success rate of 75%.

日益严重的劳动力短缺增加了对家庭服务机器人（dsr）的需求，以协助各种设置。在这项研究中，我们开发了一个DSR，可以根据开放词汇指令将日常物品运送到指定的家具上。我们的方法侧重于从预先收集的室内环境图像中检索目标物体和容器的图像。例如，给DSR“请把右边的红色毛巾挂在金属毛巾架上，放到左边的白色洗衣机里”的指令，DSR就会根据检索到的图像把红色毛巾送到洗衣机里。这是具有挑战性的，因为正确的图像应该从数千个收集的图像中检索，其中可能包括许多类似毛巾和电器的图像。为了解决这个问题，我们提出了RelaX-Former，它从正样本、未标记的正样本和负样本中学习不同的鲁棒表示。我们在包含真实世界室内图像和包含复杂引用表达式的人类注释指令的数据集上评估了RelaX-Former。实验结果表明，RelaX-Former在标准图像检索指标上优于现有的基线模型。此外，我们使用DSR进行了物理实验，以评估我们的方法在零射转移设置中的性能。在实验中，DSR根据开放词汇指令将物体搬运到特定的容器中，总体成功率为75%。

{"title":"Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning With Dense Labeling","authors":"Daichi Yashima;Ryosuke Korekata;Komei Sugiura","doi":"10.1109/LRA.2024.3522841","DOIUrl":"https://doi.org/10.1109/LRA.2024.3522841","url":null,"abstract":"Growing labor shortages are increasing the demand for domestic service robots (DSRs) to assist in various settings. In this study, we develop a DSR that transports everyday objects to specified pieces of furniture based on open-vocabulary instructions. Our approach focuses on retrieving images of target objects and receptacles from pre-collected images of indoor environments. For example, given an instruction “Please get the right red towel hanging on the metal towel rack and put it in the white washing machine on the left,” the DSR is expected to carry the red towel to the washing machine based on the retrieved images. This is challenging because the correct images should be retrieved from thousands of collected images, which may include many images of similar towels and appliances. To address this, we propose RelaX-Former, which learns diverse and robust representations from among positive, unlabeled positive, and negative samples. We evaluated RelaX-Former on a dataset containing real-world indoor images and human annotated instructions including complex referring expressions. The experimental results demonstrate that RelaX-Former outperformed existing baseline models across standard image retrieval metrics. Moreover, we performed physical experiments using a DSR to evaluate the performance of our approach in a zero-shot transfer setting. The experiments involved the DSR to carry objects to specific receptacles based on open-vocabulary instructions, achieving an overall success rate of 75%.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"10 2","pages":"1728-1735"},"PeriodicalIF":4.6,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142976068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

IEEE Robotics and Automation Letters

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀