首页 > 最新文献

IEEE Robotics and Automation Letters最新文献

英文 中文
A Fast and Accurate Visual Inertial Odometry Using Hybrid Point-Line Features 使用混合点-线特征的快速准确视觉惯性测距仪
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-11-01 DOI: 10.1109/LRA.2024.3490406
Zhenhang Chen;Zhiqiang Miao;Min Liu;Chengzhong Wu;Yaonan Wang
Mainstream visual-inertial SLAM systems use point features for motion estimation and localization. However, point features do not perform well in scenes such as weak texture and motion blur. Therefore, the introduction of line features has received a lot of attention. In this letter, we propose a point-line based real-time monocular visual inertial odometry. Aiming at the problem that most of the current works do not fully utilize the line feature properties, we derive the point-line based hybrid Multi-State Constraint Kalman Filter (hybrid MSCKF) in detail. To further improve the line feature initialization accuracy, we propose a two-step line triangulation method. Since filter-based methods are susceptible to visual outliers, we also propose a redundant line feature removal strategy suitable for the filtering framework. According to the experimental results in EuRoC data set and real environment, the proposed algorithm outperforms other state-of-the-art algorithms in accuracy and real-time performance.
主流的视觉惯性 SLAM 系统使用点特征进行运动估计和定位。然而,点特征在弱纹理和运动模糊等场景中表现不佳。因此,线特征的引入受到了广泛关注。在这封信中,我们提出了一种基于点-线的实时单目视觉惯性里程计。针对目前大多数研究没有充分利用线特征特性的问题,我们详细推导了基于点-线的混合多态约束卡尔曼滤波器(hybrid MSCKF)。为了进一步提高线特征初始化精度,我们提出了一种两步线三角测量法。由于基于滤波的方法容易受到视觉异常值的影响,我们还提出了适合滤波框架的冗余线条特征去除策略。根据在 EuRoC 数据集和真实环境中的实验结果,所提出的算法在准确性和实时性上都优于其他最先进的算法。
{"title":"A Fast and Accurate Visual Inertial Odometry Using Hybrid Point-Line Features","authors":"Zhenhang Chen;Zhiqiang Miao;Min Liu;Chengzhong Wu;Yaonan Wang","doi":"10.1109/LRA.2024.3490406","DOIUrl":"https://doi.org/10.1109/LRA.2024.3490406","url":null,"abstract":"Mainstream visual-inertial SLAM systems use point features for motion estimation and localization. However, point features do not perform well in scenes such as weak texture and motion blur. Therefore, the introduction of line features has received a lot of attention. In this letter, we propose a point-line based real-time monocular visual inertial odometry. Aiming at the problem that most of the current works do not fully utilize the line feature properties, we derive the point-line based hybrid Multi-State Constraint Kalman Filter (hybrid MSCKF) in detail. To further improve the line feature initialization accuracy, we propose a two-step line triangulation method. Since filter-based methods are susceptible to visual outliers, we also propose a redundant line feature removal strategy suitable for the filtering framework. According to the experimental results in EuRoC data set and real environment, the proposed algorithm outperforms other state-of-the-art algorithms in accuracy and real-time performance.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11345-11352"},"PeriodicalIF":4.6,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Public Hospital's Decision Analysis on Providing E-Visit Services 公立医院提供电子门诊服务的决策分析
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-31 DOI: 10.1109/LRA.2024.3490397
Na Li;Fanghao Yuan;Yan Li;Wenwen Lv
E-visits have emerged as a pivotal tool for enhancing both patients' medical experience and the efficient utilization of medical resources. By modeling stylized queueing models, this letter explores the optimal online medical resource allocation decision on providing e-visits in a public hospital, and examines the impact of the external Internet hospital on the public hospital's decision. The public hospital aims to maximize its utility by determining the optimal online medical resource allocation, while patients make their visit decisions based on utility. Results indicate that increased sensitivity of the revisit rate to resource allocation, or higher unit gain associated with e-visit patients (achieved from providing convenient access to medical services and minimizing cross-infection risks), prompts the public hospital to allocate more resources to e-visits. Notably, external Internet hospital price and capacity do not alter the basic influence patterns of the decision, but the resource allocation proportion increases with the external e-visit service price. Moreover, as the external e-visit service capacity expands, the optimal resource allocation proportion decreases when the sensitivity of the revisit rate is low (which indicates the quality is influenced slightly by resource allocation), and exhibits an initial decrease followed by an increase when the sensitivity is high.
电子门诊已成为改善患者就医体验和提高医疗资源利用效率的重要工具。本文通过建立风格化的排队模型,探讨了公立医院提供电子门诊的最优在线医疗资源配置决策,并研究了外部互联网医院对公立医院决策的影响。公立医院的目标是通过确定最优在线医疗资源分配来实现其效用最大化,而患者则根据效用做出就诊决策。结果表明,复诊率对资源分配的敏感度增加,或电子就诊患者的单位收益增加(通过提供便捷的医疗服务和最大限度地降低交叉感染风险实现),都会促使公立医院为电子就诊分配更多资源。值得注意的是,外部互联网医院的价格和能力并不改变决策的基本影响模式,但资源分配比例会随着外部电子就诊服务价格的提高而增加。此外,随着外部电子就诊服务能力的扩大,当复诊率的敏感度较低时(表明质量受资源分配的影响较小),最优资源分配比例会降低;当敏感度较高时,最优资源分配比例会先降低后提高。
{"title":"Public Hospital's Decision Analysis on Providing E-Visit Services","authors":"Na Li;Fanghao Yuan;Yan Li;Wenwen Lv","doi":"10.1109/LRA.2024.3490397","DOIUrl":"https://doi.org/10.1109/LRA.2024.3490397","url":null,"abstract":"E-visits have emerged as a pivotal tool for enhancing both patients' medical experience and the efficient utilization of medical resources. By modeling stylized queueing models, this letter explores the optimal online medical resource allocation decision on providing e-visits in a public hospital, and examines the impact of the external Internet hospital on the public hospital's decision. The public hospital aims to maximize its utility by determining the optimal online medical resource allocation, while patients make their visit decisions based on utility. Results indicate that increased sensitivity of the revisit rate to resource allocation, or higher unit gain associated with e-visit patients (achieved from providing convenient access to medical services and minimizing cross-infection risks), prompts the public hospital to allocate more resources to e-visits. Notably, external Internet hospital price and capacity do not alter the basic influence patterns of the decision, but the resource allocation proportion increases with the external e-visit service price. Moreover, as the external e-visit service capacity expands, the optimal resource allocation proportion decreases when the sensitivity of the revisit rate is low (which indicates the quality is influenced slightly by resource allocation), and exhibits an initial decrease followed by an increase when the sensitivity is high.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11577-11584"},"PeriodicalIF":4.6,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142672202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Passivity-Based Teleoperation With Variable Rotational Impedance Control 采用可变旋转阻抗控制的被动式遥控操作
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-31 DOI: 10.1109/LRA.2024.3490260
Youssef Michel;Youssef Abdelhalem;Gordon Cheng
In this work, we present a novel bilateral teleoperation architecture with variable impedance control for orientational contact tasks. We exploit Unit Quaternions and tools from Lie Theory to model and manipulate robot orientations, as well as Learning-from-Demonstration to learn a stiffness adaptation policy from the demonstrated task dynamics. The learnt policy then shapes the rotational stiffness of the remote robot during contact with the environment. We also present a passivity analysis where we use energy tanks to guarantee the passivity of the closed loop system, and hence the stable interaction. Our approach is validated on real robot hardware in a cutting task along a curve, and in a user study.
在这项工作中,我们针对方向接触任务提出了一种具有可变阻抗控制的新型双边远程操作架构。我们利用单位四元数和谎言理论工具来建模和操纵机器人的方向,并利用 "从演示中学习"(Learning-from-Demonstration)技术,从演示的任务动态中学习刚度适应策略。学习到的策略会在远程机器人与环境接触时改变其旋转刚度。我们还进行了钝化分析,利用能量槽来保证闭环系统的钝化,从而实现稳定的交互。我们的方法在沿曲线切割任务中的真实机器人硬件和用户研究中得到了验证。
{"title":"Passivity-Based Teleoperation With Variable Rotational Impedance Control","authors":"Youssef Michel;Youssef Abdelhalem;Gordon Cheng","doi":"10.1109/LRA.2024.3490260","DOIUrl":"https://doi.org/10.1109/LRA.2024.3490260","url":null,"abstract":"In this work, we present a novel bilateral teleoperation architecture with variable impedance control for orientational contact tasks. We exploit Unit Quaternions and tools from Lie Theory to model and manipulate robot orientations, as well as Learning-from-Demonstration to learn a stiffness adaptation policy from the demonstrated task dynamics. The learnt policy then shapes the rotational stiffness of the remote robot during contact with the environment. We also present a passivity analysis where we use energy tanks to guarantee the passivity of the closed loop system, and hence the stable interaction. Our approach is validated on real robot hardware in a cutting task along a curve, and in a user study.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11658-11665"},"PeriodicalIF":4.6,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10740678","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142679313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Safe Reinforcement Learning of Robot Trajectories in the Presence of Moving Obstacles 移动障碍物面前机器人轨迹的安全强化学习
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-30 DOI: 10.1109/LRA.2024.3488402
Jonas Kiemel;Ludovic Righetti;Torsten Kröger;Tamim Asfour
In this paper, we present an approach for learning collision-free robot trajectories in the presence of moving obstacles. As a first step, we train a backup policy to generate evasive movements from arbitrary initial robot states using model-free reinforcement learning. When learning policies for other tasks, the backup policy can be used to estimate the potential risk of a collision and to offer an alternative action if the estimated risk is considered too high. No matter which action is selected, our action space ensures that the kinematic limits of the robot joints are not violated. We analyze and evaluate two different methods for estimating the risk of a collision. A physics simulation performed in the background is computationally expensive but provides the best results in deterministic environments. If a data-based risk estimator is used instead, the computational effort is significantly reduced, but an additional source of error is introduced. For evaluation, we successfully learn a reaching task and a basketball task while keeping the risk of collisions low. The results demonstrate the effectiveness of our approach for deterministic and stochastic environments, including a human-robot scenario and a ball environment, where no state can be considered permanently safe. By conducting experiments with a real robot, we show that our approach can generate safe trajectories in real time.
在本文中,我们提出了一种在移动障碍物面前学习无碰撞机器人轨迹的方法。第一步,我们利用无模型强化学习技术训练一种后备策略,以便从任意的机器人初始状态生成规避动作。在为其他任务学习策略时,后备策略可用于估计碰撞的潜在风险,并在估计风险过高时提供替代行动。无论选择哪种行动,我们的行动空间都能确保不违反机器人关节的运动学极限。我们分析并评估了估算碰撞风险的两种不同方法。在后台进行的物理模拟计算成本高昂,但在确定性环境中却能提供最佳结果。如果改用基于数据的风险估算器,计算量会显著减少,但会引入额外的误差源。为了进行评估,我们成功地学习了一项伸手任务和一项篮球任务,同时保持了较低的碰撞风险。结果表明,我们的方法在确定性和随机性环境中都很有效,包括人机场景和球类环境,在这些环境中,没有任何状态可以被认为是永久安全的。通过使用真实机器人进行实验,我们证明我们的方法可以实时生成安全轨迹。
{"title":"Safe Reinforcement Learning of Robot Trajectories in the Presence of Moving Obstacles","authors":"Jonas Kiemel;Ludovic Righetti;Torsten Kröger;Tamim Asfour","doi":"10.1109/LRA.2024.3488402","DOIUrl":"https://doi.org/10.1109/LRA.2024.3488402","url":null,"abstract":"In this paper, we present an approach for learning collision-free robot trajectories in the presence of moving obstacles. As a first step, we train a backup policy to generate evasive movements from arbitrary initial robot states using model-free reinforcement learning. When learning policies for other tasks, the backup policy can be used to estimate the potential risk of a collision and to offer an alternative action if the estimated risk is considered too high. No matter which action is selected, our action space ensures that the kinematic limits of the robot joints are not violated. We analyze and evaluate two different methods for estimating the risk of a collision. A physics simulation performed in the background is computationally expensive but provides the best results in deterministic environments. If a data-based risk estimator is used instead, the computational effort is significantly reduced, but an additional source of error is introduced. For evaluation, we successfully learn a reaching task and a basketball task while keeping the risk of collisions low. The results demonstrate the effectiveness of our approach for deterministic and stochastic environments, including a human-robot scenario and a ball environment, where no state can be considered permanently safe. By conducting experiments with a real robot, we show that our approach can generate safe trajectories in real time.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11353-11360"},"PeriodicalIF":4.6,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Based Estimation of Tool-Tissue Interaction Forces for Stationary and Moving Environments 基于学习的静态和移动环境下工具与组织相互作用力的估计
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-30 DOI: 10.1109/LRA.2024.3488400
L. Nowakowski;R. V. Patel
Accurately estimating tool-tissue interaction forces during robotics-assisted minimally invasive surgery is an important aspect of enabling haptics-based teleoperation. By collecting data regarding the state of a robot in a variety of configurations, neural networks can be trained to predict this interaction force. This paper extends existing work in this domain based on collecting one of the largest known ground truth force datasets for stationary as well as moving phantoms that replicate tissue motions found in clinical procedures. Existing methods, and a new transformer-based architecture, are evaluated to demonstrate the domain gap between stationary and moving phantom tissue data and the impact that data scaling has on each architecture's ability to generalize the force estimation task. It was found that temporal networks were more sensitive to the moving domain than single-sample Feed Forward Networks (FFNs) that were trained on stationary tissue data. However, the transformer approach results in the lowest Root Mean Square Error (RMSE) when evaluating networks trained on examples of both stationary and moving phantom tissue samples. The results demonstrate the domain gap between stationary and moving surgical environments and the effectiveness of scaling datasets for increased accuracy of interaction force prediction.
准确估计机器人辅助微创手术过程中工具与组织的相互作用力是实现基于触觉的远程操作的一个重要方面。通过收集机器人在各种配置下的状态数据,可以训练神经网络来预测这种相互作用力。本文基于收集已知最大的地面真实力数据集之一,对该领域的现有工作进行了扩展,该数据集用于静止和移动模型,复制了临床手术中发现的组织运动。对现有方法和基于变压器的新架构进行了评估,以证明静态和移动模型组织数据之间的领域差距,以及数据缩放对每种架构概括力估算任务能力的影响。结果发现,与在静态组织数据上训练的单样本前馈网络(FFN)相比,时态网络对移动域更加敏感。不过,在评估根据静态和移动幻影组织样本训练的网络时,变换器方法的均方根误差(RMSE)最小。结果证明了静止和移动手术环境之间的领域差距,以及扩展数据集以提高相互作用力预测准确性的有效性。
{"title":"Learning Based Estimation of Tool-Tissue Interaction Forces for Stationary and Moving Environments","authors":"L. Nowakowski;R. V. Patel","doi":"10.1109/LRA.2024.3488400","DOIUrl":"https://doi.org/10.1109/LRA.2024.3488400","url":null,"abstract":"Accurately estimating tool-tissue interaction forces during robotics-assisted minimally invasive surgery is an important aspect of enabling haptics-based teleoperation. By collecting data regarding the state of a robot in a variety of configurations, neural networks can be trained to predict this interaction force. This paper extends existing work in this domain based on collecting one of the largest known ground truth force datasets for stationary as well as moving phantoms that replicate tissue motions found in clinical procedures. Existing methods, and a new transformer-based architecture, are evaluated to demonstrate the domain gap between stationary and moving phantom tissue data and the impact that data scaling has on each architecture's ability to generalize the force estimation task. It was found that temporal networks were more sensitive to the moving domain than single-sample Feed Forward Networks (FFNs) that were trained on stationary tissue data. However, the transformer approach results in the lowest Root Mean Square Error (RMSE) when evaluating networks trained on examples of both stationary and moving phantom tissue samples. The results demonstrate the domain gap between stationary and moving surgical environments and the effectiveness of scaling datasets for increased accuracy of interaction force prediction.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11266-11273"},"PeriodicalIF":4.6,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142598640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal Variational DeepMDP: An Efficient Approach for Industrial Assembly in High-Mix, Low-Volume Production 多模态变式 DeepMDP:用于多品种、小批量生产中的工业装配的高效方法
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-29 DOI: 10.1109/LRA.2024.3487490
Grzegorz Bartyzel
Transferability, along with sample efficiency, is a critical factor for a reinforcement learning (RL) agent's successful application in real-world contact-rich manipulation tasks, such as product assembly. For instance, in the case of the industrial insertion task on high-mix, low-volume (HMLV) production lines, transferability could eliminate the need for machine retooling, thus reducing production line downtimes. In our work, we introduce a method called Multimodal Variational DeepMDP (MVDeepMDP) that demonstrates the ability to generalize to various environmental variations not encountered during training. The key feature of our approach involves learning a multimodal latent dynamic representation. We demonstrate the effectiveness of our method in the context of an electronic parts insertion task, which is challenging for RL agents due to the diverse physical properties of the non-standardized components, as well as simple 3D printed blocks insertion. Furthermore, we evaluate the transferability of MVDeepMDP and analyze the impact of the balancing mechanism of the generalized Product-of-Experts (gPoE), which is used to combine observable modalities. Finally, we explore the influence of separately processing state modalities of different physical quantities, such as pose and 6D force/torque (F/T) data.
可转移性以及样本效率是强化学习(RL)代理成功应用于现实世界中产品组装等接触性操作任务的关键因素。例如,在多品种、小批量(HMLV)生产线上的工业插装任务中,可转移性可以消除机器重装的需要,从而减少生产线停机时间。在我们的工作中,我们引入了一种名为多模态变异 DeepMDP(MVDeepMDP)的方法,该方法展示了对训练期间未遇到的各种环境变化进行泛化的能力。我们方法的主要特点是学习多模态潜在动态表示。我们在电子零件插入任务中演示了该方法的有效性,由于非标准化组件的物理特性各不相同,该任务对 RL 代理以及简单的 3D 打印块插入具有挑战性。此外,我们还评估了 MVDeepMDP 的可移植性,并分析了广义专家产品(gPoE)平衡机制的影响,该机制用于结合可观察的模式。最后,我们探讨了分别处理不同物理量的状态模态(如姿势和 6D 力/力矩 (F/T) 数据)的影响。
{"title":"Multimodal Variational DeepMDP: An Efficient Approach for Industrial Assembly in High-Mix, Low-Volume Production","authors":"Grzegorz Bartyzel","doi":"10.1109/LRA.2024.3487490","DOIUrl":"https://doi.org/10.1109/LRA.2024.3487490","url":null,"abstract":"Transferability, along with sample efficiency, is a critical factor for a reinforcement learning (RL) agent's successful application in real-world contact-rich manipulation tasks, such as product assembly. For instance, in the case of the industrial insertion task on high-mix, low-volume (HMLV) production lines, transferability could eliminate the need for machine retooling, thus reducing production line downtimes. In our work, we introduce a method called Multimodal Variational DeepMDP (MVDeepMDP) that demonstrates the ability to generalize to various environmental variations not encountered during training. The key feature of our approach involves learning a multimodal latent dynamic representation. We demonstrate the effectiveness of our method in the context of an electronic parts insertion task, which is challenging for RL agents due to the diverse physical properties of the non-standardized components, as well as simple 3D printed blocks insertion. Furthermore, we evaluate the transferability of MVDeepMDP and analyze the impact of the balancing mechanism of the \u0000<italic>generalized Product-of-Experts</i>\u0000 (gPoE), which is used to combine observable modalities. Finally, we explore the influence of separately processing state modalities of different physical quantities, such as pose and 6D force/torque (F/T) data.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11297-11304"},"PeriodicalIF":4.6,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142645555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Safe and Efficient Multi-Agent Collision Avoidance With Physics-Informed Reinforcement Learning 利用物理信息强化学习实现安全高效的多代理碰撞规避
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-29 DOI: 10.1109/LRA.2024.3487491
Pu Feng;Rongye Shi;Size Wang;Junkang Liang;Xin Yu;Simin Li;Wenjun Wu
Reinforcement learning (RL) has shown great promise in addressing multi-agent collision avoidance challenges. However, existing RL-based methods often suffer from low training efficiency and poor action safety. To tackle these issues, we introduce a physics-informed reinforcement learning framework equipped with two modules: a Potential Field (PF) module and a Multi-Agent Multi-Level Safety (MAMLS) module. The PF module uses the Artificial Potential Field method to compute a regularization loss, adaptively integrating it into the critic's loss to enhance training efficiency. The MAMLS module formulates action safety as a constrained optimization problem, deriving safe actions by solving this optimization. Furthermore, to better address the characteristics of multi-agent collision avoidance tasks, multi-agent multi-level constraints are introduced. The results of simulations and real-world experiments showed that our physics-informed framework offers a significant improvement in terms of both the efficiency of training and safety-related metrics over advanced baseline methods.
强化学习(RL)在解决多机器人防撞难题方面已显示出巨大前景。然而,现有的基于 RL 的方法往往存在训练效率低和行动安全性差的问题。为了解决这些问题,我们引入了一个物理信息强化学习框架,该框架配备了两个模块:势场(PF)模块和多代理多级安全(MAMLS)模块。PF 模块使用人工势场方法计算正则化损失,并自适应地将其整合到批评者损失中,以提高训练效率。MAMLS 模块将行动安全视为一个约束优化问题,通过求解该优化问题得出安全行动。此外,为了更好地应对多机器人防碰撞任务的特点,还引入了多机器人多级约束。模拟和实际实验结果表明,与先进的基线方法相比,我们的物理信息框架在训练效率和安全相关指标方面都有显著提高。
{"title":"Safe and Efficient Multi-Agent Collision Avoidance With Physics-Informed Reinforcement Learning","authors":"Pu Feng;Rongye Shi;Size Wang;Junkang Liang;Xin Yu;Simin Li;Wenjun Wu","doi":"10.1109/LRA.2024.3487491","DOIUrl":"https://doi.org/10.1109/LRA.2024.3487491","url":null,"abstract":"Reinforcement learning (RL) has shown great promise in addressing multi-agent collision avoidance challenges. However, existing RL-based methods often suffer from low training efficiency and poor action safety. To tackle these issues, we introduce a physics-informed reinforcement learning framework equipped with two modules: a Potential Field (PF) module and a Multi-Agent Multi-Level Safety (MAMLS) module. The PF module uses the Artificial Potential Field method to compute a regularization loss, adaptively integrating it into the critic's loss to enhance training efficiency. The MAMLS module formulates action safety as a constrained optimization problem, deriving safe actions by solving this optimization. Furthermore, to better address the characteristics of multi-agent collision avoidance tasks, multi-agent multi-level constraints are introduced. The results of simulations and real-world experiments showed that our physics-informed framework offers a significant improvement in terms of both the efficiency of training and safety-related metrics over advanced baseline methods.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11138-11145"},"PeriodicalIF":4.6,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142598643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
D2S: Representing Sparse Descriptors and 3D Coordinates for Camera Relocalization D2S:表示稀疏描述符和三维坐标,实现相机重定位
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-29 DOI: 10.1109/LRA.2024.3487503
Bach-Thuan Bui;Huy-Hoang Bui;Dinh-Tuan Tran;Joo-Ho Lee
State-of-the-art visual localization methods mostly rely on complex procedures to match local descriptors and 3D point clouds. However, these procedures can incur significant costs in terms of inference, storage, and updates over time. In this study, we propose a direct learning-based approach that utilizes a simple network named D2S to represent complex local descriptors and their scene coordinates. Our method is characterized by its simplicity and cost-effectiveness. It solely leverages a single RGB image for localization during the testing phase and only requires a lightweight model to encode a complex sparse scene. The proposed D2S employs a combination of a simple loss function and graph attention to selectively focus on robust descriptors while disregarding areas such as clouds, trees, and several dynamic objects. This selective attention enables D2S to effectively perform a binary-semantic classification for sparse descriptors. Additionally, we propose a simple outdoor dataset to evaluate the capabilities of visual localization methods in scene-specific generalization and self-updating from unlabeled observations. Our approach outperforms the previous regression-based methods in both indoor and outdoor environments. It demonstrates the ability to generalize beyond training data, including scenarios involving transitions from day to night and adapting to domain shifts.
最先进的视觉定位方法大多依赖复杂的程序来匹配局部描述符和三维点云。然而,这些程序在推理、存储和随时间更新方面会产生巨大的成本。在本研究中,我们提出了一种基于直接学习的方法,利用名为 D2S 的简单网络来表示复杂的局部描述符及其场景坐标。我们的方法具有简单和成本效益高的特点。在测试阶段,它只需利用单张 RGB 图像进行定位,只需一个轻量级模型即可对复杂的稀疏场景进行编码。所提出的 D2S 结合使用了简单的损失函数和图注意,选择性地关注稳健描述符,而忽略云、树和一些动态物体等区域。这种选择性关注使 D2S 能够有效地对稀疏描述符进行二元语义分类。此外,我们还提出了一个简单的室外数据集,以评估视觉定位方法在特定场景泛化和从无标记观测中进行自我更新方面的能力。在室内和室外环境中,我们的方法都优于之前基于回归的方法。它展示了超越训练数据的泛化能力,包括从白天到夜晚的场景转换以及适应领域变化的能力。
{"title":"D2S: Representing Sparse Descriptors and 3D Coordinates for Camera Relocalization","authors":"Bach-Thuan Bui;Huy-Hoang Bui;Dinh-Tuan Tran;Joo-Ho Lee","doi":"10.1109/LRA.2024.3487503","DOIUrl":"https://doi.org/10.1109/LRA.2024.3487503","url":null,"abstract":"State-of-the-art visual localization methods mostly rely on complex procedures to match local descriptors and 3D point clouds. However, these procedures can incur significant costs in terms of inference, storage, and updates over time. In this study, we propose a direct learning-based approach that utilizes a simple network named D2S to represent complex local descriptors and their scene coordinates. Our method is characterized by its simplicity and cost-effectiveness. It solely leverages a single RGB image for localization during the testing phase and only requires a lightweight model to encode a complex sparse scene. The proposed D2S employs a combination of a simple loss function and graph attention to selectively focus on robust descriptors while disregarding areas such as clouds, trees, and several dynamic objects. This selective attention enables D2S to effectively perform a binary-semantic classification for sparse descriptors. Additionally, we propose a simple outdoor dataset to evaluate the capabilities of visual localization methods in scene-specific generalization and self-updating from unlabeled observations. Our approach outperforms the previous regression-based methods in both indoor and outdoor environments. It demonstrates the ability to generalize beyond training data, including scenarios involving transitions from day to night and adapting to domain shifts.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11449-11456"},"PeriodicalIF":4.6,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142636536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Art of Imitation: Learning Long-Horizon Manipulation Tasks From Few Demonstrations 模仿的艺术:从少量演示中学习远距离操作任务
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-29 DOI: 10.1109/LRA.2024.3487506
Jan Ole von Hartz;Tim Welschehold;Abhinav Valada;Joschka Boedecker
Task Parametrized Gaussian Mixture Models (TP-GMM) are a sample-efficient method for learning object-centric robot manipulation tasks. However, there are several open challenges to applying TP-GMMs in the wild. In this work, we tackle three crucial challenges synergistically. First, end-effector velocities are non-Euclidean and thus hard to model using standard GMMs. We thus propose to factorize the robot's end-effector velocity into its direction and magnitude, and model them using Riemannian GMMs. Second, we leverage the factorized velocities to segment and sequence skills from complex demonstration trajectories. Through the segmentation, we further align skill trajectories and hence leverage time as a powerful inductive bias. Third, we present a method to automatically detect relevant task parameters per skill from visual observations. Our approach enables learning complex manipulation tasks from just five demonstrations while using only RGB-D observations. Extensive experimental evaluations on RLBench demonstrate that our approach achieves state-of-the-art performance with 20-fold improved sample efficiency. Our policies generalize across different environments, object instances, and object positions, while the learned skills are reusable.
任务参数化高斯混合模型(TP-GMM)是学习以物体为中心的机器人操纵任务的一种样本高效方法。然而,在野外应用 TP-GMM 时还面临一些挑战。在这项工作中,我们协同应对了三个关键挑战。首先,末端执行器的速度是非欧几里得的,因此很难使用标准 GMM 建模。因此,我们建议将机器人的末端执行器速度因子化为方向和幅度,并使用黎曼 GMM 建立模型。其次,我们利用因子化速度对复杂的演示轨迹进行分割和技能排序。通过分割,我们进一步调整技能轨迹,从而利用时间作为强大的归纳偏倚。第三,我们提出了一种从视觉观察中自动检测每个技能的相关任务参数的方法。我们的方法只需使用 RGB-D 观察结果,就能从五个演示中学习复杂的操作任务。在 RLBench 上进行的广泛实验评估表明,我们的方法达到了最先进的性能,样本效率提高了 20 倍。我们的策略可在不同环境、对象实例和对象位置之间通用,同时所学技能可重复使用。
{"title":"The Art of Imitation: Learning Long-Horizon Manipulation Tasks From Few Demonstrations","authors":"Jan Ole von Hartz;Tim Welschehold;Abhinav Valada;Joschka Boedecker","doi":"10.1109/LRA.2024.3487506","DOIUrl":"https://doi.org/10.1109/LRA.2024.3487506","url":null,"abstract":"Task Parametrized Gaussian Mixture Models (TP-GMM) are a sample-efficient method for learning object-centric robot manipulation tasks. However, there are several open challenges to applying TP-GMMs in the wild. In this work, we tackle three crucial challenges synergistically. First, end-effector velocities are non-Euclidean and thus hard to model using standard GMMs. We thus propose to factorize the robot's end-effector velocity into its direction and magnitude, and model them using Riemannian GMMs. Second, we leverage the factorized velocities to segment and sequence skills from complex demonstration trajectories. Through the segmentation, we further align skill trajectories and hence leverage time as a powerful inductive bias. Third, we present a method to automatically detect relevant task parameters \u0000<italic>per</i>\u0000 skill from visual observations. Our approach enables learning complex manipulation tasks from just five demonstrations while using only RGB-D observations. Extensive experimental evaluations on RLBench demonstrate that our approach achieves state-of-the-art performance with 20-fold improved sample efficiency. Our policies generalize across different environments, object instances, and object positions, while the learned skills are reusable.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11369-11376"},"PeriodicalIF":4.6,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142636412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MiniTac: An Ultra-Compact $text{8 mm}$ Vision-Based Tactile Sensor for Enhanced Palpation in Robot-Assisted Minimally Invasive Surgery MiniTac:基于视觉的超小型触觉传感器($text{8 mm}$),用于增强机器人辅助微创手术中的触诊功能
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-29 DOI: 10.1109/LRA.2024.3487516
Wanlin Li;Zihang Zhao;Leiyao Cui;Weiyi Zhang;Hangxin Liu;Li-An Li;Yixin Zhu
Robot-assisted minimally invasive surgery (RAMIS) provides substantial benefits over traditional open and laparoscopic methods. However, a significant limitation of robot-assisted minimally invasive surgery (RAMIS) is the surgeon's inability to palpate tissues, a crucial technique for examining tissue properties and detecting abnormalities, restricting the widespread adoption of RAMIS. To overcome this obstacle, we introduce MiniTac, a novel vision-based tactile sensor with an ultra-compact cross-sectional diameter of 8mm, designed for seamless integration into mainstream RAMIS devices, particularly the Da Vinci surgical systems. MiniTac features a novel mechanoresponsive photonic elastomer membrane that changes color distribution under varying contact pressures. This color change is captured by an embedded miniature camera, allowing MiniTac to detect tumors both on the tissue surface and in deeper layers typically obscured from endoscopic view. MiniTac's efficacy has been rigorously tested on both phantoms and ex-vivo tissues. By leveraging advanced mechanoresponsive photonic materials, MiniTac represents a significant advancement in integrating tactile sensing into RAMIS, potentially expanding its applicability to a wider array of clinical scenarios that currently rely on traditional surgical approaches.
与传统的开腹和腹腔镜方法相比,机器人辅助微创手术(RAMIS)具有很大的优势。然而,机器人辅助微创手术(RAMIS)的一大局限是外科医生无法触诊组织,而触诊是检查组织特性和检测异常的关键技术,这限制了 RAMIS 的广泛应用。为了克服这一障碍,我们推出了基于视觉的新型触觉传感器 MiniTac,它的横截面直径只有 8 毫米,非常小巧,可无缝集成到主流 RAMIS 设备中,尤其是达芬奇手术系统。MiniTac 采用新型机械传导性光子弹性体膜,在不同的接触压力下会改变颜色分布。这种颜色变化由嵌入式微型摄像头捕捉,使 MiniTac 既能检测组织表面的肿瘤,也能检测通常被内窥镜遮挡的深层肿瘤。MiniTac 的功效已在模型和体外组织上进行了严格测试。通过利用先进的机械响应光子材料,MiniTac 在将触觉传感集成到 RAMIS 方面取得了重大进展,有可能将其应用范围扩大到目前依赖传统手术方法的更广泛的临床场景。
{"title":"MiniTac: An Ultra-Compact $text{8 mm}$ Vision-Based Tactile Sensor for Enhanced Palpation in Robot-Assisted Minimally Invasive Surgery","authors":"Wanlin Li;Zihang Zhao;Leiyao Cui;Weiyi Zhang;Hangxin Liu;Li-An Li;Yixin Zhu","doi":"10.1109/LRA.2024.3487516","DOIUrl":"https://doi.org/10.1109/LRA.2024.3487516","url":null,"abstract":"Robot-assisted minimally invasive surgery (RAMIS) provides substantial benefits over traditional open and laparoscopic methods. However, a significant limitation of robot-assisted minimally invasive surgery (RAMIS) is the surgeon's inability to palpate tissues, a crucial technique for examining tissue properties and detecting abnormalities, restricting the widespread adoption of RAMIS. To overcome this obstacle, we introduce MiniTac, a novel vision-based tactile sensor with an ultra-compact cross-sectional diameter of 8mm, designed for seamless integration into mainstream RAMIS devices, particularly the Da Vinci surgical systems. MiniTac features a novel mechanoresponsive photonic elastomer membrane that changes color distribution under varying contact pressures. This color change is captured by an embedded miniature camera, allowing MiniTac to detect tumors both on the tissue surface and in deeper layers typically obscured from endoscopic view. MiniTac's efficacy has been rigorously tested on both phantoms and ex-vivo tissues. By leveraging advanced mechanoresponsive photonic materials, MiniTac represents a significant advancement in integrating tactile sensing into RAMIS, potentially expanding its applicability to a wider array of clinical scenarios that currently rely on traditional surgical approaches.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11170-11177"},"PeriodicalIF":4.6,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142598672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Robotics and Automation Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1