首页 > 最新文献

IEEE Robotics and Automation Letters最新文献

英文 中文
Probabilistic Degeneracy Detection for Point-to-Plane Error Minimization 点到平面误差最小化的概率退化检测
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-21 DOI: 10.1109/LRA.2024.3484153
Johan Hatleskog;Kostas Alexis
Degeneracies arising from uninformative geometry are known to deteriorate LiDAR-based localization and mapping. This work introduces a new probabilistic method to detect and mitigate the effect of degeneracies in point-to-plane error minimization. The noise on the Hessian of the point-to-plane optimization problem is characterized by the noise on points and surface normals used in its construction. We exploit this characterization to quantify the probability of a direction being degenerate. The degeneracy-detection procedure is used in a new real-time degeneracy-aware iterative closest point algorithm for LiDAR registration, in which we smoothly attenuate updates in degenerate directions. The method's parameters are selected based on the noise characteristics provided in the LiDAR's datasheet. We validate the approach in four real-world experiments, demonstrating that it outperforms state-of-the-art methods at detecting and mitigating the adverse effects of degeneracies.
众所周知,无信息几何图形所产生的退化现象会恶化基于激光雷达的定位和测绘。这项工作引入了一种新的概率方法,用于检测和减轻点到平面误差最小化过程中的退化效应。点到面优化问题的 Hessian 上的噪声是由其构建过程中使用的点和表面法线上的噪声表征的。我们利用这一特征来量化某个方向退化的概率。退化检测程序被用于一种新的实时退化感知迭代最邻近点算法,该算法用于激光雷达注册,其中我们平滑地衰减退化方向上的更新。该方法的参数根据激光雷达数据表中提供的噪声特性进行选择。我们在四个真实世界的实验中验证了该方法,证明它在检测和减轻退化的不利影响方面优于最先进的方法。
{"title":"Probabilistic Degeneracy Detection for Point-to-Plane Error Minimization","authors":"Johan Hatleskog;Kostas Alexis","doi":"10.1109/LRA.2024.3484153","DOIUrl":"https://doi.org/10.1109/LRA.2024.3484153","url":null,"abstract":"Degeneracies arising from uninformative geometry are known to deteriorate LiDAR-based localization and mapping. This work introduces a new probabilistic method to detect and mitigate the effect of degeneracies in point-to-plane error minimization. The noise on the Hessian of the point-to-plane optimization problem is characterized by the noise on points and surface normals used in its construction. We exploit this characterization to quantify the probability of a direction being degenerate. The degeneracy-detection procedure is used in a new real-time degeneracy-aware iterative closest point algorithm for LiDAR registration, in which we smoothly attenuate updates in degenerate directions. The method's parameters are selected based on the noise characteristics provided in the LiDAR's datasheet. We validate the approach in four real-world experiments, demonstrating that it outperforms state-of-the-art methods at detecting and mitigating the adverse effects of degeneracies.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11234-11241"},"PeriodicalIF":4.6,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142598631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Planar Fabric Repositioning: Deformation-Aware RRT* for Non-Prehensile Fabric Manipulation 高效的平面织物重新定位:用于非拉伸织物操纵的变形感知 RRT* 技术
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-21 DOI: 10.1109/LRA.2024.3484131
Yunhai Wang;Lei Yang;Peng Zhou;Jiaming Qi;Liang Lu;Jihong Zhu;Jia Pan
Fabrics present significant challenges to robotic manipulation due to their complex dynamics and infinite degrees of freedom. This letter proposes a non-prehensile approach to aligning a fabric cut piece to a specified target pose, which is a common step for many garment manufacturing tasks. Compared to widely explored prehensile manipulation that grasps and lifts the fabric, the proposed approach uses pushing actions and allows for efficient fabric repositioning with a simpler end-effector. To handle the possible deformation caused by the pushing actions, we introduce the Deformation-aware Rapidly-exploring Random Tree Star (D-RRT*) algorithm that strategically plans the contact poses and actions to slide the fabric to desired configurations by leveraging the friction. Our D-RRT* algorithm enhances fabric manipulation by incorporating deformation-aware action sampling into the path planning process, enabling accurate predictions of non-prehensile actions and efficient navigation through the fabric's configuration space. Extensive simulations and real-world experiments on repositioning fabrics of various shapes and materials demonstrate the effectiveness of the proposed pipeline in achieving stable and efficient manipulation.
织物因其复杂的动力学特性和无限的自由度,给机器人操纵带来了巨大挑战。这封信提出了一种非拉伸方法,用于将布料裁片对准指定的目标姿势,这是许多服装制造任务的常见步骤。与广泛探索的抓取和抬起布料的前伸式操作相比,所提出的方法使用了推动动作,并允许使用更简单的末端执行器高效地重新定位布料。为了处理推动动作可能造成的变形,我们引入了变形感知快速探索随机树星(D-RRT*)算法,该算法可战略性地规划接触姿势和动作,利用摩擦力将织物滑动到所需的配置。我们的 D-RRT* 算法将变形感知动作采样纳入路径规划过程,从而提高了织物操纵能力,实现了对非拉伸动作的准确预测和织物配置空间的高效导航。在对各种形状和材料的织物进行重新定位时进行的大量模拟和实际实验证明了所提出的管道在实现稳定高效操纵方面的有效性。
{"title":"Efficient Planar Fabric Repositioning: Deformation-Aware RRT* for Non-Prehensile Fabric Manipulation","authors":"Yunhai Wang;Lei Yang;Peng Zhou;Jiaming Qi;Liang Lu;Jihong Zhu;Jia Pan","doi":"10.1109/LRA.2024.3484131","DOIUrl":"https://doi.org/10.1109/LRA.2024.3484131","url":null,"abstract":"Fabrics present significant challenges to robotic manipulation due to their complex dynamics and infinite degrees of freedom. This letter proposes a non-prehensile approach to aligning a fabric cut piece to a specified target pose, which is a common step for many garment manufacturing tasks. Compared to widely explored prehensile manipulation that grasps and lifts the fabric, the proposed approach uses pushing actions and allows for efficient fabric repositioning with a simpler end-effector. To handle the possible deformation caused by the pushing actions, we introduce the Deformation-aware Rapidly-exploring Random Tree Star (D-RRT*) algorithm that strategically plans the contact poses and actions to slide the fabric to desired configurations by leveraging the friction. Our D-RRT* algorithm enhances fabric manipulation by incorporating deformation-aware action sampling into the path planning process, enabling accurate predictions of non-prehensile actions and efficient navigation through the fabric's configuration space. Extensive simulations and real-world experiments on repositioning fabrics of various shapes and materials demonstrate the effectiveness of the proposed pipeline in achieving stable and efficient manipulation.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11258-11265"},"PeriodicalIF":4.6,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10723787","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142598677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Spiny Claws Climbing Robot Based on Self-Sensing Soles 基于自感应鞋底的棘爪攀爬机器人
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-21 DOI: 10.1109/LRA.2024.3484179
Rui Li;Changze Wu;Shuang Yan;Chuan Li;Xinglong Gong;Chul-Hee Lee;Mengjie Shou
Adaptive climbing on different surfaces is a great challenge for conventional robots due to a lack of self-sensing capabilities. Inspired by the exceptional sensing ability of feline soles, this study proposes a quadrupedal climbing robot based on self-sensing spiny-claw soles. First, a spiny-claw sole was designed by embedding stainless steel spines into a soft substrate. Next, a tensile strain sensor was designed based on carbon nanotubes and carbonyl iron powder through the squash method and then was integrated into the spiny-claw sole to fabricate the self-sensing sole. Then, a quadrupedal climbing robot was designed using four self-sensing spiny-claw soles. Subsequently, the control strategy of the self-sensing climbing robot was designed. Finally, the climbing performance of the self-sensing robot was experimentally tested. It is demonstrated that the robot can climb on different inclined surfaces with an angle of 0$^circ$ to 75$^circ$ and on three different rough surfaces. In addition, the maximum load of the robot is 175 g when climbing on a 45$^circ$ inclined surface. More importantly, the robot can detect whether there is an obstacle in the climbing path through the self-sensing soles and perform appropriate obstacle avoidance operations accordingly.
由于缺乏自感应能力,在不同表面上的自适应攀爬对于传统机器人来说是一个巨大的挑战。受猫科动物鞋底超强感应能力的启发,本研究提出了一种基于自感应棘爪鞋底的四足攀爬机器人。首先,通过将不锈钢刺嵌入软基底,设计出刺爪鞋底。接着,通过压扁法设计出基于碳纳米管和羰基铁粉的拉伸应变传感器,并将其集成到棘爪鞋底中,从而制作出自感应鞋底。然后,利用四个自感应棘爪鞋底设计了一个四足攀爬机器人。随后,设计了自感应攀爬机器人的控制策略。最后,对自感应机器人的攀爬性能进行了实验测试。实验证明,机器人可以在角度为 0$^circ$ 至 75$^circ$ 的不同倾斜表面和三种不同的粗糙表面上攀爬。此外,当机器人在 45$^circ$ 的倾斜表面上攀爬时,其最大负载为 175 g。更重要的是,机器人可以通过自感应鞋底检测攀爬路径上是否有障碍物,并相应地执行适当的避障操作。
{"title":"A Spiny Claws Climbing Robot Based on Self-Sensing Soles","authors":"Rui Li;Changze Wu;Shuang Yan;Chuan Li;Xinglong Gong;Chul-Hee Lee;Mengjie Shou","doi":"10.1109/LRA.2024.3484179","DOIUrl":"https://doi.org/10.1109/LRA.2024.3484179","url":null,"abstract":"Adaptive climbing on different surfaces is a great challenge for conventional robots due to a lack of self-sensing capabilities. Inspired by the exceptional sensing ability of feline soles, this study proposes a quadrupedal climbing robot based on self-sensing spiny-claw soles. First, a spiny-claw sole was designed by embedding stainless steel spines into a soft substrate. Next, a tensile strain sensor was designed based on carbon nanotubes and carbonyl iron powder through the squash method and then was integrated into the spiny-claw sole to fabricate the self-sensing sole. Then, a quadrupedal climbing robot was designed using four self-sensing spiny-claw soles. Subsequently, the control strategy of the self-sensing climbing robot was designed. Finally, the climbing performance of the self-sensing robot was experimentally tested. It is demonstrated that the robot can climb on different inclined surfaces with an angle of 0\u0000<inline-formula><tex-math>$^circ$</tex-math></inline-formula>\u0000 to 75\u0000<inline-formula><tex-math>$^circ$</tex-math></inline-formula>\u0000 and on three different rough surfaces. In addition, the maximum load of the robot is 175 g when climbing on a 45\u0000<inline-formula><tex-math>$^circ$</tex-math></inline-formula>\u0000 inclined surface. More importantly, the robot can detect whether there is an obstacle in the climbing path through the self-sensing soles and perform appropriate obstacle avoidance operations accordingly.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"10906-10913"},"PeriodicalIF":4.6,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142524208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rigid-Soft Hybrid Suction Cups for Enhanced Anti-Torque and Energy-Efficient Attachment 刚柔混合吸盘,增强抗扭矩和高能效附着力
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-21 DOI: 10.1109/LRA.2024.3484157
Qingkai Guo;Yu Sun;Zipan Zhao;Jiajia Ning;Ling Wang;Yuxin Lv;Xuefeng Chen;Laihao Yang
In the realm of robotics, suction-based adhesion plays a pivotal role in applications ranging from object transfer to wall-climbing robots. To improve the sealing and attachment stability of suction cups, researchers have employed state-of-the-art techniques, including the use of soft materials with better conformal properties or jamming mechanisms. However, soft materials can cause undesired deformation of the suction cup under load, and the use of jamming mechanisms has limitations in terms of size and weight. This letter introduces a novel Rigid-Soft Hybrid Suction Cup (RSH-SC) designed for enhanced stability and energy-efficient attachment in nonideal conditions (i.e., suction on irregular and inclined surfaces). To emulate the octopus's dexterous suction and the limpet's robust adhesion capabilities, the RSH-SC integrates a rigid shell for better sealing and torque resistance. Notably, compared to a soft suction cup made from Ecoflex 00-50, the RSH-SC's torque deformation resistance is 550 times greater. The RSH-SC's unique structure also allows it to maintain secure attachment without continuous vacuum pressure, thus conserving energy. A crawling robot utilizing RSH-SCs showcased stable movement on ceiling, which can significantly advance the capabilities of soft robots in complex environments, paving the way for broader applications in robotics and automated systems.
在机器人领域,基于吸力的附着力在从物体传送到爬墙机器人等各种应用中发挥着举足轻重的作用。为了提高吸盘的密封性和附着稳定性,研究人员采用了最先进的技术,包括使用保形性能更好的软材料或卡住机制。然而,软材料会导致吸盘在负载作用下发生不希望的变形,而使用卡紧机制在尺寸和重量方面也有限制。这封信介绍了一种新型刚柔混合吸盘 (RSH-SC),其设计目的是在非理想条件下(即在不规则和倾斜表面上吸附)提高稳定性和吸附能效。为了模仿章鱼灵巧的吸力和帽贝强大的粘附能力,RSH-SC 集成了一个刚性外壳,以获得更好的密封性和抗扭矩能力。值得注意的是,与使用 Ecoflex 00-50 材料制成的软吸盘相比,RSH-SC 的抗扭矩变形能力要高出 550 倍。RSH-SC 的独特结构还使其无需持续的真空压力就能保持稳固的吸附,从而节约能源。使用 RSH-SC 的爬行机器人展示了在天花板上的稳定运动,这可以大大提高软体机器人在复杂环境中的能力,为机器人和自动化系统的更广泛应用铺平道路。
{"title":"Rigid-Soft Hybrid Suction Cups for Enhanced Anti-Torque and Energy-Efficient Attachment","authors":"Qingkai Guo;Yu Sun;Zipan Zhao;Jiajia Ning;Ling Wang;Yuxin Lv;Xuefeng Chen;Laihao Yang","doi":"10.1109/LRA.2024.3484157","DOIUrl":"https://doi.org/10.1109/LRA.2024.3484157","url":null,"abstract":"In the realm of robotics, suction-based adhesion plays a pivotal role in applications ranging from object transfer to wall-climbing robots. To improve the sealing and attachment stability of suction cups, researchers have employed state-of-the-art techniques, including the use of soft materials with better conformal properties or jamming mechanisms. However, soft materials can cause undesired deformation of the suction cup under load, and the use of jamming mechanisms has limitations in terms of size and weight. This letter introduces a novel Rigid-Soft Hybrid Suction Cup (RSH-SC) designed for enhanced stability and energy-efficient attachment in nonideal conditions (i.e., suction on irregular and inclined surfaces). To emulate the octopus's dexterous suction and the limpet's robust adhesion capabilities, the RSH-SC integrates a rigid shell for better sealing and torque resistance. Notably, compared to a soft suction cup made from Ecoflex 00-50, the RSH-SC's torque deformation resistance is 550 times greater. The RSH-SC's unique structure also allows it to maintain secure attachment without continuous vacuum pressure, thus conserving energy. A crawling robot utilizing RSH-SCs showcased stable movement on ceiling, which can significantly advance the capabilities of soft robots in complex environments, paving the way for broader applications in robotics and automated systems.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"10970-10977"},"PeriodicalIF":4.6,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142565501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bimanual Long-Horizon Manipulation Via Temporal-Context Transformer RL 通过时空语境转换器 RL 实现双臂长视距操纵
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-21 DOI: 10.1109/LRA.2024.3484167
Ji-Heon Oh;Ismael Espinoza;Danbi Jung;Tae-Seong Kim
Dual-arm robots can perform bimanual long-horizon (LH) manipulation, surpassing the capabilities of single-arm robots. However, bimanual LH tasks are challenging for robot intelligence due to the complexity of long sequence variables and multi-agent interactions. While Multi-Agent Reinforcement Learning (MARL) has shown promising results in agent interactions, these models struggle with sequential LH tasks due to limitations in credit assignment, vanishing memory, and the exploration-exploitation trade-off. This paper introduces a novel dual-arm robot intelligence framework, Temporal-Context Transformer Reinforcement Learning (TC-TRL), which integrates both a hybrid offline-online policy and imitation learning. TC-TRL leverages the attention mechanism to identify relevant temporal-context information from the LH observations space, updating the encoder value function and generating an optimal actions sequence using a decoder module, which uses demonstration guidance during online training. TC-TRL is tested on six bimanual tasks, and its performance is compared against five baseline RLs: MAPPO, HAPPO, IPPO, MAT, and DA-MAT. The results show that TC-TRL outperforms the three PPO-based RLs with an average success rate of 63.46%, 42.23% against MAT, and 30.91% for DA-MAT.
双臂机器人可以执行双臂长视距(LH)操纵,超越了单臂机器人的能力。然而,由于长序列变量和多代理交互的复杂性,双臂长视距任务对机器人智能来说具有挑战性。虽然多代理强化学习(MARL)在代理互动方面取得了可喜的成果,但由于学分分配、记忆消失和探索-开发权衡等方面的限制,这些模型在处理顺序 LH 任务时却举步维艰。本文介绍了一种新颖的双臂机器人智能框架--时间-情境转换强化学习(Temporal-Context Transformer Reinforcement Learning,TC-TRL),它集成了离线-在线混合策略和模仿学习。TC-TRL 利用注意力机制从 LH 观察空间中识别相关的时间-语境信息,更新编码器值函数,并通过解码器模块生成最佳行动序列,解码器模块在在线训练期间使用示范指导。TC-TRL 在六项双臂任务中进行了测试,并将其性能与五种基准 RL 进行了比较:MAPPO、HAPPO、IPPO、MAT 和 DA-MAT。结果表明,TC-TRL 的平均成功率为 63.46%,优于三种基于 PPO 的 RL;与 MAT 相比,TC-TRL 的平均成功率为 42.23%,与 DA-MAT 相比,TC-TRL 的平均成功率为 30.91%。
{"title":"Bimanual Long-Horizon Manipulation Via Temporal-Context Transformer RL","authors":"Ji-Heon Oh;Ismael Espinoza;Danbi Jung;Tae-Seong Kim","doi":"10.1109/LRA.2024.3484167","DOIUrl":"https://doi.org/10.1109/LRA.2024.3484167","url":null,"abstract":"Dual-arm robots can perform bimanual long-horizon (LH) manipulation, surpassing the capabilities of single-arm robots. However, bimanual LH tasks are challenging for robot intelligence due to the complexity of long sequence variables and multi-agent interactions. While Multi-Agent Reinforcement Learning (MARL) has shown promising results in agent interactions, these models struggle with sequential LH tasks due to limitations in credit assignment, vanishing memory, and the exploration-exploitation trade-off. This paper introduces a novel dual-arm robot intelligence framework, Temporal-Context Transformer Reinforcement Learning (TC-TRL), which integrates both a hybrid offline-online policy and imitation learning. TC-TRL leverages the attention mechanism to identify relevant temporal-context information from the LH observations space, updating the encoder value function and generating an optimal actions sequence using a decoder module, which uses demonstration guidance during online training. TC-TRL is tested on six bimanual tasks, and its performance is compared against five baseline RLs: MAPPO, HAPPO, IPPO, MAT, and DA-MAT. The results show that TC-TRL outperforms the three PPO-based RLs with an average success rate of 63.46%, 42.23% against MAT, and 30.91% for DA-MAT.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"10898-10905"},"PeriodicalIF":4.6,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142524209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Model Free Method of Screening Training Data for Adversarial Datapoints Through Local Lipschitz Quotient Analysis 通过局部 Lipschitz Quotient 分析筛选对抗数据点训练数据的无模型方法
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-17 DOI: 10.1109/LRA.2024.3483628
Emily Kamienski;Harry Asada
It is often challenging to pick suitable data features for learning problems. Sometimes certain regions of the data are harder to learn because they are not well characterized by the selected data features. The challenge is amplified when resources for sensing and computation are limited and time-critical, yet reliable decisions must be made. For example, a robotic system for preventing falls of elderly people needs a real-time fall predictor, with low false positive and false negative rates, using a simple wearable sensor to activate a fall prevention mechanism. Here we present a methodology for assessing the learnability of data based on the Lipschitz quotient. We develop a procedure for determining which regions of the dataset contain adversarial data points, input data that look similar but belong to different target classes. Regardless of the learning model, it will be hard to learn such data. We then present a method for determining which additional feature(s) are most effective in improving the predictability of each of these regions. This is a model-independent data analysis that can be executed before constructing a prediction model through machine learning or other techniques. We demonstrate this method on two synthetic datasets and a dataset of human falls, which uses inertial measurement unit signals. For the fall dataset, we identified two groups of adversarial data points and improved the predictability of each group over the baseline dataset, as assessed by Lipschitz, by using 2 different sets of features. This work offers a valuable tool for assessing data learnability that can be applied to not only fall prediction problems, but also other robotics applications that learn from data.
为学习问题挑选合适的数据特征往往具有挑战性。有时,数据的某些区域更难学习,因为所选的数据特征不能很好地描述这些区域。如果用于感知和计算的资源有限,而且必须做出时间紧迫但又可靠的决策,那么挑战就更大了。例如,用于防止老年人跌倒的机器人系统需要一个实时跌倒预测器,该预测器具有较低的假阳性和假阴性率,可使用简单的可穿戴传感器来激活跌倒预防机制。在此,我们介绍一种基于 Lipschitz 商数评估数据可学性的方法。我们开发了一种程序,用于确定数据集中哪些区域包含对抗数据点,即看起来相似但属于不同目标类别的输入数据。无论采用哪种学习模型,都很难学习到这类数据。然后,我们将介绍一种方法,用于确定哪些附加特征能最有效地提高这些区域的可预测性。这是一种独立于模型的数据分析,可以在通过机器学习或其他技术构建预测模型之前执行。我们在两个合成数据集和一个使用惯性测量单元信号的人体跌倒数据集上演示了这种方法。对于跌倒数据集,我们确定了两组对抗性数据点,并通过使用两组不同的特征,提高了每组数据集相对于基线数据集的可预测性(由 Lipschitz 评估)。这项工作为评估数据可学性提供了一个有价值的工具,不仅可用于跌倒预测问题,还可用于其他从数据中学习的机器人应用。
{"title":"Model Free Method of Screening Training Data for Adversarial Datapoints Through Local Lipschitz Quotient Analysis","authors":"Emily Kamienski;Harry Asada","doi":"10.1109/LRA.2024.3483628","DOIUrl":"https://doi.org/10.1109/LRA.2024.3483628","url":null,"abstract":"It is often challenging to pick suitable data features for learning problems. Sometimes certain regions of the data are harder to learn because they are not well characterized by the selected data features. The challenge is amplified when resources for sensing and computation are limited and time-critical, yet reliable decisions must be made. For example, a robotic system for preventing falls of elderly people needs a real-time fall predictor, with low false positive and false negative rates, using a simple wearable sensor to activate a fall prevention mechanism. Here we present a methodology for assessing the learnability of data based on the Lipschitz quotient. We develop a procedure for determining which regions of the dataset contain adversarial data points, input data that look similar but belong to different target classes. Regardless of the learning model, it will be hard to learn such data. We then present a method for determining which additional feature(s) are most effective in improving the predictability of each of these regions. This is a model-independent data analysis that can be executed before constructing a prediction model through machine learning or other techniques. We demonstrate this method on two synthetic datasets and a dataset of human falls, which uses inertial measurement unit signals. For the fall dataset, we identified two groups of adversarial data points and improved the predictability of each group over the baseline dataset, as assessed by Lipschitz, by using 2 different sets of features. This work offers a valuable tool for assessing data learnability that can be applied to not only fall prediction problems, but also other robotics applications that learn from data.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11122-11129"},"PeriodicalIF":4.6,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142598638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MPC Design of a Continuum Robot for Pulmonary Interventional Surgery Using Koopman Operators 使用 Koopman 运算器设计肺介入手术连续机器人的 MPC
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-17 DOI: 10.1109/LRA.2024.3483048
Yuhua Song;Lifeng Zhu;Jinfeng Li;Jiawei Deng;Cheng Wang;Aiguo Song
This letter focuses on the flexible tube of a bronchoscope robot used in pulmonary intervention surgery, which is considered as a continuum robot. The dynamics model is proposed based on the Koopman operator, leveraging real data to solve for the system matrix parameters accurately. To enhance control precision, we designed a model predictive control (MPC) algorithm aimed at tracking the desired curvature and deflection angles of the flexible tube. The MPC controller uses real-time data from electromagnetic sensors to adjust the tube shape, ensuring accurate and responsive manipulation. The effectiveness of the proposed algorithm are validated through extensive experiments conducted on the Binary experimental platform. Compared with the traditional open-loop control method, the proposed algorithm has achieved significant improvements in tracking performance and operational reliability. Each system state error of the proposed algorithm is smaller, which means that better tracking performance can be obtained under the proposed MPC scheme.
这封信的重点是用于肺介入手术的支气管镜机器人的柔性管,该机器人被视为连续体机器人。我们基于 Koopman 算子提出了动力学模型,利用真实数据精确求解系统矩阵参数。为了提高控制精度,我们设计了一种模型预测控制(MPC)算法,旨在跟踪柔性管所需的曲率和偏转角。MPC 控制器使用来自电磁传感器的实时数据来调整管道形状,从而确保精确和灵敏的操控。在二进制实验平台上进行的大量实验验证了所提算法的有效性。与传统的开环控制方法相比,所提出的算法在跟踪性能和运行可靠性方面都有显著提高。所提算法的每个系统状态误差更小,这意味着在所提的 MPC 方案下可以获得更好的跟踪性能。
{"title":"MPC Design of a Continuum Robot for Pulmonary Interventional Surgery Using Koopman Operators","authors":"Yuhua Song;Lifeng Zhu;Jinfeng Li;Jiawei Deng;Cheng Wang;Aiguo Song","doi":"10.1109/LRA.2024.3483048","DOIUrl":"https://doi.org/10.1109/LRA.2024.3483048","url":null,"abstract":"This letter focuses on the flexible tube of a bronchoscope robot used in pulmonary intervention surgery, which is considered as a continuum robot. The dynamics model is proposed based on the Koopman operator, leveraging real data to solve for the system matrix parameters accurately. To enhance control precision, we designed a model predictive control (MPC) algorithm aimed at tracking the desired curvature and deflection angles of the flexible tube. The MPC controller uses real-time data from electromagnetic sensors to adjust the tube shape, ensuring accurate and responsive manipulation. The effectiveness of the proposed algorithm are validated through extensive experiments conducted on the Binary experimental platform. Compared with the traditional open-loop control method, the proposed algorithm has achieved significant improvements in tracking performance and operational reliability. Each system state error of the proposed algorithm is smaller, which means that better tracking performance can be obtained under the proposed MPC scheme.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"10819-10826"},"PeriodicalIF":4.6,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142518206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Scene Understanding for Vision-and-Language Navigation by Knowledge Awareness 通过知识感知增强视觉语言导航的场景理解能力
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-17 DOI: 10.1109/LRA.2024.3483042
Fang Gao;Jingfeng Tang;Jiabao Wang;Shaodong Li;Jun Yu
Vision-and-Language Navigation (VLN) has garnered widespread attention and research interest due to its potential applications in real-world scenarios. Despite significant progress in the VLN field in recent years, limitations persist. Many agents struggle to make accurate decisions when faced with similar candidate views during navigation, relying solely on the overall features of these views. This challenge primarily arises from the lack of common-sense knowledge about room layouts. Recognizing that room knowledge can establish relationships between rooms and objects in the environment, we construct room layout knowledge described in natural language by leveraging BLIP-2, including relationships between rooms and individual objects, relationships between objects, attributes of individual objects (such as color), and room types, thus providing comprehensive room layout information to the agent. We propose a Knowledge-Enhanced Scene Understanding (KESU) model to augment the agent's understanding of the environment by leveraging room layout knowledge. The Instruction Augmentation Module (IA) and the Knowledge History Fusion Module (KHF) in KESU respectively provide room layout knowledge for instructions and vision-history features, thereby enhancing the agent's navigation abilities. To more effectively integrate knowledge information with instruction features, we introduce Dynamic Residual Fusion (DRF) in the IA module. Finally, we conduct extensive experiments on the R2R, REVERIE, and SOON datasets, demonstrating the effectiveness of the proposed approach.
视觉语言导航(VLN)因其在现实世界中的潜在应用而受到广泛关注和研究兴趣。尽管近年来视觉语言导航领域取得了重大进展,但其局限性依然存在。在导航过程中,面对类似的候选视图,许多代理都很难做出准确的决策,只能依赖于这些视图的整体特征。这一挑战主要源于缺乏有关房间布局的常识性知识。认识到房间知识可以建立房间与环境中物体之间的关系,我们利用 BLIP-2 构建了用自然语言描述的房间布局知识,包括房间与单个物体之间的关系、物体之间的关系、单个物体的属性(如颜色)以及房间类型,从而为代理提供全面的房间布局信息。我们提出了一个知识增强场景理解(KESU)模型,通过利用房间布局知识来增强代理对环境的理解。KESU 中的指令增强模块(IA)和知识历史融合模块(KHF)分别为指令和视觉历史特征提供房间布局知识,从而增强代理的导航能力。为了更有效地整合知识信息与指令特征,我们在 IA 模块中引入了动态残留融合(DRF)。最后,我们在 R2R、REVERIE 和 SOON 数据集上进行了大量实验,证明了所提方法的有效性。
{"title":"Enhancing Scene Understanding for Vision-and-Language Navigation by Knowledge Awareness","authors":"Fang Gao;Jingfeng Tang;Jiabao Wang;Shaodong Li;Jun Yu","doi":"10.1109/LRA.2024.3483042","DOIUrl":"https://doi.org/10.1109/LRA.2024.3483042","url":null,"abstract":"Vision-and-Language Navigation (VLN) has garnered widespread attention and research interest due to its potential applications in real-world scenarios. Despite significant progress in the VLN field in recent years, limitations persist. Many agents struggle to make accurate decisions when faced with similar candidate views during navigation, relying solely on the overall features of these views. This challenge primarily arises from the lack of common-sense knowledge about room layouts. Recognizing that room knowledge can establish relationships between rooms and objects in the environment, we construct room layout knowledge described in natural language by leveraging BLIP-2, including relationships between rooms and individual objects, relationships between objects, attributes of individual objects (such as color), and room types, thus providing comprehensive room layout information to the agent. We propose a Knowledge-Enhanced Scene Understanding (KESU) model to augment the agent's understanding of the environment by leveraging room layout knowledge. The Instruction Augmentation Module (IA) and the Knowledge History Fusion Module (KHF) in KESU respectively provide room layout knowledge for instructions and vision-history features, thereby enhancing the agent's navigation abilities. To more effectively integrate knowledge information with instruction features, we introduce Dynamic Residual Fusion (DRF) in the IA module. Finally, we conduct extensive experiments on the R2R, REVERIE, and SOON datasets, demonstrating the effectiveness of the proposed approach.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"10874-10881"},"PeriodicalIF":4.6,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142518184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
iBTC: An Image-Assisting Binary and Triangle Combined Descriptor for Place Recognition by Fusing LiDAR and Camera Measurements iBTC:通过融合激光雷达和相机测量数据进行地点识别的图像辅助二元和三角组合描述符
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-17 DOI: 10.1109/LRA.2024.3483032
Zuhao Zou;Chunran Zheng;Chongjian Yuan;Shunbo Zhou;Kaiwen Xue;Fu Zhang
In this work, we introduce a novel multimodal descriptor, the image-assisting binary and triangle combined (iBTC) descriptor, which fuses LiDAR (Light Detection and Ranging) and camera measurements for 3D place recognition. The inherent invariance of a triangle to rigid transformations inspires us to design triangle-based descriptors. We first extract distinct 3D key points from both LiDAR and camera measurements and organize them into triplets to form triangles. By utilizing the lengths of the sides of these triangles, we can create triangle descriptors, enabling the rapid retrieval of similar triangles from a database. By encoding the geometric and visual details at the triangle vertices into binary descriptors, we augment the triangle descriptors with richer local information. This enrichment process empowers our descriptors to reject mis-matched triangle pairs. Consequently, the remaining matched triangle pairs yield accurate loop closure place indices and relative poses. In our experiments, we conduct a thorough comparison of our proposed method with several SOTA methods across public and self-collected datasets. The results demonstrate that our method exhibits superior performance in place recognition and overcomes the limitations associated with the unimodal methods like BTC, RING++, ORB-DBoW2, and NetVLAD. Additionally, we perform a time cost benchmark experiment and the result indicates that our method's time consumption is reasonable, compared with baseline methods.
在这项工作中,我们引入了一种新颖的多模态描述符--图像辅助二进制和三角形组合(iBTC)描述符,它将激光雷达(LiDAR)和照相机测量融合在一起,用于三维地点识别。三角形对刚性变换的固有不变性启发我们设计基于三角形的描述符。我们首先从激光雷达和摄像头的测量结果中提取不同的三维关键点,并将它们组织成三联体,形成三角形。通过利用这些三角形的边长,我们可以创建三角形描述符,从而能够从数据库中快速检索类似的三角形。通过将三角形顶点的几何和视觉细节编码成二进制描述符,我们用更丰富的本地信息增强了三角形描述符。这一丰富过程使我们的描述符能够剔除错误匹配的三角形对。因此,剩余的匹配三角形对可以产生精确的闭合环位置指数和相对位置。在实验中,我们通过公共数据集和自收集数据集将我们提出的方法与几种 SOTA 方法进行了全面比较。结果表明,我们的方法在地点识别方面表现出卓越的性能,克服了 BTC、RING++、ORB-DBoW2 和 NetVLAD 等单模态方法的局限性。此外,我们还进行了时间成本基准实验,结果表明,与基线方法相比,我们的方法耗时合理。
{"title":"iBTC: An Image-Assisting Binary and Triangle Combined Descriptor for Place Recognition by Fusing LiDAR and Camera Measurements","authors":"Zuhao Zou;Chunran Zheng;Chongjian Yuan;Shunbo Zhou;Kaiwen Xue;Fu Zhang","doi":"10.1109/LRA.2024.3483032","DOIUrl":"https://doi.org/10.1109/LRA.2024.3483032","url":null,"abstract":"In this work, we introduce a novel multimodal descriptor, the image-assisting binary and triangle combined (iBTC) descriptor, which fuses LiDAR (Light Detection and Ranging) and camera measurements for 3D place recognition. The inherent invariance of a triangle to rigid transformations inspires us to design triangle-based descriptors. We first extract distinct 3D key points from both LiDAR and camera measurements and organize them into triplets to form triangles. By utilizing the lengths of the sides of these triangles, we can create triangle descriptors, enabling the rapid retrieval of similar triangles from a database. By encoding the geometric and visual details at the triangle vertices into binary descriptors, we augment the triangle descriptors with richer local information. This enrichment process empowers our descriptors to reject mis-matched triangle pairs. Consequently, the remaining matched triangle pairs yield accurate loop closure place indices and relative poses. In our experiments, we conduct a thorough comparison of our proposed method with several SOTA methods across public and self-collected datasets. The results demonstrate that our method exhibits superior performance in place recognition and overcomes the limitations associated with the unimodal methods like BTC, RING++, ORB-DBoW2, and NetVLAD. Additionally, we perform a time cost benchmark experiment and the result indicates that our method's time consumption is reasonable, compared with baseline methods.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"10858-10865"},"PeriodicalIF":4.6,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142518168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Versatile Perching Using a Passive Mechanism With Under Actuated Fingers for Multirotor AAV 为多旋翼无人飞行器设计的带有欠驱动手指的被动机构可实现多种栖息功能
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-10-17 DOI: 10.1109/LRA.2024.3483044
Takamasa Kominami;Kazuhiro Shimonomura
This research proposes a passively driven perching mechanism designed to land not only on flat surface but also on various shaped objects, such as bar, plate, and spherical object, without the use of additional actuators. The perching mechanism consists of three under actuated fingers, and utilizes two passive drive mechanisms: one for landing without bending the fingers but by pinching, and another for grasping by bending the fingers to land. As a result, a lightweight apparatus capable of landing on various objects without the need for additional CPUs, motors, or batteries for driving and controlling was achieved. The gripping force generated by this mechanism was calculated, and it was found that the gripping force varied depending on the size of the object. Finally, the actual forces exerted by the fingers during landing on objects like safety cones, boards, or pipes were measured and compared with theoretical values.
这项研究提出了一种被动式驱动的抓取机构,其设计不仅能在平面上着陆,还能在各种形状的物体上着陆,如条形、板形和球形物体,而无需使用额外的致动器。栖息机构由三个驱动力不足的手指组成,并利用两个被动驱动机构:一个用于在不弯曲手指的情况下通过捏合着陆,另一个用于通过弯曲手指着陆来抓取。因此,这种轻型装置无需额外的中央处理器、电机或电池来驱动和控制,就能在各种物体上着陆。我们计算了该机构产生的抓取力,发现抓取力随物体的大小而变化。最后,测量了手指在安全锥、木板或管道等物体上着地时所产生的实际力,并与理论值进行了比较。
{"title":"Versatile Perching Using a Passive Mechanism With Under Actuated Fingers for Multirotor AAV","authors":"Takamasa Kominami;Kazuhiro Shimonomura","doi":"10.1109/LRA.2024.3483044","DOIUrl":"https://doi.org/10.1109/LRA.2024.3483044","url":null,"abstract":"This research proposes a passively driven perching mechanism designed to land not only on flat surface but also on various shaped objects, such as bar, plate, and spherical object, without the use of additional actuators. The perching mechanism consists of three under actuated fingers, and utilizes two passive drive mechanisms: one for landing without bending the fingers but by pinching, and another for grasping by bending the fingers to land. As a result, a lightweight apparatus capable of landing on various objects without the need for additional CPUs, motors, or batteries for driving and controlling was achieved. The gripping force generated by this mechanism was calculated, and it was found that the gripping force varied depending on the size of the object. Finally, the actual forces exerted by the fingers during landing on objects like safety cones, boards, or pipes were measured and compared with theoretical values.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11226-11233"},"PeriodicalIF":4.6,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142598628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Robotics and Automation Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1