首页 > 最新文献

arXiv - CS - Robotics最新文献

英文 中文
Open-Set Semantic Uncertainty Aware Metric-Semantic Graph Matching 开放集语义不确定性感知度量-语义图匹配
Pub Date : 2024-09-17 DOI: arxiv-2409.11555
Kurran Singh, John J. Leonard
Underwater object-level mapping requires incorporating visual foundationmodels to handle the uncommon and often previously unseen object classesencountered in marine scenarios. In this work, a metric of semantic uncertaintyfor open-set object detections produced by visual foundation models iscalculated and then incorporated into an object-level uncertainty trackingframework. Object-level uncertainties and geometric relationships betweenobjects are used to enable robust object-level loop closure detection forunknown object classes. The above loop closure detection problem is formulatedas a graph-matching problem. While graph matching, in general, is NP-Complete,a solver for an equivalent formulation of the proposed graph matching problemas a graph editing problem is tested on multiple challenging underwater scenes.Results for this solver as well as three other solvers demonstrate that theproposed methods are feasible for real-time use in marine environments for therobust, open-set, multi-object, semantic-uncertainty-aware loop closuredetection. Further experimental results on the KITTI dataset demonstrate thatthe method generalizes to large-scale terrestrial scenes.
水下物体级制图需要结合视觉基础模型,以处理海洋场景中遇到的不常见且经常是以前从未见过的物体类别。在这项工作中,计算了视觉基础模型产生的开放集物体检测的语义不确定性度量,然后将其纳入物体级不确定性跟踪框架。利用对象级不确定性和对象之间的几何关系,可对未知对象类别进行稳健的对象级闭环检测。上述闭环检测问题被表述为图形匹配问题。该求解器和其他三个求解器的结果表明,在海洋环境中实时使用所提出的方法进行稳健、开放集、多对象、语义不确定性感知的闭合回路检测是可行的。在 KITTI 数据集上的进一步实验结果表明,该方法适用于大规模陆地场景。
{"title":"Open-Set Semantic Uncertainty Aware Metric-Semantic Graph Matching","authors":"Kurran Singh, John J. Leonard","doi":"arxiv-2409.11555","DOIUrl":"https://doi.org/arxiv-2409.11555","url":null,"abstract":"Underwater object-level mapping requires incorporating visual foundation\u0000models to handle the uncommon and often previously unseen object classes\u0000encountered in marine scenarios. In this work, a metric of semantic uncertainty\u0000for open-set object detections produced by visual foundation models is\u0000calculated and then incorporated into an object-level uncertainty tracking\u0000framework. Object-level uncertainties and geometric relationships between\u0000objects are used to enable robust object-level loop closure detection for\u0000unknown object classes. The above loop closure detection problem is formulated\u0000as a graph-matching problem. While graph matching, in general, is NP-Complete,\u0000a solver for an equivalent formulation of the proposed graph matching problem\u0000as a graph editing problem is tested on multiple challenging underwater scenes.\u0000Results for this solver as well as three other solvers demonstrate that the\u0000proposed methods are feasible for real-time use in marine environments for the\u0000robust, open-set, multi-object, semantic-uncertainty-aware loop closure\u0000detection. Further experimental results on the KITTI dataset demonstrate that\u0000the method generalizes to large-scale terrestrial scenes.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142266867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
UniLCD: Unified Local-Cloud Decision-Making via Reinforcement Learning UniLCD:通过强化学习进行统一本地云决策
Pub Date : 2024-09-17 DOI: arxiv-2409.11403
Kathakoli Sengupta, Zhongkai Shagguan, Sandesh Bharadwaj, Sanjay Arora, Eshed Ohn-Bar, Renato Mancuso
Embodied vision-based real-world systems, such as mobile robots, require acareful balance between energy consumption, compute latency, and safetyconstraints to optimize operation across dynamic tasks and contexts. As localcomputation tends to be restricted, offloading the computation, ie, to a remoteserver, can save local resources while providing access to high-qualitypredictions from powerful and large models. However, the resultingcommunication and latency overhead has led to limited usability of cloud modelsin dynamic, safety-critical, real-time settings. To effectively address thistrade-off, we introduce UniLCD, a novel hybrid inference framework for enablingflexible local-cloud collaboration. By efficiently optimizing a flexiblerouting module via reinforcement learning and a suitable multi-task objective,UniLCD is specifically designed to support the multiple constraints ofsafety-critical end-to-end mobile systems. We validate the proposed approachusing a challenging, crowded navigation task requiring frequent and timelyswitching between local and cloud operations. UniLCD demonstrates improvedoverall performance and efficiency, by over 35% compared to state-of-the-artbaselines based on various split computing and early exit strategies.
基于嵌入式视觉的真实世界系统(如移动机器人)需要在能耗、计算延迟和安全限制之间取得谨慎的平衡,以优化在动态任务和环境中的运行。由于本地计算往往受到限制,因此将计算卸载到远程服务器上可以节省本地资源,同时还能从强大的大型模型中获取高质量的预测结果。然而,由此产生的通信和延迟开销导致云模型在动态、安全关键、实时环境中的可用性受到限制。为了有效解决这一矛盾,我们引入了 UniLCD,这是一种新颖的混合推理框架,用于实现灵活的本地-云协作。通过强化学习和合适的多任务目标对灵活路由模块进行有效优化,UniLCD 专为支持安全关键型端到端移动系统的多重约束而设计。我们利用一项具有挑战性的拥挤导航任务验证了所提出的方法,该任务要求在本地操作和云操作之间频繁、及时地切换。与基于各种分离计算和早期退出策略的先进基线相比,UniLCD 的整体性能和效率提高了 35% 以上。
{"title":"UniLCD: Unified Local-Cloud Decision-Making via Reinforcement Learning","authors":"Kathakoli Sengupta, Zhongkai Shagguan, Sandesh Bharadwaj, Sanjay Arora, Eshed Ohn-Bar, Renato Mancuso","doi":"arxiv-2409.11403","DOIUrl":"https://doi.org/arxiv-2409.11403","url":null,"abstract":"Embodied vision-based real-world systems, such as mobile robots, require a\u0000careful balance between energy consumption, compute latency, and safety\u0000constraints to optimize operation across dynamic tasks and contexts. As local\u0000computation tends to be restricted, offloading the computation, ie, to a remote\u0000server, can save local resources while providing access to high-quality\u0000predictions from powerful and large models. However, the resulting\u0000communication and latency overhead has led to limited usability of cloud models\u0000in dynamic, safety-critical, real-time settings. To effectively address this\u0000trade-off, we introduce UniLCD, a novel hybrid inference framework for enabling\u0000flexible local-cloud collaboration. By efficiently optimizing a flexible\u0000routing module via reinforcement learning and a suitable multi-task objective,\u0000UniLCD is specifically designed to support the multiple constraints of\u0000safety-critical end-to-end mobile systems. We validate the proposed approach\u0000using a challenging, crowded navigation task requiring frequent and timely\u0000switching between local and cloud operations. UniLCD demonstrates improved\u0000overall performance and efficiency, by over 35% compared to state-of-the-art\u0000baselines based on various split computing and early exit strategies.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142266869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MI-HGNN: Morphology-Informed Heterogeneous Graph Neural Network for Legged Robot Contact Perception MI-HGNN:用于腿部机器人接触感知的形态信息异构图神经网络
Pub Date : 2024-09-17 DOI: arxiv-2409.11146
Daniel Butterfield, Sandilya Sai Garimella, Nai-Jen Cheng, Lu Gan
We present a Morphology-Informed Heterogeneous Graph Neural Network (MI-HGNN)for learning-based contact perception. The architecture and connectivity of theMI-HGNN are constructed from the robot morphology, in which nodes and edges arerobot joints and links, respectively. By incorporating the morphology-informedconstraints into a neural network, we improve a learning-based approach usingmodel-based knowledge. We apply the proposed MI-HGNN to two contact perceptionproblems, and conduct extensive experiments using both real-world and simulateddata collected using two quadruped robots. Our experiments demonstrate thesuperiority of our method in terms of effectiveness, generalization ability,model efficiency, and sample efficiency. Our MI-HGNN improved the performanceof a state-of-the-art model that leverages robot morphological symmetry by 8.4%with only 0.21% of its parameters. Although MI-HGNN is applied to contactperception problems for legged robots in this work, it can be seamlesslyapplied to other types of multi-body dynamical systems and has the potential toimprove other robot learning frameworks. Our code is made publicly available athttps://github.com/lunarlab-gatech/Morphology-Informed-HGNN.
我们提出了一种用于基于学习的接触感知的形态信息异构图神经网络(MI-HGNN)。MI-HGNN的结构和连通性是根据机器人形态构建的,其中节点和边分别是机器人的关节和链接。通过将形态学信息约束纳入神经网络,我们利用基于模型的知识改进了基于学习的方法。我们将提出的 MI-HGNN 应用于两个接触感知问题,并使用两个四足机器人收集的真实世界数据和模拟数据进行了大量实验。实验证明,我们的方法在有效性、泛化能力、模型效率和样本效率方面都更胜一筹。我们的 MI-HGNN 仅用 0.21% 的参数就将利用机器人形态对称性的最先进模型的性能提高了 8.4%。虽然 MI-HGNN 在这项工作中应用于腿部机器人的接触感知问题,但它可以无缝应用于其他类型的多体动力学系统,并有可能改进其他机器人学习框架。我们的代码可在https://github.com/lunarlab-gatech/Morphology-Informed-HGNN。
{"title":"MI-HGNN: Morphology-Informed Heterogeneous Graph Neural Network for Legged Robot Contact Perception","authors":"Daniel Butterfield, Sandilya Sai Garimella, Nai-Jen Cheng, Lu Gan","doi":"arxiv-2409.11146","DOIUrl":"https://doi.org/arxiv-2409.11146","url":null,"abstract":"We present a Morphology-Informed Heterogeneous Graph Neural Network (MI-HGNN)\u0000for learning-based contact perception. The architecture and connectivity of the\u0000MI-HGNN are constructed from the robot morphology, in which nodes and edges are\u0000robot joints and links, respectively. By incorporating the morphology-informed\u0000constraints into a neural network, we improve a learning-based approach using\u0000model-based knowledge. We apply the proposed MI-HGNN to two contact perception\u0000problems, and conduct extensive experiments using both real-world and simulated\u0000data collected using two quadruped robots. Our experiments demonstrate the\u0000superiority of our method in terms of effectiveness, generalization ability,\u0000model efficiency, and sample efficiency. Our MI-HGNN improved the performance\u0000of a state-of-the-art model that leverages robot morphological symmetry by 8.4%\u0000with only 0.21% of its parameters. Although MI-HGNN is applied to contact\u0000perception problems for legged robots in this work, it can be seamlessly\u0000applied to other types of multi-body dynamical systems and has the potential to\u0000improve other robot learning frameworks. Our code is made publicly available at\u0000https://github.com/lunarlab-gatech/Morphology-Informed-HGNN.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142266988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SDP: Spiking Diffusion Policy for Robotic Manipulation with Learnable Channel-Wise Membrane Thresholds SDP:利用可学习的通道膜阈值实现机器人操纵的尖峰扩散策略
Pub Date : 2024-09-17 DOI: arxiv-2409.11195
Zhixing Hou, Maoxu Gao, Hang Yu, Mengyu Yang, Chio-In Ieong
This paper introduces a Spiking Diffusion Policy (SDP) learning method forrobotic manipulation by integrating Spiking Neurons and Learnable Channel-wiseMembrane Thresholds (LCMT) into the diffusion policy model, thereby enhancingcomputational efficiency and achieving high performance in evaluated tasks.Specifically, the proposed SDP model employs the U-Net architecture as thebackbone for diffusion learning within the Spiking Neural Network (SNN). Itstrategically places residual connections between the spike convolutionoperations and the Leaky Integrate-and-Fire (LIF) nodes, thereby preventingdisruptions to the spiking states. Additionally, we introduce a temporalencoding block and a temporal decoding block to transform static and dynamicdata with timestep $T_S$ into each other, enabling the transmission of datawithin the SNN in spike format. Furthermore, we propose LCMT to enable theadaptive acquisition of membrane potential thresholds, thereby matching theconditions of varying membrane potentials and firing rates across channels andavoiding the cumbersome process of manually setting and tuning hyperparameters.Evaluating the SDP model on seven distinct tasks with SNN timestep $T_S=4$, weachieve results comparable to those of the ANN counterparts, along with fasterconvergence speeds than the baseline SNN method. This improvement isaccompanied by a reduction of 94.3% in dynamic energy consumption estimated on45nm hardware.
本文介绍了一种用于机器人操纵的尖峰扩散策略(SDP)学习方法,它将尖峰神经元和可学习通道膜阈值(LCMT)集成到扩散策略模型中,从而提高了计算效率,并在评估任务中实现了高性能。具体来说,所提出的SDP模型采用U-Net架构作为尖峰神经网络(SNN)内扩散学习的骨干。它策略性地将残余连接置于尖峰卷积迭代和泄漏整合与发射(LIF)节点之间,从而防止尖峰状态受到破坏。此外,我们还引入了一个时序编码块和一个时序解码块,将时间步长为 $T_S$ 的静态数据和动态数据相互转换,从而在 SNN 中以尖峰格式传输数据。此外,我们还提出了 LCMT,以实现膜电位阈值的自适应采集,从而匹配不同通道的不同膜电位和发射率条件,并避免了手动设置和调整超参数的繁琐过程。在 SNN 时间步为 $T_S=4$ 的七个不同任务上评估了 SDP 模型,我们取得了与 ANN 对应模型相当的结果,而且收敛速度比基准 SNN 方法更快。同时,在 45 纳米硬件上估算的动态能耗降低了 94.3%。
{"title":"SDP: Spiking Diffusion Policy for Robotic Manipulation with Learnable Channel-Wise Membrane Thresholds","authors":"Zhixing Hou, Maoxu Gao, Hang Yu, Mengyu Yang, Chio-In Ieong","doi":"arxiv-2409.11195","DOIUrl":"https://doi.org/arxiv-2409.11195","url":null,"abstract":"This paper introduces a Spiking Diffusion Policy (SDP) learning method for\u0000robotic manipulation by integrating Spiking Neurons and Learnable Channel-wise\u0000Membrane Thresholds (LCMT) into the diffusion policy model, thereby enhancing\u0000computational efficiency and achieving high performance in evaluated tasks.\u0000Specifically, the proposed SDP model employs the U-Net architecture as the\u0000backbone for diffusion learning within the Spiking Neural Network (SNN). It\u0000strategically places residual connections between the spike convolution\u0000operations and the Leaky Integrate-and-Fire (LIF) nodes, thereby preventing\u0000disruptions to the spiking states. Additionally, we introduce a temporal\u0000encoding block and a temporal decoding block to transform static and dynamic\u0000data with timestep $T_S$ into each other, enabling the transmission of data\u0000within the SNN in spike format. Furthermore, we propose LCMT to enable the\u0000adaptive acquisition of membrane potential thresholds, thereby matching the\u0000conditions of varying membrane potentials and firing rates across channels and\u0000avoiding the cumbersome process of manually setting and tuning hyperparameters.\u0000Evaluating the SDP model on seven distinct tasks with SNN timestep $T_S=4$, we\u0000achieve results comparable to those of the ANN counterparts, along with faster\u0000convergence speeds than the baseline SNN method. This improvement is\u0000accompanied by a reduction of 94.3% in dynamic energy consumption estimated on\u000045nm hardware.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142266985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Annealed Winner-Takes-All for Motion Forecasting 运动预测中的退火胜者为王
Pub Date : 2024-09-17 DOI: arxiv-2409.11172
Yihong Xu, Victor Letzelter, Mickaël Chen, Éloi Zablocki, Matthieu Cord
In autonomous driving, motion prediction aims at forecasting the futuretrajectories of nearby agents, helping the ego vehicle to anticipate behaviorsand drive safely. A key challenge is generating a diverse set of futurepredictions, commonly addressed using data-driven models with Multiple ChoiceLearning (MCL) architectures and Winner-Takes-All (WTA) training objectives.However, these methods face initialization sensitivity and traininginstabilities. Additionally, to compensate for limited performance, someapproaches rely on training with a large set of hypotheses, requiring apost-selection step during inference to significantly reduce the number ofpredictions. To tackle these issues, we take inspiration from annealed MCL, arecently introduced technique that improves the convergence properties of MCLmethods through an annealed Winner-Takes-All loss (aWTA). In this paper, wedemonstrate how the aWTA loss can be integrated with state-of-the-art motionforecasting models to enhance their performance using only a minimal set ofhypotheses, eliminating the need for the cumbersome post-selection step. Ourapproach can be easily incorporated into any trajectory prediction modelnormally trained using WTA and yields significant improvements. To facilitatethe application of our approach to future motion forecasting models, the codewill be made publicly available upon acceptance:https://github.com/valeoai/MF_aWTA.
在自动驾驶中,运动预测旨在预测附近代理的未来轨迹,帮助自我车辆预测行为并安全驾驶。一个关键的挑战是如何生成一组多样化的未来预测,通常使用数据驱动模型、多选学习(MCL)架构和赢家通吃(WTA)训练目标来解决这一问题。此外,为了弥补有限的性能,一些方法依赖于使用大量假设集进行训练,这就需要在推理过程中进行后选择步骤,以大幅减少预测次数。为了解决这些问题,我们从退火 MCL 中汲取灵感。退火 MCL 是最近引入的一种技术,它通过退火赢家通吃损失(aWTA)改善了 MCL 方法的收敛特性。在本文中,我们演示了如何将 aWTA 损失与最新的运动预测模型相结合,以提高其性能,只需使用最小的假设集,而无需繁琐的后选步骤。我们的方法可以很容易地集成到任何通常使用 WTA 训练的轨迹预测模型中,并产生显著的改进。为了便于将我们的方法应用到未来的运动预测模型中,我们将在接受后公开代码:https://github.com/valeoai/MF_aWTA。
{"title":"Annealed Winner-Takes-All for Motion Forecasting","authors":"Yihong Xu, Victor Letzelter, Mickaël Chen, Éloi Zablocki, Matthieu Cord","doi":"arxiv-2409.11172","DOIUrl":"https://doi.org/arxiv-2409.11172","url":null,"abstract":"In autonomous driving, motion prediction aims at forecasting the future\u0000trajectories of nearby agents, helping the ego vehicle to anticipate behaviors\u0000and drive safely. A key challenge is generating a diverse set of future\u0000predictions, commonly addressed using data-driven models with Multiple Choice\u0000Learning (MCL) architectures and Winner-Takes-All (WTA) training objectives.\u0000However, these methods face initialization sensitivity and training\u0000instabilities. Additionally, to compensate for limited performance, some\u0000approaches rely on training with a large set of hypotheses, requiring a\u0000post-selection step during inference to significantly reduce the number of\u0000predictions. To tackle these issues, we take inspiration from annealed MCL, a\u0000recently introduced technique that improves the convergence properties of MCL\u0000methods through an annealed Winner-Takes-All loss (aWTA). In this paper, we\u0000demonstrate how the aWTA loss can be integrated with state-of-the-art motion\u0000forecasting models to enhance their performance using only a minimal set of\u0000hypotheses, eliminating the need for the cumbersome post-selection step. Our\u0000approach can be easily incorporated into any trajectory prediction model\u0000normally trained using WTA and yields significant improvements. To facilitate\u0000the application of our approach to future motion forecasting models, the code\u0000will be made publicly available upon acceptance:\u0000https://github.com/valeoai/MF_aWTA.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142266987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The 1st InterAI Workshop: Interactive AI for Human-centered Robotics 第 1 届 InterAI 研讨会:以人为中心的机器人交互式人工智能
Pub Date : 2024-09-17 DOI: arxiv-2409.11150
Yuchong Zhang, Elmira Yadollahi, Yong Ma, Di Fu, Iolanda Leite, Danica Kragic
The workshop is affiliated with 33nd IEEE International Conference on Robotand Human Interactive Communication (RO-MAN 2024) August 26~30, 2023 /Pasadena, CA, USA. It is designed as a half-day event, extending over fourhours from 9:00 to 12:30 PST time. It accommodates both in-person and virtualattendees (via Zoom), ensuring a flexible participation mode. The agenda isthoughtfully crafted to include a diverse range of sessions: two keynotespeeches that promise to provide insightful perspectives, two dedicated paperpresentation sessions, an interactive panel discussion to foster dialogue amongexperts which facilitates deeper dives into specific topics, and a 15-minutecoffee break. The workshop website:https://sites.google.com/view/interaiworkshops/home.
研讨会将于 2023 年 8 月 26 日至 30 日在美国加利福尼亚州帕萨迪纳市举行。会议为期半天,从太平洋标准时间 9:00 到 12:30 共四个小时。会议可同时容纳现场和虚拟与会者(通过 Zoom),确保灵活的参与模式。会议议程经过深思熟虑,包括各种会议:两场主题演讲,保证提供有见地的观点;两场专门的论文报告会;一场互动式小组讨论,促进专家之间的对话,以便深入探讨特定主题;以及 15 分钟的茶歇。研讨会网站:https://sites.google.com/view/interaiworkshops/home。
{"title":"The 1st InterAI Workshop: Interactive AI for Human-centered Robotics","authors":"Yuchong Zhang, Elmira Yadollahi, Yong Ma, Di Fu, Iolanda Leite, Danica Kragic","doi":"arxiv-2409.11150","DOIUrl":"https://doi.org/arxiv-2409.11150","url":null,"abstract":"The workshop is affiliated with 33nd IEEE International Conference on Robot\u0000and Human Interactive Communication (RO-MAN 2024) August 26~30, 2023 /\u0000Pasadena, CA, USA. It is designed as a half-day event, extending over four\u0000hours from 9:00 to 12:30 PST time. It accommodates both in-person and virtual\u0000attendees (via Zoom), ensuring a flexible participation mode. The agenda is\u0000thoughtfully crafted to include a diverse range of sessions: two keynote\u0000speeches that promise to provide insightful perspectives, two dedicated paper\u0000presentation sessions, an interactive panel discussion to foster dialogue among\u0000experts which facilitates deeper dives into specific topics, and a 15-minute\u0000coffee break. The workshop website:\u0000https://sites.google.com/view/interaiworkshops/home.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Air-FAR: Fast and Adaptable Routing for Aerial Navigation in Large-scale Complex Unknown Environments Air-FAR:在大规模复杂未知环境中为空中导航提供快速、自适应的路由选择
Pub Date : 2024-09-17 DOI: arxiv-2409.11188
Botao He, Guofei Chen, Cornelia Fermuller, Yiannis Aloimonos, Ji Zhang
This paper presents a novel method for real-time 3D navigation inlarge-scale, complex environments using a hierarchical 3D visibility graph(V-graph). The proposed algorithm addresses the computational challenges ofV-graph construction and shortest path search on the graph simultaneously. Byintroducing hierarchical 3D V-graph construction with heuristic visibilityupdate, the 3D V-graph is constructed in O(K*n^2logn) time, which guaranteesreal-time performance. The proposed iterative divide-and-conquer path searchmethod can achieve near-optimal path solutions within the constraints ofreal-time operations. The algorithm ensures efficient 3D V-graph constructionand path search. Extensive simulated and real-world environments validated thatour algorithm reduces the travel time by 42%, achieves up to 24.8% highertrajectory efficiency, and runs faster than most benchmarks by orders ofmagnitude in complex environments. The code and developed simulator have beenopen-sourced to facilitate future research.
本文提出了一种使用分层三维可见度图(V-graph)在大规模复杂环境中进行实时三维导航的新方法。所提出的算法同时解决了V图构建和图上最短路径搜索的计算难题。通过引入分层三维可见性图构建和启发式可见性更新,三维可见性图的构建只需O(K*n^2logn)时间,保证了实时性。所提出的迭代分而治之路径搜索方法可以在实时操作的约束条件下获得接近最优的路径解。该算法确保了高效的三维 V 型图构建和路径搜索。大量的模拟和实际环境验证了我们的算法可以减少 42% 的旅行时间,实现高达 24.8% 的高轨迹效率,并且在复杂环境中的运行速度比大多数基准快了几个数量级。代码和开发的模拟器已经开源,以方便未来的研究。
{"title":"Air-FAR: Fast and Adaptable Routing for Aerial Navigation in Large-scale Complex Unknown Environments","authors":"Botao He, Guofei Chen, Cornelia Fermuller, Yiannis Aloimonos, Ji Zhang","doi":"arxiv-2409.11188","DOIUrl":"https://doi.org/arxiv-2409.11188","url":null,"abstract":"This paper presents a novel method for real-time 3D navigation in\u0000large-scale, complex environments using a hierarchical 3D visibility graph\u0000(V-graph). The proposed algorithm addresses the computational challenges of\u0000V-graph construction and shortest path search on the graph simultaneously. By\u0000introducing hierarchical 3D V-graph construction with heuristic visibility\u0000update, the 3D V-graph is constructed in O(K*n^2logn) time, which guarantees\u0000real-time performance. The proposed iterative divide-and-conquer path search\u0000method can achieve near-optimal path solutions within the constraints of\u0000real-time operations. The algorithm ensures efficient 3D V-graph construction\u0000and path search. Extensive simulated and real-world environments validated that\u0000our algorithm reduces the travel time by 42%, achieves up to 24.8% higher\u0000trajectory efficiency, and runs faster than most benchmarks by orders of\u0000magnitude in complex environments. The code and developed simulator have been\u0000open-sourced to facilitate future research.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142266986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
3D Water Quality Mapping using Invariant Extended Kalman Filtering for Underwater Robot Localization 利用不变扩展卡尔曼滤波进行水下机器人定位的 3D 水质绘图
Pub Date : 2024-09-17 DOI: arxiv-2409.11578
Kaustubh Joshi, Tianchen Liu, Alan Williams, Matthew Gray, Xiaomin Lin, Nikhil Chopra
Water quality mapping for critical parameters such as temperature, salinity,and turbidity is crucial for assessing an aquaculture farm's health and yieldcapacity. Traditional approaches involve using boats or human divers, which aretime-constrained and lack depth variability. This work presents an innovativeapproach to 3D water quality mapping in shallow water environments using aBlueROV2 equipped with GPS and a water quality sensor. This system allows foraccurate location correction by resurfacing when errors occur. This study isbeing conducted at an oyster farm in the Chesapeake Bay, USA, providing a morecomprehensive and precise water quality analysis in aquaculture settings.
绘制温度、盐度和浊度等关键参数的水质图对于评估水产养殖场的健康状况和产量能力至关重要。传统方法需要使用船只或人力潜水员,这不仅受时间限制,而且缺乏深度可变性。这项工作提出了一种在浅水环境中使用配备全球定位系统和水质传感器的蓝鲸 2 号潜水器进行三维水质测绘的创新方法。该系统可在出现错误时通过重新浮出水面进行精确定位校正。这项研究正在美国切萨皮克湾的一个牡蛎养殖场进行,为水产养殖环境提供更全面、更精确的水质分析。
{"title":"3D Water Quality Mapping using Invariant Extended Kalman Filtering for Underwater Robot Localization","authors":"Kaustubh Joshi, Tianchen Liu, Alan Williams, Matthew Gray, Xiaomin Lin, Nikhil Chopra","doi":"arxiv-2409.11578","DOIUrl":"https://doi.org/arxiv-2409.11578","url":null,"abstract":"Water quality mapping for critical parameters such as temperature, salinity,\u0000and turbidity is crucial for assessing an aquaculture farm's health and yield\u0000capacity. Traditional approaches involve using boats or human divers, which are\u0000time-constrained and lack depth variability. This work presents an innovative\u0000approach to 3D water quality mapping in shallow water environments using a\u0000BlueROV2 equipped with GPS and a water quality sensor. This system allows for\u0000accurate location correction by resurfacing when errors occur. This study is\u0000being conducted at an oyster farm in the Chesapeake Bay, USA, providing a more\u0000comprehensive and precise water quality analysis in aquaculture settings.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142266864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Resilient and Adaptive Replanning for Multi-Robot Target Tracking with Sensing and Communication Danger Zones 利用传感和通信危险区为多机器人目标跟踪进行弹性和自适应重新规划
Pub Date : 2024-09-17 DOI: arxiv-2409.11230
Peihan Li, Yuwei Wu, Jiazhen Liu, Gaurav S. Sukhatme, Vijay Kumar, Lifeng Zhou
Multi-robot collaboration for target tracking presents significant challengesin hazardous environments, including addressing robot failures, dynamicpriority changes, and other unpredictable factors. Moreover, these challengesare increased in adversarial settings if the environment is unknown. In thispaper, we propose a resilient and adaptive framework for multi-robot,multi-target tracking in environments with unknown sensing and communicationdanger zones. The damages posed by these zones are temporary, allowing robotsto track targets while accepting the risk of entering dangerous areas. Weformulate the problem as an optimization with soft chance constraints, enablingreal-time adjustments to robot behavior based on varying types of dangers andfailures. An adaptive replanning strategy is introduced, featuring differenttriggers to improve group performance. This approach allows for dynamicprioritization of target tracking and risk aversion or resilience, depending onevolving resources and real-time conditions. To validate the effectiveness ofthe proposed method, we benchmark and evaluate it across multiple scenarios insimulation and conduct several real-world experiments.
在危险环境中,多机器人协作追踪目标面临着巨大挑战,包括处理机器人故障、优先级动态变化和其他不可预测因素。此外,在环境未知的对抗环境中,这些挑战会更加严峻。在本文中,我们提出了一种弹性自适应框架,用于在具有未知传感和通信危险区的环境中进行多机器人、多目标跟踪。这些区域造成的损害是暂时的,允许机器人在跟踪目标的同时接受进入危险区域的风险。我们将这一问题表述为一个带有软机会约束的优化问题,从而能够根据不同类型的危险和故障对机器人行为进行实时调整。我们引入了一种自适应重新规划策略,其特点是采用不同的触发器来提高群体性能。这种方法可以根据不断变化的资源和实时条件,对目标跟踪和风险规避或恢复能力进行动态优先排序。为了验证所提方法的有效性,我们在多个模拟场景中对其进行了基准测试和评估,并进行了多次真实世界实验。
{"title":"Resilient and Adaptive Replanning for Multi-Robot Target Tracking with Sensing and Communication Danger Zones","authors":"Peihan Li, Yuwei Wu, Jiazhen Liu, Gaurav S. Sukhatme, Vijay Kumar, Lifeng Zhou","doi":"arxiv-2409.11230","DOIUrl":"https://doi.org/arxiv-2409.11230","url":null,"abstract":"Multi-robot collaboration for target tracking presents significant challenges\u0000in hazardous environments, including addressing robot failures, dynamic\u0000priority changes, and other unpredictable factors. Moreover, these challenges\u0000are increased in adversarial settings if the environment is unknown. In this\u0000paper, we propose a resilient and adaptive framework for multi-robot,\u0000multi-target tracking in environments with unknown sensing and communication\u0000danger zones. The damages posed by these zones are temporary, allowing robots\u0000to track targets while accepting the risk of entering dangerous areas. We\u0000formulate the problem as an optimization with soft chance constraints, enabling\u0000real-time adjustments to robot behavior based on varying types of dangers and\u0000failures. An adaptive replanning strategy is introduced, featuring different\u0000triggers to improve group performance. This approach allows for dynamic\u0000prioritization of target tracking and risk aversion or resilience, depending on\u0000evolving resources and real-time conditions. To validate the effectiveness of\u0000the proposed method, we benchmark and evaluate it across multiple scenarios in\u0000simulation and conduct several real-world experiments.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142266982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Use the Force, Bot! -- Force-Aware ProDMP with Event-Based Replanning 使用原力,机器人-- 基于事件重新规划的力感知 ProDMP
Pub Date : 2024-09-17 DOI: arxiv-2409.11144
Paul Werner Lödige, Maximilian Xiling Li, Rudolf Lioutikov
Movement Primitives (MPs) are a well-established method for representing andgenerating modular robot trajectories. This work presents FA-ProDMP, a newapproach which introduces force awareness to Probabilistic Dynamic MovementPrimitives (ProDMP). FA-ProDMP adapts the trajectory during runtime to accountfor measured and desired forces. It offers smooth trajectories and capturesposition and force correlations over multiple trajectories, e.g. a set of humandemonstrations. FA-ProDMP supports multiple axes of force and is thus agnosticto cartesian or joint space control. This makes FA-ProDMP a valuable tool forlearning contact rich manipulation tasks such as polishing, cutting orindustrial assembly from demonstration. In order to reliably evaluateFA-ProDMP, this work additionally introduces a modular, 3D printed task suitecalled POEMPEL, inspired by the popular Lego Technic pins. POEMPEL mimicsindustrial peg-in-hole assembly tasks with force requirements. It offersmultiple parameters of adjustment, such as position, orientation and plugstiffness level, thus varying the direction and amount of required forces. Ourexperiments show that FA-ProDMP outperforms other MP formulations on thePOEMPEL setup and a electrical power plug insertion task, due to its replanningcapabilities based on the measured forces. These findings highlight howFA-ProDMP enhances the performance of robotic systems in contact-richmanipulation tasks.
运动原型(MP)是表示和生成模块化机器人轨迹的一种行之有效的方法。本研究提出的 FA-ProDMP 是一种新方法,它将力感知引入了概率动态运动原语 (ProDMP)。FA-ProDMP 可在运行时调整轨迹,以考虑测量到的力和期望的力。它提供平滑轨迹,并捕捉多个轨迹(例如一组人体演示)上的位置和力相关性。FA-ProDMP 支持多轴受力,因此与笛卡尔或关节空间控制无关。这使得 FA-ProDMP 成为学习丰富的接触操作任务(如抛光、切割或工业装配)的重要工具。为了对 FA-ProDMP 进行可靠的评估,这项工作还引入了一种名为 POEMPEL 的模块化 3D 打印任务套装,其灵感来源于广受欢迎的乐高 Technic 销钉。POEMPEL 模拟了具有力要求的工业钉入孔装配任务。它提供多种调节参数,如位置、方向和插头刚度水平,从而改变所需力的方向和大小。最近的实验表明,FA-ProDMP在POEMPEL设置和电源插头插入任务上的表现优于其他MP公式,这得益于它根据测量力重新规划的能力。这些发现凸显了FA-ProDMP是如何提高机器人系统在接触力操纵任务中的性能的。
{"title":"Use the Force, Bot! -- Force-Aware ProDMP with Event-Based Replanning","authors":"Paul Werner Lödige, Maximilian Xiling Li, Rudolf Lioutikov","doi":"arxiv-2409.11144","DOIUrl":"https://doi.org/arxiv-2409.11144","url":null,"abstract":"Movement Primitives (MPs) are a well-established method for representing and\u0000generating modular robot trajectories. This work presents FA-ProDMP, a new\u0000approach which introduces force awareness to Probabilistic Dynamic Movement\u0000Primitives (ProDMP). FA-ProDMP adapts the trajectory during runtime to account\u0000for measured and desired forces. It offers smooth trajectories and captures\u0000position and force correlations over multiple trajectories, e.g. a set of human\u0000demonstrations. FA-ProDMP supports multiple axes of force and is thus agnostic\u0000to cartesian or joint space control. This makes FA-ProDMP a valuable tool for\u0000learning contact rich manipulation tasks such as polishing, cutting or\u0000industrial assembly from demonstration. In order to reliably evaluate\u0000FA-ProDMP, this work additionally introduces a modular, 3D printed task suite\u0000called POEMPEL, inspired by the popular Lego Technic pins. POEMPEL mimics\u0000industrial peg-in-hole assembly tasks with force requirements. It offers\u0000multiple parameters of adjustment, such as position, orientation and plug\u0000stiffness level, thus varying the direction and amount of required forces. Our\u0000experiments show that FA-ProDMP outperforms other MP formulations on the\u0000POEMPEL setup and a electrical power plug insertion task, due to its replanning\u0000capabilities based on the measured forces. These findings highlight how\u0000FA-ProDMP enhances the performance of robotic systems in contact-rich\u0000manipulation tasks.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - CS - Robotics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1