arXiv - CS - Robotics最新文献_第3页

WeHelp: A Shared Autonomy System for Wheelchair Users WeHelp：轮椅使用者共享自主系统

arXiv - CS - Robotics

Pub Date : 2024-09-18 DOI: arxiv-2409.12159

Abulikemu Abuduweili, Alice Wu, Tianhao Wei, Weiye Zhao

There is a large population of wheelchair users. Most of the wheelchair usersneed help with daily tasks. However, according to recent reports, their needsare not properly satisfied due to the lack of caregivers. Therefore, in thisproject, we develop WeHelp, a shared autonomy system aimed for wheelchairusers. A robot with a WeHelp system has three modes, following mode, remotecontrol mode and tele-operation mode. In the following mode, the robot followsthe wheelchair user automatically via visual tracking. The wheelchair user canask the robot to follow them from behind, by the left or by the right. When thewheelchair user asks for help, the robot will recognize the command via speechrecognition, and then switch to the teleoperation mode or remote control mode.In the teleoperation mode, the wheelchair user takes over the robot with a joystick and controls the robot to complete some complex tasks for their needs,such as opening doors, moving obstacles on the way, reaching objects on a highshelf or on the low ground, etc. In the remote control mode, a remote assistanttakes over the robot and helps the wheelchair user complete some complex tasksfor their needs. Our evaluation shows that the pipeline is useful and practicalfor wheelchair users. Source code and demo of the paper are available aturl{https://github.com/Walleclipse/WeHelp}.

轮椅使用者人数众多。大多数轮椅使用者在日常生活中需要帮助。然而，根据最近的报告，由于缺乏护理人员，他们的需求并没有得到很好的满足。因此，在本项目中，我们开发了针对轮椅使用者的共享自主系统 WeHelp。带有 WeHelp 系统的机器人有三种模式：跟随模式、远程控制模式和远程操作模式。在跟随模式下，机器人通过视觉跟踪自动跟随轮椅使用者。轮椅使用者可以要求机器人从后面、左边或右边跟随他们。在远程操作模式下，轮椅使用者通过操纵杆控制机器人完成一些复杂的任务，如开门、移动途中的障碍物、拿取高处或低处的物品等。在远程控制模式下，远程助手接管机器人，帮助轮椅使用者完成一些复杂的任务。我们的评估结果表明，该流水线对轮椅使用者来说非常实用。本文的源代码和演示可在以下网址获取：url{https://github.com/Walleclipse/WeHelp}。

{"title":"WeHelp: A Shared Autonomy System for Wheelchair Users","authors":"Abulikemu Abuduweili, Alice Wu, Tianhao Wei, Weiye Zhao","doi":"arxiv-2409.12159","DOIUrl":"https://doi.org/arxiv-2409.12159","url":null,"abstract":"There is a large population of wheelchair users. Most of the wheelchair users\u0000need help with daily tasks. However, according to recent reports, their needs\u0000are not properly satisfied due to the lack of caregivers. Therefore, in this\u0000project, we develop WeHelp, a shared autonomy system aimed for wheelchair\u0000users. A robot with a WeHelp system has three modes, following mode, remote\u0000control mode and tele-operation mode. In the following mode, the robot follows\u0000the wheelchair user automatically via visual tracking. The wheelchair user can\u0000ask the robot to follow them from behind, by the left or by the right. When the\u0000wheelchair user asks for help, the robot will recognize the command via speech\u0000recognition, and then switch to the teleoperation mode or remote control mode.\u0000In the teleoperation mode, the wheelchair user takes over the robot with a joy\u0000stick and controls the robot to complete some complex tasks for their needs,\u0000such as opening doors, moving obstacles on the way, reaching objects on a high\u0000shelf or on the low ground, etc. In the remote control mode, a remote assistant\u0000takes over the robot and helps the wheelchair user complete some complex tasks\u0000for their needs. Our evaluation shows that the pipeline is useful and practical\u0000for wheelchair users. Source code and demo of the paper are available at\u0000url{https://github.com/Walleclipse/WeHelp}.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":"49 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Metric-Semantic Factor Graph Generation based on Graph Neural Networks 基于图神经网络的度量语义因子图生成

arXiv - CS - Robotics

Pub Date : 2024-09-18 DOI: arxiv-2409.11972

Jose Andres Millan-Romera, Hriday Bavle, Muhammad Shaheer, Holger Voos, Jose Luis Sanchez-Lopez

Understanding the relationships between geometric structures and semanticconcepts is crucial for building accurate models of complex environments. Inindoors, certain spatial constraints, such as the relative positioning ofplanes, remain consistent despite variations in layout. This paper explores howthese invariant relationships can be captured in a graph SLAM framework byrepresenting high-level concepts like rooms and walls, linking them togeometric elements like planes through an optimizable factor graph. Severalefforts have tackled this issue with add-hoc solutions for each conceptgeneration and with manually-defined factors. This paper proposes a novel method for metric-semantic factor graphgeneration which includes defining a semantic scene graph, integratinggeometric information, and learning the interconnecting factors, all based onGraph Neural Networks (GNNs). An edge classification network (G-GNN) sorts theedges between planes into same room, same wall or none types. The resultingrelations are clustered, generating a room or wall for each cluster. A secondfamily of networks (F-GNN) infers the geometrical origin of the new nodes. Thedefinition of the factors employs the same F-GNN used for the metric attributeof the generated nodes. Furthermore, share the new factor graph with theS-Graphs+ algorithm, extending its graph expressiveness and scenerepresentation with the ultimate goal of improving the SLAM performance. Thecomplexity of the environments is increased to N-plane rooms by training thenetworks on L-shaped rooms. The framework is evaluated in synthetic andsimulated scenarios as no real datasets of the required complex layouts areavailable.

理解几何结构和语义概念之间的关系对于建立复杂环境的精确模型至关重要。在室内，尽管布局各不相同，但某些空间约束条件（如飞机的相对位置）仍然保持一致。本文探讨了如何在图 SLAM 框架中捕捉这些不变的关系，方法是表示房间和墙壁等高级概念，并通过可优化的因子图将它们与平面等几何元素联系起来。在解决这一问题的过程中，许多人都采用了针对每种概念生成和手动定义因子的临时解决方案。本文提出了一种新的度量-语义因子图生成方法，包括定义语义场景图、整合几何信息和学习相互连接的因子，所有这些都基于图神经网络（GNN）。边缘分类网络（G-GNN）将平面之间的边缘分为同一房间、同一墙壁或无类型。对由此产生的关系进行聚类，为每个聚类生成一个房间或一面墙。第二类网络（F-GNN）推断新节点的几何起源。因子的定义与 F-GNN 相同，用于生成节点的度量属性。此外，将新的因子图与 S-Graphs+ 算法共享，扩展了其图形表达能力和场景表示能力，最终目的是提高 SLAM 性能。通过在 L 型房间中训练网络，将环境复杂度提高到 N 平面房间。由于没有所需的复杂布局的真实数据集，因此在合成和模拟场景中对该框架进行了评估。

{"title":"Metric-Semantic Factor Graph Generation based on Graph Neural Networks","authors":"Jose Andres Millan-Romera, Hriday Bavle, Muhammad Shaheer, Holger Voos, Jose Luis Sanchez-Lopez","doi":"arxiv-2409.11972","DOIUrl":"https://doi.org/arxiv-2409.11972","url":null,"abstract":"Understanding the relationships between geometric structures and semantic\u0000concepts is crucial for building accurate models of complex environments. In\u0000indoors, certain spatial constraints, such as the relative positioning of\u0000planes, remain consistent despite variations in layout. This paper explores how\u0000these invariant relationships can be captured in a graph SLAM framework by\u0000representing high-level concepts like rooms and walls, linking them to\u0000geometric elements like planes through an optimizable factor graph. Several\u0000efforts have tackled this issue with add-hoc solutions for each concept\u0000generation and with manually-defined factors. This paper proposes a novel method for metric-semantic factor graph\u0000generation which includes defining a semantic scene graph, integrating\u0000geometric information, and learning the interconnecting factors, all based on\u0000Graph Neural Networks (GNNs). An edge classification network (G-GNN) sorts the\u0000edges between planes into same room, same wall or none types. The resulting\u0000relations are clustered, generating a room or wall for each cluster. A second\u0000family of networks (F-GNN) infers the geometrical origin of the new nodes. The\u0000definition of the factors employs the same F-GNN used for the metric attribute\u0000of the generated nodes. Furthermore, share the new factor graph with the\u0000S-Graphs+ algorithm, extending its graph expressiveness and scene\u0000representation with the ultimate goal of improving the SLAM performance. The\u0000complexity of the environments is increased to N-plane rooms by training the\u0000networks on L-shaped rooms. The framework is evaluated in synthetic and\u0000simulated scenarios as no real datasets of the required complex layouts are\u0000available.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":"14 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Secure Control Systems for Autonomous Quadrotors against Cyber-Attacks 确保自主四旋翼飞行器控制系统免受网络攻击

arXiv - CS - Robotics

Pub Date : 2024-09-18 DOI: arxiv-2409.11897

Samuel Belkadi

The problem of safety for robotic systems has been extensively studied.However, little attention has been given to security issues forthree-dimensional systems, such as quadrotors. Malicious adversaries cancompromise robot sensors and communication networks, causing incidents,achieving illegal objectives, or even injuring people. This study first designsan intelligent control system for autonomous quadrotors. Then, it investigatesthe problems of optimal false data injection attack scheduling andcountermeasure design for unmanned aerial vehicles. Using a state-of-the-artdeep learning-based approach, an optimal false data injection attack scheme isproposed to deteriorate a quadrotor's tracking performance with limited attackenergy. Subsequently, an optimal tracking control strategy is learned tomitigate attacks and recover the quadrotor's tracking performance. We base ourwork on Agilicious, a state-of-the-art quadrotor recently deployed forautonomous settings. This paper is the first in the United Kingdom to deploythis quadrotor and implement reinforcement learning on its platform. Therefore,to promote easy reproducibility with minimal engineering overhead, we furtherprovide (1) a comprehensive breakdown of this quadrotor, including softwarestacks and hardware alternatives; (2) a detailed reinforcement-learningframework to train autonomous controllers on Agilicious agents; and (3) a newopen-source environment that builds upon PyFlyt for future reinforcementlearning research on Agilicious platforms. Both simulated and real-worldexperiments are conducted to show the effectiveness of the proposed frameworksin section 5.2.

机器人系统的安全问题已得到广泛研究。然而，人们很少关注四旋翼机器人等三维系统的安全问题。恶意对手可能会破坏机器人传感器和通信网络，从而引发事故，达到非法目的，甚至伤人。本研究首先为自主四旋翼机器人设计了一个智能控制系统。然后，研究了无人驾驶飞行器的最佳虚假数据注入攻击调度和对策设计问题。利用最先进的基于深度学习的方法，提出了一种最佳虚假数据注入攻击方案，以有限的攻击能量降低四旋翼飞行器的跟踪性能。随后，我们学习了一种最佳跟踪控制策略，以抵御攻击并恢复四旋翼飞行器的跟踪性能。我们的工作以 Agilicious 为基础，Agilicious 是最近部署用于自主设置的最先进的四旋翼飞行器。本文是英国首篇部署该四旋翼飞行器并在其平台上实施强化学习的论文。因此，为了以最小的工程开销提高可重复性，我们进一步提供了：（1）该四旋翼飞行器的全面分解，包括软件栈和硬件替代品；（2）详细的强化学习框架，用于在 Agilicious 代理上训练自主控制器；以及（3）基于 PyFlyt 的新开源环境，用于未来在 Agilicious 平台上的强化学习研究。在第 5.2 节中，我们进行了模拟和真实世界的实验，以展示所提框架的有效性。

{"title":"Secure Control Systems for Autonomous Quadrotors against Cyber-Attacks","authors":"Samuel Belkadi","doi":"arxiv-2409.11897","DOIUrl":"https://doi.org/arxiv-2409.11897","url":null,"abstract":"The problem of safety for robotic systems has been extensively studied.\u0000However, little attention has been given to security issues for\u0000three-dimensional systems, such as quadrotors. Malicious adversaries can\u0000compromise robot sensors and communication networks, causing incidents,\u0000achieving illegal objectives, or even injuring people. This study first designs\u0000an intelligent control system for autonomous quadrotors. Then, it investigates\u0000the problems of optimal false data injection attack scheduling and\u0000countermeasure design for unmanned aerial vehicles. Using a state-of-the-art\u0000deep learning-based approach, an optimal false data injection attack scheme is\u0000proposed to deteriorate a quadrotor's tracking performance with limited attack\u0000energy. Subsequently, an optimal tracking control strategy is learned to\u0000mitigate attacks and recover the quadrotor's tracking performance. We base our\u0000work on Agilicious, a state-of-the-art quadrotor recently deployed for\u0000autonomous settings. This paper is the first in the United Kingdom to deploy\u0000this quadrotor and implement reinforcement learning on its platform. Therefore,\u0000to promote easy reproducibility with minimal engineering overhead, we further\u0000provide (1) a comprehensive breakdown of this quadrotor, including software\u0000stacks and hardware alternatives; (2) a detailed reinforcement-learning\u0000framework to train autonomous controllers on Agilicious agents; and (3) a new\u0000open-source environment that builds upon PyFlyt for future reinforcement\u0000learning research on Agilicious platforms. Both simulated and real-world\u0000experiments are conducted to show the effectiveness of the proposed frameworks\u0000in section 5.2.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":"52 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142266827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Representing Positional Information in Generative World Models for Object Manipulation 在用于物体操作的生成式世界模型中表示位置信息

arXiv - CS - Robotics

Pub Date : 2024-09-18 DOI: arxiv-2409.12005

Stefano Ferraro, Pietro Mazzaglia, Tim Verbelen, Bart Dhoedt, Sai Rajeswar

Object manipulation capabilities are essential skills that set apart embodiedagents engaging with the world, especially in the realm of robotics. Theability to predict outcomes of interactions with objects is paramount in thissetting. While model-based control methods have started to be employed fortackling manipulation tasks, they have faced challenges in accuratelymanipulating objects. As we analyze the causes of this limitation, we identifythe cause of underperformance in the way current world models represent crucialpositional information, especially about the target's goal specification forobject positioning tasks. We introduce a general approach that empowers worldmodel-based agents to effectively solve object-positioning tasks. We proposetwo declinations of this approach for generative world models:position-conditioned (PCP) and latent-conditioned (LCP) policy learning. Inparticular, LCP employs object-centric latent representations that explicitlycapture object positional information for goal specification. This naturallyleads to the emergence of multimodal capabilities, enabling the specificationof goals through spatial coordinates or a visual goal. Our methods arerigorously evaluated across several manipulation environments, showingfavorable performance compared to current model-based control approaches.

物体操作能力是使与世界打交道的化身机器人与众不同的基本技能，尤其是在机器人领域。在这种情况下，预测与物体交互结果的能力至关重要。虽然基于模型的控制方法已开始用于处理操纵任务，但它们在准确操纵物体方面面临挑战。在分析造成这种限制的原因时，我们发现性能不佳的原因在于当前世界模型表示关键位置信息的方式，特别是关于目标的目标定位任务。我们介绍了一种通用方法，它能让基于世界模型的代理有效地解决物体定位任务。我们为生成式世界模型提出了两种方法：位置条件（PCP）和潜伏条件（LCP）策略学习。其中，LCP 采用以物体为中心的潜在表征，明确捕捉物体位置信息，用于目标指定。这自然会导致多模态能力的出现，从而能够通过空间坐标或视觉目标来指定目标。我们的方法在多个操纵环境中进行了理论评估，显示出与当前基于模型的控制方法相比更优越的性能。

{"title":"Representing Positional Information in Generative World Models for Object Manipulation","authors":"Stefano Ferraro, Pietro Mazzaglia, Tim Verbelen, Bart Dhoedt, Sai Rajeswar","doi":"arxiv-2409.12005","DOIUrl":"https://doi.org/arxiv-2409.12005","url":null,"abstract":"Object manipulation capabilities are essential skills that set apart embodied\u0000agents engaging with the world, especially in the realm of robotics. The\u0000ability to predict outcomes of interactions with objects is paramount in this\u0000setting. While model-based control methods have started to be employed for\u0000tackling manipulation tasks, they have faced challenges in accurately\u0000manipulating objects. As we analyze the causes of this limitation, we identify\u0000the cause of underperformance in the way current world models represent crucial\u0000positional information, especially about the target's goal specification for\u0000object positioning tasks. We introduce a general approach that empowers world\u0000model-based agents to effectively solve object-positioning tasks. We propose\u0000two declinations of this approach for generative world models:\u0000position-conditioned (PCP) and latent-conditioned (LCP) policy learning. In\u0000particular, LCP employs object-centric latent representations that explicitly\u0000capture object positional information for goal specification. This naturally\u0000leads to the emergence of multimodal capabilities, enabling the specification\u0000of goals through spatial coordinates or a visual goal. Our methods are\u0000rigorously evaluated across several manipulation environments, showing\u0000favorable performance compared to current model-based control approaches.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":"4 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Residual Descent Differential Dynamic Game (RD3G) -- A Fast Newton Solver for Constrained General Sum Games 残差后裔动态博弈（RD3G）--受约束泛和博弈的快速牛顿求解器

arXiv - CS - Robotics

Pub Date : 2024-09-18 DOI: arxiv-2409.12152

Zhiyuan Zhang, Panagiotis Tsiotras

We present Residual Descent Differential Dynamic Game (RD3G), a Newton-basedsolver for constrained multi-agent game-control problems. The proposed solverseeks a local Nash equilibrium for problems where agents are coupled throughtheir rewards and state constraints. We compare the proposed method againstcompeting state-of-the-art techniques and showcase the computational benefitsof the RD3G algorithm on several example problems.

我们提出了残差后裔动态博弈（RD3G），这是一种基于牛顿的求解器，适用于受约束的多代理博弈控制问题。所提出的求解器可以为代理通过其奖励和状态约束耦合的问题寻求局部纳什均衡。我们将提出的方法与最先进的竞争技术进行了比较，并在几个示例问题上展示了 RD3G 算法的计算优势。

引用次数: 0

Repeatable Energy-Efficient Perching for Flapping-Wing Robots Using Soft Grippers 使用软爪为扇翼机器人提供可重复的高能效栖息地

arXiv - CS - Robotics

Pub Date : 2024-09-18 DOI: arxiv-2409.11921

Krispin C. V. Broers, Sophie F. Armanini

With the emergence of new flapping-wing micro aerial vehicle (FWMAV) designs,a need for extensive and advanced mission capabilities arises. FWMAVs try toadapt and emulate the flight features of birds and flying insects. Whilecurrent designs already achieve high manoeuvrability, they still almostentirely lack perching and take-off abilities. These capabilities could, forinstance, enable long-term monitoring and surveillance missions, and operationsin cluttered environments or in proximity to humans and animals. We present thedevelopment and testing of a framework that enables repeatable perching andtake-off for small to medium-sized FWMAVs, utilising soft, non-damaginggrippers. Thanks to its novel active-passive actuation system, anenergy-conserving state can be achieved and indefinitely maintained while thevehicle is perched. A prototype of the proposed system weighing under 39 g wasmanufactured and extensively tested on a 110 g flapping-wing robot. Successfulfree-flight tests demonstrated the full mission cycle of landing, perching andsubsequent take-off. The telemetry data recorded during the flights yieldsextensive insight into the system's behaviour and is a valuable step towardsfull automation and optimisation of the entire take-off and landing cycle.

随着新型拍翼式微型飞行器（FWMAV）设计的出现，需要具备广泛而先进的任务能力。FWMAV 尝试适应和模仿鸟类和飞虫的飞行特征。虽然目前的设计已经实现了很高的机动性，但它们仍然几乎完全缺乏栖息和起飞能力。例如，这些能力可以实现长期监测和监视任务，以及在杂乱环境中或在靠近人类和动物的地方执行任务。我们介绍了一个框架的开发和测试情况，该框架利用柔软、无损伤的抓取器实现了中小型 FWMAV 的可重复栖息和起飞。由于采用了新颖的主动-被动致动系统，因此可以实现能量守恒状态，并在车辆栖息时无限期地保持这种状态。该系统的原型重量不到 39 克，已在一个重 110 克的拍翼机器人上进行了广泛测试。成功的飞行测试展示了着陆、栖息和随后起飞的整个任务周期。飞行期间记录的遥测数据对系统的行为产生了广泛的影响，是实现整个起飞和着陆周期的完全自动化和优化的重要一步。

{"title":"Repeatable Energy-Efficient Perching for Flapping-Wing Robots Using Soft Grippers","authors":"Krispin C. V. Broers, Sophie F. Armanini","doi":"arxiv-2409.11921","DOIUrl":"https://doi.org/arxiv-2409.11921","url":null,"abstract":"With the emergence of new flapping-wing micro aerial vehicle (FWMAV) designs,\u0000a need for extensive and advanced mission capabilities arises. FWMAVs try to\u0000adapt and emulate the flight features of birds and flying insects. While\u0000current designs already achieve high manoeuvrability, they still almost\u0000entirely lack perching and take-off abilities. These capabilities could, for\u0000instance, enable long-term monitoring and surveillance missions, and operations\u0000in cluttered environments or in proximity to humans and animals. We present the\u0000development and testing of a framework that enables repeatable perching and\u0000take-off for small to medium-sized FWMAVs, utilising soft, non-damaging\u0000grippers. Thanks to its novel active-passive actuation system, an\u0000energy-conserving state can be achieved and indefinitely maintained while the\u0000vehicle is perched. A prototype of the proposed system weighing under 39 g was\u0000manufactured and extensively tested on a 110 g flapping-wing robot. Successful\u0000free-flight tests demonstrated the full mission cycle of landing, perching and\u0000subsequent take-off. The telemetry data recorded during the flights yields\u0000extensive insight into the system's behaviour and is a valuable step towards\u0000full automation and optimisation of the entire take-off and landing cycle.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":"20 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142266826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Particle-based Instance-aware Semantic Occupancy Mapping in Dynamic Environments 动态环境中基于粒子的实例感知语义占用映射

arXiv - CS - Robotics

Pub Date : 2024-09-18 DOI: arxiv-2409.11975

Gang Chen, Zhaoying Wang, Wei Dong, Javier Alonso-Mora

Representing the 3D environment with instance-aware semantic and geometricinformation is crucial for interaction-aware robots in dynamic environments.Nonetheless, creating such a representation poses challenges due to sensornoise, instance segmentation and tracking errors, and the objects' dynamicmotion. This paper introduces a novel particle-based instance-aware semanticoccupancy map to tackle these challenges. Particles with an augmented instancestate are used to estimate the Probability Hypothesis Density (PHD) of theobjects and implicitly model the environment. Utilizing a State-augmentedSequential Monte Carlo PHD (S$^2$MC-PHD) filter, these particles are updated tojointly estimate occupancy status, semantic, and instance IDs, mitigatingnoise. Additionally, a memory module is adopted to enhance the map'sresponsiveness to previously observed objects. Experimental results on theVirtual KITTI 2 dataset demonstrate that the proposed approach surpassesstate-of-the-art methods across multiple metrics under different noiseconditions. Subsequent tests using real-world data further validate theeffectiveness of the proposed approach.

然而，由于传感器噪声、实例分割和跟踪误差以及物体的动态运动，创建这种表示方法面临着挑战。本文介绍了一种新颖的基于粒子的实例感知语义占位图来应对这些挑战。具有增强实例状态的粒子被用来估计物体的概率假设密度（PHD），并对环境进行隐式建模。利用状态增强序列蒙特卡洛 PHD（S$^2$MC-PHD）滤波器，对这些粒子进行更新，以联合估计占用状态、语义和实例 ID，从而降低噪声。此外，还采用了记忆模块来增强地图对先前观察到的对象的反应能力。在虚拟 KITTI 2 数据集上的实验结果表明，在不同的噪声条件下，所提出的方法在多个指标上都超过了目前最先进的方法。随后使用真实世界数据进行的测试进一步验证了所提方法的有效性。

{"title":"Particle-based Instance-aware Semantic Occupancy Mapping in Dynamic Environments","authors":"Gang Chen, Zhaoying Wang, Wei Dong, Javier Alonso-Mora","doi":"arxiv-2409.11975","DOIUrl":"https://doi.org/arxiv-2409.11975","url":null,"abstract":"Representing the 3D environment with instance-aware semantic and geometric\u0000information is crucial for interaction-aware robots in dynamic environments.\u0000Nonetheless, creating such a representation poses challenges due to sensor\u0000noise, instance segmentation and tracking errors, and the objects' dynamic\u0000motion. This paper introduces a novel particle-based instance-aware semantic\u0000occupancy map to tackle these challenges. Particles with an augmented instance\u0000state are used to estimate the Probability Hypothesis Density (PHD) of the\u0000objects and implicitly model the environment. Utilizing a State-augmented\u0000Sequential Monte Carlo PHD (S$^2$MC-PHD) filter, these particles are updated to\u0000jointly estimate occupancy status, semantic, and instance IDs, mitigating\u0000noise. Additionally, a memory module is adopted to enhance the map's\u0000responsiveness to previously observed objects. Experimental results on the\u0000Virtual KITTI 2 dataset demonstrate that the proposed approach surpasses\u0000state-of-the-art methods across multiple metrics under different noise\u0000conditions. Subsequent tests using real-world data further validate the\u0000effectiveness of the proposed approach.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":"198 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-robot connection towards collective obstacle field traversal 多机器人连接实现集体穿越障碍物区域

arXiv - CS - Robotics

Pub Date : 2024-09-18 DOI: arxiv-2409.11709

Haodi Hu, Xingjue Liao, Wuhao Du, Feifei Qian

Environments with large terrain height variations present great challengesfor legged robot locomotion. Drawing inspiration from fire ants' collectiveassembly behavior, we study strategies that can enable two ``connectable''robots to collectively navigate over bumpy terrains with height variationslarger than robot leg length. Each robot was designed to be extremely simple,with a cubical body and one rotary motor actuating four vertical peg legs thatmove in pairs. Two or more robots could physically connect to one another toenhance collective mobility. We performed locomotion experiments with atwo-robot group, across an obstacle field filled with uniformly-distributedsemi-spherical ``boulders''. Experimentally-measured robot speed suggested thatthe connection length between the robots has a significant effect on collectivemobility: connection length C in [0.86, 0.9] robot unit body length (UBL) wereable to produce sustainable movements across the obstacle field, whereasconnection length C in [0.63, 0.84] and [0.92, 1.1] UBL resulted in lowtraversability. An energy landscape based model revealed the underlyingmechanism of how connection length modulated collective mobility through thesystem's potential energy landscape, and informed adaptation strategies for thetwo-robot system to adapt their connection length for traversing obstaclefields with varying spatial frequencies. Our results demonstrated that byvarying the connection configuration between the robots, the two-robot systemcould leverage mechanical intelligence to better utilize obstacle interactionforces and produce improved locomotion. Going forward, we envision thatgeneralized principles of robot-environment coupling can inform design andcontrol strategies for a large group of small robots to achieve ant-likecollective environment negotiation.

地形高度变化大的环境给有腿机器人的运动带来了巨大挑战。从火蚁的集体装配行为中汲取灵感，我们研究了能使两个 "可连接 "机器人在高度变化大于机器人腿长的凹凸地形上集体导航的策略。每个机器人的设计都非常简单，只有一个立方体的身体和一个旋转电机，电机驱动四个成对移动的垂直钉腿。两个或多个机器人可以物理连接，以增强集体移动能力。我们用两个机器人组进行了运动实验，穿越了布满均匀分布的半球形 "巨石 "的障碍场地。实验测量的机器人速度表明，机器人之间的连接长度对集体运动能力有显著影响：机器人单位体长（UBL）在[0.86, 0.9]的连接长度C能够在障碍物区域内产生可持续运动，而UBL在[0.63, 0.84]和[0.92, 1.1]的连接长度C则导致低穿越能力。基于能量景观的模型揭示了连接长度如何通过系统的势能景观调节集体移动性的基本机制，并为双机器人系统调整连接长度以穿越不同空间频率的障碍场提供了适应策略。我们的研究结果表明，通过改变机器人之间的连接配置，双机器人系统可以利用机械智能更好地利用障碍物相互作用力，从而改善运动性能。展望未来，我们认为机器人与环境耦合的一般原理可以为一大群小型机器人的设计和控制策略提供参考，从而实现类似蚂蚁的集体环境协商。

{"title":"Multi-robot connection towards collective obstacle field traversal","authors":"Haodi Hu, Xingjue Liao, Wuhao Du, Feifei Qian","doi":"arxiv-2409.11709","DOIUrl":"https://doi.org/arxiv-2409.11709","url":null,"abstract":"Environments with large terrain height variations present great challenges\u0000for legged robot locomotion. Drawing inspiration from fire ants' collective\u0000assembly behavior, we study strategies that can enable two ``connectable''\u0000robots to collectively navigate over bumpy terrains with height variations\u0000larger than robot leg length. Each robot was designed to be extremely simple,\u0000with a cubical body and one rotary motor actuating four vertical peg legs that\u0000move in pairs. Two or more robots could physically connect to one another to\u0000enhance collective mobility. We performed locomotion experiments with a\u0000two-robot group, across an obstacle field filled with uniformly-distributed\u0000semi-spherical ``boulders''. Experimentally-measured robot speed suggested that\u0000the connection length between the robots has a significant effect on collective\u0000mobility: connection length C in [0.86, 0.9] robot unit body length (UBL) were\u0000able to produce sustainable movements across the obstacle field, whereas\u0000connection length C in [0.63, 0.84] and [0.92, 1.1] UBL resulted in low\u0000traversability. An energy landscape based model revealed the underlying\u0000mechanism of how connection length modulated collective mobility through the\u0000system's potential energy landscape, and informed adaptation strategies for the\u0000two-robot system to adapt their connection length for traversing obstacle\u0000fields with varying spatial frequencies. Our results demonstrated that by\u0000varying the connection configuration between the robots, the two-robot system\u0000could leverage mechanical intelligence to better utilize obstacle interaction\u0000forces and produce improved locomotion. Going forward, we envision that\u0000generalized principles of robot-environment coupling can inform design and\u0000control strategies for a large group of small robots to achieve ant-like\u0000collective environment negotiation.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":"32 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142266857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Generalized Robot Learning Framework 通用机器人学习框架

arXiv - CS - Robotics

Pub Date : 2024-09-18 DOI: arxiv-2409.12061

Jiahuan Yan, Zhouyang Hong, Yu Zhao, Yu Tian, Yunxin Liu, Travis Davies, Luhui Hu

Imitation based robot learning has recently gained significant attention inthe robotics field due to its theoretical potential for transferability andgeneralizability. However, it remains notoriously costly, both in terms ofhardware and data collection, and deploying it in real-world environmentsdemands meticulous setup of robots and precise experimental conditions. In thispaper, we present a low-cost robot learning framework that is both easilyreproducible and transferable to various robots and environments. Wedemonstrate that deployable imitation learning can be successfully applied evento industrial-grade robots, not just expensive collaborative robotic arms.Furthermore, our results show that multi-task robot learning is achievable withsimple network architectures and fewer demonstrations than previously thoughtnecessary. As the current evaluating method is almost subjective when it comesto real-world manipulation tasks, we propose Voting Positive Rate (VPR) - anovel evaluation strategy that provides a more objective assessment ofperformance. We conduct an extensive comparison of success rates across variousself-designed tasks to validate our approach. To foster collaboration andsupport the robot learning community, we have open-sourced all relevantdatasets and model checkpoints, available at huggingface.co/ZhiChengAI.

基于模仿的机器人学习因其理论上的可迁移性和通用性潜力，最近在机器人领域获得了极大关注。然而，它在硬件和数据收集方面的成本仍然很高，在现实环境中部署它需要对机器人进行细致的设置和精确的实验条件。在本文中，我们提出了一种低成本的机器人学习框架，该框架既易于复制，又可移植到各种机器人和环境中。此外，我们的研究结果表明，多任务机器人学习可以通过简单的网络架构和较少的演示来实现，而不像以前认为的那样有必要。由于目前的评估方法在实际操作任务中几乎是主观的，因此我们提出了投票成功率（VPR）--一种提供更客观性能评估的高级评估策略。我们对各种自行设计的任务的成功率进行了广泛比较，以验证我们的方法。为了促进合作和支持机器人学习社区，我们开源了所有相关数据集和模型检查点，详情请访问 huggingface.co/ZhiChengAI。

{"title":"Generalized Robot Learning Framework","authors":"Jiahuan Yan, Zhouyang Hong, Yu Zhao, Yu Tian, Yunxin Liu, Travis Davies, Luhui Hu","doi":"arxiv-2409.12061","DOIUrl":"https://doi.org/arxiv-2409.12061","url":null,"abstract":"Imitation based robot learning has recently gained significant attention in\u0000the robotics field due to its theoretical potential for transferability and\u0000generalizability. However, it remains notoriously costly, both in terms of\u0000hardware and data collection, and deploying it in real-world environments\u0000demands meticulous setup of robots and precise experimental conditions. In this\u0000paper, we present a low-cost robot learning framework that is both easily\u0000reproducible and transferable to various robots and environments. We\u0000demonstrate that deployable imitation learning can be successfully applied even\u0000to industrial-grade robots, not just expensive collaborative robotic arms.\u0000Furthermore, our results show that multi-task robot learning is achievable with\u0000simple network architectures and fewer demonstrations than previously thought\u0000necessary. As the current evaluating method is almost subjective when it comes\u0000to real-world manipulation tasks, we propose Voting Positive Rate (VPR) - a\u0000novel evaluation strategy that provides a more objective assessment of\u0000performance. We conduct an extensive comparison of success rates across various\u0000self-designed tasks to validate our approach. To foster collaboration and\u0000support the robot learning community, we have open-sourced all relevant\u0000datasets and model checkpoints, available at huggingface.co/ZhiChengAI.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":"98 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

RMP-YOLO: A Robust Motion Predictor for Partially Observable Scenarios even if You Only Look Once RMP-YOLO：即使只看一眼，也能预测部分可观测场景的稳健运动预测器

arXiv - CS - Robotics

Pub Date : 2024-09-18 DOI: arxiv-2409.11696

Jiawei Sun, Jiahui Li, Tingchen Liu, Chengran Yuan, Shuo Sun, Zefan Huang, Anthony Wong, Keng Peng Tee, Marcelo H. Ang Jr

We introduce RMP-YOLO, a unified framework designed to provide robust motionpredictions even with incomplete input data. Our key insight stems from theobservation that complete and reliable historical trajectory data plays apivotal role in ensuring accurate motion prediction. Therefore, we propose anew paradigm that prioritizes the reconstruction of intact historicaltrajectories before feeding them into the prediction modules. Our approachintroduces a novel scene tokenization module to enhance the extraction andfusion of spatial and temporal features. Following this, our proposed recoverymodule reconstructs agents' incomplete historical trajectories by leveraginglocal map topology and interactions with nearby agents. The reconstructed,clean historical data is then integrated into the downstream predictionmodules. Our framework is able to effectively handle missing data of varyinglengths and remains robust against observation noise, while maintaining highprediction accuracy. Furthermore, our recovery module is compatible withexisting prediction models, ensuring seamless integration. Extensiveexperiments validate the effectiveness of our approach, and deployment inreal-world autonomous vehicles confirms its practical utility. In the 2024Waymo Motion Prediction Competition, our method, RMP-YOLO, achievesstate-of-the-art performance, securing third place.

我们介绍了 RMP-YOLO，这是一个统一的框架，旨在即使在输入数据不完整的情况下也能提供稳健的运动预测。我们的主要见解源于我们的观察：完整可靠的历史轨迹数据在确保运动预测准确性方面发挥着关键作用。因此，我们提出了一种新的模式，即在将完整的历史轨迹数据输入预测模块之前，优先重建这些数据。我们的方法引入了一个新颖的场景标记化模块，以加强空间和时间特征的提取和融合。随后，我们提出的恢复模块利用本地地图拓扑以及与附近代理的交互作用，重建代理的不完整历史轨迹。重建后的干净历史数据将被整合到下游预测模块中。我们的框架能够有效处理不同长度的缺失数据，并在保持高预测精度的同时，对观测噪声保持稳健。此外，我们的恢复模块与现有的预测模型兼容，确保了无缝集成。广泛的实验验证了我们方法的有效性，在现实世界自动驾驶车辆中的部署也证实了它的实用性。在 2024 年 Waymo 运动预测竞赛中，我们的方法 RMP-YOLO 取得了最先进的性能，获得了第三名。

{"title":"RMP-YOLO: A Robust Motion Predictor for Partially Observable Scenarios even if You Only Look Once","authors":"Jiawei Sun, Jiahui Li, Tingchen Liu, Chengran Yuan, Shuo Sun, Zefan Huang, Anthony Wong, Keng Peng Tee, Marcelo H. Ang Jr","doi":"arxiv-2409.11696","DOIUrl":"https://doi.org/arxiv-2409.11696","url":null,"abstract":"We introduce RMP-YOLO, a unified framework designed to provide robust motion\u0000predictions even with incomplete input data. Our key insight stems from the\u0000observation that complete and reliable historical trajectory data plays a\u0000pivotal role in ensuring accurate motion prediction. Therefore, we propose a\u0000new paradigm that prioritizes the reconstruction of intact historical\u0000trajectories before feeding them into the prediction modules. Our approach\u0000introduces a novel scene tokenization module to enhance the extraction and\u0000fusion of spatial and temporal features. Following this, our proposed recovery\u0000module reconstructs agents' incomplete historical trajectories by leveraging\u0000local map topology and interactions with nearby agents. The reconstructed,\u0000clean historical data is then integrated into the downstream prediction\u0000modules. Our framework is able to effectively handle missing data of varying\u0000lengths and remains robust against observation noise, while maintaining high\u0000prediction accuracy. Furthermore, our recovery module is compatible with\u0000existing prediction models, ensuring seamless integration. Extensive\u0000experiments validate the effectiveness of our approach, and deployment in\u0000real-world autonomous vehicles confirms its practical utility. In the 2024\u0000Waymo Motion Prediction Competition, our method, RMP-YOLO, achieves\u0000state-of-the-art performance, securing third place.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":"51 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0