arXiv - CS - Robotics最新文献_第10页

General Methods for Evaluating Collision Probability of Different Types of Theta-phi Positioners 评估不同类型 Theta-phi 定位器碰撞概率的一般方法

arXiv - CS - Robotics

Pub Date : 2024-09-11 DOI: arxiv-2409.07288

Baolong Chen, Jianping Wang, Zhigang Liu, Zengxiang Zhou, Hongzhuan Hu, Feifan Zhang

In many modern astronomical facilities, multi-object telescopes are crucialinstruments. Most of these telescopes have thousands of robotic fiberpositioners(RFPs) installed on their focal plane, sharing an overlappingworkspace. Collisions between RFPs during their movement can result in sometargets becoming unreachable and cause structural damage. Therefore, it isnecessary to reasonably assess and evaluate the collision probability of theRFPs. In this study, we propose a mathematical models of collision probabilityand validate its results using Monte Carlo simulations. In addition, a newcollision calculation method is proposed for faster calculation(nearly 0.15% oforiginal time). Simulation experiments have verified that our method canevaluate the collision probability between RFPs with both equal and unequal armlengths. Additionally, we found that adopting a target distribution based on aPoisson distribution can reduce the collision probability by approximately 2.6%on average.

在许多现代天文设施中，多目标望远镜都是至关重要的仪器。这些望远镜的焦平面上大多安装有数千个机器人光纤定位器（RFP），共享一个重叠的工作空间。在移动过程中，RFP 之间的碰撞会导致某些目标无法到达，并造成结构损坏。因此，有必要合理评估和评价 RFP 的碰撞概率。在本研究中，我们提出了碰撞概率的数学模型，并通过蒙特卡罗模拟验证了其结果。此外，我们还提出了一种新的碰撞计算方法，使计算速度更快（接近原计算时间的 0.15%）。模拟实验验证了我们的方法可以评估臂长相等和不相等的 RFP 之间的碰撞概率。此外，我们还发现采用基于泊松分布的目标分布可以将碰撞概率平均降低约 2.6%。

引用次数: 0

Electrokinetic Propulsion for Electronically Integrated Microscopic Robots 电子集成微观机器人的电动推进器

arXiv - CS - Robotics

Pub Date : 2024-09-11 DOI: arxiv-2409.07293

Lucas C. Hanson, William H. Reinhardt, Scott Shrager, Tarunyaa Sivakumar, Marc Z. Miskin

Robots too small to see by eye have rapidly evolved in recent years thanks tothe incorporation of on-board microelectronics. Semiconductor circuits havebeen used in microrobots capable of executing controlled wireless steering,prescribed legged gait patterns, and user-triggered transitions between digitalstates. Yet these promising new capabilities have come at the steep price ofcomplicated fabrication. Even though circuit components can be reliably builtby semiconductor foundries, currently available actuators for electronicallyintegrated microrobots are built with intricate multi-step cleanroom protocolsand use mechanisms like articulated legs or bubble generators that are hard todesign and control. Here, we present a propulsion system for electronicallyintegrated microrobots that can be built with a single step of lithographicprocessing, readily integrates with microelectronics thanks to low current/lowvoltage operation (1V, 10nA), and yields robots that swim at speeds over onebody length per second. Inspired by work on micromotors, these robots generateelectric fields in a surrounding fluid, and by extension propulsiveelectrokinetic flows. The underlying physics is captured by a model in whichrobot speed is proportional to applied current, making design and controlstraightforward. As proof, we build basic robots that use on-board circuits anda closed-loop optical control scheme to navigate waypoints and move incoordinated swarms. Broadly, solid-state propulsion clears the way for robust,easy to manufacture, electronically controlled microrobots that operatereliably over months to years.

由于采用了机载微电子技术，体型小到肉眼无法看到的机器人近年来得到了迅速发展。半导体电路已被用于微型机器人，它们能够执行受控无线转向、规定的腿部步态以及用户触发的数字状态之间的转换。然而，这些充满希望的新功能却以复杂的制造工艺为代价。尽管电路元件可以由半导体代工厂可靠地制造，但目前可用的电子集成微型机器人致动器都是通过复杂的多步骤洁净室协议制造的，并使用难以设计和控制的铰接式腿或气泡发生器等机制。在这里，我们提出了一种用于电子集成微型机器人的推进系统，该系统只需一步光刻处理就能完成，由于采用低电流/低电压操作（1V，10nA），因此很容易与微电子集成，并能产生每秒游动速度超过 1body 长度的机器人。受微电机研究的启发，这些机器人能在周围流体中产生电场，进而产生推动动能流。机器人的速度与外加电流成正比，这一模型捕捉到了基本的物理学原理，使设计和控制变得简单明了。为了证明这一点，我们制造了一些基本机器人，它们使用板载电路和闭环光学控制方案来导航航点和移动不协调的蜂群。从广义上讲，固态推进为制造坚固耐用、易于制造、可在数月至数年内可靠运行的电子控制微型机器人开辟了道路。

{"title":"Electrokinetic Propulsion for Electronically Integrated Microscopic Robots","authors":"Lucas C. Hanson, William H. Reinhardt, Scott Shrager, Tarunyaa Sivakumar, Marc Z. Miskin","doi":"arxiv-2409.07293","DOIUrl":"https://doi.org/arxiv-2409.07293","url":null,"abstract":"Robots too small to see by eye have rapidly evolved in recent years thanks to\u0000the incorporation of on-board microelectronics. Semiconductor circuits have\u0000been used in microrobots capable of executing controlled wireless steering,\u0000prescribed legged gait patterns, and user-triggered transitions between digital\u0000states. Yet these promising new capabilities have come at the steep price of\u0000complicated fabrication. Even though circuit components can be reliably built\u0000by semiconductor foundries, currently available actuators for electronically\u0000integrated microrobots are built with intricate multi-step cleanroom protocols\u0000and use mechanisms like articulated legs or bubble generators that are hard to\u0000design and control. Here, we present a propulsion system for electronically\u0000integrated microrobots that can be built with a single step of lithographic\u0000processing, readily integrates with microelectronics thanks to low current/low\u0000voltage operation (1V, 10nA), and yields robots that swim at speeds over one\u0000body length per second. Inspired by work on micromotors, these robots generate\u0000electric fields in a surrounding fluid, and by extension propulsive\u0000electrokinetic flows. The underlying physics is captured by a model in which\u0000robot speed is proportional to applied current, making design and control\u0000straightforward. As proof, we build basic robots that use on-board circuits and\u0000a closed-loop optical control scheme to navigate waypoints and move in\u0000coordinated swarms. Broadly, solid-state propulsion clears the way for robust,\u0000easy to manufacture, electronically controlled microrobots that operate\u0000reliably over months to years.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence 在线决策元变形器：基于随意变形器的通用嵌入式智能强化学习框架

arXiv - CS - Robotics

Pub Date : 2024-09-11 DOI: arxiv-2409.07341

Luo Ji, Runji Lin

Interactive artificial intelligence in the motion control field is aninteresting topic, especially when universal knowledge is adaptive to multipletasks and universal environments. Despite there being increasing efforts in thefield of Reinforcement Learning (RL) with the aid of transformers, most of themmight be limited by the offline training pipeline, which prohibits explorationand generalization abilities. To address this limitation, we propose theframework of Online Decision MetaMorphFormer (ODM) which aims to achieveself-awareness, environment recognition, and action planning through a unifiedmodel architecture. Motivated by cognitive and behavioral psychology, an ODMagent is able to learn from others, recognize the world, and practice itselfbased on its own experience. ODM can also be applied to any arbitrary agentwith a multi-joint body, located in different environments, and trained withdifferent types of tasks using large-scale pre-trained datasets. Through theuse of pre-trained datasets, ODM can quickly warm up and learn the necessaryknowledge to perform the desired task, while the target environment continuesto reinforce the universal policy. Extensive online experiments as well asfew-shot and zero-shot environmental tests are used to verify ODM's performanceand generalization ability. The results of our study contribute to the study ofgeneral artificial intelligence in embodied and cognitive fields. Code,results, and video examples can be found on the websiteurl{https://rlodm.github.io/odm/}.

运动控制领域的交互式人工智能是一个有趣的话题，尤其是当通用知识能够适应多重任务和通用环境时。尽管在借助变形器进行强化学习（RL）领域的努力越来越多，但大多数变形器可能会受到离线训练管道的限制，从而阻碍了探索和泛化能力。针对这一局限，我们提出了在线决策元变形器（ODM）框架，旨在通过统一的模型架构实现自我认知、环境识别和行动规划。受认知心理学和行为心理学的启发，ODM 代理能够向他人学习、识别世界并根据自身经验进行自我练习。ODM 还可应用于任何具有多关节身体、位于不同环境中的任意代理，并使用大规模预训练数据集进行不同类型任务的训练。通过使用预训练数据集，ODM 可以快速预热并学习执行所需任务的必要知识，同时目标环境会继续强化通用策略。为了验证 ODM 的性能和泛化能力，我们进行了广泛的在线实验以及少量和零次环境测试。我们的研究成果有助于体现和认知领域的通用人工智能研究。代码、结果和视频示例可以在网站（url{https://rlodm.github.io/odm/}）上找到。

{"title":"Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence","authors":"Luo Ji, Runji Lin","doi":"arxiv-2409.07341","DOIUrl":"https://doi.org/arxiv-2409.07341","url":null,"abstract":"Interactive artificial intelligence in the motion control field is an\u0000interesting topic, especially when universal knowledge is adaptive to multiple\u0000tasks and universal environments. Despite there being increasing efforts in the\u0000field of Reinforcement Learning (RL) with the aid of transformers, most of them\u0000might be limited by the offline training pipeline, which prohibits exploration\u0000and generalization abilities. To address this limitation, we propose the\u0000framework of Online Decision MetaMorphFormer (ODM) which aims to achieve\u0000self-awareness, environment recognition, and action planning through a unified\u0000model architecture. Motivated by cognitive and behavioral psychology, an ODM\u0000agent is able to learn from others, recognize the world, and practice itself\u0000based on its own experience. ODM can also be applied to any arbitrary agent\u0000with a multi-joint body, located in different environments, and trained with\u0000different types of tasks using large-scale pre-trained datasets. Through the\u0000use of pre-trained datasets, ODM can quickly warm up and learn the necessary\u0000knowledge to perform the desired task, while the target environment continues\u0000to reinforce the universal policy. Extensive online experiments as well as\u0000few-shot and zero-shot environmental tests are used to verify ODM's performance\u0000and generalization ability. The results of our study contribute to the study of\u0000general artificial intelligence in embodied and cognitive fields. Code,\u0000results, and video examples can be found on the website\u0000url{https://rlodm.github.io/odm/}.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Learning Task Specifications from Demonstrations as Probabilistic Automata 从作为概率自动机的演示中学习任务规范

arXiv - CS - Robotics

Pub Date : 2024-09-11 DOI: arxiv-2409.07091

Mattijs Baert, Sam Leroux, Pieter Simoens

Specifying tasks for robotic systems traditionally requires coding expertise,deep domain knowledge, and significant time investment. While learning fromdemonstration offers a promising alternative, existing methods often strugglewith tasks of longer horizons. To address this limitation, we introduce acomputationally efficient approach for learning probabilistic deterministicfinite automata (PDFA) that capture task structures and expert preferencesdirectly from demonstrations. Our approach infers sub-goals and their temporaldependencies, producing an interpretable task specification that domain expertscan easily understand and adjust. We validate our method through experimentsinvolving object manipulation tasks, showcasing how our method enables a robotarm to effectively replicate diverse expert strategies while adapting tochanging conditions.

传统上，为机器人系统指定任务需要专业的编码技术、深厚的领域知识和大量的时间投入。虽然从演示中学习是一种很有前景的替代方法，但现有的方法往往难以应对周期较长的任务。为了解决这一局限性，我们引入了一种计算高效的概率确定性无限自动机（PDFA）学习方法，它能直接从演示中捕捉任务结构和专家偏好。我们的方法可以推断出子目标及其时间依赖关系，从而生成领域专家易于理解和调整的可解释任务规范。我们通过涉及物体操作任务的实验验证了我们的方法，展示了我们的方法如何使机械臂有效地复制不同的专家策略，同时适应不断变化的条件。

引用次数: 0

Unsupervised Point Cloud Registration with Self-Distillation 利用自扩散技术实现无监督点云注册

arXiv - CS - Robotics

Pub Date : 2024-09-11 DOI: arxiv-2409.07558

Christian Löwens, Thorben Funke, André Wagner, Alexandru Paul Condurache

Rigid point cloud registration is a fundamental problem and highly relevantin robotics and autonomous driving. Nowadays deep learning methods can betrained to match a pair of point clouds, given the transformation between them.However, this training is often not scalable due to the high cost of collectingground truth poses. Therefore, we present a self-distillation approach to learnpoint cloud registration in an unsupervised fashion. Here, each sample ispassed to a teacher network and an augmented view is passed to a studentnetwork. The teacher includes a trainable feature extractor and a learning-freerobust solver such as RANSAC. The solver forces consistency amongcorrespondences and optimizes for the unsupervised inlier ratio, eliminatingthe need for ground truth labels. Our approach simplifies the trainingprocedure by removing the need for initial hand-crafted features or consecutivepoint cloud frames as seen in related methods. We show that our method not onlysurpasses them on the RGB-D benchmark 3DMatch but also generalizes well toautomotive radar, where classical features adopted by others fail. The code isavailable at https://github.com/boschresearch/direg .

刚性点云注册是一个基本问题，与机器人和自动驾驶高度相关。然而，由于收集地面真实姿态的成本较高，这种训练通常无法扩展。因此，我们提出了一种以无监督方式学习点云注册的自增强方法。在这种方法中，每个样本都会传递给教师网络，而增强视图则会传递给学生网络。教师网络包括一个可训练的特征提取器和一个免于学习的求解器（如 RANSAC）。求解器强制实现对应关系之间的一致性，并优化无监督离群比，从而消除了对地面实况标签的需求。我们的方法不需要相关方法中的初始手工特征或连续点云帧，从而简化了训练过程。我们的研究表明，我们的方法不仅在 RGB-D 基准 3DMatch 上超越了这些方法，而且还能很好地应用于汽车雷达，而其他方法所采用的经典特征在汽车雷达上是失效的。代码见 https://github.com/boschresearch/direg。

{"title":"Unsupervised Point Cloud Registration with Self-Distillation","authors":"Christian Löwens, Thorben Funke, André Wagner, Alexandru Paul Condurache","doi":"arxiv-2409.07558","DOIUrl":"https://doi.org/arxiv-2409.07558","url":null,"abstract":"Rigid point cloud registration is a fundamental problem and highly relevant\u0000in robotics and autonomous driving. Nowadays deep learning methods can be\u0000trained to match a pair of point clouds, given the transformation between them.\u0000However, this training is often not scalable due to the high cost of collecting\u0000ground truth poses. Therefore, we present a self-distillation approach to learn\u0000point cloud registration in an unsupervised fashion. Here, each sample is\u0000passed to a teacher network and an augmented view is passed to a student\u0000network. The teacher includes a trainable feature extractor and a learning-free\u0000robust solver such as RANSAC. The solver forces consistency among\u0000correspondences and optimizes for the unsupervised inlier ratio, eliminating\u0000the need for ground truth labels. Our approach simplifies the training\u0000procedure by removing the need for initial hand-crafted features or consecutive\u0000point cloud frames as seen in related methods. We show that our method not only\u0000surpasses them on the RGB-D benchmark 3DMatch but also generalizes well to\u0000automotive radar, where classical features adopted by others fail. The code is\u0000available at https://github.com/boschresearch/direg .","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Flow-Inspired Lightweight Multi-Robot Real-Time Scheduling Planner 受流程启发的轻量级多机器人实时调度规划器

arXiv - CS - Robotics

Pub Date : 2024-09-11 DOI: arxiv-2409.06952

Han Liu, Yu Jin, Tianjiang Hu, Kai Huang

Collision avoidance and trajectory planning are crucial in multi-robotsystems, particularly in environments with numerous obstacles. Althoughextensive research has been conducted in this field, the challenge of rapidtraversal through such environments has not been fully addressed. This paperaddresses this problem by proposing a novel real-time scheduling schemedesigned to optimize the passage of multi-robot systems through complex,obstacle-rich maps. Inspired from network flow optimization, our schemedecomposes the environment into a network structure, enabling the efficientallocation of robots to paths based on real-time congestion data. The proposedscheduling planner operates on top of existing collision avoidance algorithms,focusing on minimizing traversal time by balancing robot detours and waitingtimes. Our simulation results demonstrate the efficiency of the proposedscheme. Additionally, we validated its effectiveness through real world flighttests using ten quadrotors. This work contributes a lightweight, effectivescheduling planner capable of meeting the real-time demands of multi-robotsystems in obstacle-rich environments.

避免碰撞和轨迹规划在多机器人系统中至关重要，尤其是在障碍物众多的环境中。尽管在这一领域已经开展了大量研究，但快速穿越此类环境的挑战尚未完全解决。本文针对这一问题，提出了一种新颖的实时调度方案，旨在优化多机器人系统通过复杂、障碍物众多的地图。受网络流优化的启发，我们的方案将环境分解成一个网络结构，从而能够根据实时拥堵数据将机器人有效分配到路径上。所提出的调度规划器在现有的避免碰撞算法基础上运行，重点是通过平衡机器人的绕行和等待时间，最大限度地减少穿越时间。我们的仿真结果证明了所提方案的效率。此外，我们还使用十台四旋翼机器人进行了实际飞行测试，验证了该方案的有效性。这项工作提供了一种轻量级、高效的调度计划器，能够满足多机器人系统在障碍物密集环境中的实时需求。

{"title":"Flow-Inspired Lightweight Multi-Robot Real-Time Scheduling Planner","authors":"Han Liu, Yu Jin, Tianjiang Hu, Kai Huang","doi":"arxiv-2409.06952","DOIUrl":"https://doi.org/arxiv-2409.06952","url":null,"abstract":"Collision avoidance and trajectory planning are crucial in multi-robot\u0000systems, particularly in environments with numerous obstacles. Although\u0000extensive research has been conducted in this field, the challenge of rapid\u0000traversal through such environments has not been fully addressed. This paper\u0000addresses this problem by proposing a novel real-time scheduling scheme\u0000designed to optimize the passage of multi-robot systems through complex,\u0000obstacle-rich maps. Inspired from network flow optimization, our scheme\u0000decomposes the environment into a network structure, enabling the efficient\u0000allocation of robots to paths based on real-time congestion data. The proposed\u0000scheduling planner operates on top of existing collision avoidance algorithms,\u0000focusing on minimizing traversal time by balancing robot detours and waiting\u0000times. Our simulation results demonstrate the efficiency of the proposed\u0000scheme. Additionally, we validated its effectiveness through real world flight\u0000tests using ten quadrotors. This work contributes a lightweight, effective\u0000scheduling planner capable of meeting the real-time demands of multi-robot\u0000systems in obstacle-rich environments.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ODYSSEE: Oyster Detection Yielded by Sensor Systems on Edge Electronics ODYSSEE：边缘电子传感器系统产生的牡蛎探测结果

arXiv - CS - Robotics

Pub Date : 2024-09-11 DOI: arxiv-2409.07003

Xiaomin Lin, Vivek Mange, Arjun Suresh, Bernhard Neuberger, Aadi Palnitkar, Brendan Campbell, Alan Williams, Kleio Baxevani, Jeremy Mallette, Alhim Vera, Markus Vincze, Ioannis Rekleitis, Herbert G. Tanner, Yiannis Aloimonos

Oysters are a keystone species in coastal ecosystems, offering significanteconomic, environmental, and cultural benefits. However, current monitoringsystems are often destructive, typically involving dredging to physicallycollect and count oysters. A nondestructive alternative is manualidentification from video footage collected by divers, which is time-consumingand labor-intensive with expert input. An alternative to human monitoring is the deployment of a system with trainedobject detection models that performs real-time, on edge oyster detection inthe field. One such platform is the Aqua2 robot. Effective training of thesemodels requires extensive high-quality data, which is difficult to obtain inmarine settings. To address these complications, we introduce a novel methodthat leverages stable diffusion to generate high-quality synthetic data for themarine domain. We exploit diffusion models to create photorealistic marineimagery, using ControlNet inputs to ensure consistency with the segmentationground-truth mask, the geometry of the scene, and the target domain of realunderwater images for oysters. The resulting dataset is used to train aYOLOv10-based vision model, achieving a state-of-the-art 0.657 mAP@50 foroyster detection on the Aqua2 platform. The system we introduce not onlyimproves oyster habitat monitoring, but also paves the way to autonomoussurveillance for various tasks in marine contexts, improving aquaculture andconservation efforts.

牡蛎是沿海生态系统中的关键物种，具有重要的经济、环境和文化效益。然而，目前的监测系统往往是破坏性的，通常需要挖泥来收集和计数牡蛎。一种非破坏性的替代方法是通过潜水员收集的视频录像进行人工识别，这需要专家的投入，耗时耗力。人工监测的另一种替代方法是部署一个具有训练有素的目标检测模型的系统，在现场进行实时、边缘牡蛎检测。Aqua2 机器人就是这样一个平台。这些模型的有效训练需要大量高质量的数据，而这在水下环境中很难获得。为了解决这些复杂问题，我们引入了一种新方法，利用稳定扩散为海洋领域生成高质量的合成数据。我们利用扩散模型来创建逼真的海洋图像，使用 ControlNet 输入来确保与分割地面实况掩码、场景几何以及牡蛎真实水下图像的目标域保持一致。由此产生的数据集用于训练基于 YOLOv10 的视觉模型，在 Aqua2 平台上实现了最先进的 0.657 mAP@50 的牡蛎检测。我们介绍的系统不仅改善了牡蛎栖息地的监测，还为海洋环境中各种任务的自主监控铺平了道路，从而改善了水产养殖和保护工作。

{"title":"ODYSSEE: Oyster Detection Yielded by Sensor Systems on Edge Electronics","authors":"Xiaomin Lin, Vivek Mange, Arjun Suresh, Bernhard Neuberger, Aadi Palnitkar, Brendan Campbell, Alan Williams, Kleio Baxevani, Jeremy Mallette, Alhim Vera, Markus Vincze, Ioannis Rekleitis, Herbert G. Tanner, Yiannis Aloimonos","doi":"arxiv-2409.07003","DOIUrl":"https://doi.org/arxiv-2409.07003","url":null,"abstract":"Oysters are a keystone species in coastal ecosystems, offering significant\u0000economic, environmental, and cultural benefits. However, current monitoring\u0000systems are often destructive, typically involving dredging to physically\u0000collect and count oysters. A nondestructive alternative is manual\u0000identification from video footage collected by divers, which is time-consuming\u0000and labor-intensive with expert input. An alternative to human monitoring is the deployment of a system with trained\u0000object detection models that performs real-time, on edge oyster detection in\u0000the field. One such platform is the Aqua2 robot. Effective training of these\u0000models requires extensive high-quality data, which is difficult to obtain in\u0000marine settings. To address these complications, we introduce a novel method\u0000that leverages stable diffusion to generate high-quality synthetic data for the\u0000marine domain. We exploit diffusion models to create photorealistic marine\u0000imagery, using ControlNet inputs to ensure consistency with the segmentation\u0000ground-truth mask, the geometry of the scene, and the target domain of real\u0000underwater images for oysters. The resulting dataset is used to train a\u0000YOLOv10-based vision model, achieving a state-of-the-art 0.657 mAP@50 for\u0000oyster detection on the Aqua2 platform. The system we introduce not only\u0000improves oyster habitat monitoring, but also paves the way to autonomous\u0000surveillance for various tasks in marine contexts, improving aquaculture and\u0000conservation efforts.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Compliant Blind Handover Control for Human-Robot Collaboration 用于人机协作的兼容盲切换控制

arXiv - CS - Robotics

Pub Date : 2024-09-11 DOI: arxiv-2409.07155

Davide Ferrari, Andrea Pupa, Cristian Secchi

This paper presents a Human-Robot Blind Handover architecture within thecontext of Human-Robot Collaboration (HRC). The focus lies on a blind handoverscenario where the operator is intentionally faced away, focused in a task, andrequires an object from the robot. In this context, it is imperative for therobot to autonomously manage the entire handover process. Key considerationsinclude ensuring safety while handing the object to the operator's hand, anddetect the proper timing to release the object. The article explores strategiesto navigate these challenges, emphasizing the need for a robot to operatesafely and independently in facilitating blind handovers, thereby contributingto the advancement of HRC protocols and fostering a natural and efficientcollaboration between humans and robots.

本文介绍了人机协作（HRC）背景下的人机盲切换架构。其重点在于一种盲目交接场景，即操作员故意面朝远方，专注于一项任务，并要求机器人提供一个对象。在这种情况下，机器人必须自主管理整个交接过程。主要考虑因素包括在将物体交到操作员手中时确保安全，以及检测释放物体的适当时机。文章探讨了应对这些挑战的策略，强调了机器人在促进盲人交接过程中安全、独立操作的必要性，从而推动了人机交互协议的发展，促进了人类与机器人之间自然、高效的合作。

引用次数: 0

Learning Robotic Manipulation Policies from Point Clouds with Conditional Flow Matching 利用条件流匹配从点云学习机器人操纵策略

arXiv - CS - Robotics

Pub Date : 2024-09-11 DOI: arxiv-2409.07343

Eugenio Chisari, Nick Heppert, Max Argus, Tim Welschehold, Thomas Brox, Abhinav Valada

Learning from expert demonstrations is a promising approach for trainingrobotic manipulation policies from limited data. However, imitation learningalgorithms require a number of design choices ranging from the input modality,training objective, and 6-DoF end-effector pose representation. Diffusion-basedmethods have gained popularity as they enable predicting long-horizontrajectories and handle multimodal action distributions. Recently, ConditionalFlow Matching (CFM) (or Rectified Flow) has been proposed as a more flexiblegeneralization of diffusion models. In this paper, we investigate theapplication of CFM in the context of robotic policy learning and specificallystudy the interplay with the other design choices required to build animitation learning algorithm. We show that CFM gives the best performance whencombined with point cloud input observations. Additionally, we study thefeasibility of a CFM formulation on the SO(3) manifold and evaluate itssuitability with a simplified example. We perform extensive experiments onRLBench which demonstrate that our proposed PointFlowMatch approach achieves astate-of-the-art average success rate of 67.8% over eight tasks, double theperformance of the next best method.

从专家示范中学习是一种从有限数据中训练机器人操纵策略的有前途的方法。然而，模仿学习算法需要一系列设计选择，包括输入模式、训练目标和 6-DoF 末端执行器姿势表示。基于扩散的方法能够预测长视角轨迹并处理多模态动作分布，因此广受欢迎。最近，有人提出了条件流匹配（ConditionalFlow Matching，CFM）（或整流）方法，作为扩散模型更灵活的概括。在本文中，我们研究了 CFM 在机器人策略学习中的应用，并特别研究了它与构建动画学习算法所需的其他设计选择之间的相互作用。我们的研究表明，CFM 与点云输入观测结果相结合时性能最佳。此外，我们还研究了 SO(3) 流形上 CFM 表述的可行性，并通过一个简化示例对其适用性进行了评估。我们在 RLBench 上进行了大量实验，结果表明我们提出的点流匹配方法在八项任务中取得了 67.8% 的最新平均成功率，是次佳方法的两倍。

{"title":"Learning Robotic Manipulation Policies from Point Clouds with Conditional Flow Matching","authors":"Eugenio Chisari, Nick Heppert, Max Argus, Tim Welschehold, Thomas Brox, Abhinav Valada","doi":"arxiv-2409.07343","DOIUrl":"https://doi.org/arxiv-2409.07343","url":null,"abstract":"Learning from expert demonstrations is a promising approach for training\u0000robotic manipulation policies from limited data. However, imitation learning\u0000algorithms require a number of design choices ranging from the input modality,\u0000training objective, and 6-DoF end-effector pose representation. Diffusion-based\u0000methods have gained popularity as they enable predicting long-horizon\u0000trajectories and handle multimodal action distributions. Recently, Conditional\u0000Flow Matching (CFM) (or Rectified Flow) has been proposed as a more flexible\u0000generalization of diffusion models. In this paper, we investigate the\u0000application of CFM in the context of robotic policy learning and specifically\u0000study the interplay with the other design choices required to build an\u0000imitation learning algorithm. We show that CFM gives the best performance when\u0000combined with point cloud input observations. Additionally, we study the\u0000feasibility of a CFM formulation on the SO(3) manifold and evaluate its\u0000suitability with a simplified example. We perform extensive experiments on\u0000RLBench which demonstrate that our proposed PointFlowMatch approach achieves a\u0000state-of-the-art average success rate of 67.8% over eight tasks, double the\u0000performance of the next best method.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222135","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models 曼巴政策：利用混合选择性状态模型实现高效的 3D 扩散策略

arXiv - CS - Robotics

Pub Date : 2024-09-11 DOI: arxiv-2409.07163

Jiahang Cao, Qiang Zhang, Jingkai Sun, Jiaxu Wang, Hao Cheng, Yulin Li, Jun Ma, Yecheng Shao, Wen Zhao, Gang Han, Yijie Guo, Renjing Xu

Diffusion models have been widely employed in the field of 3D manipulationdue to their efficient capability to learn distributions, allowing for preciseprediction of action trajectories. However, diffusion models typically rely onlarge parameter UNet backbones as policy networks, which can be challenging todeploy on resource-constrained devices. Recently, the Mamba model has emergedas a promising solution for efficient modeling, offering low computationalcomplexity and strong performance in sequence modeling. In this work, wepropose the Mamba Policy, a lighter but stronger policy that reduces theparameter count by over 80% compared to the original policy network whileachieving superior performance. Specifically, we introduce the XMamba Block,which effectively integrates input information with conditional features andleverages a combination of Mamba and Attention mechanisms for deep featureextraction. Extensive experiments demonstrate that the Mamba Policy excels onthe Adroit, Dexart, and MetaWorld datasets, requiring significantly fewercomputational resources. Additionally, we highlight the Mamba Policy's enhancedrobustness in long-horizon scenarios compared to baseline methods and explorethe performance of various Mamba variants within the Mamba Policy framework.Our project page is in https://andycao1125.github.io/mamba_policy/.

扩散模型因其高效的分布学习能力而被广泛应用于三维操纵领域，从而可以对行动轨迹进行精确预测。然而，扩散模型通常依赖于作为策略网络的大型参数 UNet 主干网，这对于在资源有限的设备上部署具有挑战性。最近，Mamba 模型作为高效建模的一种有前途的解决方案出现了，它在序列建模方面具有较低的计算复杂度和较强的性能。在这项工作中，我们提出了 Mamba 策略，这是一种更轻便但更强大的策略，与原始策略网络相比，参数数量减少了 80% 以上，同时还实现了卓越的性能。具体来说，我们引入了 XMamba Block，它有效地整合了输入信息和条件特征，并利用 Mamba 和 Attention 机制的组合进行深度特征提取。广泛的实验证明，Mamba 策略在 Adroit、Dexart 和 MetaWorld 数据集上表现出色，所需的计算资源大大减少。此外，与基线方法相比，我们强调了 Mamba Policy 在长视距场景下的增强稳健性，并探索了 Mamba Policy 框架内各种 Mamba 变种的性能。我们的项目页面位于 https://andycao1125.github.io/mamba_policy/。

{"title":"Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models","authors":"Jiahang Cao, Qiang Zhang, Jingkai Sun, Jiaxu Wang, Hao Cheng, Yulin Li, Jun Ma, Yecheng Shao, Wen Zhao, Gang Han, Yijie Guo, Renjing Xu","doi":"arxiv-2409.07163","DOIUrl":"https://doi.org/arxiv-2409.07163","url":null,"abstract":"Diffusion models have been widely employed in the field of 3D manipulation\u0000due to their efficient capability to learn distributions, allowing for precise\u0000prediction of action trajectories. However, diffusion models typically rely on\u0000large parameter UNet backbones as policy networks, which can be challenging to\u0000deploy on resource-constrained devices. Recently, the Mamba model has emerged\u0000as a promising solution for efficient modeling, offering low computational\u0000complexity and strong performance in sequence modeling. In this work, we\u0000propose the Mamba Policy, a lighter but stronger policy that reduces the\u0000parameter count by over 80% compared to the original policy network while\u0000achieving superior performance. Specifically, we introduce the XMamba Block,\u0000which effectively integrates input information with conditional features and\u0000leverages a combination of Mamba and Attention mechanisms for deep feature\u0000extraction. Extensive experiments demonstrate that the Mamba Policy excels on\u0000the Adroit, Dexart, and MetaWorld datasets, requiring significantly fewer\u0000computational resources. Additionally, we highlight the Mamba Policy's enhanced\u0000robustness in long-horizon scenarios compared to baseline methods and explore\u0000the performance of various Mamba variants within the Mamba Policy framework.\u0000Our project page is in https://andycao1125.github.io/mamba_policy/.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0