首页 > 最新文献

Autonomous Robots最新文献

英文 中文
ProgPrompt: program generation for situated robot task planning using large language models ProgPrompt:使用大型语言模型生成定位机器人任务规划的程序
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-28 DOI: 10.1007/s10514-023-10135-3
Ishika Singh, Valts Blukis, Arsalan Mousavian, Ankit Goyal, Danfei Xu, Jonathan Tremblay, Dieter Fox, Jesse Thomason, Animesh Garg

Task planning can require defining myriad domain knowledge about the world in which a robot needs to act. To ameliorate that effort, large language models (LLMs) can be used to score potential next actions during task planning, and even generate action sequences directly, given an instruction in natural language with no additional domain information. However, such methods either require enumerating all possible next steps for scoring, or generate free-form text that may contain actions not possible on a given robot in its current context. We present a programmatic LLM prompt structure that enables plan generation functional across situated environments, robot capabilities, and tasks. Our key insight is to prompt the LLM with program-like specifications of the available actions and objects in an environment, as well as with example programs that can be executed. We make concrete recommendations about prompt structure and generation constraints through ablation experiments, demonstrate state of the art success rates in VirtualHome household tasks, and deploy our method on a physical robot arm for tabletop tasks. Website and code at progprompt.github.io

任务规划可能需要定义关于机器人需要在其中行动的世界的无数领域知识。为了改进这种工作,可以使用大型语言模型(llm)在任务规划期间对潜在的下一步动作进行评分,甚至可以直接生成动作序列,在没有附加领域信息的自然语言中给出指令。然而,这种方法要么需要列举所有可能的下一步得分,要么生成自由格式的文本,其中可能包含给定机器人在当前上下文中不可能执行的动作。我们提出了一个程序化的LLM提示结构,使计划生成功能能够跨越环境、机器人能力和任务。我们的主要见解是用环境中可用操作和对象的类似程序的规范,以及可以执行的示例程序来提示LLM。我们通过消融实验提出了关于提示结构和生成约束的具体建议,展示了VirtualHome家庭任务中最先进的成功率,并将我们的方法部署在用于桌面任务的物理机械臂上。网站和代码在progprompt.github.io
{"title":"ProgPrompt: program generation for situated robot task planning using large language models","authors":"Ishika Singh,&nbsp;Valts Blukis,&nbsp;Arsalan Mousavian,&nbsp;Ankit Goyal,&nbsp;Danfei Xu,&nbsp;Jonathan Tremblay,&nbsp;Dieter Fox,&nbsp;Jesse Thomason,&nbsp;Animesh Garg","doi":"10.1007/s10514-023-10135-3","DOIUrl":"10.1007/s10514-023-10135-3","url":null,"abstract":"<div><p>Task planning can require defining myriad domain knowledge about the world in which a robot needs to act. To ameliorate that effort, large language models (LLMs) can be used to score potential next actions during task planning, and even generate action sequences directly, given an instruction in natural language with no additional domain information. However, such methods either require enumerating all possible next steps for scoring, or generate free-form text that may contain actions not possible on a given robot in its current context. We present a programmatic LLM prompt structure that enables plan generation functional across situated environments, robot capabilities, and tasks. Our key insight is to prompt the LLM with program-like specifications of the available actions and objects in an environment, as well as with example <span>programs</span> that can be executed. We make concrete recommendations about prompt structure and generation constraints through ablation experiments, demonstrate state of the art success rates in VirtualHome household tasks, and deploy our method on a physical robot arm for tabletop tasks. Website and code at progprompt.github.io</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 8","pages":"999 - 1012"},"PeriodicalIF":3.5,"publicationDate":"2023-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-023-10135-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48320797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Learning scalable and efficient communication policies for multi-robot collision avoidance 学习可扩展且高效的多机器人防撞通信策略
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-19 DOI: 10.1007/s10514-023-10127-3
Álvaro Serra-Gómez, Hai Zhu, Bruno Brito, Wendelin Böhmer, Javier Alonso-Mora

Decentralized multi-robot systems typically perform coordinated motion planning by constantly broadcasting their intentions to avoid collisions. However, the risk of collision between robots varies as they move and communication may not always be needed. This paper presents an efficient communication method that addresses the problem of “when” and “with whom” to communicate in multi-robot collision avoidance scenarios. In this approach, each robot learns to reason about other robots’ states and considers the risk of future collisions before asking for the trajectory plans of other robots. We introduce a new neural architecture for the learned communication policy which allows our method to be scalable. We evaluate and verify the proposed communication strategy in simulation with up to twelve quadrotors, and present results on the zero-shot generalization/robustness capabilities of the policy in different scenarios. We demonstrate that our policy (learned in a simulated environment) can be successfully transferred to real robots.

分散式多机器人系统通常通过不断地广播它们的意图来执行协调运动规划,以避免碰撞。然而,机器人之间发生碰撞的风险随着它们的移动而变化,而且通信可能并不总是需要的。本文提出了一种有效的通信方法,解决了多机器人避碰场景中“何时”和“与谁”进行通信的问题。在这种方法中,每个机器人学习推理其他机器人的状态,并在询问其他机器人的轨迹计划之前考虑未来碰撞的风险。我们为学习到的通信策略引入了一种新的神经结构,使我们的方法具有可扩展性。我们在多达12个四旋翼机的仿真中评估和验证了所提出的通信策略,并给出了该策略在不同场景下的零射击泛化/鲁棒性能力的结果。我们证明了我们的策略(在模拟环境中学习)可以成功地转移到真实的机器人上。
{"title":"Learning scalable and efficient communication policies for multi-robot collision avoidance","authors":"Álvaro Serra-Gómez,&nbsp;Hai Zhu,&nbsp;Bruno Brito,&nbsp;Wendelin Böhmer,&nbsp;Javier Alonso-Mora","doi":"10.1007/s10514-023-10127-3","DOIUrl":"10.1007/s10514-023-10127-3","url":null,"abstract":"<div><p>Decentralized multi-robot systems typically perform coordinated motion planning by constantly broadcasting their intentions to avoid collisions. However, the risk of collision between robots varies as they move and communication may not always be needed. This paper presents an efficient communication method that addresses the problem of “when” and “with whom” to communicate in multi-robot collision avoidance scenarios. In this approach, each robot learns to reason about other robots’ states and considers the risk of future collisions before asking for the trajectory plans of other robots. We introduce a new neural architecture for the learned communication policy which allows our method to be scalable. We evaluate and verify the proposed communication strategy in simulation with up to twelve quadrotors, and present results on the zero-shot generalization/robustness capabilities of the policy in different scenarios. We demonstrate that our policy (learned in a simulated environment) can be successfully transferred to real robots.\u0000</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 8","pages":"1275 - 1297"},"PeriodicalIF":3.5,"publicationDate":"2023-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-023-10127-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46045076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Humans as path-finders for mobile robots using teach-by-showing navigation 人类作为移动机器人的寻路者,使用指示式导航
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-17 DOI: 10.1007/s10514-023-10125-5
Alessandro Antonucci, Paolo Bevilacqua, Stefano Leonardi, Luigi Paolopoli, Daniele Fontanelli

One of the most important barriers towards a widespread use of mobile robots in unstructured, human populated and possibly a-priori unknown work environments is the ability to plan a safe path. In this paper, we propose to delegate this activity to a human operator that walks in front of the robot marking with her/his footsteps the path to be followed. The implementation of this approach requires a high degree of robustness in locating the specific person to be followed (the path-finder). We propose a three phases approach to fulfil this goal: 1. Identification and tracking of the person in the image space, 2. Sensor fusion between camera data and laser sensors, 3. Point interpolation with continuous curvature paths. The approach is described in the paper and extensively validated with experimental results.

在非结构化、人口密集和可能先验未知的工作环境中广泛使用移动机器人的最重要障碍之一是规划安全路径的能力。在本文中,我们建议将这项活动委托给一名人类操作员,该操作员走在机器人前面,用她/他的脚步标记要遵循的路径。这种方法的实现需要在定位要跟随的特定人员(寻路者)方面具有高度的健壮性。我们建议分三个阶段实现这一目标:1 .图像空间中人的识别与跟踪;2 .相机数据与激光传感器的融合;具有连续曲率路径的点插值。本文对该方法进行了描述,并用实验结果进行了广泛的验证。
{"title":"Humans as path-finders for mobile robots using teach-by-showing navigation","authors":"Alessandro Antonucci,&nbsp;Paolo Bevilacqua,&nbsp;Stefano Leonardi,&nbsp;Luigi Paolopoli,&nbsp;Daniele Fontanelli","doi":"10.1007/s10514-023-10125-5","DOIUrl":"10.1007/s10514-023-10125-5","url":null,"abstract":"<div><p>One of the most important barriers towards a widespread use of mobile robots in unstructured, human populated and possibly a-priori unknown work environments is the ability to plan a safe path. In this paper, we propose to delegate this activity to a human operator that walks in front of the robot marking with her/his footsteps the path to be followed. The implementation of this approach requires a high degree of robustness in locating the specific person to be followed (the <i>path-finder</i>). We propose a three phases approach to fulfil this goal: 1. Identification and tracking of the person in the image space, 2. Sensor fusion between camera data and laser sensors, 3. Point interpolation with continuous curvature paths. The approach is described in the paper and extensively validated with experimental results.\u0000</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 8","pages":"1255 - 1273"},"PeriodicalIF":3.5,"publicationDate":"2023-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-023-10125-5.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43207995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pseudo-trilateral adversarial training for domain adaptive traversability prediction 伪三边对抗训练域自适应可穿越性预测
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-02 DOI: 10.1007/s10514-023-10123-7
Zheng Chen, Durgakant Pushp, Jason M. Gregory, Lantao Liu

Traversability prediction is a fundamental perception capability for autonomous navigation. Deep neural networks (DNNs) have been widely used to predict traversability during the last decade. The performance of DNNs is significantly boosted by exploiting a large amount of data. However, the diversity of data in different domains imposes significant gaps in the prediction performance. In this work, we make efforts to reduce the gaps by proposing a novel pseudo-trilateral adversarial model that adopts a coarse-to-fine alignment (CALI) to perform unsupervised domain adaptation (UDA). Our aim is to transfer the perception model with high data efficiency, eliminate the prohibitively expensive data labeling, and improve the generalization capability during the adaptation from easy-to-access source domains to various challenging target domains. Existing UDA methods usually adopt a bilateral zero-sum game structure. We prove that our CALI model—a pseudo-trilateral game structure is advantageous over existing bilateral game structures. This proposed work bridges theoretical analyses and algorithm designs, leading to an efficient UDA model with easy and stable training. We further develop a variant of CALI—Informed CALI, which is inspired by the recent success of mixup data augmentation techniques and mixes informative regions based on the results of CALI. This mixture step provides an explicit bridging between the two domains and exposes under-performing classes more during training. We show the superiorities of our proposed models over multiple baselines in several challenging domain adaptation setups. To further validate the effectiveness of our proposed models, we then combine our perception model with a visual planner to build a navigation system and show the high reliability of our model in complex natural environments.

可穿越性预测是自主导航的一项基本感知能力。在过去的十年中,深度神经网络(dnn)被广泛用于预测可穿越性。通过利用大量数据,深度神经网络的性能得到了显著提高。然而,不同领域数据的多样性在预测性能上造成了很大的差距。在这项工作中,我们通过提出一种新的伪三边对抗模型来减少差距,该模型采用粗精对齐(CALI)来执行无监督域自适应(UDA)。我们的目标是以高数据效率转移感知模型,消除过于昂贵的数据标记,并提高从易于访问的源域到各种具有挑战性的目标域的适应过程中的泛化能力。现有的UDA方法通常采用双边零和博弈结构。我们证明了我们的CALI模型——一个伪三边博弈结构比现有的双边博弈结构更有优势。本文将理论分析与算法设计相结合,得到了一个训练简单、稳定、高效的UDA模型。我们进一步开发了一种CALI - informed CALI的变体,它受到最近混合数据增强技术的成功启发,并基于CALI的结果混合了信息区域。这个混合步骤提供了两个领域之间的显式桥梁,并在训练期间更多地暴露了表现不佳的类。我们在几个具有挑战性的领域适应设置中展示了我们提出的模型在多个基线上的优势。为了进一步验证我们提出的模型的有效性,我们将我们的感知模型与视觉规划器结合起来构建导航系统,并在复杂的自然环境中展示了我们的模型的高可靠性。
{"title":"Pseudo-trilateral adversarial training for domain adaptive traversability prediction","authors":"Zheng Chen,&nbsp;Durgakant Pushp,&nbsp;Jason M. Gregory,&nbsp;Lantao Liu","doi":"10.1007/s10514-023-10123-7","DOIUrl":"10.1007/s10514-023-10123-7","url":null,"abstract":"<div><p>Traversability prediction is a fundamental perception capability for autonomous navigation. Deep neural networks (DNNs) have been widely used to predict traversability during the last decade. The performance of DNNs is significantly boosted by exploiting a large amount of data. However, the diversity of data in different domains imposes significant gaps in the prediction performance. In this work, we make efforts to reduce the gaps by proposing a novel pseudo-trilateral adversarial model that adopts a coarse-to-fine alignment (CALI) to perform <i>unsupervised domain adaptation</i> (UDA). Our aim is to transfer the perception model with high data efficiency, eliminate the prohibitively expensive data labeling, and improve the generalization capability during the adaptation from easy-to-access <i>source domains</i> to various challenging <i>target domains</i>. Existing UDA methods usually adopt a bilateral zero-sum game structure. We prove that our CALI model—a pseudo-trilateral game structure is advantageous over existing bilateral game structures. This proposed work bridges theoretical analyses and algorithm designs, leading to an efficient UDA model with easy and stable training. We further develop a variant of CALI—Informed CALI, which is inspired by the recent success of mixup data augmentation techniques and mixes informative regions based on the results of CALI. This mixture step provides an explicit bridging between the two domains and exposes under-performing classes more during training. We show the superiorities of our proposed models over multiple baselines in several challenging domain adaptation setups. To further validate the effectiveness of our proposed models, we then combine our perception model with a visual planner to build a navigation system and show the high reliability of our model in complex natural environments.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 8","pages":"1155 - 1174"},"PeriodicalIF":3.5,"publicationDate":"2023-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135016477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Design and control of BRAVER: a bipedal robot actuated via proprioceptive electric motors 本体感觉电机驱动的两足机器人BRAVER的设计与控制
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-23 DOI: 10.1007/s10514-023-10117-5
Zhengguo Zhu, Weiliang Zhu, Guoteng Zhang, Teng Chen, Yibin Li, Xuewen Rong, Rui Song, Daoling Qin, Qiang Hua, Shugen Ma

This paper presents the design and control of a high-speed running bipedal robot, BRAVER. The robot, which weighs 8.6 kg and is 0.36 m tall, has six active degrees, all of which are driven by custom back-driveable modular actuators, which enable high-bandwidth force control and proprioceptive torque feedback. We present the details of the hardware design, including the actuator, leg, foot, and onboard control systems, as well as the locomotion controller design for high dynamic tasks and improving robustness. We have demonstrated the performance of BRAVER using a series of experiments, including multi-terrains walking, up and down 15(^{circ }) slopes, pushing recovery, and running. The maximum running speed of BRAVER reaches 1.75 m/s.

本文介绍了一种高速奔跑双足机器人BRAVER的设计与控制。该机器人重8.6公斤,高0.36米,有6个主动度,全部由定制的可反向驱动的模块化致动器驱动,从而实现高带宽力控制和本体感觉扭矩反馈。我们介绍了硬件设计的细节,包括执行器、腿、脚和车载控制系统,以及高动态任务和提高鲁棒性的运动控制器设计。我们已经通过一系列实验证明了BRAVER的性能,包括多地形行走,上下15个(^{circ })斜坡,推动恢复和跑步。BRAVER的最大运行速度可达1.75 m/s。
{"title":"Design and control of BRAVER: a bipedal robot actuated via proprioceptive electric motors","authors":"Zhengguo Zhu,&nbsp;Weiliang Zhu,&nbsp;Guoteng Zhang,&nbsp;Teng Chen,&nbsp;Yibin Li,&nbsp;Xuewen Rong,&nbsp;Rui Song,&nbsp;Daoling Qin,&nbsp;Qiang Hua,&nbsp;Shugen Ma","doi":"10.1007/s10514-023-10117-5","DOIUrl":"10.1007/s10514-023-10117-5","url":null,"abstract":"<div><p>This paper presents the design and control of a high-speed running bipedal robot, BRAVER. The robot, which weighs 8.6 kg and is 0.36 m tall, has six active degrees, all of which are driven by custom back-driveable modular actuators, which enable high-bandwidth force control and proprioceptive torque feedback. We present the details of the hardware design, including the actuator, leg, foot, and onboard control systems, as well as the locomotion controller design for high dynamic tasks and improving robustness. We have demonstrated the performance of BRAVER using a series of experiments, including multi-terrains walking, up and down 15<span>(^{circ })</span> slopes, pushing recovery, and running. The maximum running speed of BRAVER reaches 1.75 m/s.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 8","pages":"1229 - 1243"},"PeriodicalIF":3.5,"publicationDate":"2023-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43379062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An empirical characterization of ODE models of swarm behaviors in common foraging scenarios 常见觅食情景下群体行为的ODE模型的经验表征
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-23 DOI: 10.1007/s10514-023-10121-9
John Harwell, Angel Sylvester, Maria Gini

There is a large class of real-world problems, such as warehouse transport, at different scales, swarm densities, etc., that can be characterized as Central Place Foraging Problems (CPFPs). We contribute to swarm engineering by designing an Ordinary Differential Equation (ODE) model that strives to capture the underlying behavioral dynamics of the CPFP in these application areas. Our simulation results show that a hybrid ODE modeling approach combining analytic parameter calculations and post-hoc (i.e., after running experiments) parameter fitting can be just as effective as a purely post-hoc approach to computing parameters via simulations, while requiring less tuning and iterative refinement. This makes it easier to design systems with provable bounds on behavior. Additionally, the resulting model parameters are more understandable because their values can be traced back to problem features, such as system size, robot control algorithm, etc. Finally, we perform real-robot experiments to further understand the limits of our model from an engineering standpoint.

在现实世界中,有大量的问题,如仓库运输,在不同的规模,群体密度等,可以被描述为中心地点觅食问题(CPFPs)。我们通过设计一个常微分方程(ODE)模型来为群体工程做出贡献,该模型努力捕捉CPFP在这些应用领域的潜在行为动态。我们的模拟结果表明,结合分析参数计算和事后(即运行实验后)参数拟合的混合ODE建模方法可以与通过模拟计算参数的纯粹事后方法一样有效,同时需要更少的调整和迭代改进。这使得设计具有可证明行为界限的系统变得更加容易。此外,得到的模型参数更容易理解,因为它们的值可以追溯到问题特征,如系统大小、机器人控制算法等。最后,我们进行了真实的机器人实验,从工程的角度进一步了解我们的模型的局限性。
{"title":"An empirical characterization of ODE models of swarm behaviors in common foraging scenarios","authors":"John Harwell,&nbsp;Angel Sylvester,&nbsp;Maria Gini","doi":"10.1007/s10514-023-10121-9","DOIUrl":"10.1007/s10514-023-10121-9","url":null,"abstract":"<div><p>There is a large class of real-world problems, such as warehouse transport, at different scales, swarm densities, etc., that can be characterized as Central Place Foraging Problems (CPFPs). We contribute to swarm engineering by designing an Ordinary Differential Equation (ODE) model that strives to capture the underlying behavioral dynamics of the CPFP in these application areas. Our simulation results show that a hybrid ODE modeling approach combining analytic parameter calculations and post-hoc (i.e., after running experiments) parameter fitting can be just as effective as a purely post-hoc approach to computing parameters via simulations, while requiring less tuning and iterative refinement. This makes it easier to design systems with provable bounds on behavior. Additionally, the resulting model parameters are more understandable because their values can be traced back to problem features, such as system size, robot control algorithm, etc. Finally, we perform real-robot experiments to further understand the limits of our model from an engineering standpoint.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 7","pages":"963 - 977"},"PeriodicalIF":3.5,"publicationDate":"2023-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43611429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-frame rate homography and visual odometry by tracking binary features from the focal plane 通过焦平面跟踪二进制特征实现高帧率单应性和视觉里程计
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-22 DOI: 10.1007/s10514-023-10122-8
Riku Murai, Sajad Saeedi, Paul H. J. Kelly

Robotics faces a long-standing obstacle in which the speed of the vision system’s scene understanding is insufficient, impeding the robot’s ability to perform agile tasks. Consequently, robots must often rely on interpolation and extrapolation of the vision data to accomplish tasks in a timely and effective manner. One of the primary reasons for these delays is the analog-to-digital conversion that occurs on a per-pixel basis across the image sensor, along with the transfer of pixel-intensity information to the host device. This results in significant delays and power consumption in modern visual processing pipelines. The SCAMP-5—a general-purpose Focal-plane Sensor-processor array (FPSP)—used in this research performs computations in the analog domain prior to analog-to-digital conversion. By extracting features from the image on the focal plane, the amount of data that needs to be digitised and transferred is reduced. This allows for a high frame rate and low energy consumption for the SCAMP-5. The focus of our work is on localising the camera within the scene, which is crucial for scene understanding and for any downstream robotics tasks. We present a localisation system that utilise the FPSP in two parts. First, a 6-DoF odometry system is introduced, which efficiently estimates its position against a known marker at over 400 FPS. Second, our work is extended to implement BIT-VO—6-DoF visual odometry system which operates under an unknown natural environment at 300 FPS.

机器人技术长期面临着视觉系统对场景理解速度不足的问题,这阻碍了机器人执行敏捷任务的能力。因此,机器人必须经常依靠视觉数据的插值和外推来及时有效地完成任务。造成这些延迟的主要原因之一是在图像传感器上以每个像素为基础发生的模数转换,以及将像素强度信息传输到主机设备。这在现代视觉处理管道中导致了显著的延迟和功耗。在本研究中使用的scamp -5 -一种通用焦平面传感器处理器阵列(FPSP) -在模数转换之前执行模拟域的计算。通过在焦平面上提取图像的特征,减少了需要数字化和传输的数据量。这使得SCAMP-5具有高帧率和低能耗。我们的工作重点是在场景中定位相机,这对于场景理解和任何下游机器人任务至关重要。我们提出了一个利用FPSP的定位系统,分为两部分。首先,引入了一个6自由度测程系统,该系统可以以超过400 FPS的速度根据已知标记有效地估计其位置。其次,我们的工作扩展到实现BIT-VO-6-DoF视觉里程计系统,该系统在未知自然环境下以300 FPS运行。
{"title":"High-frame rate homography and visual odometry by tracking binary features from the focal plane","authors":"Riku Murai,&nbsp;Sajad Saeedi,&nbsp;Paul H. J. Kelly","doi":"10.1007/s10514-023-10122-8","DOIUrl":"10.1007/s10514-023-10122-8","url":null,"abstract":"<div><p>Robotics faces a long-standing obstacle in which the speed of the vision system’s scene understanding is insufficient, impeding the robot’s ability to perform agile tasks. Consequently, robots must often rely on interpolation and extrapolation of the vision data to accomplish tasks in a timely and effective manner. One of the primary reasons for these delays is the analog-to-digital conversion that occurs on a per-pixel basis across the image sensor, along with the transfer of pixel-intensity information to the host device. This results in significant delays and power consumption in modern visual processing pipelines. The SCAMP-5—a general-purpose Focal-plane Sensor-processor array (FPSP)—used in this research performs computations in the analog domain prior to analog-to-digital conversion. By extracting features from the image on the focal plane, the amount of data that needs to be digitised and transferred is reduced. This allows for a high frame rate and low energy consumption for the SCAMP-5. The focus of our work is on localising the camera within the scene, which is crucial for scene understanding and for any downstream robotics tasks. We present a localisation system that utilise the FPSP in two parts. First, a 6-DoF odometry system is introduced, which efficiently estimates its position against a known marker at over 400 FPS. Second, our work is extended to implement BIT-VO—6-DoF visual odometry system which operates under an unknown natural environment at 300 FPS.\u0000</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 8","pages":"1579 - 1592"},"PeriodicalIF":3.5,"publicationDate":"2023-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-023-10122-8.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47385185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Robust bounded control scheme for quadrotor vehicles under high dynamic disturbances 高动态扰动下四旋翼飞行器的鲁棒有界控制方法
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-21 DOI: 10.1007/s10514-023-10124-6
J. Betancourt, P. Castillo, P. García, V. Balaguer, R. Lozano

In this paper, an optimal bounded robust control algorithm for secure autonomous navigation in quadcopter vehicles is proposed. The controller is developed combining two parts; one dedicated to stabilize the closed-loop system and the second one for dealing and estimating external disturbances as well unknown nonlinearities inherent to the real system’s operations. For bounding the energy used by the system during a mission and, without losing its robustness properties, the quadratic problem formulation is used considering the actuators system constraints. The resulting optimal bounded control scheme improves considerably the stability and robustness of the closed-loop system and at the same time bounds the motor control inputs. The controller is validated in real-time flights and in unconventional conditions for high wind-gusts and Loss of Effectiveness in two rotors. The experimental results demonstrate the good performance of the proposed controller in both scenarios.

提出了一种四轴飞行器安全自主导航的最优有界鲁棒控制算法。控制器由两部分组成;一种致力于稳定闭环系统,另一种用于处理和估计外部干扰以及实际系统运行中固有的未知非线性。考虑致动器系统的约束条件,采用二次问题的形式,在保证系统鲁棒性的前提下限定系统在执行任务时的能量消耗。所得到的最优有界控制方案大大提高了闭环系统的稳定性和鲁棒性,同时对电机控制输入有界。该控制器在实时飞行和非常规条件下进行了验证,以应对大风和双旋翼的有效性损失。实验结果表明,该控制器在两种情况下都具有良好的性能。
{"title":"Robust bounded control scheme for quadrotor vehicles under high dynamic disturbances","authors":"J. Betancourt,&nbsp;P. Castillo,&nbsp;P. García,&nbsp;V. Balaguer,&nbsp;R. Lozano","doi":"10.1007/s10514-023-10124-6","DOIUrl":"10.1007/s10514-023-10124-6","url":null,"abstract":"<div><p>In this paper, an optimal bounded robust control algorithm for secure autonomous navigation in quadcopter vehicles is proposed. The controller is developed combining two parts; one dedicated to stabilize the closed-loop system and the second one for dealing and estimating external disturbances as well unknown nonlinearities inherent to the real system’s operations. For bounding the energy used by the system during a mission and, without losing its robustness properties, the quadratic problem formulation is used considering the actuators system constraints. The resulting optimal bounded control scheme improves considerably the stability and robustness of the closed-loop system and at the same time bounds the motor control inputs. The controller is validated in real-time flights and in unconventional conditions for high wind-gusts and Loss of Effectiveness in two rotors. The experimental results demonstrate the good performance of the proposed controller in both scenarios.\u0000</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 8","pages":"1245 - 1254"},"PeriodicalIF":3.5,"publicationDate":"2023-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42302811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards neuromorphic FPGA-based infrastructures for a robotic arm 面向基于神经形态FPGA的机械臂基础设施
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-14 DOI: 10.1007/s10514-023-10111-x
Salvador Canas-Moreno, Enrique Piñero-Fuentes, Antonio Rios-Navarro, Daniel Cascado-Caballero, Fernando Perez-Peña, Alejandro Linares-Barranco

Muscles are stretched with bursts of spikes that come from motor neurons connected to the cerebellum through the spinal cord. Then, alpha motor neurons directly innervate the muscles to complete the motor command coming from upper biological structures. Nevertheless, classical robotic systems usually require complex computational capabilities and relative high-power consumption to process their control algorithm, which requires information from the robot’s proprioceptive sensors. The way in which the information is encoded and transmitted is an important difference between biological systems and robotic machines. Neuromorphic engineering mimics these behaviors found in biology into engineering solutions to produce more efficient systems and for a better understanding of neural systems. This paper presents the application of a Spike-based Proportional-Integral-Derivative controller to a 6-DoF Scorbot ER-VII robotic arm, feeding the motors with Pulse-Frequency-Modulation instead of Pulse-Width-Modulation, mimicking the way in which motor neurons act over muscles. The presented frameworks allow the robot to be commanded and monitored locally or remotely from both a Python software running on a computer or from a spike-based neuromorphic hardware. Multi-FPGA and single-PSoC solutions are compared. These frameworks are intended for experimental use of the neuromorphic community as a testbed platform and for dataset recording for machine learning purposes.

通过脊髓与小脑相连的运动神经元发出的脉冲会拉伸肌肉。然后,α运动神经元直接支配肌肉,完成来自上层生物结构的运动指令。然而,传统的机器人系统通常需要复杂的计算能力和相对高的功耗来处理它们的控制算法,这需要机器人本体感觉传感器的信息。信息编码和传输的方式是生物系统和机器人之间的一个重要区别。神经形态工程将生物学中的这些行为模仿成工程解决方案,以生产更高效的系统,并更好地理解神经系统。本文介绍了一种基于脉冲的比例-积分-导数控制器在6自由度Scorbot ER-VII机械臂上的应用,模拟运动神经元在肌肉上的作用方式,为电机提供脉冲频率调制而不是脉冲宽度调制。所提出的框架允许从计算机上运行的Python软件或基于spike的神经形态硬件对机器人进行本地或远程命令和监控。比较了多fpga和单psoc解决方案。这些框架旨在用于神经形态社区作为测试平台的实验使用,以及用于机器学习目的的数据集记录。
{"title":"Towards neuromorphic FPGA-based infrastructures for a robotic arm","authors":"Salvador Canas-Moreno,&nbsp;Enrique Piñero-Fuentes,&nbsp;Antonio Rios-Navarro,&nbsp;Daniel Cascado-Caballero,&nbsp;Fernando Perez-Peña,&nbsp;Alejandro Linares-Barranco","doi":"10.1007/s10514-023-10111-x","DOIUrl":"10.1007/s10514-023-10111-x","url":null,"abstract":"<div><p>Muscles are stretched with bursts of spikes that come from motor neurons connected to the cerebellum through the spinal cord. Then, alpha motor neurons directly innervate the muscles to complete the motor command coming from upper biological structures. Nevertheless, classical robotic systems usually require complex computational capabilities and relative high-power consumption to process their control algorithm, which requires information from the robot’s proprioceptive sensors. The way in which the information is encoded and transmitted is an important difference between biological systems and robotic machines. Neuromorphic engineering mimics these behaviors found in biology into engineering solutions to produce more efficient systems and for a better understanding of neural systems. This paper presents the application of a Spike-based Proportional-Integral-Derivative controller to a 6-DoF Scorbot ER-VII robotic arm, feeding the motors with Pulse-Frequency-Modulation instead of Pulse-Width-Modulation, mimicking the way in which motor neurons act over muscles. The presented frameworks allow the robot to be commanded and monitored locally or remotely from both a Python software running on a computer or from a spike-based neuromorphic hardware. Multi-FPGA and single-PSoC solutions are compared. These frameworks are intended for experimental use of the neuromorphic community as a testbed platform and for dataset recording for machine learning purposes.\u0000</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 7","pages":"947 - 961"},"PeriodicalIF":3.5,"publicationDate":"2023-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-023-10111-x.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45151320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning rewards from exploratory demonstrations using probabilistic temporal ranking 使用概率时间排序从探索性演示中学习奖励
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-10 DOI: 10.1007/s10514-023-10120-w
Michael Burke, Katie Lu, Daniel Angelov, Artūras Straižys, Craig Innes, Kartic Subr, Subramanian Ramamoorthy

Informative path-planning is a well established approach to visual-servoing and active viewpoint selection in robotics, but typically assumes that a suitable cost function or goal state is known. This work considers the inverse problem, where the goal of the task is unknown, and a reward function needs to be inferred from exploratory example demonstrations provided by a demonstrator, for use in a downstream informative path-planning policy. Unfortunately, many existing reward inference strategies are unsuited to this class of problems, due to the exploratory nature of the demonstrations. In this paper, we propose an alternative approach to cope with the class of problems where these sub-optimal, exploratory demonstrations occur. We hypothesise that, in tasks which require discovery, successive states of any demonstration are progressively more likely to be associated with a higher reward, and use this hypothesis to generate time-based binary comparison outcomes and infer reward functions that support these ranks, under a probabilistic generative model. We formalise this probabilistic temporal ranking approach and show that it improves upon existing approaches to perform reward inference for autonomous ultrasound scanning, a novel application of learning from demonstration in medical imaging while also being of value across a broad range of goal-oriented learning from demonstration tasks.

信息路径规划是机器人视觉伺服和主动视点选择的一种成熟方法,但通常假设已知合适的成本函数或目标状态。这项工作考虑了反问题,其中任务的目标是未知的,并且需要从演示者提供的探索性示例演示中推断出奖励函数,以用于下游信息路径规划策略。不幸的是,由于演示的探索性,许多现有的奖励推理策略不适合这类问题。在本文中,我们提出了一种替代方法来处理出现这些次优探索性演示的这类问题。我们假设,在需要发现的任务中,任何演示的连续状态都越来越有可能与更高的奖励相关联,并使用该假设在概率生成模型下生成基于时间的二元比较结果,并推断支持这些排名的奖励函数。我们将这种概率时间排序方法正式化,并表明它改进了现有的方法来执行自主超声扫描的奖励推理,这是一种从演示中学习在医学成像中的新应用,同时在从演示任务中进行广泛的目标导向学习方面也有价值。
{"title":"Learning rewards from exploratory demonstrations using probabilistic temporal ranking","authors":"Michael Burke,&nbsp;Katie Lu,&nbsp;Daniel Angelov,&nbsp;Artūras Straižys,&nbsp;Craig Innes,&nbsp;Kartic Subr,&nbsp;Subramanian Ramamoorthy","doi":"10.1007/s10514-023-10120-w","DOIUrl":"10.1007/s10514-023-10120-w","url":null,"abstract":"<div><p>Informative path-planning is a well established approach to visual-servoing and active viewpoint selection in robotics, but typically assumes that a suitable cost function or goal state is known. This work considers the inverse problem, where the goal of the task is unknown, and a reward function needs to be inferred from exploratory example demonstrations provided by a demonstrator, for use in a downstream informative path-planning policy. Unfortunately, many existing reward inference strategies are unsuited to this class of problems, due to the exploratory nature of the demonstrations. In this paper, we propose an alternative approach to cope with the class of problems where these sub-optimal, exploratory demonstrations occur. We hypothesise that, in tasks which require discovery, successive states of any demonstration are progressively more likely to be associated with a higher reward, and use this hypothesis to generate time-based binary comparison outcomes and infer reward functions that support these ranks, under a probabilistic generative model. We formalise this <i>probabilistic temporal ranking</i> approach and show that it improves upon existing approaches to perform reward inference for autonomous ultrasound scanning, a novel application of learning from demonstration in medical imaging while also being of value across a broad range of goal-oriented learning from demonstration tasks.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 6","pages":"733 - 751"},"PeriodicalIF":3.5,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-023-10120-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48400024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Autonomous Robots
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1