Pub Date : 2024-10-17DOI: 10.1007/s10514-024-10178-0
Luisa Fairfax, Patricio Vela
This paper explores range and bearing angle regulation of a leader–follower using monocular vision. The main challenge is that monocular vision does not directly provide a range measurement. The contribution is a novel concurrent learning (CL) approach, called CL Subtended Angle and Bearing Estimator for Relative pose (CL-SABER), which achieves range regulation without communication, persistency of excitation or known geometry and is demonstrated on a physical, robot platform. A history stack estimates target size which augments the Kalman filter (KF) with a range pseudomeasurement. The target is followed to scale without drift, persistency of excitation requirements, prior knowledge, or additional measurements. Finite excitation is required to achieve parameter convergence and perform steady-state regulation using CL-SABER. Evaluation using simulation and mobile robot experiments in special Euclidean planar space (SE(2)) show that the new method provides stable and consistent range regulation, as demonstrated by the inter-rater reliability, including in noisy and high leader acceleration environments.
本文探讨了利用单目视觉对领航员-追随者进行测距和方位角调节的问题。主要挑战在于单目视觉无法直接提供距离测量。本文的贡献在于采用了一种新颖的并发学习(CL)方法,称为 "CL-SABER"(CL Subtended Angle and Bearing Estimator for Relative pose)。历史堆栈可估算目标大小,并通过范围伪测量来增强卡尔曼滤波器(KF)。跟踪目标时,无需漂移、持续激励要求、先验知识或额外测量。利用 CL-SABER 实现参数收敛和稳态调节需要有限的激励。在特殊欧几里得平面空间(SE(2))中使用模拟和移动机器人实验进行的评估表明,新方法可提供稳定一致的测距调节,这一点已通过评分者之间的可靠性得到证明,包括在嘈杂和高领导加速度环境中。
{"title":"A concurrent learning approach to monocular vision range regulation of leader/follower systems","authors":"Luisa Fairfax, Patricio Vela","doi":"10.1007/s10514-024-10178-0","DOIUrl":"10.1007/s10514-024-10178-0","url":null,"abstract":"<div><p>This paper explores range and bearing angle regulation of a leader–follower using monocular vision. The main challenge is that monocular vision does not directly provide a range measurement. The contribution is a novel concurrent learning (CL) approach, called CL Subtended Angle and Bearing Estimator for Relative pose (CL-SABER), which achieves range regulation without communication, persistency of excitation or known geometry and is demonstrated on a physical, robot platform. A history stack estimates target size which augments the Kalman filter (KF) with a range pseudomeasurement. The target is followed <i>to scale without drift, persistency of excitation requirements, prior knowledge, or additional measurements</i>. <i>Finite</i> excitation is required to achieve parameter convergence and perform steady-state regulation using CL-SABER. Evaluation using simulation and mobile robot experiments in special Euclidean planar space (<i>SE</i>(2)) show that the new method provides stable and consistent range regulation, as demonstrated by the inter-rater reliability, including in noisy and high leader acceleration environments.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 8","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-024-10178-0.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142447364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-12DOI: 10.1007/s10514-024-10180-6
Mitchell Usayiwevu, Fouad Sukkar, Chanyeol Yoo, Robert Fitch, Teresa Vidal-Calleja
Inertial-aided systems require continuous motion excitation among other reasons to characterize the measurement biases that will enable accurate integration required for localization frameworks. This paper proposes the use of informative path planning to find the best trajectory for minimizing the uncertainty of IMU biases and an adaptive traces method to guide the planner towards trajectories that aid convergence. The key contribution is a novel regression method based on Gaussian Process (GP) to enforce continuity and differentiability between waypoints from a variant of the (hbox {RRT}^*) planning algorithm. We employ linear operators applied to the GP kernel function to infer not only continuous position trajectories, but also velocities and accelerations. The use of linear functionals enable velocity and acceleration constraints given by the IMU measurements to be imposed on the position GP model. The results from both simulation and real-world experiments show that planning for IMU bias convergence helps minimize localization errors in state estimation frameworks.
惯性辅助系统需要持续的运动激励,以确定测量偏差的特征,从而实现定位框架所需的精确整合。本文提出使用信息路径规划来寻找最佳轨迹,以最大限度地减少 IMU 偏差的不确定性,并提出一种自适应轨迹方法,以引导规划者找到有助于收敛的轨迹。该方法的主要贡献是基于高斯过程(GP)的新型回归方法,以强制执行 (hbox {RRT}^*)规划算法变体的航点之间的连续性和可区分性。我们采用应用于 GP 核函数的线性算子,不仅能推断连续的位置轨迹,还能推断速度和加速度。通过使用线性函数,可以将 IMU 测量给出的速度和加速度约束施加到位置 GP 模型上。模拟和实际实验的结果表明,对 IMU 偏差收敛进行规划有助于最大限度地减少状态估计框架中的定位误差。
{"title":"Continuous planning for inertial-aided systems","authors":"Mitchell Usayiwevu, Fouad Sukkar, Chanyeol Yoo, Robert Fitch, Teresa Vidal-Calleja","doi":"10.1007/s10514-024-10180-6","DOIUrl":"10.1007/s10514-024-10180-6","url":null,"abstract":"<div><p>Inertial-aided systems require continuous motion excitation among other reasons to characterize the measurement biases that will enable accurate integration required for localization frameworks. This paper proposes the use of informative path planning to find the best trajectory for minimizing the uncertainty of IMU biases and an adaptive traces method to guide the planner towards trajectories that aid convergence. The key contribution is a novel regression method based on Gaussian Process (GP) to enforce continuity and differentiability between waypoints from a variant of the <span>(hbox {RRT}^*)</span> planning algorithm. We employ linear operators applied to the GP kernel function to infer not only continuous position trajectories, but also velocities and accelerations. The use of linear functionals enable velocity and acceleration constraints given by the IMU measurements to be imposed on the position GP model. The results from both simulation and real-world experiments show that planning for IMU bias convergence helps minimize localization errors in state estimation frameworks.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 8","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-024-10180-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142411505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-12DOI: 10.1007/s10514-024-10177-1
Yingke Li, Mengxue Hou, Enlu Zhou, Fumin Zhang
The process-aware source seeking (PASS) problem in flow fields aims to find an informative trajectory to reach an unknown source location while taking the energy consumption in the flow fields into consideration. Taking advantage of the dynamic flow field partition technique, this paper formulates this problem as a task and motion planning (TAMP) problem and proposes a bi-level hierarchical planning framework to decouple the planning of inter-region transition and inner-region trajectory by introducing inter-region junctions. An integrated strategy is developed to enable efficient upper-level planning by investigating the optimal solution of the lower-level planner. In order to leverage the information acquisition and computational burden, a dynamic event-triggered mechanism is introduced to enable asynchronized estimation, region partitioning and re-plans. The proposed algorithm provides guaranteed convergence of the trajectory, and achieves automatic trade-offs of both exploration-exploitation and accuracy-efficiency. Simulations in a highly complicated and realistic ocean surface flow field validate the merits of the proposed algorithm, which demonstrates a significant reduction in computational burden without compromising planning optimality.
{"title":"Dynamic event-triggered integrated task and motion planning for process-aware source seeking","authors":"Yingke Li, Mengxue Hou, Enlu Zhou, Fumin Zhang","doi":"10.1007/s10514-024-10177-1","DOIUrl":"10.1007/s10514-024-10177-1","url":null,"abstract":"<div><p>The process-aware source seeking (PASS) problem in flow fields aims to find an informative trajectory to reach an unknown source location while taking the energy consumption in the flow fields into consideration. Taking advantage of the dynamic flow field partition technique, this paper formulates this problem as a task and motion planning (TAMP) problem and proposes a bi-level hierarchical planning framework to decouple the planning of inter-region transition and inner-region trajectory by introducing inter-region junctions. An integrated strategy is developed to enable efficient upper-level planning by investigating the optimal solution of the lower-level planner. In order to leverage the information acquisition and computational burden, a dynamic event-triggered mechanism is introduced to enable asynchronized estimation, region partitioning and re-plans. The proposed algorithm provides guaranteed convergence of the trajectory, and achieves automatic trade-offs of both exploration-exploitation and accuracy-efficiency. Simulations in a highly complicated and realistic ocean surface flow field validate the merits of the proposed algorithm, which demonstrates a significant reduction in computational burden without compromising planning optimality.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 8","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-024-10177-1.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142411504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-06DOI: 10.1007/s10514-024-10174-4
Hann Woei Ho, Ye Zhou, Yiting Feng, Guido C. H. E. de Croon
This paper proposes an innovative approach for optical flow-based control of micro air vehicles (MAVs), addressing challenges inherent in the nonlinearity of optical flow observables. The proposed incremental nonlinear dynamic inversion (INDI) control scheme employs an efficient data-driven approach to directly estimate the inverse of the time-varying INDI control effectiveness in real-time. This method eliminates the constant effectiveness assumption typically made by traditional INDI methods and reduces the computational burden associated with inverting this variable at each time step. It effectively handles rapidly changing system dynamics, often encountered in optical flow-based control, particularly height-dependent control variables. Stability analysis of the proposed control scheme is conducted, and its robustness and efficiency are demonstrated through both numerical simulations and real-world flight tests. These tests include multiple landings of an MAV on a static, flat surface with several different tracking setpoints, as well as hovering and landings on moving and undulating surfaces. Despite the challenges posed by noisy optical flow estimates and lateral or vertical movements of the landing surfaces, the MAV successfully tracks or lands on the surface with an exponential decay of both height and vertical velocity almost simultaneously, aligning with the desired performance.
本文针对微型飞行器(MAVs)基于光流的控制提出了一种创新方法,以解决光流观测值的非线性所固有的挑战。所提出的增量非线性动态反演(INDI)控制方案采用了一种高效的数据驱动方法,可直接实时估算时变 INDI 控制效果的逆值。这种方法消除了传统 INDI 方法通常采用的恒定有效性假设,并减轻了在每个时间步长反演该变量的计算负担。它能有效处理基于光流的控制中经常遇到的快速变化的系统动态,尤其是与高度相关的控制变量。对提出的控制方案进行了稳定性分析,并通过数值模拟和实际飞行测试证明了其稳健性和效率。这些测试包括飞行器在静态、平坦的表面上以多个不同的跟踪设定点进行多次着陆,以及在移动和起伏的表面上悬停和着陆。尽管存在噪声光流估计和着陆表面横向或纵向移动带来的挑战,但飞行器成功地在表面跟踪或着陆,高度和垂直速度几乎同时呈指数衰减,符合预期性能。
{"title":"Optical flow-based control for micro air vehicles: an efficient data-driven incremental nonlinear dynamic inversion approach","authors":"Hann Woei Ho, Ye Zhou, Yiting Feng, Guido C. H. E. de Croon","doi":"10.1007/s10514-024-10174-4","DOIUrl":"10.1007/s10514-024-10174-4","url":null,"abstract":"<div><p>This paper proposes an innovative approach for optical flow-based control of micro air vehicles (MAVs), addressing challenges inherent in the nonlinearity of optical flow observables. The proposed incremental nonlinear dynamic inversion (INDI) control scheme employs an efficient data-driven approach to directly estimate the inverse of the time-varying INDI control effectiveness in real-time. This method eliminates the constant effectiveness assumption typically made by traditional INDI methods and reduces the computational burden associated with inverting this variable at each time step. It effectively handles rapidly changing system dynamics, often encountered in optical flow-based control, particularly height-dependent control variables. Stability analysis of the proposed control scheme is conducted, and its robustness and efficiency are demonstrated through both numerical simulations and real-world flight tests. These tests include multiple landings of an MAV on a static, flat surface with several different tracking setpoints, as well as hovering and landings on moving and undulating surfaces. Despite the challenges posed by noisy optical flow estimates and lateral or vertical movements of the landing surfaces, the MAV successfully tracks or lands on the surface with an exponential decay of both height and vertical velocity almost simultaneously, aligning with the desired performance.\u0000</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 8","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142410300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-01DOI: 10.1007/s10514-024-10173-5
Xili Yi, An Dang, Nima Fazeli
In this paper, we present models and planning algorithms to slide an object on a planar surface via frictional patch contact made with its top surface, whether the surface is horizontal or inclined. The core of our approach is the asymmetric dual limit surfaces model that determines slip boundary conditions for both the top and support patch contacts made with the object. This model enables us to compute a range of twists that can keep the object in sticking contact with the robot end-effector while slipping on the supporting plane. Based on these constraints, we derive a planning algorithm to slide objects with only top contact to arbitrary goal poses without slippage between end effector and the object. We fit the proposed model and demonstrate its predictive accuracy on a variety of object geometries and motions. We also evaluate the planning algorithm over a variety of objects and goals, demonstrating an orientation error improvement of 90% when compared to methods naive to linear path planners. For more results and information, please visit https://www.mmintlab.com/dual-limit-surfaces/.
{"title":"Dual asymmetric limit surfaces and their applications to planar manipulation","authors":"Xili Yi, An Dang, Nima Fazeli","doi":"10.1007/s10514-024-10173-5","DOIUrl":"10.1007/s10514-024-10173-5","url":null,"abstract":"<div><p>In this paper, we present models and planning algorithms to slide an object on a planar surface via frictional patch contact made with its <i>top surface</i>, whether the surface is horizontal or inclined. The core of our approach is the asymmetric dual limit surfaces model that determines slip boundary conditions for both the top and support patch contacts made with the object. This model enables us to compute a range of twists that can keep the object in sticking contact with the robot end-effector while slipping on the supporting plane. Based on these constraints, we derive a planning algorithm to slide objects with only top contact to arbitrary goal poses without slippage between end effector and the object. We fit the proposed model and demonstrate its predictive accuracy on a variety of object geometries and motions. We also evaluate the planning algorithm over a variety of objects and goals, demonstrating an orientation error improvement of 90% when compared to methods naive to linear path planners. For more results and information, please visit https://www.mmintlab.com/dual-limit-surfaces/.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 7","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142409297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-30DOI: 10.1007/s10514-024-10175-3
Yaohui Guo, X. Jessie Yang, Cong Shi
Trust is a crucial factor for effective human–robot teaming. Existing literature on trust modeling predominantly focuses on dyadic human-autonomy teams where one human agent interacts with one robot. There is little, if not no, research on trust modeling in teams consisting of multiple human and robotic agents. To fill this important research gap, we present the Trust Inference and Propagation (TIP) model to model and estimate human trust in multi-human multi-robot teams. In a multi-human multi-robot team, we postulate that there exist two types of experiences that a human agent has with a robot: direct and indirect experiences. The TIP model presents a novel mathematical framework that explicitly accounts for both types of experiences. To evaluate the model, we conducted a human-subject experiment with 15 pairs of participants ((N=30)). Each pair performed a search and detection task with two drones. Results show that our TIP model successfully captured the underlying trust dynamics and significantly outperformed a baseline model. To the best of our knowledge, the TIP model is the first mathematical framework for computational trust modeling in multi-human multi-robot teams.
信任是人类与机器人有效合作的关键因素。关于信任建模的现有文献主要集中在二元人类-自主团队,即一个人类代理与一个机器人互动。关于由多个人类和机器人代理组成的团队的信任建模的研究很少,甚至没有。为了填补这一重要的研究空白,我们提出了信任推理与传播(TIP)模型,用于模拟和估算多人多机器人团队中的人类信任度。在一个多人多机器人团队中,我们假设人类代理与机器人之间存在两类经验:直接经验和间接经验。TIP 模型提出了一个新颖的数学框架,明确地说明了这两种类型的体验。为了评估该模型,我们用 15 对参与者((N=30))进行了人类-主体实验。每对参与者使用两架无人机完成搜索和探测任务。结果表明,我们的 TIP 模型成功捕捉到了潜在的信任动态,其表现明显优于基线模型。据我们所知,TIP 模型是第一个用于多人多机器人团队信任建模的数学框架。
{"title":"TIP: A trust inference and propagation model in multi-human multi-robot teams","authors":"Yaohui Guo, X. Jessie Yang, Cong Shi","doi":"10.1007/s10514-024-10175-3","DOIUrl":"10.1007/s10514-024-10175-3","url":null,"abstract":"<div><p>Trust is a crucial factor for effective human–robot teaming. Existing literature on trust modeling predominantly focuses on dyadic human-autonomy teams where one human agent interacts with one robot. There is little, if not no, research on trust modeling in teams consisting of multiple human and robotic agents. To fill this important research gap, we present the Trust Inference and Propagation (TIP) model to model and estimate human trust in multi-human multi-robot teams. In a multi-human multi-robot team, we postulate that there exist two types of experiences that a human agent has with a robot: direct and indirect experiences. The TIP model presents a novel mathematical framework that explicitly accounts for both types of experiences. To evaluate the model, we conducted a human-subject experiment with 15 pairs of participants (<span>(N=30)</span>). Each pair performed a search and detection task with two drones. Results show that our TIP model successfully captured the underlying trust dynamics and significantly outperformed a baseline model. To the best of our knowledge, the TIP model is the first mathematical framework for computational trust modeling in multi-human multi-robot teams.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 7","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-024-10175-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142415249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-30DOI: 10.1007/s10514-024-10169-1
Micha Sende, Christian Raffelsberger, Christian Bettstetter
Swarm algorithms promise to solve certain problems in large multi-robot systems. The evaluation of large swarms is however challenging as simulations alone often lack some properties of real systems whereas real-world experiments are costly and complex. We present a mixed reality (MR) system that connects simulated and physical robots though a 5G network, facilitating MR experiments to evaluate communication-based swarm algorithms. The effectiveness of the system is demonstrated through extensive experiments with unmanned aerial vehicles. Measurements show that the communication requirements of swarm coordination are well met by 5G but the computing power of the simulation server can be a bottleneck. However, even when the simulation slows down, communication and coordination take place in real time. In conclusion, 5G-enabled MR experiments are a feasible tool for bridging the reality gap in the development and evaluation of robot swarms.
{"title":"Bridging the reality gap in drone swarm development through mixed reality","authors":"Micha Sende, Christian Raffelsberger, Christian Bettstetter","doi":"10.1007/s10514-024-10169-1","DOIUrl":"10.1007/s10514-024-10169-1","url":null,"abstract":"<div><p>Swarm algorithms promise to solve certain problems in large multi-robot systems. The evaluation of large swarms is however challenging as simulations alone often lack some properties of real systems whereas real-world experiments are costly and complex. We present a mixed reality (MR) system that connects simulated and physical robots though a 5G network, facilitating MR experiments to evaluate communication-based swarm algorithms. The effectiveness of the system is demonstrated through extensive experiments with unmanned aerial vehicles. Measurements show that the communication requirements of swarm coordination are well met by 5G but the computing power of the simulation server can be a bottleneck. However, even when the simulation slows down, communication and coordination take place in real time. In conclusion, 5G-enabled MR experiments are a feasible tool for bridging the reality gap in the development and evaluation of robot swarms.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 7","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-024-10169-1.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142415175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-28DOI: 10.1007/s10514-024-10172-6
Alex Stephens, Matthew Budd, Michal Staniaszek, Benoit Casseau, Paul Duckworth, Maurice Fallon, Nick Hawes, Bruno Lacerda
The exploration of new environments is a crucial challenge for mobile robots. This task becomes even more complex with the added requirement of ensuring safety. Here, safety refers to the robot staying in regions where the values of certain environmental conditions (such as terrain steepness or radiation levels) are within a predefined threshold. We consider two types of safe exploration problems. First, the robot has a map of its workspace, but the values of the environmental features relevant to safety are unknown beforehand and must be explored. Second, both the map and the environmental features are unknown, and the robot must build a map whilst remaining safe. Our proposed framework uses a Gaussian process to predict the value of the environmental features in unvisited regions. We then build a Markov decision process that integrates the Gaussian process predictions with the transition probabilities of the environmental model. The Markov decision process is then incorporated into an exploration algorithm that decides which new region of the environment to explore based on information value, predicted safety, and distance from the current position of the robot. We empirically evaluate the effectiveness of our framework through simulations and its application on a physical robot in an underground environment.
{"title":"Planning under uncertainty for safe robot exploration using Gaussian process prediction","authors":"Alex Stephens, Matthew Budd, Michal Staniaszek, Benoit Casseau, Paul Duckworth, Maurice Fallon, Nick Hawes, Bruno Lacerda","doi":"10.1007/s10514-024-10172-6","DOIUrl":"10.1007/s10514-024-10172-6","url":null,"abstract":"<div><p>The exploration of new environments is a crucial challenge for mobile robots. This task becomes even more complex with the added requirement of ensuring safety. Here, safety refers to the robot staying in regions where the values of certain environmental conditions (such as terrain steepness or radiation levels) are within a predefined threshold. We consider two types of safe exploration problems. First, the robot has a map of its workspace, but the values of the environmental features relevant to safety are unknown beforehand and must be explored. Second, both the map and the environmental features are unknown, and the robot must build a map whilst remaining safe. Our proposed framework uses a Gaussian process to predict the value of the environmental features in unvisited regions. We then build a Markov decision process that integrates the Gaussian process predictions with the transition probabilities of the environmental model. The Markov decision process is then incorporated into an exploration algorithm that decides which new region of the environment to explore based on information value, predicted safety, and distance from the current position of the robot. We empirically evaluate the effectiveness of our framework through simulations and its application on a physical robot in an underground environment.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 7","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-024-10172-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142193368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-27DOI: 10.1007/s10514-024-10170-8
Gagan Khandate, Tristan L. Saidi, Siqi Shang, Eric T. Chang, Yang Liu, Seth Dennis, Johnson Adams, Matei Ciocarlie
We present a method for enabling Reinforcement Learning of motor control policies for complex skills such as dexterous manipulation. We posit that a key difficulty for training such policies is the difficulty of exploring the problem state space, as the accessible and useful regions of this space form a complex structure along manifolds of the original high-dimensional state space. This work presents a method to enable and support exploration with Sampling-based Planning. We use a generally applicable non-holonomic Rapidly-exploring Random Trees algorithm and present multiple methods to use the resulting structure to bootstrap model-free Reinforcement Learning. Our method is effective at learning various challenging dexterous motor control skills of higher difficulty than previously shown. In particular, we achieve dexterous in-hand manipulation of complex objects while simultaneously securing the object without the use of passive support surfaces. These policies also transfer effectively to real robots. A number of example videos can also be found on the project website: sbrl.cs.columbia.edu
{"title":"R (times ) R: Rapid eXploration for Reinforcement learning via sampling-based reset distributions and imitation pre-training","authors":"Gagan Khandate, Tristan L. Saidi, Siqi Shang, Eric T. Chang, Yang Liu, Seth Dennis, Johnson Adams, Matei Ciocarlie","doi":"10.1007/s10514-024-10170-8","DOIUrl":"10.1007/s10514-024-10170-8","url":null,"abstract":"<div><p>We present a method for enabling Reinforcement Learning of motor control policies for complex skills such as dexterous manipulation. We posit that a key difficulty for training such policies is the difficulty of exploring the problem state space, as the accessible and useful regions of this space form a complex structure along manifolds of the original high-dimensional state space. This work presents a method to enable and support exploration with Sampling-based Planning. We use a generally applicable non-holonomic Rapidly-exploring Random Trees algorithm and present multiple methods to use the resulting structure to bootstrap model-free Reinforcement Learning. Our method is effective at learning various challenging dexterous motor control skills of higher difficulty than previously shown. In particular, we achieve dexterous in-hand manipulation of complex objects while simultaneously securing the object without the use of passive support surfaces. These policies also transfer effectively to real robots. A number of example videos can also be found on the project website: sbrl.cs.columbia.edu</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 7","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142193366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Programmable matter refers to material that can be programmed to alter its physical properties, including its shape. Such matter can be built as a lattice of attached robotic modules, each seen as an autonomous agent with communication and motion capabilities. Self-reconfiguration consists in changing the initial arrangement of modules to form a desired goal shape, and is known to be a complex problem due to its algorithmic complexity and motion constraints. In this paper, we propose to use a max-flow algorithm as a centralized global planner to determine the concurrent paths to be traversed by modules through a porous structure composed of 3D Catoms meta-modules with the aim of increasing the parallelism of motions, and hence decreasing the self-reconfiguration time. We implement a traffic light system as a distributed asynchronous local planning algorithm to control the motions to avoid collisions. We evaluated our algorithm using VisibleSim simulator on different self-reconfiguration scenarios and compared the performance with an existing fully distributed synchronous self-reconfiguration algorithm for similar structures. The results show that the new method provides a significant gain in self-reconfiguration time and energy efficiency.
可编程物质是指可以通过编程改变其物理特性(包括形状)的材料。可编程物质是指可以通过编程改变其物理特性(包括形状)的材料。这种物质可以构建成一个由附加机器人模块组成的晶格,每个模块都是一个具有通信和运动能力的自主代理。众所周知,由于算法的复杂性和运动限制,自我重新配置是一个复杂的问题。在本文中,我们建议使用最大流算法作为集中式全局规划器,以确定模块通过由 3D Catoms 元模块组成的多孔结构的并发路径,从而提高运动的并行性,进而缩短自我重新配置时间。我们采用交通灯系统作为分布式异步局部规划算法,控制运动以避免碰撞。我们使用 VisibleSim 模拟器在不同的自重新配置场景中评估了我们的算法,并将其性能与现有的针对类似结构的全分布式同步自重新配置算法进行了比较。结果表明,新方法显著缩短了自重新配置时间,提高了能效。
{"title":"ASAPs: asynchronous hybrid self-reconfiguration algorithm for porous modular robotic structures","authors":"Jad Bassil, Benoît Piranda, Abdallah Makhoul, Julien Bourgeois","doi":"10.1007/s10514-024-10171-7","DOIUrl":"10.1007/s10514-024-10171-7","url":null,"abstract":"<div><p>Programmable matter refers to material that can be programmed to alter its physical properties, including its shape. Such matter can be built as a lattice of attached robotic modules, each seen as an autonomous agent with communication and motion capabilities. Self-reconfiguration consists in changing the initial arrangement of modules to form a desired goal shape, and is known to be a complex problem due to its algorithmic complexity and motion constraints. In this paper, we propose to use a max-flow algorithm as a centralized global planner to determine the concurrent paths to be traversed by modules through a porous structure composed of <i>3D Catoms</i> meta-modules with the aim of increasing the parallelism of motions, and hence decreasing the self-reconfiguration time. We implement a traffic light system as a distributed asynchronous local planning algorithm to control the motions to avoid collisions. We evaluated our algorithm using <i>VisibleSim</i> simulator on different self-reconfiguration scenarios and compared the performance with an existing fully distributed synchronous self-reconfiguration algorithm for similar structures. The results show that the new method provides a significant gain in self-reconfiguration time and energy efficiency.\u0000</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 7","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142193369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}