Deep Sequential Models for Sampling-Based Planning

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Pub Date : 2018-10-01 DOI:10.1109/IROS.2018.8593947

Yen-Ling Kuo, Andrei Barbu, Boris Katz

{"title":"Deep Sequential Models for Sampling-Based Planning","authors":"Yen-Ling Kuo, Andrei Barbu, Boris Katz","doi":"10.1109/IROS.2018.8593947","DOIUrl":null,"url":null,"abstract":"We demonstrate how a sequence model and a sampling-based planner can influence each other to produce efficient plans and how such a model can automatically learn to take advantage of observations of the environment. Sampling-based planners such as RRT generally know nothing of their environments even if they have traversed similar spaces many times. A sequence model, such as an HMM or LSTM, guides the search for good paths. The resulting model, called DeRRT*, observes the state of the planner and the local environment to bias the next move and next planner state. The neural-network-based models avoid manual feature engineering by co-training a convolutional network which processes map features and observations from sensors. We incorporate this sequence model in a manner that combines its likelihood with the existing bias for searching large unexplored Voronoi regions. This leads to more efficient trajectories with fewer rejected samples even in difficult domains such as when escaping bug traps. This model can also be used for dimensionality reduction in multi-agent environments with dynamic obstacles. Instead of planning in a high-dimensional space that includes the configurations of the other agents, we plan in a low-dimensional subspace relying on the sequence model to bias samples using the observed behavior of the other agents. The techniques presented here are general, include both graphical models and deep learning approaches, and can be adapted to a range of planners.","PeriodicalId":6640,"journal":{"name":"2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)","volume":"24 8","pages":"6490-6497"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IROS.2018.8593947","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 23

Abstract

We demonstrate how a sequence model and a sampling-based planner can influence each other to produce efficient plans and how such a model can automatically learn to take advantage of observations of the environment. Sampling-based planners such as RRT generally know nothing of their environments even if they have traversed similar spaces many times. A sequence model, such as an HMM or LSTM, guides the search for good paths. The resulting model, called DeRRT*, observes the state of the planner and the local environment to bias the next move and next planner state. The neural-network-based models avoid manual feature engineering by co-training a convolutional network which processes map features and observations from sensors. We incorporate this sequence model in a manner that combines its likelihood with the existing bias for searching large unexplored Voronoi regions. This leads to more efficient trajectories with fewer rejected samples even in difficult domains such as when escaping bug traps. This model can also be used for dimensionality reduction in multi-agent environments with dynamic obstacles. Instead of planning in a high-dimensional space that includes the configurations of the other agents, we plan in a low-dimensional subspace relying on the sequence model to bias samples using the observed behavior of the other agents. The techniques presented here are general, include both graphical models and deep learning approaches, and can be adapted to a range of planners.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于抽样规划的深度序列模型

我们演示了序列模型和基于抽样的计划器如何相互影响以产生有效的计划，以及这样的模型如何自动学习以利用对环境的观察。像RRT这样的基于抽样的规划者通常对他们的环境一无所知，即使他们已经多次穿越类似的空间。序列模型，如HMM或LSTM，指导搜索好的路径。由此产生的模型称为DeRRT*，它观察计划者的状态和局部环境，从而判断下一步行动和下一个计划者的状态。基于神经网络的模型通过共同训练卷积网络来避免人工特征工程，卷积网络处理地图特征和来自传感器的观察。我们以一种将其可能性与搜索大型未勘探Voronoi区域的现有偏差相结合的方式合并该序列模型。这导致更有效的轨迹和更少的拒绝样本，即使在困难的领域，如当逃避bug陷阱。该模型也可用于具有动态障碍物的多智能体环境的降维。我们不是在包含其他智能体配置的高维空间中进行规划，而是在依赖序列模型的低维子空间中进行规划，利用观察到的其他智能体的行为对样本进行偏置。这里介绍的技术是通用的，包括图形模型和深度学习方法，可以适用于一系列计划者。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

自引率

0.00%

发文量