DiPCAN: Distilling Privileged Information for Crowd-Aware Navigation

G. Monaci, Michel Aractingi, T. Silander
{"title":"DiPCAN: Distilling Privileged Information for Crowd-Aware Navigation","authors":"G. Monaci, Michel Aractingi, T. Silander","doi":"10.15607/rss.2022.xviii.045","DOIUrl":null,"url":null,"abstract":"—Mobile robots need to navigate in crowded environments to provide services to humans. Traditional approaches to crowd-aware navigation decouple people motion prediction from robot motion planning, leading to undesired robot behaviours. Recent deep learning-based methods integrate crowd forecasting in the planner, assuming precise tracking of the agents in the scene. To do this they require expensive LiDAR sensors and tracking algorithms that are complex and brittle. In this paper we propose a two-step approach to first learn a robot navigation policy based on privileged information about exact pedestrian locations available in simulation. A second learning step distills the knowledge acquired by the first network into an adaptation network that uses only narrow field-of-view image data from the robot camera. While the navigation policy is trained in simulation without any expert supervision such as trajectories computed by a planner, it exhibits state-of-the-art performance on a broad range of dense crowd simulations and real-world experiments. Video results at https://europe.naverlabs.com/research/dipcan.","PeriodicalId":340265,"journal":{"name":"Robotics: Science and Systems XVIII","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics: Science and Systems XVIII","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15607/rss.2022.xviii.045","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

—Mobile robots need to navigate in crowded environments to provide services to humans. Traditional approaches to crowd-aware navigation decouple people motion prediction from robot motion planning, leading to undesired robot behaviours. Recent deep learning-based methods integrate crowd forecasting in the planner, assuming precise tracking of the agents in the scene. To do this they require expensive LiDAR sensors and tracking algorithms that are complex and brittle. In this paper we propose a two-step approach to first learn a robot navigation policy based on privileged information about exact pedestrian locations available in simulation. A second learning step distills the knowledge acquired by the first network into an adaptation network that uses only narrow field-of-view image data from the robot camera. While the navigation policy is trained in simulation without any expert supervision such as trajectories computed by a planner, it exhibits state-of-the-art performance on a broad range of dense crowd simulations and real-world experiments. Video results at https://europe.naverlabs.com/research/dipcan.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
DiPCAN:群体感知导航的特权信息提取
-移动机器人需要在拥挤的环境中导航,为人类提供服务。传统的群体感知导航方法将人的运动预测与机器人的运动规划分离开来,导致机器人产生不期望的行为。最近基于深度学习的方法在规划器中集成了人群预测,假设对场景中的智能体进行精确跟踪。要做到这一点,他们需要昂贵的激光雷达传感器和复杂而脆弱的跟踪算法。在本文中,我们提出了一种两步的方法,首先根据仿真中提供的准确行人位置的特权信息学习机器人导航策略。第二个学习步骤将第一个网络获得的知识提炼成一个适应网络,该网络仅使用机器人相机的窄视场图像数据。虽然导航策略是在模拟中训练的,没有任何专家监督,如由规划器计算的轨迹,但它在大范围的密集人群模拟和现实世界实验中表现出最先进的性能。视频结果见https://europe.naverlabs.com/research/dipcan。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Underwater Robot-To-Human Communication Via Motion: Implementation and Full-Loop Human Interface Evaluation Meta Value Learning for Fast Policy-Centric Optimal Motion Planning A Learning-based Iterative Control Framework for Controlling a Robot Arm with Pneumatic Artificial Muscles Aerial Layouting: Design and Control of a Compliant and Actuated End-Effector for Precise In-flight Marking on Ceilings Occupancy-SLAM: Simultaneously Optimizing Robot Poses and Continuous Occupancy Map
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1