Automatic Discovery of Motion Patterns that Improve Learning Rate in Communication-Limited Multi-Robot Systems

2020 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI) Pub Date : 2020-09-14 DOI:10.1109/MFI49285.2020.9235218

Taeyeong Choi, Theodore P. Pavlic

{"title":"Automatic Discovery of Motion Patterns that Improve Learning Rate in Communication-Limited Multi-Robot Systems","authors":"Taeyeong Choi, Theodore P. Pavlic","doi":"10.1109/MFI49285.2020.9235218","DOIUrl":null,"url":null,"abstract":"Learning in robotic systems is largely constrained by the quality of the training data available to a robot learner. Robots may have to make multiple, repeated expensive excursions to gather this data or have humans in the loop to perform demonstrations to ensure reliable performance. The cost can be much higher when a robot embedded within a multi-robot system must learn from the complex aggregate of the many robots that surround it and may react to the learner’s motions. In our previous work [1], [2], we considered the problem of Remote Teammate Localization (ReTLo), where a single robot in a team uses passive observations of a nearby neighbor to accurately infer the position of robots outside of its sensory range even when robot-to-robot communication is not allowed in the system. We demonstrated a communication-free approach to show that the rearmost robot can use motion information of a single robot within its sensory range to predict the positions of all robots in the convoy. Here, we expand on that work with Selective Random Sampling (SRS), a framework that improves the ReTLo learning process by enabling the learner to actively deviate from its trajectory in ways that are likely to lead to better training samples and consequently gain accurate localization ability with fewer observations. By adding diversity to the learner’s motion, SRS simultaneously improves the learner’s predictions of all other teammates and thus can achieve similar performance as prior methods with less data.","PeriodicalId":446154,"journal":{"name":"2020 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MFI49285.2020.9235218","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Learning in robotic systems is largely constrained by the quality of the training data available to a robot learner. Robots may have to make multiple, repeated expensive excursions to gather this data or have humans in the loop to perform demonstrations to ensure reliable performance. The cost can be much higher when a robot embedded within a multi-robot system must learn from the complex aggregate of the many robots that surround it and may react to the learner’s motions. In our previous work [1], [2], we considered the problem of Remote Teammate Localization (ReTLo), where a single robot in a team uses passive observations of a nearby neighbor to accurately infer the position of robots outside of its sensory range even when robot-to-robot communication is not allowed in the system. We demonstrated a communication-free approach to show that the rearmost robot can use motion information of a single robot within its sensory range to predict the positions of all robots in the convoy. Here, we expand on that work with Selective Random Sampling (SRS), a framework that improves the ReTLo learning process by enabling the learner to actively deviate from its trajectory in ways that are likely to lead to better training samples and consequently gain accurate localization ability with fewer observations. By adding diversity to the learner’s motion, SRS simultaneously improves the learner’s predictions of all other teammates and thus can achieve similar performance as prior methods with less data.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

在通信受限的多机器人系统中提高学习率的运动模式自动发现

机器人系统中的学习在很大程度上受到机器人学习者可用的训练数据质量的限制。机器人可能需要进行多次，重复的昂贵的短途旅行来收集这些数据，或者让人类在循环中进行演示以确保可靠的性能。当嵌入在多机器人系统中的机器人必须从周围许多机器人的复杂集合中学习，并可能对学习者的动作做出反应时，成本可能会高得多。在我们之前的工作[1]，[2]中，我们考虑了远程队友定位(Remote队友Localization, ReTLo)问题，即团队中的单个机器人使用对附近邻居的被动观察来准确推断其感知范围之外的机器人的位置，即使系统中不允许机器人之间的通信。我们演示了一种无需通信的方法，表明最后面的机器人可以使用其感知范围内单个机器人的运动信息来预测车队中所有机器人的位置。在这里，我们扩展了选择性随机抽样(SRS)的工作，这是一个框架，通过使学习者主动偏离其轨迹的方式，可能导致更好的训练样本，从而以更少的观察值获得准确的定位能力，从而改进了ReTLo学习过程。通过增加学习者动作的多样性，SRS同时提高了学习者对所有其他队友的预测，从而可以在数据较少的情况下获得与先前方法相似的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2020 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI)

自引率

0.00%

发文量