Automatic Discovery of Motion Patterns that Improve Learning Rate in Communication-Limited Multi-Robot Systems

Taeyeong Choi, Theodore P. Pavlic
{"title":"Automatic Discovery of Motion Patterns that Improve Learning Rate in Communication-Limited Multi-Robot Systems","authors":"Taeyeong Choi, Theodore P. Pavlic","doi":"10.1109/MFI49285.2020.9235218","DOIUrl":null,"url":null,"abstract":"Learning in robotic systems is largely constrained by the quality of the training data available to a robot learner. Robots may have to make multiple, repeated expensive excursions to gather this data or have humans in the loop to perform demonstrations to ensure reliable performance. The cost can be much higher when a robot embedded within a multi-robot system must learn from the complex aggregate of the many robots that surround it and may react to the learner’s motions. In our previous work [1], [2], we considered the problem of Remote Teammate Localization (ReTLo), where a single robot in a team uses passive observations of a nearby neighbor to accurately infer the position of robots outside of its sensory range even when robot-to-robot communication is not allowed in the system. We demonstrated a communication-free approach to show that the rearmost robot can use motion information of a single robot within its sensory range to predict the positions of all robots in the convoy. Here, we expand on that work with Selective Random Sampling (SRS), a framework that improves the ReTLo learning process by enabling the learner to actively deviate from its trajectory in ways that are likely to lead to better training samples and consequently gain accurate localization ability with fewer observations. By adding diversity to the learner’s motion, SRS simultaneously improves the learner’s predictions of all other teammates and thus can achieve similar performance as prior methods with less data.","PeriodicalId":446154,"journal":{"name":"2020 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MFI49285.2020.9235218","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Learning in robotic systems is largely constrained by the quality of the training data available to a robot learner. Robots may have to make multiple, repeated expensive excursions to gather this data or have humans in the loop to perform demonstrations to ensure reliable performance. The cost can be much higher when a robot embedded within a multi-robot system must learn from the complex aggregate of the many robots that surround it and may react to the learner’s motions. In our previous work [1], [2], we considered the problem of Remote Teammate Localization (ReTLo), where a single robot in a team uses passive observations of a nearby neighbor to accurately infer the position of robots outside of its sensory range even when robot-to-robot communication is not allowed in the system. We demonstrated a communication-free approach to show that the rearmost robot can use motion information of a single robot within its sensory range to predict the positions of all robots in the convoy. Here, we expand on that work with Selective Random Sampling (SRS), a framework that improves the ReTLo learning process by enabling the learner to actively deviate from its trajectory in ways that are likely to lead to better training samples and consequently gain accurate localization ability with fewer observations. By adding diversity to the learner’s motion, SRS simultaneously improves the learner’s predictions of all other teammates and thus can achieve similar performance as prior methods with less data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在通信受限的多机器人系统中提高学习率的运动模式自动发现
机器人系统中的学习在很大程度上受到机器人学习者可用的训练数据质量的限制。机器人可能需要进行多次,重复的昂贵的短途旅行来收集这些数据,或者让人类在循环中进行演示以确保可靠的性能。当嵌入在多机器人系统中的机器人必须从周围许多机器人的复杂集合中学习,并可能对学习者的动作做出反应时,成本可能会高得多。在我们之前的工作[1],[2]中,我们考虑了远程队友定位(Remote队友Localization, ReTLo)问题,即团队中的单个机器人使用对附近邻居的被动观察来准确推断其感知范围之外的机器人的位置,即使系统中不允许机器人之间的通信。我们演示了一种无需通信的方法,表明最后面的机器人可以使用其感知范围内单个机器人的运动信息来预测车队中所有机器人的位置。在这里,我们扩展了选择性随机抽样(SRS)的工作,这是一个框架,通过使学习者主动偏离其轨迹的方式,可能导致更好的训练样本,从而以更少的观察值获得准确的定位能力,从而改进了ReTLo学习过程。通过增加学习者动作的多样性,SRS同时提高了学习者对所有其他队友的预测,从而可以在数据较少的情况下获得与先前方法相似的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
OAFuser: Online Adaptive Extended Object Tracking and Fusion using automotive Radar Detections Observability driven Multi-modal Line-scan Camera Calibration Localization and velocity estimation based on multiple bistatic measurements A Continuous Probabilistic Origin Association Filter for Extended Object Tracking Towards Automatic Classification of Fragmented Rock Piles via Proprioceptive Sensing and Wavelet Analysis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1