Observational Learning: Imitation Through an Adaptive Probabilistic Approach

2021 IEEE International Conference on Autonomous Systems (ICAS) Pub Date : 2021-08-11 DOI:10.1109/ICAS49788.2021.9551152

Sheida Nozari, L. Marcenaro, David Martín, C. Regazzoni

引用次数: 1

Abstract

This paper proposes an adaptive method to enable imitation learning from expert demonstrations in a multi-agent context. The proposed system employs the inverse reinforcement learning method to a coupled Dynamic Bayesian Network to facilitate dynamic learning in an interactive system. This method studies the interaction at both discrete and continuous levels by identifying inter-relationships between the objects to facilitate the prediction of an expert agent. We evaluate the learning procedure in the scene of learner agent based on probabilistic reward function. Our goal is to estimate policies that predict matched trajectories with the observed one by minimizing the Kullback-Leiber divergence. The reward policies provide a probabilistic dynamic structure to minimise the abnormalities.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

观察学习:通过自适应概率方法进行模仿

本文提出了一种自适应方法来实现多智能体环境下专家演示的模仿学习。该系统将逆强化学习方法应用于一个耦合的动态贝叶斯网络，以促进交互式系统的动态学习。该方法通过识别对象之间的相互关系来研究离散级和连续级的相互作用，以方便专家代理的预测。我们基于概率奖励函数来评估学习智能体场景下的学习过程。我们的目标是通过最小化Kullback-Leiber散度来估计预测与观测轨迹匹配的政策。奖励政策提供了一个概率动态结构，以尽量减少异常。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2021 IEEE International Conference on Autonomous Systems (ICAS)

自引率

0.00%

发文量