Learning drivers for TORCS through imitation using supervised methods

2009 IEEE Symposium on Computational Intelligence and Games Pub Date : 2009-09-07 DOI:10.1109/CIG.2009.5286480

L. Cardamone, D. Loiacono, P. Lanzi

引用次数: 60

Abstract

In this paper, we apply imitation learning to develop drivers for The Open Racing Car Simulator (TORCS). Our approach can be classified as a direct method in that it applies supervised learning to learn car racing behaviors from the data collected from other drivers. In the literature, this approach is known to have led to extremely poor performance with drivers capable of completing only very small parts of a track. In this paper we show that, by using high-level information about the track ahead of the car and by predicting high-level actions, it is possible to develop drivers with performances that in some cases are only 15% lower than the performance of the fastest driver available in TORCS. Our experimental results suggest that our approach can be effective in developing drivers with good performance in non-trivial tracks using a very limited amount of data and computational resources. We analyze the driving behavior of the controllers developed using our approach and identify perceptual aliasing as one of the factors which can limit performance of our approach.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使用监督方法通过模仿学习TORCS驱动程序

在本文中，我们将模仿学习应用于开放式赛车模拟器(TORCS)的车手开发。我们的方法可以被归类为直接方法，因为它应用监督学习从其他车手收集的数据中学习赛车行为。在文献中，这种方法被认为会导致车手的表现非常差，他们只能完成赛道很小的一部分。在本文中，我们表明，通过使用有关汽车前方赛道的高级信息并通过预测高级动作，可以开发出在某些情况下性能仅比TORCS中可用的最快驾驶员性能低15%的驾驶员。我们的实验结果表明，我们的方法可以在使用非常有限的数据和计算资源的情况下，有效地开发出在非平凡赛道上表现良好的驱动程序。我们分析了使用我们的方法开发的控制器的驾驶行为，并将感知混叠识别为限制我们方法性能的因素之一。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2009 IEEE Symposium on Computational Intelligence and Games

自引率

0.00%

发文量