Comparison of Control Methods Based on Imitation Learning for Autonomous Driving

Yinfeng Gao, Yuqi Liu, Qichao Zhang, Yu Wang, Dongbin Zhao, Dawei Ding, Zhonghua Pang, Yueming Zhang
{"title":"Comparison of Control Methods Based on Imitation Learning for Autonomous Driving","authors":"Yinfeng Gao, Yuqi Liu, Qichao Zhang, Yu Wang, Dongbin Zhao, Dawei Ding, Zhonghua Pang, Yueming Zhang","doi":"10.1109/ICICIP47338.2019.9012185","DOIUrl":null,"url":null,"abstract":"Recently, some learning-based methods such as reinforcement learning and imitation learning have been used to address the control problem for autonomous driving. Note that reinforcement learning has strong reliance on the simulation environment and requires a handcraft design of the reward function. Considering different factors in autonomous driving, a general evaluation method is still being explored. The purpose of imitation learning is to learn the control policy through human demonstrations. It is meaningful to compare the control performances of current main imitation learning methods based on the provided dataset. In this paper, we compare three typical imitation learning algorithms: Behavior cloning, Dataset Aggregation (DAgger) and Information maximizing Generative Adversarial Imitation Learning (InfoGAIL) in the The Open Racing Car Simulator (TORCS) and Car Learning to Act (CARLA) simulators, respectively. The performance of algorithms is evaluated on lane-keeping task in racing and urban environment. The experiment results show DAgger performs best in simple lane keeping problem, and InfoGAIL has the unique advantage of distinguishing different driving styles from expert demonstrations.","PeriodicalId":431872,"journal":{"name":"2019 Tenth International Conference on Intelligent Control and Information Processing (ICICIP)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Tenth International Conference on Intelligent Control and Information Processing (ICICIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICIP47338.2019.9012185","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Recently, some learning-based methods such as reinforcement learning and imitation learning have been used to address the control problem for autonomous driving. Note that reinforcement learning has strong reliance on the simulation environment and requires a handcraft design of the reward function. Considering different factors in autonomous driving, a general evaluation method is still being explored. The purpose of imitation learning is to learn the control policy through human demonstrations. It is meaningful to compare the control performances of current main imitation learning methods based on the provided dataset. In this paper, we compare three typical imitation learning algorithms: Behavior cloning, Dataset Aggregation (DAgger) and Information maximizing Generative Adversarial Imitation Learning (InfoGAIL) in the The Open Racing Car Simulator (TORCS) and Car Learning to Act (CARLA) simulators, respectively. The performance of algorithms is evaluated on lane-keeping task in racing and urban environment. The experiment results show DAgger performs best in simple lane keeping problem, and InfoGAIL has the unique advantage of distinguishing different driving styles from expert demonstrations.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于模仿学习的自动驾驶控制方法比较
近年来,一些基于学习的方法如强化学习和模仿学习已被用于解决自动驾驶的控制问题。注意,强化学习对模拟环境有很强的依赖性,需要手工设计奖励函数。考虑到自动驾驶中的不同因素,一种通用的评估方法仍在探索中。模仿学习的目的是通过人的示范来学习控制策略。基于所提供的数据集,比较目前主要的模仿学习方法的控制性能是有意义的。在本文中,我们比较了三种典型的模仿学习算法:行为克隆,数据集聚合(DAgger)和信息最大化生成对抗模仿学习(InfoGAIL)分别在开放赛车模拟器(TORCS)和汽车学习行动(CARLA)模拟器。对算法在赛车和城市环境下的车道保持任务进行了性能评价。实验结果表明,DAgger在简单的车道保持问题上表现最好,而InfoGAIL在区分专家演示的不同驾驶风格方面具有独特优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Mobile Robot Autonomous Exploration and Navigation in Large-scale Indoor Environments Cross Spectral-Spatial Convolutional Network for Hyperspectral Image Classification Sparse Coding with Outliers A Novel Fuzzy Logic Control on the FVVT Lift of Internal Combustion Engine Adaptive Fuzzy Compensation Control of MIMO Stochastic Nonlinear Systems with Input Hysteresis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1