基于模仿学习的自动驾驶控制方法比较

2019 Tenth International Conference on Intelligent Control and Information Processing (ICICIP) Pub Date : 2019-12-01 DOI:10.1109/ICICIP47338.2019.9012185

Yinfeng Gao, Yuqi Liu, Qichao Zhang, Yu Wang, Dongbin Zhao, Dawei Ding, Zhonghua Pang, Yueming Zhang

{"title":"基于模仿学习的自动驾驶控制方法比较","authors":"Yinfeng Gao, Yuqi Liu, Qichao Zhang, Yu Wang, Dongbin Zhao, Dawei Ding, Zhonghua Pang, Yueming Zhang","doi":"10.1109/ICICIP47338.2019.9012185","DOIUrl":null,"url":null,"abstract":"Recently, some learning-based methods such as reinforcement learning and imitation learning have been used to address the control problem for autonomous driving. Note that reinforcement learning has strong reliance on the simulation environment and requires a handcraft design of the reward function. Considering different factors in autonomous driving, a general evaluation method is still being explored. The purpose of imitation learning is to learn the control policy through human demonstrations. It is meaningful to compare the control performances of current main imitation learning methods based on the provided dataset. In this paper, we compare three typical imitation learning algorithms: Behavior cloning, Dataset Aggregation (DAgger) and Information maximizing Generative Adversarial Imitation Learning (InfoGAIL) in the The Open Racing Car Simulator (TORCS) and Car Learning to Act (CARLA) simulators, respectively. The performance of algorithms is evaluated on lane-keeping task in racing and urban environment. The experiment results show DAgger performs best in simple lane keeping problem, and InfoGAIL has the unique advantage of distinguishing different driving styles from expert demonstrations.","PeriodicalId":431872,"journal":{"name":"2019 Tenth International Conference on Intelligent Control and Information Processing (ICICIP)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Comparison of Control Methods Based on Imitation Learning for Autonomous Driving\",\"authors\":\"Yinfeng Gao, Yuqi Liu, Qichao Zhang, Yu Wang, Dongbin Zhao, Dawei Ding, Zhonghua Pang, Yueming Zhang\",\"doi\":\"10.1109/ICICIP47338.2019.9012185\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, some learning-based methods such as reinforcement learning and imitation learning have been used to address the control problem for autonomous driving. Note that reinforcement learning has strong reliance on the simulation environment and requires a handcraft design of the reward function. Considering different factors in autonomous driving, a general evaluation method is still being explored. The purpose of imitation learning is to learn the control policy through human demonstrations. It is meaningful to compare the control performances of current main imitation learning methods based on the provided dataset. In this paper, we compare three typical imitation learning algorithms: Behavior cloning, Dataset Aggregation (DAgger) and Information maximizing Generative Adversarial Imitation Learning (InfoGAIL) in the The Open Racing Car Simulator (TORCS) and Car Learning to Act (CARLA) simulators, respectively. The performance of algorithms is evaluated on lane-keeping task in racing and urban environment. The experiment results show DAgger performs best in simple lane keeping problem, and InfoGAIL has the unique advantage of distinguishing different driving styles from expert demonstrations.\",\"PeriodicalId\":431872,\"journal\":{\"name\":\"2019 Tenth International Conference on Intelligent Control and Information Processing (ICICIP)\",\"volume\":\"84 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 Tenth International Conference on Intelligent Control and Information Processing (ICICIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICICIP47338.2019.9012185\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Tenth International Conference on Intelligent Control and Information Processing (ICICIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICIP47338.2019.9012185","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

近年来，一些基于学习的方法如强化学习和模仿学习已被用于解决自动驾驶的控制问题。注意，强化学习对模拟环境有很强的依赖性，需要手工设计奖励函数。考虑到自动驾驶中的不同因素，一种通用的评估方法仍在探索中。模仿学习的目的是通过人的示范来学习控制策略。基于所提供的数据集，比较目前主要的模仿学习方法的控制性能是有意义的。在本文中，我们比较了三种典型的模仿学习算法:行为克隆，数据集聚合(DAgger)和信息最大化生成对抗模仿学习(InfoGAIL)分别在开放赛车模拟器(TORCS)和汽车学习行动(CARLA)模拟器。对算法在赛车和城市环境下的车道保持任务进行了性能评价。实验结果表明，DAgger在简单的车道保持问题上表现最好，而InfoGAIL在区分专家演示的不同驾驶风格方面具有独特优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Comparison of Control Methods Based on Imitation Learning for Autonomous Driving

Recently, some learning-based methods such as reinforcement learning and imitation learning have been used to address the control problem for autonomous driving. Note that reinforcement learning has strong reliance on the simulation environment and requires a handcraft design of the reward function. Considering different factors in autonomous driving, a general evaluation method is still being explored. The purpose of imitation learning is to learn the control policy through human demonstrations. It is meaningful to compare the control performances of current main imitation learning methods based on the provided dataset. In this paper, we compare three typical imitation learning algorithms: Behavior cloning, Dataset Aggregation (DAgger) and Information maximizing Generative Adversarial Imitation Learning (InfoGAIL) in the The Open Racing Car Simulator (TORCS) and Car Learning to Act (CARLA) simulators, respectively. The performance of algorithms is evaluated on lane-keeping task in racing and urban environment. The experiment results show DAgger performs best in simple lane keeping problem, and InfoGAIL has the unique advantage of distinguishing different driving styles from expert demonstrations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 Tenth International Conference on Intelligent Control and Information Processing (ICICIP)

自引率

0.00%

发文量

期刊最新文献

Mobile Robot Autonomous Exploration and Navigation in Large-scale Indoor Environments Cross Spectral-Spatial Convolutional Network for Hyperspectral Image Classification Sparse Coding with Outliers A Novel Fuzzy Logic Control on the FVVT Lift of Internal Combustion Engine Adaptive Fuzzy Compensation Control of MIMO Stochastic Nonlinear Systems with Input Hysteresis