Model-Based Reinforcement Learning for Advanced Adaptive Cruise Control: A Hybrid Car Following Policy

M. U. Yavas, T. Kumbasar, N. K. Ure
{"title":"Model-Based Reinforcement Learning for Advanced Adaptive Cruise Control: A Hybrid Car Following Policy","authors":"M. U. Yavas, T. Kumbasar, N. K. Ure","doi":"10.1109/iv51971.2022.9827279","DOIUrl":null,"url":null,"abstract":"Adaptive cruise control (ACC) is one of the frontier functionality for highly automated vehicles and has been widely studied by both academia and industry. However, previous ACC approaches are reactive and rely on precise information about the current state of a single lead vehicle. With the advancement in the field of artificial intelligence, particularly in reinforcement learning, there is a big opportunity to enhance the current functionality. This paper presents an advanced ACC concept with unique environment representation and model-based reinforcement learning (MBRL) technique which enables predictive driving. By being predictive, we refer to the capability to handle multiple lead vehicles and have internal predictions about the traffic environment which avoids reactive short-term policies. Moreover, we propose a hybrid policy that combines classical car following policies with MBRL policy to avoid accidents by monitoring the internal model of the MBRL policy. Our extensive evaluation in a realistic simulation environment shows that the proposed approach is superior to the reference model-based and model-free algorithms. The MBRL agent requires only 150k samples (approximately 50 hours driving) to converge, which is x4 more sample efficient than model-free methods.","PeriodicalId":184622,"journal":{"name":"2022 IEEE Intelligent Vehicles Symposium (IV)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Intelligent Vehicles Symposium (IV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iv51971.2022.9827279","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

Adaptive cruise control (ACC) is one of the frontier functionality for highly automated vehicles and has been widely studied by both academia and industry. However, previous ACC approaches are reactive and rely on precise information about the current state of a single lead vehicle. With the advancement in the field of artificial intelligence, particularly in reinforcement learning, there is a big opportunity to enhance the current functionality. This paper presents an advanced ACC concept with unique environment representation and model-based reinforcement learning (MBRL) technique which enables predictive driving. By being predictive, we refer to the capability to handle multiple lead vehicles and have internal predictions about the traffic environment which avoids reactive short-term policies. Moreover, we propose a hybrid policy that combines classical car following policies with MBRL policy to avoid accidents by monitoring the internal model of the MBRL policy. Our extensive evaluation in a realistic simulation environment shows that the proposed approach is superior to the reference model-based and model-free algorithms. The MBRL agent requires only 150k samples (approximately 50 hours driving) to converge, which is x4 more sample efficient than model-free methods.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于模型强化学习的高级自适应巡航控制:混合动力汽车跟随策略
自适应巡航控制(ACC)是高度自动驾驶汽车的前沿功能之一,受到了学术界和工业界的广泛研究。然而,以前的ACC方法是被动的,并且依赖于单个导联车辆当前状态的精确信息。随着人工智能领域的进步,特别是在强化学习方面,有很大的机会来增强当前的功能。本文提出了一种先进的ACC概念,该概念具有独特的环境表示和基于模型的强化学习(MBRL)技术,可实现预测驾驶。通过预测性,我们指的是处理多个领先车辆的能力,以及对交通环境的内部预测,从而避免被动的短期政策。此外,我们还提出了一种将经典汽车跟随策略与MBRL策略相结合的混合策略,通过监控MBRL策略的内部模型来避免事故的发生。我们在真实仿真环境中的广泛评估表明,该方法优于参考的基于模型和无模型算法。MBRL代理只需要150k个样本(大约50小时的驾驶时间)就可以收敛,这比无模型方法的样本效率高4倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Dynamic Conflict Mitigation for Cooperative Driving Control of Intelligent Vehicles Detecting vehicles in the dark in urban environments - A human benchmark A Sequential Decision-theoretic Method for Detecting Mobile Robots Localization Failures Scene Spatio-Temporal Graph Convolutional Network for Pedestrian Intention Estimation What Can be Seen is What You Get: Structure Aware Point Cloud Augmentation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1