机动规划与学习:高速公路场景下高度自动驾驶车辆的车道选择方法。

Cristina Menendez-Romero, F. Winkler, C. Dornhege, Wolfram Burgard
{"title":"机动规划与学习:高速公路场景下高度自动驾驶车辆的车道选择方法。","authors":"Cristina Menendez-Romero, F. Winkler, C. Dornhege, Wolfram Burgard","doi":"10.1109/ITSC45102.2020.9294190","DOIUrl":null,"url":null,"abstract":"Highway scenarios are highly dynamic environments where several vehicles interact following their own goal, leading to different combinations of scenes that also change over time. Human drivers adapt their driving behavior integrating current information with their former experiences. In a similar way, an autonomous system performing any driving activity should be able to integrate information learned from former interactions. Reinforcement Learning has shown promising results, but it should only be applied to autonomous vehicles if the system is also able to fulfill safety and integrity requirements on a deterministic and reproducible way. This paper presents a planning system that is able to learn over time, always complying to the safety requirements. Our planner integrates several layers interacting with each other, combining the advantages of Reinforcement Learning based systems and reactive systems. We present a planner that ensures driving safety on short horizons and integrates previous experiences to optimize the expected reward. We evaluate our method in simulation comparing different learning techniques. Our results show that the planning system is able to adaptively integrate this experience outperforming rule-based strategies.","PeriodicalId":394538,"journal":{"name":"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Maneuver Planning and Learning: a Lane Selection Approach for Highly Automated Vehicles in Highway Scenarios.\",\"authors\":\"Cristina Menendez-Romero, F. Winkler, C. Dornhege, Wolfram Burgard\",\"doi\":\"10.1109/ITSC45102.2020.9294190\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Highway scenarios are highly dynamic environments where several vehicles interact following their own goal, leading to different combinations of scenes that also change over time. Human drivers adapt their driving behavior integrating current information with their former experiences. In a similar way, an autonomous system performing any driving activity should be able to integrate information learned from former interactions. Reinforcement Learning has shown promising results, but it should only be applied to autonomous vehicles if the system is also able to fulfill safety and integrity requirements on a deterministic and reproducible way. This paper presents a planning system that is able to learn over time, always complying to the safety requirements. Our planner integrates several layers interacting with each other, combining the advantages of Reinforcement Learning based systems and reactive systems. We present a planner that ensures driving safety on short horizons and integrates previous experiences to optimize the expected reward. We evaluate our method in simulation comparing different learning techniques. Our results show that the planning system is able to adaptively integrate this experience outperforming rule-based strategies.\",\"PeriodicalId\":394538,\"journal\":{\"name\":\"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITSC45102.2020.9294190\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITSC45102.2020.9294190","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

高速公路场景是高度动态的环境,其中几辆车按照各自的目标相互作用,导致不同的场景组合也会随着时间的推移而变化。人类驾驶员将当前信息与以前的经验结合起来,调整自己的驾驶行为。类似地,执行任何驾驶活动的自动驾驶系统应该能够整合从以前的交互中学习到的信息。强化学习已经显示出有希望的结果,但只有在系统能够以确定性和可复制的方式满足安全性和完整性要求的情况下,它才能应用于自动驾驶汽车。本文提出了一个能够随着时间的推移不断学习,始终符合安全要求的规划系统。我们的规划器集成了几个相互作用的层,结合了基于强化学习的系统和反应系统的优点。我们提出了一个计划,确保短期内的驾驶安全,并整合以往的经验来优化预期奖励。我们在模拟中比较了不同的学习技术来评估我们的方法。我们的研究结果表明,规划系统能够自适应地整合这些经验,优于基于规则的策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Maneuver Planning and Learning: a Lane Selection Approach for Highly Automated Vehicles in Highway Scenarios.
Highway scenarios are highly dynamic environments where several vehicles interact following their own goal, leading to different combinations of scenes that also change over time. Human drivers adapt their driving behavior integrating current information with their former experiences. In a similar way, an autonomous system performing any driving activity should be able to integrate information learned from former interactions. Reinforcement Learning has shown promising results, but it should only be applied to autonomous vehicles if the system is also able to fulfill safety and integrity requirements on a deterministic and reproducible way. This paper presents a planning system that is able to learn over time, always complying to the safety requirements. Our planner integrates several layers interacting with each other, combining the advantages of Reinforcement Learning based systems and reactive systems. We present a planner that ensures driving safety on short horizons and integrates previous experiences to optimize the expected reward. We evaluate our method in simulation comparing different learning techniques. Our results show that the planning system is able to adaptively integrate this experience outperforming rule-based strategies.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
CR-TMS: Connected Vehicles enabled Road Traffic Congestion Mitigation System using Virtual Road Capacity Inflation A novel concept for validation of pre-crash perception sensor information using contact sensor Space-time Map based Path Planning Scheme in Large-scale Intelligent Warehouse System Weakly-supervised Road Condition Classification Using Automatically Generated Labels Studying the Impact of Public Transport on Disaster Evacuation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1