机动规划与学习:高速公路场景下高度自动驾驶车辆的车道选择方法。

2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC) Pub Date : 2020-09-20 DOI:10.1109/ITSC45102.2020.9294190

Cristina Menendez-Romero, F. Winkler, C. Dornhege, Wolfram Burgard

{"title":"机动规划与学习:高速公路场景下高度自动驾驶车辆的车道选择方法。","authors":"Cristina Menendez-Romero, F. Winkler, C. Dornhege, Wolfram Burgard","doi":"10.1109/ITSC45102.2020.9294190","DOIUrl":null,"url":null,"abstract":"Highway scenarios are highly dynamic environments where several vehicles interact following their own goal, leading to different combinations of scenes that also change over time. Human drivers adapt their driving behavior integrating current information with their former experiences. In a similar way, an autonomous system performing any driving activity should be able to integrate information learned from former interactions. Reinforcement Learning has shown promising results, but it should only be applied to autonomous vehicles if the system is also able to fulfill safety and integrity requirements on a deterministic and reproducible way. This paper presents a planning system that is able to learn over time, always complying to the safety requirements. Our planner integrates several layers interacting with each other, combining the advantages of Reinforcement Learning based systems and reactive systems. We present a planner that ensures driving safety on short horizons and integrates previous experiences to optimize the expected reward. We evaluate our method in simulation comparing different learning techniques. Our results show that the planning system is able to adaptively integrate this experience outperforming rule-based strategies.","PeriodicalId":394538,"journal":{"name":"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Maneuver Planning and Learning: a Lane Selection Approach for Highly Automated Vehicles in Highway Scenarios.\",\"authors\":\"Cristina Menendez-Romero, F. Winkler, C. Dornhege, Wolfram Burgard\",\"doi\":\"10.1109/ITSC45102.2020.9294190\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Highway scenarios are highly dynamic environments where several vehicles interact following their own goal, leading to different combinations of scenes that also change over time. Human drivers adapt their driving behavior integrating current information with their former experiences. In a similar way, an autonomous system performing any driving activity should be able to integrate information learned from former interactions. Reinforcement Learning has shown promising results, but it should only be applied to autonomous vehicles if the system is also able to fulfill safety and integrity requirements on a deterministic and reproducible way. This paper presents a planning system that is able to learn over time, always complying to the safety requirements. Our planner integrates several layers interacting with each other, combining the advantages of Reinforcement Learning based systems and reactive systems. We present a planner that ensures driving safety on short horizons and integrates previous experiences to optimize the expected reward. We evaluate our method in simulation comparing different learning techniques. Our results show that the planning system is able to adaptively integrate this experience outperforming rule-based strategies.\",\"PeriodicalId\":394538,\"journal\":{\"name\":\"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITSC45102.2020.9294190\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITSC45102.2020.9294190","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

高速公路场景是高度动态的环境，其中几辆车按照各自的目标相互作用，导致不同的场景组合也会随着时间的推移而变化。人类驾驶员将当前信息与以前的经验结合起来，调整自己的驾驶行为。类似地，执行任何驾驶活动的自动驾驶系统应该能够整合从以前的交互中学习到的信息。强化学习已经显示出有希望的结果，但只有在系统能够以确定性和可复制的方式满足安全性和完整性要求的情况下，它才能应用于自动驾驶汽车。本文提出了一个能够随着时间的推移不断学习，始终符合安全要求的规划系统。我们的规划器集成了几个相互作用的层，结合了基于强化学习的系统和反应系统的优点。我们提出了一个计划，确保短期内的驾驶安全，并整合以往的经验来优化预期奖励。我们在模拟中比较了不同的学习技术来评估我们的方法。我们的研究结果表明，规划系统能够自适应地整合这些经验，优于基于规则的策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Maneuver Planning and Learning: a Lane Selection Approach for Highly Automated Vehicles in Highway Scenarios.

Highway scenarios are highly dynamic environments where several vehicles interact following their own goal, leading to different combinations of scenes that also change over time. Human drivers adapt their driving behavior integrating current information with their former experiences. In a similar way, an autonomous system performing any driving activity should be able to integrate information learned from former interactions. Reinforcement Learning has shown promising results, but it should only be applied to autonomous vehicles if the system is also able to fulfill safety and integrity requirements on a deterministic and reproducible way. This paper presents a planning system that is able to learn over time, always complying to the safety requirements. Our planner integrates several layers interacting with each other, combining the advantages of Reinforcement Learning based systems and reactive systems. We present a planner that ensures driving safety on short horizons and integrates previous experiences to optimize the expected reward. We evaluate our method in simulation comparing different learning techniques. Our results show that the planning system is able to adaptively integrate this experience outperforming rule-based strategies.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)

自引率

0.00%

发文量