Cristina Menendez-Romero, F. Winkler, C. Dornhege, Wolfram Burgard
{"title":"机动规划与学习:高速公路场景下高度自动驾驶车辆的车道选择方法。","authors":"Cristina Menendez-Romero, F. Winkler, C. Dornhege, Wolfram Burgard","doi":"10.1109/ITSC45102.2020.9294190","DOIUrl":null,"url":null,"abstract":"Highway scenarios are highly dynamic environments where several vehicles interact following their own goal, leading to different combinations of scenes that also change over time. Human drivers adapt their driving behavior integrating current information with their former experiences. In a similar way, an autonomous system performing any driving activity should be able to integrate information learned from former interactions. Reinforcement Learning has shown promising results, but it should only be applied to autonomous vehicles if the system is also able to fulfill safety and integrity requirements on a deterministic and reproducible way. This paper presents a planning system that is able to learn over time, always complying to the safety requirements. Our planner integrates several layers interacting with each other, combining the advantages of Reinforcement Learning based systems and reactive systems. We present a planner that ensures driving safety on short horizons and integrates previous experiences to optimize the expected reward. We evaluate our method in simulation comparing different learning techniques. Our results show that the planning system is able to adaptively integrate this experience outperforming rule-based strategies.","PeriodicalId":394538,"journal":{"name":"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Maneuver Planning and Learning: a Lane Selection Approach for Highly Automated Vehicles in Highway Scenarios.\",\"authors\":\"Cristina Menendez-Romero, F. Winkler, C. Dornhege, Wolfram Burgard\",\"doi\":\"10.1109/ITSC45102.2020.9294190\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Highway scenarios are highly dynamic environments where several vehicles interact following their own goal, leading to different combinations of scenes that also change over time. Human drivers adapt their driving behavior integrating current information with their former experiences. In a similar way, an autonomous system performing any driving activity should be able to integrate information learned from former interactions. Reinforcement Learning has shown promising results, but it should only be applied to autonomous vehicles if the system is also able to fulfill safety and integrity requirements on a deterministic and reproducible way. This paper presents a planning system that is able to learn over time, always complying to the safety requirements. Our planner integrates several layers interacting with each other, combining the advantages of Reinforcement Learning based systems and reactive systems. We present a planner that ensures driving safety on short horizons and integrates previous experiences to optimize the expected reward. We evaluate our method in simulation comparing different learning techniques. Our results show that the planning system is able to adaptively integrate this experience outperforming rule-based strategies.\",\"PeriodicalId\":394538,\"journal\":{\"name\":\"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITSC45102.2020.9294190\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITSC45102.2020.9294190","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Maneuver Planning and Learning: a Lane Selection Approach for Highly Automated Vehicles in Highway Scenarios.
Highway scenarios are highly dynamic environments where several vehicles interact following their own goal, leading to different combinations of scenes that also change over time. Human drivers adapt their driving behavior integrating current information with their former experiences. In a similar way, an autonomous system performing any driving activity should be able to integrate information learned from former interactions. Reinforcement Learning has shown promising results, but it should only be applied to autonomous vehicles if the system is also able to fulfill safety and integrity requirements on a deterministic and reproducible way. This paper presents a planning system that is able to learn over time, always complying to the safety requirements. Our planner integrates several layers interacting with each other, combining the advantages of Reinforcement Learning based systems and reactive systems. We present a planner that ensures driving safety on short horizons and integrates previous experiences to optimize the expected reward. We evaluate our method in simulation comparing different learning techniques. Our results show that the planning system is able to adaptively integrate this experience outperforming rule-based strategies.