{"title":"Pygame中基于深度强化学习的自动驾驶训练模型重用","authors":"Youtian Guo, Qi Gao, Feng Pan","doi":"10.23919/CCC50068.2020.9188547","DOIUrl":null,"url":null,"abstract":"Autonomous-Driving technology has begun to bring great convenience to daily trip, transportation, and surveying harsh environment. Considering that deep reinforcement learning has requirements for the convergence performance of the training results, and the actual training results sometimes cannot converge steadily or fail to reach the training goals, in this paper, the trained model reuse method was proposed, which can use the trained model generates Q(St, At) and can be used as a part of Deep Reinforcement Learning model, and this model was built based on the value function that could predict the Q value corresponding to the various actions performed in the environment state. In the Pygame platform, a simplified traffic simulation environment was set, it is observed that the Autonomous-Driving vehicle could run smoothly without collision in a fixed-length test simulation environment, and this trained model reuse method could help autonomous vehicle accelerate the learning process, obtain better simulation results during most of the training process, save simulation time and computing resources.","PeriodicalId":255872,"journal":{"name":"2020 39th Chinese Control Conference (CCC)","volume":"94 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Trained Model Reuse of Autonomous-Driving in Pygame with Deep Reinforcement Learning\",\"authors\":\"Youtian Guo, Qi Gao, Feng Pan\",\"doi\":\"10.23919/CCC50068.2020.9188547\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Autonomous-Driving technology has begun to bring great convenience to daily trip, transportation, and surveying harsh environment. Considering that deep reinforcement learning has requirements for the convergence performance of the training results, and the actual training results sometimes cannot converge steadily or fail to reach the training goals, in this paper, the trained model reuse method was proposed, which can use the trained model generates Q(St, At) and can be used as a part of Deep Reinforcement Learning model, and this model was built based on the value function that could predict the Q value corresponding to the various actions performed in the environment state. In the Pygame platform, a simplified traffic simulation environment was set, it is observed that the Autonomous-Driving vehicle could run smoothly without collision in a fixed-length test simulation environment, and this trained model reuse method could help autonomous vehicle accelerate the learning process, obtain better simulation results during most of the training process, save simulation time and computing resources.\",\"PeriodicalId\":255872,\"journal\":{\"name\":\"2020 39th Chinese Control Conference (CCC)\",\"volume\":\"94 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 39th Chinese Control Conference (CCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/CCC50068.2020.9188547\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 39th Chinese Control Conference (CCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/CCC50068.2020.9188547","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Trained Model Reuse of Autonomous-Driving in Pygame with Deep Reinforcement Learning
Autonomous-Driving technology has begun to bring great convenience to daily trip, transportation, and surveying harsh environment. Considering that deep reinforcement learning has requirements for the convergence performance of the training results, and the actual training results sometimes cannot converge steadily or fail to reach the training goals, in this paper, the trained model reuse method was proposed, which can use the trained model generates Q(St, At) and can be used as a part of Deep Reinforcement Learning model, and this model was built based on the value function that could predict the Q value corresponding to the various actions performed in the environment state. In the Pygame platform, a simplified traffic simulation environment was set, it is observed that the Autonomous-Driving vehicle could run smoothly without collision in a fixed-length test simulation environment, and this trained model reuse method could help autonomous vehicle accelerate the learning process, obtain better simulation results during most of the training process, save simulation time and computing resources.