基于开环训练强化学习的Actor-Critic牵引控制

IF 0.8 Q3 ENGINEERING, MULTIDISCIPLINARY Modelling and Simulation in Engineering Pub Date : 2021-12-07 DOI:10.1155/2021/4641450

M. Drechsler, T. Fiorentin, H. Göllinger

{"title":"基于开环训练强化学习的Actor-Critic牵引控制","authors":"M. Drechsler, T. Fiorentin, H. Göllinger","doi":"10.1155/2021/4641450","DOIUrl":null,"url":null,"abstract":"The use of actor-critic algorithms can improve the controllers currently implemented in automotive applications. This method combines reinforcement learning (RL) and neural networks to achieve the possibility of controlling nonlinear systems with real-time capabilities. Actor-critic algorithms were already applied with success in different controllers including autonomous driving, antilock braking system (ABS), and electronic stability control (ESC). However, in the current researches, virtual environments are implemented for the training process instead of using real plants to obtain the datasets. This limitation is given by trial and error methods implemented for the training process, which generates considerable risks in case the controller directly acts on the real plant. In this way, the present research proposes and evaluates an open-loop training process, which permits the data acquisition without the control interaction and an open-loop training of the neural networks. The performance of the trained controllers is evaluated by a design of experiments (DOE) to understand how it is affected by the generated dataset. The results present a successful application of open-loop training architecture. The controller can maintain the slip ratio under adequate levels during maneuvers on different floors, including grounds that are not applied during the training process. The actor neural network is also able to identify the different floors and change the acceleration profile according to the characteristics of each ground.","PeriodicalId":45541,"journal":{"name":"Modelling and Simulation in Engineering","volume":"20 1","pages":""},"PeriodicalIF":0.8000,"publicationDate":"2021-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Actor-Critic Traction Control Based on Reinforcement Learning with Open-Loop Training\",\"authors\":\"M. Drechsler, T. Fiorentin, H. Göllinger\",\"doi\":\"10.1155/2021/4641450\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of actor-critic algorithms can improve the controllers currently implemented in automotive applications. This method combines reinforcement learning (RL) and neural networks to achieve the possibility of controlling nonlinear systems with real-time capabilities. Actor-critic algorithms were already applied with success in different controllers including autonomous driving, antilock braking system (ABS), and electronic stability control (ESC). However, in the current researches, virtual environments are implemented for the training process instead of using real plants to obtain the datasets. This limitation is given by trial and error methods implemented for the training process, which generates considerable risks in case the controller directly acts on the real plant. In this way, the present research proposes and evaluates an open-loop training process, which permits the data acquisition without the control interaction and an open-loop training of the neural networks. The performance of the trained controllers is evaluated by a design of experiments (DOE) to understand how it is affected by the generated dataset. The results present a successful application of open-loop training architecture. The controller can maintain the slip ratio under adequate levels during maneuvers on different floors, including grounds that are not applied during the training process. The actor neural network is also able to identify the different floors and change the acceleration profile according to the characteristics of each ground.\",\"PeriodicalId\":45541,\"journal\":{\"name\":\"Modelling and Simulation in Engineering\",\"volume\":\"20 1\",\"pages\":\"\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2021-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Modelling and Simulation in Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1155/2021/4641450\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Modelling and Simulation in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1155/2021/4641450","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 1

摘要

使用actor-critic算法可以改进目前在汽车应用中实现的控制器。该方法将强化学习(RL)与神经网络相结合，实现了具有实时性的非线性系统控制。Actor-critic算法已经成功应用于不同的控制器，包括自动驾驶、防抱死制动系统(ABS)和电子稳定控制(ESC)。然而，在目前的研究中，在训练过程中采用虚拟环境，而不是使用真实的植物来获取数据集。这种限制是由在训练过程中实施的试错方法给出的，如果控制器直接作用于实际设备，则会产生相当大的风险。通过这种方式，本研究提出并评估了一种开环训练过程，该过程允许在没有控制交互的情况下进行数据采集和神经网络的开环训练。通过实验设计(DOE)评估训练控制器的性能，以了解它如何受到生成数据集的影响。结果表明了开环训练体系结构的成功应用。控制器可以在不同楼层(包括训练过程中未使用的地面)的机动过程中保持适当的滑移率。行动者神经网络还能够识别不同的楼层，并根据每个楼层的特征改变加速度剖面。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Actor-Critic Traction Control Based on Reinforcement Learning with Open-Loop Training

The use of actor-critic algorithms can improve the controllers currently implemented in automotive applications. This method combines reinforcement learning (RL) and neural networks to achieve the possibility of controlling nonlinear systems with real-time capabilities. Actor-critic algorithms were already applied with success in different controllers including autonomous driving, antilock braking system (ABS), and electronic stability control (ESC). However, in the current researches, virtual environments are implemented for the training process instead of using real plants to obtain the datasets. This limitation is given by trial and error methods implemented for the training process, which generates considerable risks in case the controller directly acts on the real plant. In this way, the present research proposes and evaluates an open-loop training process, which permits the data acquisition without the control interaction and an open-loop training of the neural networks. The performance of the trained controllers is evaluated by a design of experiments (DOE) to understand how it is affected by the generated dataset. The results present a successful application of open-loop training architecture. The controller can maintain the slip ratio under adequate levels during maneuvers on different floors, including grounds that are not applied during the training process. The actor neural network is also able to identify the different floors and change the acceleration profile according to the characteristics of each ground.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Modelling and Simulation in Engineering ENGINEERING, MULTIDISCIPLINARY-

CiteScore

2.70

自引率

3.10%

发文量

审稿时长

18 weeks

期刊介绍： Modelling and Simulation in Engineering aims at providing a forum for the discussion of formalisms, methodologies and simulation tools that are intended to support the new, broader interpretation of Engineering. Competitive pressures of Global Economy have had a profound effect on the manufacturing in Europe, Japan and the USA with much of the production being outsourced. In this context the traditional interpretation of engineering profession linked to the actual manufacturing needs to be broadened to include the integration of outsourced components and the consideration of logistic, economical and human factors in the design of engineering products and services.