基于开环训练强化学习的Actor-Critic牵引控制

IF 0.8 Q3 ENGINEERING, MULTIDISCIPLINARY Modelling and Simulation in Engineering Pub Date : 2021-12-07 DOI:10.1155/2021/4641450
M. Drechsler, T. Fiorentin, H. Göllinger
{"title":"基于开环训练强化学习的Actor-Critic牵引控制","authors":"M. Drechsler, T. Fiorentin, H. Göllinger","doi":"10.1155/2021/4641450","DOIUrl":null,"url":null,"abstract":"The use of actor-critic algorithms can improve the controllers currently implemented in automotive applications. This method combines reinforcement learning (RL) and neural networks to achieve the possibility of controlling nonlinear systems with real-time capabilities. Actor-critic algorithms were already applied with success in different controllers including autonomous driving, antilock braking system (ABS), and electronic stability control (ESC). However, in the current researches, virtual environments are implemented for the training process instead of using real plants to obtain the datasets. This limitation is given by trial and error methods implemented for the training process, which generates considerable risks in case the controller directly acts on the real plant. In this way, the present research proposes and evaluates an open-loop training process, which permits the data acquisition without the control interaction and an open-loop training of the neural networks. The performance of the trained controllers is evaluated by a design of experiments (DOE) to understand how it is affected by the generated dataset. The results present a successful application of open-loop training architecture. The controller can maintain the slip ratio under adequate levels during maneuvers on different floors, including grounds that are not applied during the training process. The actor neural network is also able to identify the different floors and change the acceleration profile according to the characteristics of each ground.","PeriodicalId":45541,"journal":{"name":"Modelling and Simulation in Engineering","volume":"20 1","pages":""},"PeriodicalIF":0.8000,"publicationDate":"2021-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Actor-Critic Traction Control Based on Reinforcement Learning with Open-Loop Training\",\"authors\":\"M. Drechsler, T. Fiorentin, H. Göllinger\",\"doi\":\"10.1155/2021/4641450\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of actor-critic algorithms can improve the controllers currently implemented in automotive applications. This method combines reinforcement learning (RL) and neural networks to achieve the possibility of controlling nonlinear systems with real-time capabilities. Actor-critic algorithms were already applied with success in different controllers including autonomous driving, antilock braking system (ABS), and electronic stability control (ESC). However, in the current researches, virtual environments are implemented for the training process instead of using real plants to obtain the datasets. This limitation is given by trial and error methods implemented for the training process, which generates considerable risks in case the controller directly acts on the real plant. In this way, the present research proposes and evaluates an open-loop training process, which permits the data acquisition without the control interaction and an open-loop training of the neural networks. The performance of the trained controllers is evaluated by a design of experiments (DOE) to understand how it is affected by the generated dataset. The results present a successful application of open-loop training architecture. The controller can maintain the slip ratio under adequate levels during maneuvers on different floors, including grounds that are not applied during the training process. The actor neural network is also able to identify the different floors and change the acceleration profile according to the characteristics of each ground.\",\"PeriodicalId\":45541,\"journal\":{\"name\":\"Modelling and Simulation in Engineering\",\"volume\":\"20 1\",\"pages\":\"\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2021-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Modelling and Simulation in Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1155/2021/4641450\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Modelling and Simulation in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1155/2021/4641450","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 1

摘要

使用actor-critic算法可以改进目前在汽车应用中实现的控制器。该方法将强化学习(RL)与神经网络相结合,实现了具有实时性的非线性系统控制。Actor-critic算法已经成功应用于不同的控制器,包括自动驾驶、防抱死制动系统(ABS)和电子稳定控制(ESC)。然而,在目前的研究中,在训练过程中采用虚拟环境,而不是使用真实的植物来获取数据集。这种限制是由在训练过程中实施的试错方法给出的,如果控制器直接作用于实际设备,则会产生相当大的风险。通过这种方式,本研究提出并评估了一种开环训练过程,该过程允许在没有控制交互的情况下进行数据采集和神经网络的开环训练。通过实验设计(DOE)评估训练控制器的性能,以了解它如何受到生成数据集的影响。结果表明了开环训练体系结构的成功应用。控制器可以在不同楼层(包括训练过程中未使用的地面)的机动过程中保持适当的滑移率。行动者神经网络还能够识别不同的楼层,并根据每个楼层的特征改变加速度剖面。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Actor-Critic Traction Control Based on Reinforcement Learning with Open-Loop Training
The use of actor-critic algorithms can improve the controllers currently implemented in automotive applications. This method combines reinforcement learning (RL) and neural networks to achieve the possibility of controlling nonlinear systems with real-time capabilities. Actor-critic algorithms were already applied with success in different controllers including autonomous driving, antilock braking system (ABS), and electronic stability control (ESC). However, in the current researches, virtual environments are implemented for the training process instead of using real plants to obtain the datasets. This limitation is given by trial and error methods implemented for the training process, which generates considerable risks in case the controller directly acts on the real plant. In this way, the present research proposes and evaluates an open-loop training process, which permits the data acquisition without the control interaction and an open-loop training of the neural networks. The performance of the trained controllers is evaluated by a design of experiments (DOE) to understand how it is affected by the generated dataset. The results present a successful application of open-loop training architecture. The controller can maintain the slip ratio under adequate levels during maneuvers on different floors, including grounds that are not applied during the training process. The actor neural network is also able to identify the different floors and change the acceleration profile according to the characteristics of each ground.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Modelling and Simulation in Engineering
Modelling and Simulation in Engineering ENGINEERING, MULTIDISCIPLINARY-
CiteScore
2.70
自引率
3.10%
发文量
42
审稿时长
18 weeks
期刊介绍: Modelling and Simulation in Engineering aims at providing a forum for the discussion of formalisms, methodologies and simulation tools that are intended to support the new, broader interpretation of Engineering. Competitive pressures of Global Economy have had a profound effect on the manufacturing in Europe, Japan and the USA with much of the production being outsourced. In this context the traditional interpretation of engineering profession linked to the actual manufacturing needs to be broadened to include the integration of outsourced components and the consideration of logistic, economical and human factors in the design of engineering products and services.
期刊最新文献
Setting Safe Operation Conditions for Acetyl Chloride Hydrolysis through Dynamic Modelling and Bifurcation Analysis Mathematical-Based CFD Modelling and Simulation of Mushroom Drying in Tray Dryer Evaluation of Electric Vehicle-Dependent Strategy in Addis Ababa, Ethiopia Transport System Two-Dimensional FEM Approach of Metabolic Effect on Thermoregulation in Human Dermal Parts During Walking and Marathon Progressive Collapse Resistance Mechanism of RC Frame Structure Considering Reinforcement Corrosion
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1