Multi-Kernel Online Reinforcement Learning for Path Tracking Control of Intelligent Vehicles

IEEE Transactions on Systems Man and Cybernetics Part A-Systems and Humans Pub Date : 2021-11-01 DOI:10.1109/tsmc.2020.2966631

Jiahang Liu, Zhenhua Huang, Xin Xu, Xinglong Zhang, Shiliang Sun, Dazi Li

{"title":"Multi-Kernel Online Reinforcement Learning for Path Tracking Control of Intelligent Vehicles","authors":"Jiahang Liu, Zhenhua Huang, Xin Xu, Xinglong Zhang, Shiliang Sun, Dazi Li","doi":"10.1109/tsmc.2020.2966631","DOIUrl":null,"url":null,"abstract":"Path tracking control of intelligent vehicles has to deal with the difficulties of model uncertainties and nonlinearities. As a class of adaptive optimal control methods, reinforcement learning (RL) has received increasing attention in solving difficult control problems. However, feature representation and online learning ability are two major problems to be solved for learning control of uncertain dynamic systems. In this article, we propose a multi-kernel online RL approach for path tracking control of intelligent vehicles. In the proposed approach, a multiple kernel feature learning framework is designed for online learning control based on dual heuristic programming (DHP) and the new online learning control algorithm is called multi-kernel DHP (MKDHP). In MKDHP, instead of the expert knowledge for selecting and fine-tuning of a suitable kernel function, only a set of basic kernel functions is required to be predefined and the multi-kernel features can be learned for value function approximation in the critic. The simulation studies on path tracking control for intelligent vehicles have been conducted under $S$ -curve and urban road conditions. The results demonstrated that compared with other typical path tracking controllers for intelligent vehicles, such as the linear quadratic regulator (LQR), the pure pursuit controller and the ribbon-based controller, the proposed multi-kernel learning controller can achieve better performance in terms of tracking precision and smoothness.","PeriodicalId":55007,"journal":{"name":"IEEE Transactions on Systems Man and Cybernetics Part A-Systems and Humans","volume":"43 1","pages":"6962-6975"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Systems Man and Cybernetics Part A-Systems and Humans","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/tsmc.2020.2966631","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 18

Abstract

Path tracking control of intelligent vehicles has to deal with the difficulties of model uncertainties and nonlinearities. As a class of adaptive optimal control methods, reinforcement learning (RL) has received increasing attention in solving difficult control problems. However, feature representation and online learning ability are two major problems to be solved for learning control of uncertain dynamic systems. In this article, we propose a multi-kernel online RL approach for path tracking control of intelligent vehicles. In the proposed approach, a multiple kernel feature learning framework is designed for online learning control based on dual heuristic programming (DHP) and the new online learning control algorithm is called multi-kernel DHP (MKDHP). In MKDHP, instead of the expert knowledge for selecting and fine-tuning of a suitable kernel function, only a set of basic kernel functions is required to be predefined and the multi-kernel features can be learned for value function approximation in the critic. The simulation studies on path tracking control for intelligent vehicles have been conducted under $S$ -curve and urban road conditions. The results demonstrated that compared with other typical path tracking controllers for intelligent vehicles, such as the linear quadratic regulator (LQR), the pure pursuit controller and the ribbon-based controller, the proposed multi-kernel learning controller can achieve better performance in terms of tracking precision and smoothness.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于多核在线强化学习的智能车辆路径跟踪控制

智能车辆的路径跟踪控制必须解决模型不确定性和非线性等问题。强化学习(reinforcement learning, RL)作为一类自适应最优控制方法，在解决复杂控制问题方面受到越来越多的关注。然而，特征表示和在线学习能力是不确定动态系统学习控制需要解决的两个主要问题。本文提出了一种用于智能车辆路径跟踪控制的多核在线强化学习方法。该方法设计了一种基于双启发式规划(dual heuristic programming, DHP)的在线学习控制多核特征学习框架，并将其称为多核DHP (multi-kernel DHP, MKDHP)。在MKDHP中，不需要专家知识来选择和微调合适的核函数，只需要预先定义一组基本核函数，并可以学习多核特征来逼近临界中的值函数。对智能车辆在S曲线和城市道路条件下的路径跟踪控制进行了仿真研究。结果表明，与线性二次型调节器(LQR)、纯追踪控制器和基于带状的控制器等典型的智能车辆路径跟踪控制器相比，所提出的多核学习控制器在跟踪精度和平滑度方面都具有更好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Systems Man and Cybernetics Part A-Systems and Humans 工程技术-计算机：控制论

自引率

0.00%

发文量

审稿时长

6.0 months

期刊介绍： The scope of the IEEE Transactions on Systems, Man, and Cybernetics: Systems includes the fields of systems engineering. It includes issue formulation, analysis and modeling, decision making, and issue interpretation for any of the systems engineering lifecycle phases associated with the definition, development, and deployment of large systems. In addition, it includes systems management, systems engineering processes, and a variety of systems engineering methods such as optimization, modeling and simulation.