Rongxin Cui, Chenguang Yang, Y. Li, Sanjay K. Sharma
{"title":"控制输入饱和的自主水下航行器神经网络强化学习控制","authors":"Rongxin Cui, Chenguang Yang, Y. Li, Sanjay K. Sharma","doi":"10.1109/CONTROL.2014.6915114","DOIUrl":null,"url":null,"abstract":"In this paper, the trajectory tracking control of the autonomous underwater vehicle (AUV) has been investigated in discrete time, for ease of digital computer calculation. A reinforcement learning scheme is employed using two neural networks, whereas the first one is to compensate for uncertainties for the controller, and the second one is to estimate the evaluation function, such that optimal tracking performance could be achieve for the AUV. Simulation results show that the errors convergence to a adjustable neighborhood around zero, and optimization has been achieved in the sense of reinforcement learning.","PeriodicalId":269044,"journal":{"name":"2014 UKACC International Conference on Control (CONTROL)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":"{\"title\":\"Neural network based reinforcement learning control of autonomous underwater vehicles with control input saturation\",\"authors\":\"Rongxin Cui, Chenguang Yang, Y. Li, Sanjay K. Sharma\",\"doi\":\"10.1109/CONTROL.2014.6915114\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, the trajectory tracking control of the autonomous underwater vehicle (AUV) has been investigated in discrete time, for ease of digital computer calculation. A reinforcement learning scheme is employed using two neural networks, whereas the first one is to compensate for uncertainties for the controller, and the second one is to estimate the evaluation function, such that optimal tracking performance could be achieve for the AUV. Simulation results show that the errors convergence to a adjustable neighborhood around zero, and optimization has been achieved in the sense of reinforcement learning.\",\"PeriodicalId\":269044,\"journal\":{\"name\":\"2014 UKACC International Conference on Control (CONTROL)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-07-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"18\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 UKACC International Conference on Control (CONTROL)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CONTROL.2014.6915114\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 UKACC International Conference on Control (CONTROL)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONTROL.2014.6915114","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Neural network based reinforcement learning control of autonomous underwater vehicles with control input saturation
In this paper, the trajectory tracking control of the autonomous underwater vehicle (AUV) has been investigated in discrete time, for ease of digital computer calculation. A reinforcement learning scheme is employed using two neural networks, whereas the first one is to compensate for uncertainties for the controller, and the second one is to estimate the evaluation function, such that optimal tracking performance could be achieve for the AUV. Simulation results show that the errors convergence to a adjustable neighborhood around zero, and optimization has been achieved in the sense of reinforcement learning.