{"title":"基于ADP算法的多输入增强系统数据驱动跟踪控制","authors":"Y. Lv, X. Ren, Shuangyi Hu, Linwei Li, J. Na","doi":"10.1109/DDCLS.2019.8909070","DOIUrl":null,"url":null,"abstract":"The data-driven optimal tracking controls (OTC) for the unknown multi-input system are proposed in this paper, and a novel tuning law is used to update NN weights in the learning scheme. First, the formula of the OTC for the multi-input NZS game is presented. A three-layer neural network (NN) data-driven model is introduced to approximate the unknown system, and the input dynamics are obtained. Then, to solve the OTC as a regulation optimal problem, an augmentation multi-input system is constructed with the tracking error and command trajectory. Moreover, we use a reinforcement learning based data-driven NN method to online learn the optimal value functions for each input, which is directly used to calculate the optimal tracking control associated with each performance index function. The convergence of the NN weights is proved. Finally, a simulation is presented to verify the feasibility of our algorithm in this paper.","PeriodicalId":6699,"journal":{"name":"2019 IEEE 8th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"54 1","pages":"534-538"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data-Driven Tracking Controls of Multi-input Augmented Systems Based on ADP Algorithm\",\"authors\":\"Y. Lv, X. Ren, Shuangyi Hu, Linwei Li, J. Na\",\"doi\":\"10.1109/DDCLS.2019.8909070\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The data-driven optimal tracking controls (OTC) for the unknown multi-input system are proposed in this paper, and a novel tuning law is used to update NN weights in the learning scheme. First, the formula of the OTC for the multi-input NZS game is presented. A three-layer neural network (NN) data-driven model is introduced to approximate the unknown system, and the input dynamics are obtained. Then, to solve the OTC as a regulation optimal problem, an augmentation multi-input system is constructed with the tracking error and command trajectory. Moreover, we use a reinforcement learning based data-driven NN method to online learn the optimal value functions for each input, which is directly used to calculate the optimal tracking control associated with each performance index function. The convergence of the NN weights is proved. Finally, a simulation is presented to verify the feasibility of our algorithm in this paper.\",\"PeriodicalId\":6699,\"journal\":{\"name\":\"2019 IEEE 8th Data Driven Control and Learning Systems Conference (DDCLS)\",\"volume\":\"54 1\",\"pages\":\"534-538\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 8th Data Driven Control and Learning Systems Conference (DDCLS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DDCLS.2019.8909070\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 8th Data Driven Control and Learning Systems Conference (DDCLS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DDCLS.2019.8909070","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Data-Driven Tracking Controls of Multi-input Augmented Systems Based on ADP Algorithm
The data-driven optimal tracking controls (OTC) for the unknown multi-input system are proposed in this paper, and a novel tuning law is used to update NN weights in the learning scheme. First, the formula of the OTC for the multi-input NZS game is presented. A three-layer neural network (NN) data-driven model is introduced to approximate the unknown system, and the input dynamics are obtained. Then, to solve the OTC as a regulation optimal problem, an augmentation multi-input system is constructed with the tracking error and command trajectory. Moreover, we use a reinforcement learning based data-driven NN method to online learn the optimal value functions for each input, which is directly used to calculate the optimal tracking control associated with each performance index function. The convergence of the NN weights is proved. Finally, a simulation is presented to verify the feasibility of our algorithm in this paper.