{"title":"二阶多智能体系统的自适应强化学习跟踪控制","authors":"Weiwei Bai, Liang Cao, Guowei Dong, Hongyi Li","doi":"10.1109/DDCLS.2019.8908978","DOIUrl":null,"url":null,"abstract":"In this paper, the adaptive reinforcement learning tracking control problem is studied for second-order pure-feedback multi-agent systems (MASs). The pure-feedback MASs are transformed into strict-feedback form by using the mean value theorem. The reinforcement learning approach is applied to handle the unknown functions and system control performance index. Moreover, the error terms are introduced to the controller, which can improve the robust of the control scheme. The theoretical analysis indicates that all the signals and tracking errors in close-loop system are semi-global uniformly ultimately bounded (SGUUB), and the numerical simulation are conducted to verify the superiority of this scheme.","PeriodicalId":6699,"journal":{"name":"2019 IEEE 8th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"3 1","pages":"202-207"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Adaptive Reinforcement Learning Tracking Control for Second-Order Multi-Agent Systems\",\"authors\":\"Weiwei Bai, Liang Cao, Guowei Dong, Hongyi Li\",\"doi\":\"10.1109/DDCLS.2019.8908978\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, the adaptive reinforcement learning tracking control problem is studied for second-order pure-feedback multi-agent systems (MASs). The pure-feedback MASs are transformed into strict-feedback form by using the mean value theorem. The reinforcement learning approach is applied to handle the unknown functions and system control performance index. Moreover, the error terms are introduced to the controller, which can improve the robust of the control scheme. The theoretical analysis indicates that all the signals and tracking errors in close-loop system are semi-global uniformly ultimately bounded (SGUUB), and the numerical simulation are conducted to verify the superiority of this scheme.\",\"PeriodicalId\":6699,\"journal\":{\"name\":\"2019 IEEE 8th Data Driven Control and Learning Systems Conference (DDCLS)\",\"volume\":\"3 1\",\"pages\":\"202-207\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 8th Data Driven Control and Learning Systems Conference (DDCLS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DDCLS.2019.8908978\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 8th Data Driven Control and Learning Systems Conference (DDCLS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DDCLS.2019.8908978","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Adaptive Reinforcement Learning Tracking Control for Second-Order Multi-Agent Systems
In this paper, the adaptive reinforcement learning tracking control problem is studied for second-order pure-feedback multi-agent systems (MASs). The pure-feedback MASs are transformed into strict-feedback form by using the mean value theorem. The reinforcement learning approach is applied to handle the unknown functions and system control performance index. Moreover, the error terms are introduced to the controller, which can improve the robust of the control scheme. The theoretical analysis indicates that all the signals and tracking errors in close-loop system are semi-global uniformly ultimately bounded (SGUUB), and the numerical simulation are conducted to verify the superiority of this scheme.