{"title":"A reinforcement learning control scheme for nonlinear systems with multiple actions","authors":"C. Chen, C. Jou","doi":"10.1109/AFSS.1996.583552","DOIUrl":null,"url":null,"abstract":"In this paper an attempt is made to apply reinforcement learning schemes to the adaptive control of nonlinear systems with multiple continuous control actions. The control task is formulated into a sequential optimization problem. A learning algorithm is developed based on the concepts of dynamic programming and stochastic approximation and the techniques of random search and parameter estimation. The proposed algorithm is complete and general enough so that the controller can be constituted by various computing models, e.g., neural networks. The efficiency of the proposed algorithm is demonstrated by applying the methods to the nonlinear control problems with multiple control actions.","PeriodicalId":197019,"journal":{"name":"Soft Computing in Intelligent Systems and Information Processing. Proceedings of the 1996 Asian Fuzzy Systems Symposium","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1996-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soft Computing in Intelligent Systems and Information Processing. Proceedings of the 1996 Asian Fuzzy Systems Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AFSS.1996.583552","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In this paper an attempt is made to apply reinforcement learning schemes to the adaptive control of nonlinear systems with multiple continuous control actions. The control task is formulated into a sequential optimization problem. A learning algorithm is developed based on the concepts of dynamic programming and stochastic approximation and the techniques of random search and parameter estimation. The proposed algorithm is complete and general enough so that the controller can be constituted by various computing models, e.g., neural networks. The efficiency of the proposed algorithm is demonstrated by applying the methods to the nonlinear control problems with multiple control actions.