Juan Fang, Qiangang Zheng, Wei-ming Liu, Haibo Zhang
{"title":"Optimization control with multi-constraint of aeroengine acceleration process based on reinforcement learning","authors":"Juan Fang, Qiangang Zheng, Wei-ming Liu, Haibo Zhang","doi":"10.1117/12.2671152","DOIUrl":null,"url":null,"abstract":"With the development of Reinforcement Learning (RL), it becomes able to solve the continuous action space problem and shows strong ability in dealing with complex nonlinear control problem. Based on the Deep Deterministic Policy Gradient (DDPG) algorithm, a novel scheme of aeroengine acceleration controller is proposed in this paper. According to the characteristics of the engine acceleration stage, the reward function is constructed, and the state parameters are updated in the form of sliding window to reduce the sensitivity of the network to noise. DDPG adopts actor-critic framework, critic calculates value function by the deep neural network, actor outputs action command and forms a closed-loop control system with the engine. The method is verified by digital simulation at ground condition and the results demonstrate that compared with the traditional PID controller, the acceleration time of DDPG controller is reduced by 41.56%. Additionally, the network converges within 400 steps.","PeriodicalId":227528,"journal":{"name":"International Conference on Artificial Intelligence and Computer Engineering (ICAICE 2022)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Artificial Intelligence and Computer Engineering (ICAICE 2022)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2671152","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the development of Reinforcement Learning (RL), it becomes able to solve the continuous action space problem and shows strong ability in dealing with complex nonlinear control problem. Based on the Deep Deterministic Policy Gradient (DDPG) algorithm, a novel scheme of aeroengine acceleration controller is proposed in this paper. According to the characteristics of the engine acceleration stage, the reward function is constructed, and the state parameters are updated in the form of sliding window to reduce the sensitivity of the network to noise. DDPG adopts actor-critic framework, critic calculates value function by the deep neural network, actor outputs action command and forms a closed-loop control system with the engine. The method is verified by digital simulation at ground condition and the results demonstrate that compared with the traditional PID controller, the acceleration time of DDPG controller is reduced by 41.56%. Additionally, the network converges within 400 steps.