基于涡轮螺旋桨发动机动力管理系统和深度强化学习的单杆控制方法设计

IF 1 4区工程技术 Q3 ENGINEERING, AEROSPACE Proceedings of the Institution of Mechanical Engineers Part G-Journal of Aerospace Engineering Pub Date : 2024-02-29 DOI:10.1177/09544100241235824

Run-Min Ji, Xiang-Hua Huang, Xing-Long Zhang, Ling-Wei Li

{"title":"基于涡轮螺旋桨发动机动力管理系统和深度强化学习的单杆控制方法设计","authors":"Run-Min Ji, Xiang-Hua Huang, Xing-Long Zhang, Ling-Wei Li","doi":"10.1177/09544100241235824","DOIUrl":null,"url":null,"abstract":"This paper presents a single-lever control method based on Power Management System (PMS) and improved Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm for turboprop engines. In this approach, power level angle command, which is the single-lever command, is decoupled into controlled variable commands by PMS, and the controller based on improved TD3 algorithm can ensure that controlled variables track their commands rapidly and accurately. To achieve the optimal conversion relationship between different commands, an offline optimization process is used to design PMS. By optimization, specific fuel consumption and propeller efficiency are both improved after conversion. To deal with strong interactions between different control loops of a turboprop engine, TD3 algorithm which is a deep reinforcement learning algorithm is adopted. Two improvements which are the design method of observation state and prioritized experience replay are made to enhance the tracking accuracy. Simulation results show that improved TD3 algorithm can learn an optimal control policy to guarantee good control effect with fast response and small overshoot. The maximum settling time is less than 0.25s and the maximum overshoot is less than 0.1%. It also has a good robustness performance when the plant exists model uncertainties. The maximum fluctuations are less than 0.05%.","PeriodicalId":54566,"journal":{"name":"Proceedings of the Institution of Mechanical Engineers Part G-Journal of Aerospace Engineering","volume":"45 1","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Single-lever control method design based on power management system and deep reinforcement learning for turboprop engines\",\"authors\":\"Run-Min Ji, Xiang-Hua Huang, Xing-Long Zhang, Ling-Wei Li\",\"doi\":\"10.1177/09544100241235824\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a single-lever control method based on Power Management System (PMS) and improved Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm for turboprop engines. In this approach, power level angle command, which is the single-lever command, is decoupled into controlled variable commands by PMS, and the controller based on improved TD3 algorithm can ensure that controlled variables track their commands rapidly and accurately. To achieve the optimal conversion relationship between different commands, an offline optimization process is used to design PMS. By optimization, specific fuel consumption and propeller efficiency are both improved after conversion. To deal with strong interactions between different control loops of a turboprop engine, TD3 algorithm which is a deep reinforcement learning algorithm is adopted. Two improvements which are the design method of observation state and prioritized experience replay are made to enhance the tracking accuracy. Simulation results show that improved TD3 algorithm can learn an optimal control policy to guarantee good control effect with fast response and small overshoot. The maximum settling time is less than 0.25s and the maximum overshoot is less than 0.1%. It also has a good robustness performance when the plant exists model uncertainties. The maximum fluctuations are less than 0.05%.\",\"PeriodicalId\":54566,\"journal\":{\"name\":\"Proceedings of the Institution of Mechanical Engineers Part G-Journal of Aerospace Engineering\",\"volume\":\"45 1\",\"pages\":\"\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2024-02-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Institution of Mechanical Engineers Part G-Journal of Aerospace Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1177/09544100241235824\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, AEROSPACE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Institution of Mechanical Engineers Part G-Journal of Aerospace Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1177/09544100241235824","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, AEROSPACE","Score":null,"Total":0}

引用次数: 0

摘要

本文提出了一种基于动力管理系统（PMS）和改进型双延迟深度确定性策略梯度（TD3）算法的涡轮螺旋桨发动机单杠杆控制方法。在该方法中，功率水平角指令（即单杠杆指令）被 PMS 解耦为受控变量指令，基于改进的 TD3 算法的控制器可确保受控变量快速、准确地跟踪其指令。为了实现不同指令之间的最佳转换关系，设计 PMS 时采用了离线优化流程。通过优化，转换后的比油耗和螺旋桨效率都得到了提高。为了处理涡轮螺旋桨发动机不同控制回路之间的强烈相互作用，采用了深度强化学习算法 TD3。通过观察状态设计方法和优先经验重放这两项改进来提高跟踪精度。仿真结果表明，改进后的 TD3 算法可以学习最优控制策略，保证良好的控制效果，响应速度快，过冲小。最大平稳时间小于 0.25s，最大过冲小于 0.1%。当工厂存在模型不确定性时，它也具有良好的鲁棒性能。最大波动小于 0.05%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Single-lever control method design based on power management system and deep reinforcement learning for turboprop engines

This paper presents a single-lever control method based on Power Management System (PMS) and improved Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm for turboprop engines. In this approach, power level angle command, which is the single-lever command, is decoupled into controlled variable commands by PMS, and the controller based on improved TD3 algorithm can ensure that controlled variables track their commands rapidly and accurately. To achieve the optimal conversion relationship between different commands, an offline optimization process is used to design PMS. By optimization, specific fuel consumption and propeller efficiency are both improved after conversion. To deal with strong interactions between different control loops of a turboprop engine, TD3 algorithm which is a deep reinforcement learning algorithm is adopted. Two improvements which are the design method of observation state and prioritized experience replay are made to enhance the tracking accuracy. Simulation results show that improved TD3 algorithm can learn an optimal control policy to guarantee good control effect with fast response and small overshoot. The maximum settling time is less than 0.25s and the maximum overshoot is less than 0.1%. It also has a good robustness performance when the plant exists model uncertainties. The maximum fluctuations are less than 0.05%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the Institution of Mechanical Engineers Part G-Journal of Aerospace Engineering 工程技术-工程：机械

CiteScore

2.40

自引率

18.20%

发文量

212

审稿时长

5.7 months

期刊介绍： The Journal of Aerospace Engineering is dedicated to the publication of high quality research in all branches of applied sciences and technology dealing with aircraft and spacecraft, and their support systems. "Our authorship is truly international and all efforts are made to ensure that each paper is presented in the best possible way and reaches a wide audience. "The Editorial Board is composed of recognized experts representing the technical communities of fifteen countries. The Board Members work in close cooperation with the editors, reviewers, and authors to achieve a consistent standard of well written and presented papers."Professor Rodrigo Martinez-Val, Universidad Politécnica de Madrid, Spain This journal is a member of the Committee on Publication Ethics (COPE).