基于涡轮螺旋桨发动机动力管理系统和深度强化学习的单杆控制方法设计

Run-Min Ji, Xiang-Hua Huang, Xing-Long Zhang, Ling-Wei Li
{"title":"基于涡轮螺旋桨发动机动力管理系统和深度强化学习的单杆控制方法设计","authors":"Run-Min Ji, Xiang-Hua Huang, Xing-Long Zhang, Ling-Wei Li","doi":"10.1177/09544100241235824","DOIUrl":null,"url":null,"abstract":"This paper presents a single-lever control method based on Power Management System (PMS) and improved Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm for turboprop engines. In this approach, power level angle command, which is the single-lever command, is decoupled into controlled variable commands by PMS, and the controller based on improved TD3 algorithm can ensure that controlled variables track their commands rapidly and accurately. To achieve the optimal conversion relationship between different commands, an offline optimization process is used to design PMS. By optimization, specific fuel consumption and propeller efficiency are both improved after conversion. To deal with strong interactions between different control loops of a turboprop engine, TD3 algorithm which is a deep reinforcement learning algorithm is adopted. Two improvements which are the design method of observation state and prioritized experience replay are made to enhance the tracking accuracy. Simulation results show that improved TD3 algorithm can learn an optimal control policy to guarantee good control effect with fast response and small overshoot. The maximum settling time is less than 0.25s and the maximum overshoot is less than 0.1%. It also has a good robustness performance when the plant exists model uncertainties. The maximum fluctuations are less than 0.05%.","PeriodicalId":54566,"journal":{"name":"Proceedings of the Institution of Mechanical Engineers Part G-Journal of Aerospace Engineering","volume":"45 1","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Single-lever control method design based on power management system and deep reinforcement learning for turboprop engines\",\"authors\":\"Run-Min Ji, Xiang-Hua Huang, Xing-Long Zhang, Ling-Wei Li\",\"doi\":\"10.1177/09544100241235824\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a single-lever control method based on Power Management System (PMS) and improved Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm for turboprop engines. In this approach, power level angle command, which is the single-lever command, is decoupled into controlled variable commands by PMS, and the controller based on improved TD3 algorithm can ensure that controlled variables track their commands rapidly and accurately. To achieve the optimal conversion relationship between different commands, an offline optimization process is used to design PMS. By optimization, specific fuel consumption and propeller efficiency are both improved after conversion. To deal with strong interactions between different control loops of a turboprop engine, TD3 algorithm which is a deep reinforcement learning algorithm is adopted. Two improvements which are the design method of observation state and prioritized experience replay are made to enhance the tracking accuracy. Simulation results show that improved TD3 algorithm can learn an optimal control policy to guarantee good control effect with fast response and small overshoot. The maximum settling time is less than 0.25s and the maximum overshoot is less than 0.1%. It also has a good robustness performance when the plant exists model uncertainties. The maximum fluctuations are less than 0.05%.\",\"PeriodicalId\":54566,\"journal\":{\"name\":\"Proceedings of the Institution of Mechanical Engineers Part G-Journal of Aerospace Engineering\",\"volume\":\"45 1\",\"pages\":\"\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2024-02-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Institution of Mechanical Engineers Part G-Journal of Aerospace Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1177/09544100241235824\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, AEROSPACE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Institution of Mechanical Engineers Part G-Journal of Aerospace Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1177/09544100241235824","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, AEROSPACE","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一种基于动力管理系统(PMS)和改进型双延迟深度确定性策略梯度(TD3)算法的涡轮螺旋桨发动机单杠杆控制方法。在该方法中,功率水平角指令(即单杠杆指令)被 PMS 解耦为受控变量指令,基于改进的 TD3 算法的控制器可确保受控变量快速、准确地跟踪其指令。为了实现不同指令之间的最佳转换关系,设计 PMS 时采用了离线优化流程。通过优化,转换后的比油耗和螺旋桨效率都得到了提高。为了处理涡轮螺旋桨发动机不同控制回路之间的强烈相互作用,采用了深度强化学习算法 TD3。通过观察状态设计方法和优先经验重放这两项改进来提高跟踪精度。仿真结果表明,改进后的 TD3 算法可以学习最优控制策略,保证良好的控制效果,响应速度快,过冲小。最大平稳时间小于 0.25s,最大过冲小于 0.1%。当工厂存在模型不确定性时,它也具有良好的鲁棒性能。最大波动小于 0.05%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Single-lever control method design based on power management system and deep reinforcement learning for turboprop engines
This paper presents a single-lever control method based on Power Management System (PMS) and improved Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm for turboprop engines. In this approach, power level angle command, which is the single-lever command, is decoupled into controlled variable commands by PMS, and the controller based on improved TD3 algorithm can ensure that controlled variables track their commands rapidly and accurately. To achieve the optimal conversion relationship between different commands, an offline optimization process is used to design PMS. By optimization, specific fuel consumption and propeller efficiency are both improved after conversion. To deal with strong interactions between different control loops of a turboprop engine, TD3 algorithm which is a deep reinforcement learning algorithm is adopted. Two improvements which are the design method of observation state and prioritized experience replay are made to enhance the tracking accuracy. Simulation results show that improved TD3 algorithm can learn an optimal control policy to guarantee good control effect with fast response and small overshoot. The maximum settling time is less than 0.25s and the maximum overshoot is less than 0.1%. It also has a good robustness performance when the plant exists model uncertainties. The maximum fluctuations are less than 0.05%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
2.40
自引率
18.20%
发文量
212
审稿时长
5.7 months
期刊介绍: The Journal of Aerospace Engineering is dedicated to the publication of high quality research in all branches of applied sciences and technology dealing with aircraft and spacecraft, and their support systems. "Our authorship is truly international and all efforts are made to ensure that each paper is presented in the best possible way and reaches a wide audience. "The Editorial Board is composed of recognized experts representing the technical communities of fifteen countries. The Board Members work in close cooperation with the editors, reviewers, and authors to achieve a consistent standard of well written and presented papers."Professor Rodrigo Martinez-Val, Universidad Politécnica de Madrid, Spain This journal is a member of the Committee on Publication Ethics (COPE).
期刊最新文献
Fatigue life analysis of a composite materials structure using allowable strain criteria Feasibility study of carbon-fiber reinforced polymer linerless pressure vessel tank Testability modeling of aeroengine and analysis optimization method based on improved correlation matrix Research on a backstepping flight control method improved by STFT in atmospheric disturbance applications Evaluating the effect of frigate hangar shape modifications on helicopter recovery using piloted flight simulation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1