Two-dimensional reinforcement learning model-free fault-tolerant control for batch processes against multi- faults

IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Computers & Chemical Engineering Pub Date : 2024-09-26 DOI:10.1016/j.compchemeng.2024.108883
{"title":"Two-dimensional reinforcement learning model-free fault-tolerant control for batch processes against multi- faults","authors":"","doi":"10.1016/j.compchemeng.2024.108883","DOIUrl":null,"url":null,"abstract":"<div><div>Aiming at the characteristics of batch process changing along with time and batch directions, the existence of unmodeled dynamics, and the partial failure of actuators or/and sensors, we propose a novel 2D reinforcement learning (RL) fault tolerant control strategy without considering model parameters. Firstly, a two-Dimensional (2D) augmented state space model and 2D Q function-based fault tolerant control (FTC) framework is established. The 2D Bellman equation can be acquired by analyzing the relationship between the 2D value function and the 2D Q function. Based on the extended model and Q-learning concept of RL, a data-driven FTTC independent of model parameters is designed, and a 2D data-driven Q-learning algorithm is proposed. Finally, taking the pressure holding phase in the injection process as the experimental object, the control effect is compared with that of the traditional model-based FTC, and better tracking performance and unbiasedness to the probing noise can be proved.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":null,"pages":null},"PeriodicalIF":3.9000,"publicationDate":"2024-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135424003016","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Aiming at the characteristics of batch process changing along with time and batch directions, the existence of unmodeled dynamics, and the partial failure of actuators or/and sensors, we propose a novel 2D reinforcement learning (RL) fault tolerant control strategy without considering model parameters. Firstly, a two-Dimensional (2D) augmented state space model and 2D Q function-based fault tolerant control (FTC) framework is established. The 2D Bellman equation can be acquired by analyzing the relationship between the 2D value function and the 2D Q function. Based on the extended model and Q-learning concept of RL, a data-driven FTTC independent of model parameters is designed, and a 2D data-driven Q-learning algorithm is proposed. Finally, taking the pressure holding phase in the injection process as the experimental object, the control effect is compared with that of the traditional model-based FTC, and better tracking performance and unbiasedness to the probing noise can be proved.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
二维强化学习模型无故障控制批处理过程,防止多重故障
针对批处理过程随时间和批处理方向变化、存在未建模动态以及执行器或/和传感器部分失效的特点,我们提出了一种无需考虑模型参数的新型二维强化学习(RL)容错控制策略。首先,我们建立了一个二维(2D)增强状态空间模型和基于二维 Q 函数的容错控制(FTC)框架。通过分析二维值函数和二维 Q 函数之间的关系,可以获得二维 Bellman 方程。基于 RL 的扩展模型和 Q-learning 概念,设计了独立于模型参数的数据驱动 FTTC,并提出了二维数据驱动 Q-learning 算法。最后,以注塑过程中的保压阶段为实验对象,对比了与传统基于模型的 FTC 的控制效果,证明了更好的跟踪性能和对探测噪声的无偏性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computers & Chemical Engineering
Computers & Chemical Engineering 工程技术-工程:化工
CiteScore
8.70
自引率
14.00%
发文量
374
审稿时长
70 days
期刊介绍: Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.
期刊最新文献
Accurate key parameters estimation of PEM fuel cells using self-adaptive bonobo optimizer Industrial Process Fault Detection Based on Siamese Recurrent Autoencoder Spacially affordable decarbonization of coal-fired power plants via membrane-based on-site CO2 absorption: A techno-economic analysis Strategic investments and portfolio management in interdependent low-carbon electricity and natural gas markets Two-dimensional reinforcement learning model-free fault-tolerant control for batch processes against multi- faults
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1