Development of algorithms for augmenting and replacing conventional process control using reinforcement learning

IF 3.9 2区工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Computers & Chemical Engineering Pub Date : 2024-11-01 Epub Date: 2024-08-08 DOI:10.1016/j.compchemeng.2024.108826

Daniel Beahr , Debangsu Bhattacharyya , Douglas A. Allan , Stephen E. Zitney

{"title":"Development of algorithms for augmenting and replacing conventional process control using reinforcement learning","authors":"Daniel Beahr , Debangsu Bhattacharyya , Douglas A. Allan , Stephen E. Zitney","doi":"10.1016/j.compchemeng.2024.108826","DOIUrl":null,"url":null,"abstract":"<div><p>This work seeks to allow for the online operation and training of model-free reinforcement learning (RL) agents but limit the risk to system equipment and personnel. The parallel implementation of RL alongside more conventional process control (CPC) allows for the RL algorithm to learn from CPC. The past performance of both methods are assessed on a continuous basis allowing for a transition from CPC to RL and, if needed, transitioning back to CPC from RL. This allows for the RL algorithm to slowly and safely assume control of the process without significant degradation in control performance. It is shown that the RL can derive a near optimal policy even when coupled with a suboptimal CPC. It is also demonstrated that the coupled RL-CPC algorithm learns at a faster rate than traditional RL methods of exploration while the algorithm’s performance does not deteriorate below CPC, even when exposed to an unknown operating condition.</p></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"190 ","pages":"Article 108826"},"PeriodicalIF":3.9000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135424002448","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/8 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

This work seeks to allow for the online operation and training of model-free reinforcement learning (RL) agents but limit the risk to system equipment and personnel. The parallel implementation of RL alongside more conventional process control (CPC) allows for the RL algorithm to learn from CPC. The past performance of both methods are assessed on a continuous basis allowing for a transition from CPC to RL and, if needed, transitioning back to CPC from RL. This allows for the RL algorithm to slowly and safely assume control of the process without significant degradation in control performance. It is shown that the RL can derive a near optimal policy even when coupled with a suboptimal CPC. It is also demonstrated that the coupled RL-CPC algorithm learns at a faster rate than traditional RL methods of exploration while the algorithm’s performance does not deteriorate below CPC, even when exposed to an unknown operating condition.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用强化学习开发增强和替代传统过程控制的算法

这项工作旨在实现无模型强化学习（RL）代理的在线操作和训练，同时限制对系统设备和人员造成的风险。RL 与更传统的过程控制（CPC）并行实施，允许 RL 算法从 CPC 中学习。对这两种方法过去的性能进行持续评估，以便从 CPC 过渡到 RL，并在必要时从 RL 过渡回 CPC。这样，RL 算法就能缓慢而安全地控制流程，而不会明显降低控制性能。研究表明，即使与次优的 CPC 相结合，RL 也能推导出接近最优的策略。研究还表明，与传统的 RL 探索方法相比，耦合 RL-CPC 算法的学习速度更快，同时，即使在未知的运行条件下，该算法的性能也不会低于 CPC。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Computers & Chemical Engineering 工程技术-工程：化工

CiteScore

8.70

自引率

14.00%

发文量

374

审稿时长

70 days

期刊介绍： Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.