首页 > 最新文献

Computers & Chemical Engineering最新文献

英文 中文
Explainable Neural Network meets Graph Neural Network: Recent advances in process fault detection and diagnosis 可解释神经网络与图神经网络:过程故障检测与诊断的最新进展
IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-03-01 Epub Date: 2025-12-13 DOI: 10.1016/j.compchemeng.2025.109528
Yi Liu , Bingbing Shen , David Shan-Hill Wong , Mingwei Jia , Yuan Yao
Modern processes rely on thousands of sensors, yet operators still lack trustworthy tools for measurement-driven fault detection and diagnosis. Deep learning excels at capturing nonlinearity and dynamics, but its opaque decision-making limits use in safety-critical processes. This survey fills that gap by introducing a measurement-oriented taxonomy of explainable neural networks (XNNs) and explainable graph neural networks (XGNNs) from the view of instrumentation and measurement. XNNs are posited as variable-centered instruments that assign calibrated importance scores to individual sensors for different faults. XGNNs are framed as topology-centric instruments, allowing direct measurement of interaction strength and causal propagation among units and control loops. This review delivers step-by-step guidelines that convert historical data into explainable detectors, trackers, and diagnostic meters. A comparison highlights when an XNN suffices and when an XGNN is mandatory, giving instrumentation engineers a decision chart. Different from prior surveys, we show that graphs are the faithful way to integrate P&IDs, material balances, and causal knowledge into deep learning measurements: XGNN explanations map directly onto process diagrams, creating on-screen instruments that display both the alarm and its physical trail. Finally, it concludes by identifying open challenges and recommending future directions for industrial deployment.
现代流程依赖于数千个传感器,但操作人员仍然缺乏可靠的工具来进行测量驱动的故障检测和诊断。深度学习擅长捕捉非线性和动态,但其不透明的决策限制了在安全关键过程中的应用。本研究从仪器和测量的角度引入了面向测量的可解释神经网络(xnn)和可解释图神经网络(xgnn)分类法,填补了这一空白。xnn被设定为可变中心的工具,为不同故障的单个传感器分配校准的重要性分数。xgnn被构建为以拓扑为中心的工具,允许直接测量单元和控制回路之间的相互作用强度和因果传播。本综述提供了将历史数据转换为可解释的检测器、跟踪器和诊断仪表的逐步指南。一个比较突出了XGNN在什么情况下是足够的,什么情况下是必须的,给仪器工程师一个决策图。与之前的调查不同,我们表明图形是将P&; id、物料平衡和因果知识集成到深度学习测量中的忠实方法:XGNN解释直接映射到流程图上,创建显示警报及其物理轨迹的屏幕仪器。最后,它通过确定开放的挑战和建议未来的工业部署方向来结束。
{"title":"Explainable Neural Network meets Graph Neural Network: Recent advances in process fault detection and diagnosis","authors":"Yi Liu ,&nbsp;Bingbing Shen ,&nbsp;David Shan-Hill Wong ,&nbsp;Mingwei Jia ,&nbsp;Yuan Yao","doi":"10.1016/j.compchemeng.2025.109528","DOIUrl":"10.1016/j.compchemeng.2025.109528","url":null,"abstract":"<div><div>Modern processes rely on thousands of sensors, yet operators still lack trustworthy tools for measurement-driven fault detection and diagnosis. Deep learning excels at capturing nonlinearity and dynamics, but its opaque decision-making limits use in safety-critical processes. This survey fills that gap by introducing a measurement-oriented taxonomy of explainable neural networks (XNNs) and explainable graph neural networks (XGNNs) from the view of instrumentation and measurement. XNNs are posited as variable-centered instruments that assign calibrated importance scores to individual sensors for different faults. XGNNs are framed as topology-centric instruments, allowing direct measurement of interaction strength and causal propagation among units and control loops. This review delivers step-by-step guidelines that convert historical data into explainable detectors, trackers, and diagnostic meters. A comparison highlights when an XNN suffices and when an XGNN is mandatory, giving instrumentation engineers a decision chart. Different from prior surveys, we show that graphs are the faithful way to integrate P&amp;IDs, material balances, and causal knowledge into deep learning measurements: XGNN explanations map directly onto process diagrams, creating on-screen instruments that display both the alarm and its physical trail. Finally, it concludes by identifying open challenges and recommending future directions for industrial deployment.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"206 ","pages":"Article 109528"},"PeriodicalIF":3.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145797289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust control by applying reinforcement learning to adapt explicit model predictive control policies 通过应用强化学习来适应显式模型预测控制策略的鲁棒控制
IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-03-01 Epub Date: 2025-12-10 DOI: 10.1016/j.compchemeng.2025.109519
Edward Hendrik Bras, Tobias Muller Louw, Steven Martin Bradshaw
Reinforcement learning (RL) is a data-driven optimal control technique that has seen limited adoption in the process industries owing to high operational data requirements and the need to balance safe control with exploration. In contrast, Model Predictive Control (MPC) has been established as the benchmark method for optimal control in industrial applications but relies on a dynamic model of the controlled process. In this work, MPC is used as a starting point for online, continuing actor-critic RL applied to the simulated quadruple tank benchmark. Optimal control actions were precomputed as a function of the state space using the plant model, and a neural network was fitted to these data to generate an explicit MPC policy. Subsequently, this policy was adapted during closed-loop interaction of the RL agent with the (simulated) true plant, which exhibited different dynamics to the nominal plant model. The RL controller resolved the effects of plant-model mismatch on closed-loop control performance in the minimum-phase operating region by finding the optimal policy incrementally. In the non-minimum phase operating region, inverse response prevented the RL agent from operating effectively. From multivariable zero analysis, it was shown that process zeros very close to the origin or in the right half plane introduce a significant risk of closed-loop instability under RL control. This work’s findings show both the potential and limitations of RL by adapting precomputed optimal control policies and optimal cost functions (value functions) developed using non-linear MPC.
强化学习(RL)是一种数据驱动的最优控制技术,由于操作数据要求高,需要平衡安全控制与探索,因此在过程工业中应用有限。相比之下,模型预测控制(MPC)已被确立为工业应用中最优控制的基准方法,但它依赖于被控过程的动态模型。在这项工作中,MPC被用作在线的起点,持续的演员评论RL应用于模拟的四缸坦克基准。使用植物模型作为状态空间的函数预先计算最优控制动作,并对这些数据进行神经网络拟合以生成显式MPC策略。随后,该策略在RL代理与(模拟的)真实植物的闭环交互过程中被适应,其表现出与名义植物模型不同的动态。RL控制器通过逐步寻找最优策略的方法,解决了最小相位运行区域内厂模失配对闭环控制性能的影响。在非最小相位工作区域,逆响应影响了RL agent的有效工作。多变量零点分析表明,在RL控制下,非常接近原点或在右半平面的过程零点会引入明显的闭环不稳定风险。这项工作的发现显示了RL的潜力和局限性,它采用了预先计算的最优控制策略和使用非线性MPC开发的最优成本函数(价值函数)。
{"title":"Robust control by applying reinforcement learning to adapt explicit model predictive control policies","authors":"Edward Hendrik Bras,&nbsp;Tobias Muller Louw,&nbsp;Steven Martin Bradshaw","doi":"10.1016/j.compchemeng.2025.109519","DOIUrl":"10.1016/j.compchemeng.2025.109519","url":null,"abstract":"<div><div>Reinforcement learning (RL) is a data-driven optimal control technique that has seen limited adoption in the process industries owing to high operational data requirements and the need to balance safe control with exploration. In contrast, Model Predictive Control (MPC) has been established as the benchmark method for optimal control in industrial applications but relies on a dynamic model of the controlled process. In this work, MPC is used as a starting point for online, continuing actor-critic RL applied to the simulated quadruple tank benchmark. Optimal control actions were precomputed as a function of the state space using the plant model, and a neural network was fitted to these data to generate an explicit MPC policy. Subsequently, this policy was adapted during closed-loop interaction of the RL agent with the (simulated) true plant, which exhibited different dynamics to the nominal plant model. The RL controller resolved the effects of plant-model mismatch on closed-loop control performance in the minimum-phase operating region by finding the optimal policy incrementally. In the non-minimum phase operating region, inverse response prevented the RL agent from operating effectively. From multivariable zero analysis, it was shown that process zeros very close to the origin or in the right half plane introduce a significant risk of closed-loop instability under RL control. This work’s findings show both the potential and limitations of RL by adapting precomputed optimal control policies and optimal cost functions (value functions) developed using non-linear MPC.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"206 ","pages":"Article 109519"},"PeriodicalIF":3.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145797287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A risk-aware LNG terminal scheduling digital twin based on deep reinforcement learning 基于深度强化学习的风险感知LNG终端调度数字孪生
IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-03-01 Epub Date: 2025-12-10 DOI: 10.1016/j.compchemeng.2025.109520
Yee Hung Hong , Jinglin Wang , Chao Peng , Jinsong Zhao
At LNG receiving terminals, the daily send-out target fluctuates with seawater temperature, the accumulated runtime of parallel units, and planned start-up or shutdown sequences. Operators must still decide which pumps or vaporizers to activate under time pressure and incomplete information. In practice, these choices often rely on experience and short-term intuition rather than systematic evaluation, which can lead to uneven runtime distribution, maintenance bottlenecks, and unnecessary energy consumption. This study asks whether such human decision gaps can be reduced within the actual physical and organizational constraints of a working terminal. We develop RALT-DT, a risk-aware learning and control digital twin that integrates deep reinforcement learning with process-level physical models. The “risk-aware” feature is embodied in three aspects: (1) all policy decisions are constrained by explicit mass and heat-balance equations and safety interlocks, ensuring that the control actions remain within certified operating envelopes; (2) the learning reward explicitly penalizes excessive switching, uneven runtime dispersion, and deviations from preventive-maintenance requirements, treating long-term mechanical wear and operational stability as quantifiable risks; and (3) the system continuously monitors plant–model mismatch and adapts its confidence weighting, so that recommendations are moderated when uncertainty grows. To make the solution practical, the plant’s many operating devices are grouped into four core classes—low-pressure (LP) pumps, high-pressure (HP) pumps, open-rack vaporizers (ORV), and submerged-combustion vaporizers (SCV). A two-time-scale roster generator translates continuous policy outputs into binary start–stop schedules that meet maintenance lock-out and switch-inertia constraints. The resulting framework forms a closed learning loop that is both deterministic and interpretable. A one-month on-site shadow test was carried out, in which the algorithm’s decisions were compared with real operator schedules under live conditions. The digital twin achieved an average electrical energy reduction of 9.4 %, equivalent to a saving of about 754 MWh, without violating throughput or switching limits. When calibrated against plant telemetry and vendor performance curves, the model maintained consistent accuracy across varying load and temperature conditions. These results indicate that a physics-grounded and risk-aware learning framework can systematically enhance human decision quality in large-scale terminal operations. It converts intuition-driven scheduling into a reproducible and auditable policy that improves both electrical energy efficiency and asset reliability.
在LNG接收终端,每日发送目标随着海水温度、并联机组累计运行时间以及计划启动或关闭顺序而波动。操作人员仍然必须在时间压力和信息不完整的情况下决定启动哪些泵或汽化器。在实践中,这些选择通常依赖于经验和短期的直觉,而不是系统的评估,这可能导致不均匀的运行时分布、维护瓶颈和不必要的能源消耗。这项研究询问是否可以在工作终端的实际物理和组织约束下减少这种人类决策差距。我们开发了RALT-DT,一种风险感知学习和控制数字双胞胎,将深度强化学习与过程级物理模型集成在一起。“风险意识”特征体现在三个方面:(1)所有政策决策都受到明确的质量和热平衡方程以及安全联锁的约束,确保控制行动保持在认证的操作范围内;(2)学习奖励明确惩罚过度切换、运行时间不均匀和偏离预防性维护要求,将长期机械磨损和运行稳定性视为可量化的风险;(3)系统持续监测工厂模型不匹配并自适应其置信度权重,从而在不确定性增加时调节推荐。为了使该解决方案切实可行,该工厂的许多操作设备被分为四个核心类别:低压(LP)泵、高压(HP)泵、开式机架汽化器(ORV)和浸入式燃烧汽化器(SCV)。双时间尺度花名册生成器将连续的策略输出转换为满足维护锁定和开关惯性约束的二进制启停时间表。由此产生的框架形成了一个封闭的学习循环,它既确定又可解释。进行了为期一个月的现场阴影测试,将该算法的决策与现场条件下的实际操作计划进行了比较。在不违反吞吐量或开关限制的情况下,数字孪生实现了平均9.4%的电能减少,相当于节省了约754兆瓦时。当根据工厂遥测数据和供应商性能曲线进行校准时,该模型在不同的负载和温度条件下保持一致的准确性。这些结果表明,基于物理和风险意识的学习框架可以系统地提高大规模码头运营中的人类决策质量。它将直觉驱动的调度转换为可重复和可审计的策略,从而提高电力能源效率和资产可靠性。
{"title":"A risk-aware LNG terminal scheduling digital twin based on deep reinforcement learning","authors":"Yee Hung Hong ,&nbsp;Jinglin Wang ,&nbsp;Chao Peng ,&nbsp;Jinsong Zhao","doi":"10.1016/j.compchemeng.2025.109520","DOIUrl":"10.1016/j.compchemeng.2025.109520","url":null,"abstract":"<div><div>At LNG receiving terminals, the daily send-out target fluctuates with seawater temperature, the accumulated runtime of parallel units, and planned start-up or shutdown sequences. Operators must still decide which pumps or vaporizers to activate under time pressure and incomplete information. In practice, these choices often rely on experience and short-term intuition rather than systematic evaluation, which can lead to uneven runtime distribution, maintenance bottlenecks, and unnecessary energy consumption. This study asks whether such human decision gaps can be reduced within the actual physical and organizational constraints of a working terminal. We develop RALT-DT, a risk-aware learning and control digital twin that integrates deep reinforcement learning with process-level physical models. The “risk-aware” feature is embodied in three aspects: (1) all policy decisions are constrained by explicit mass and heat-balance equations and safety interlocks, ensuring that the control actions remain within certified operating envelopes; (2) the learning reward explicitly penalizes excessive switching, uneven runtime dispersion, and deviations from preventive-maintenance requirements, treating long-term mechanical wear and operational stability as quantifiable risks; and (3) the system continuously monitors plant–model mismatch and adapts its confidence weighting, so that recommendations are moderated when uncertainty grows. To make the solution practical, the plant’s many operating devices are grouped into four core classes—low-pressure (LP) pumps, high-pressure (HP) pumps, open-rack vaporizers (ORV), and submerged-combustion vaporizers (SCV). A two-time-scale roster generator translates continuous policy outputs into binary start–stop schedules that meet maintenance lock-out and switch-inertia constraints. The resulting framework forms a closed learning loop that is both deterministic and interpretable. A one-month on-site shadow test was carried out, in which the algorithm’s decisions were compared with real operator schedules under live conditions. The digital twin achieved an average electrical energy reduction of 9.4 %, equivalent to a saving of about 754 MWh, without violating throughput or switching limits. When calibrated against plant telemetry and vendor performance curves, the model maintained consistent accuracy across varying load and temperature conditions. These results indicate that a physics-grounded and risk-aware learning framework can systematically enhance human decision quality in large-scale terminal operations. It converts intuition-driven scheduling into a reproducible and auditable policy that improves both electrical energy efficiency and asset reliability.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"206 ","pages":"Article 109520"},"PeriodicalIF":3.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145797285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Petri net-based approach for modeling and analyzing the vulnerability of storage tanks under simultaneous fire and explosion hazards 基于Petri网的储罐火灾爆炸脆弱性建模分析方法
IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-03-01 Epub Date: 2025-11-24 DOI: 10.1016/j.compchemeng.2025.109501
Zahra Khodabakhsh , Khadijeh Mostafaee Dolatabad , Matin Aleahmad , Leila Omidi
Fire and explosion accidents pose significant risks in the chemical and process industries. This study develops an integrated modeling framework to assess the vulnerability of oil storage tanks exposed to simultaneous fire and explosion hazards. The methodology integrates fault tree analysis for identifying contributing factors to simultaneous fire and explosion hazards, a hybrid Fuzzy Cognitive Maps-Bayesian Networks (FCM-BN) approach to quantify probabilistic relationships, graph theory metrics for criticality analysis, and Petri net simulations to model domino effect propagation. The FCM-BN model identifies three key contributors to domino effects, including domino effects from adjacent equipment, unprotected electrical equipment, and hot work and maintenance activities near leaks or flammable vapors. Graph theory analysis identifies Tank No. 4 as the most critical unit based on centrality metrics ("betweenness" and "closeness"), while Petri net simulations show that adjacent tanks, particularly Tanks No. 8 and No. 5, are highly vulnerable to explosion impacts. The framework provides both theoretical insights into domino effect mechanisms and practical tools for risk management, enabling targeted safety interventions and optimal resource allocation in storage facilities. These findings establish a foundation for preventing domino accidents through evidence-based vulnerability assessment and dynamic propagation modeling.
在化学和加工工业中,火灾和爆炸事故构成了重大风险。本研究开发了一个综合建模框架来评估同时暴露于火灾和爆炸危险的储油罐的脆弱性。该方法集成了故障树分析,用于识别同时引起火灾和爆炸危险的因素,模糊认知图-贝叶斯网络(FCM-BN)混合方法,用于量化概率关系,图论度量,用于临界分析,Petri网模拟,用于模拟多米诺效应的传播。FCM-BN模型确定了多米诺骨牌效应的三个关键因素,包括相邻设备的多米诺骨牌效应,未受保护的电气设备,以及泄漏或易燃蒸气附近的热工和维护活动。图论分析认为,基于中心性指标(“中间性”和“接近性”),4号罐是最关键的单元,而Petri网模拟显示,相邻的罐,特别是8号罐和5号罐,非常容易受到爆炸的影响。该框架提供了对多米诺骨牌效应机制的理论见解和风险管理的实用工具,实现了有针对性的安全干预和存储设施的最佳资源分配。这些发现为通过基于证据的脆弱性评估和动态传播建模来预防多米诺骨牌事故奠定了基础。
{"title":"A Petri net-based approach for modeling and analyzing the vulnerability of storage tanks under simultaneous fire and explosion hazards","authors":"Zahra Khodabakhsh ,&nbsp;Khadijeh Mostafaee Dolatabad ,&nbsp;Matin Aleahmad ,&nbsp;Leila Omidi","doi":"10.1016/j.compchemeng.2025.109501","DOIUrl":"10.1016/j.compchemeng.2025.109501","url":null,"abstract":"<div><div>Fire and explosion accidents pose significant risks in the chemical and process industries. This study develops an integrated modeling framework to assess the vulnerability of oil storage tanks exposed to simultaneous fire and explosion hazards. The methodology integrates fault tree analysis for identifying contributing factors to simultaneous fire and explosion hazards, a hybrid Fuzzy Cognitive Maps-Bayesian Networks (FCM-BN) approach to quantify probabilistic relationships, graph theory metrics for criticality analysis, and Petri net simulations to model domino effect propagation. The FCM-BN model identifies three key contributors to domino effects, including domino effects from adjacent equipment, unprotected electrical equipment, and hot work and maintenance activities near leaks or flammable vapors. Graph theory analysis identifies Tank No. 4 as the most critical unit based on centrality metrics (\"betweenness\" and \"closeness\"), while Petri net simulations show that adjacent tanks, particularly Tanks No. 8 and No. 5, are highly vulnerable to explosion impacts. The framework provides both theoretical insights into domino effect mechanisms and practical tools for risk management, enabling targeted safety interventions and optimal resource allocation in storage facilities. These findings establish a foundation for preventing domino accidents through evidence-based vulnerability assessment and dynamic propagation modeling.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"206 ","pages":"Article 109501"},"PeriodicalIF":3.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145691664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncertainty-aware design optimization of hybrid membrane–cryogenic nitrogen rejection units (NRU): decoupling energy and product quality 复合式膜-低温脱氮装置的不确定性优化设计:解耦能量与产品质量
IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-03-01 Epub Date: 2025-12-19 DOI: 10.1016/j.compchemeng.2025.109533
Norfamila Che Mat , Edelin Emirra Empir , Kasih Syazwina Qistina Syaheezam , Muhammad Abdul Qyyum
Hybrid membrane–cryogenic nitrogen rejection units (NRUs) are increasingly proposed for upgrading sub-quality natural gas (NG), yet the mechanistic basis for their performance benefits remains insufficiently understood. This work develops a surrogate-assisted multi-objective optimization framework that integrates perturbation-expansion membrane modelling with Super Learner ensemble surrogates and analytical Bayesian uncertainty quantification to enable computationally efficient hybrid system design under data-sparse conditions. The optimization explores a 14-dimensional design space, yielding Pareto-optimal trade-offs with CH₄ recovery spanning 0.80–0.97 and specific power consumption (SPC) of 0.32–0.52 kWh/kg CH₄. Permutation-based sensitivity analysis identifies functional decoupling between system domains: membrane parameters predominantly govern SPC, while cryogenic column variables control CH₄ recovery, with minimal cross-influence. These results challenge the common assumption that membrane pre-concentration directly reduces energy demand. Instead, optimal hybrid configurations achieve >93% recovery at 0.49–0.51 kWh/kg CH₄—comparable to standalone cryogenic performance—demonstrating that hybrid value arises primarily from operational flexibility, enabling independent manipulation of product quality and energy consumption. Analytical Bayesian inference achieves 24–30% reduction in predictive uncertainty with cross-validation consistency <0.06. The framework produces performance–confidence trade-off maps across the design space, supporting systematic selection of operating strategies based on specified confidence thresholds. By reducing optimization time from 23 to 5 h while quantifying prediction reliability, the proposed approach offers an alternative to empirical tuning practices and provides clearer visibility into performance–flexibility interactions for hybrid NRU design.
混合膜-低温脱氮装置(nru)越来越多地被提出用于改善亚质量天然气(NG),但其性能优势的机制基础仍未得到充分了解。这项工作开发了一个代理辅助的多目标优化框架,该框架将微扰扩展膜建模与超级学习者集成代理和分析贝叶斯不确定性量化相结合,以实现数据稀疏条件下计算效率高的混合系统设计。优化探索了一个14维的设计空间,得到了帕累托最优权衡,硫酸铵回收率为0.80-0.97,比功耗(SPC)为0.32-0.52 kWh/kg氯化铵。基于置换的敏感性分析确定了系统域之间的功能解耦:膜参数主要控制SPC,而低温柱变量控制CH₄回收率,交叉影响最小。这些结果挑战了膜预浓缩直接降低能量需求的普遍假设。相反,最佳混合配置在0.49-0.51 kWh/kg CH₄下可实现>;93%的回收率,与单独的低温性能相当,这表明混合配置的价值主要来自操作灵活性,能够独立操纵产品质量和能耗。分析贝叶斯推理预测不确定性降低24-30%,交叉验证一致性<;0.06。该框架生成跨设计空间的性能-置信度权衡图,支持基于指定置信度阈值的操作策略的系统选择。通过将优化时间从23小时减少到5小时,同时量化预测可靠性,该方法为经验调优实践提供了替代方案,并为混合NRU设计提供了更清晰的性能-灵活性交互可见性。
{"title":"Uncertainty-aware design optimization of hybrid membrane–cryogenic nitrogen rejection units (NRU): decoupling energy and product quality","authors":"Norfamila Che Mat ,&nbsp;Edelin Emirra Empir ,&nbsp;Kasih Syazwina Qistina Syaheezam ,&nbsp;Muhammad Abdul Qyyum","doi":"10.1016/j.compchemeng.2025.109533","DOIUrl":"10.1016/j.compchemeng.2025.109533","url":null,"abstract":"<div><div>Hybrid membrane–cryogenic nitrogen rejection units (NRUs) are increasingly proposed for upgrading sub-quality natural gas (NG), yet the mechanistic basis for their performance benefits remains insufficiently understood. This work develops a surrogate-assisted multi-objective optimization framework that integrates perturbation-expansion membrane modelling with Super Learner ensemble surrogates and analytical Bayesian uncertainty quantification to enable computationally efficient hybrid system design under data-sparse conditions. The optimization explores a 14-dimensional design space, yielding Pareto-optimal trade-offs with CH₄ recovery spanning 0.80–0.97 and specific power consumption (SPC) of 0.32–0.52 kWh/kg CH₄. Permutation-based sensitivity analysis identifies functional decoupling between system domains: membrane parameters predominantly govern SPC, while cryogenic column variables control CH₄ recovery, with minimal cross-influence. These results challenge the common assumption that membrane pre-concentration directly reduces energy demand. Instead, optimal hybrid configurations achieve &gt;93% recovery at 0.49–0.51 kWh/kg CH₄—comparable to standalone cryogenic performance—demonstrating that hybrid value arises primarily from operational flexibility, enabling independent manipulation of product quality and energy consumption. Analytical Bayesian inference achieves 24–30% reduction in predictive uncertainty with cross-validation consistency &lt;0.06. The framework produces performance–confidence trade-off maps across the design space, supporting systematic selection of operating strategies based on specified confidence thresholds. By reducing optimization time from 23 to 5 h while quantifying prediction reliability, the proposed approach offers an alternative to empirical tuning practices and provides clearer visibility into performance–flexibility interactions for hybrid NRU design.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"206 ","pages":"Article 109533"},"PeriodicalIF":3.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145836368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reusable surrogate models for distillation columns 蒸馏塔的可重用代理模型
IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-03-01 Epub Date: 2025-12-12 DOI: 10.1016/j.compchemeng.2025.109523
Martin Bubel, Tobias Seidel, Michael Bortz
Surrogate modeling is a powerful methodology in chemical process engineering, frequently employed to accelerate optimization tasks. Despite their popularity, most surrogate models are trained for a narrow range of fixed chemical systems and operating conditions, which limits their reusability. This work introduces a paradigm shift towards reusable surrogates by developing a single model for distillation columns that generalizes across a vast design space. The key enabler is a novel ML-fueled modelfluid representation which allows for the generation of datasets of more than 1000000 samples. This allows the surrogate to generalize not only over column specifications but also over the entire chemical space of homogeneous ternary vapor–liquid mixtures. We validate the model’s accuracy and demonstrate its practical utility in a case study on entrainer distillation, where it successfully screens and ranks candidate entrainers, significantly reducing the computational effort compared to rigorous optimization.
代理建模是化工过程工程中一种强大的方法,经常用于加速优化任务。尽管它们很受欢迎,但大多数替代模型都是针对固定化学系统和操作条件的狭窄范围进行训练的,这限制了它们的可重用性。这项工作通过为蒸馏塔开发一个单一的模型,在广阔的设计空间中推广,引入了向可重用替代品的范式转变。关键的推动因素是一种新的ml驱动的模型流体表示,它允许生成超过1000000个样本的数据集。这使得代理不仅可以在色谱柱规格上进行推广,而且可以在均匀三元气液混合物的整个化学空间上进行推广。我们验证了模型的准确性,并在夹带剂蒸馏的案例研究中展示了它的实用性,在该案例中,它成功地筛选和排列了候选夹带剂,与严格的优化相比,大大减少了计算量。
{"title":"Reusable surrogate models for distillation columns","authors":"Martin Bubel,&nbsp;Tobias Seidel,&nbsp;Michael Bortz","doi":"10.1016/j.compchemeng.2025.109523","DOIUrl":"10.1016/j.compchemeng.2025.109523","url":null,"abstract":"<div><div>Surrogate modeling is a powerful methodology in chemical process engineering, frequently employed to accelerate optimization tasks. Despite their popularity, most surrogate models are trained for a narrow range of fixed chemical systems and operating conditions, which limits their reusability. This work introduces a paradigm shift towards reusable surrogates by developing a single model for distillation columns that generalizes across a vast design space. The key enabler is a novel ML-fueled modelfluid representation which allows for the generation of datasets of more than <span><math><mrow><mn>1000000</mn></mrow></math></span> samples. This allows the surrogate to generalize not only over column specifications but also over the entire chemical space of homogeneous ternary vapor–liquid mixtures. We validate the model’s accuracy and demonstrate its practical utility in a case study on entrainer distillation, where it successfully screens and ranks candidate entrainers, significantly reducing the computational effort compared to rigorous optimization.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"206 ","pages":"Article 109523"},"PeriodicalIF":3.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145797283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Safe deployment of offline reinforcement learning via input convex action correction 通过输入凸动作校正的离线强化学习的安全部署
IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-03-01 Epub Date: 2025-12-24 DOI: 10.1016/j.compchemeng.2025.109535
Alex Durkin , Jasper Stolte , Matthew Jones , Raghuraman Pitchumani , Bei Li , Christian Michler , Mehmet Mercangöz
Offline reinforcement learning (offline RL) offers a promising framework for developing control strategies in chemical process systems using historical data, without the risks or costs of online experimentation. This work investigates the application of offline RL to the safe and efficient control of an exothermic polymerisation continuous stirred-tank reactor. We introduce a Gymnasium-compatible simulation environment that captures the reactor’s nonlinear dynamics, including reaction kinetics, energy balances, and operational constraints. The environment supports three industrially relevant scenarios: startup, grade change down, and grade change up. It also includes reproducible offline datasets generated from proportional–integral controllers with randomised tunings, providing a benchmark for evaluating offline RL algorithms in realistic process control tasks.
We assess behaviour cloning and implicit Q-learning as baseline algorithms, highlighting the challenges offline agents face, including steady-state offsets and degraded performance near setpoints. To address these issues, we propose a novel deployment-time safety layer that performs gradient-based action correction using partially input convex neural networks (PICNNs) as learned cost models. The PICNN enables real-time, differentiable correction of policy actions by descending a convex, state-conditioned cost surface, without requiring retraining or environment interaction.
Experimental results show that offline RL, particularly when combined with convex action correction, can outperform traditional control approaches and maintain stability across all scenarios. These findings demonstrate the feasibility of integrating offline RL with interpretable and safety-aware corrections for high-stakes chemical process control, and lay the groundwork for more reliable data-driven automation in industrial systems.
离线强化学习(Offline reinforcement learning,离线RL)提供了一个很有前途的框架,可以利用历史数据开发化学过程系统的控制策略,而无需在线实验的风险或成本。本文研究了离线RL在放热聚合连续搅拌槽反应器安全有效控制中的应用。我们介绍了一个体育馆兼容的模拟环境,捕捉反应堆的非线性动力学,包括反应动力学,能量平衡和操作约束。该环境支持三种与工业相关的场景:启动、等级降级和等级升级。它还包括由随机调谐的比例积分控制器生成的可重复离线数据集,为在实际过程控制任务中评估离线RL算法提供基准。我们评估了行为克隆和隐式q -学习作为基线算法,突出了离线代理面临的挑战,包括稳态偏移和在设定值附近的性能下降。为了解决这些问题,我们提出了一种新的部署时间安全层,该层使用部分输入凸神经网络(PICNNs)作为学习成本模型执行基于梯度的动作校正。PICNN通过下降一个凸的、状态条件的成本面来实现实时的、可微分的策略行为校正,而不需要再训练或环境交互。实验结果表明,离线强化学习,特别是与凸动作校正相结合时,可以优于传统的控制方法,并在所有场景下保持稳定性。这些发现证明了将离线强化学习与可解释和安全意识校正集成在高风险化学过程控制中的可行性,并为工业系统中更可靠的数据驱动自动化奠定了基础。
{"title":"Safe deployment of offline reinforcement learning via input convex action correction","authors":"Alex Durkin ,&nbsp;Jasper Stolte ,&nbsp;Matthew Jones ,&nbsp;Raghuraman Pitchumani ,&nbsp;Bei Li ,&nbsp;Christian Michler ,&nbsp;Mehmet Mercangöz","doi":"10.1016/j.compchemeng.2025.109535","DOIUrl":"10.1016/j.compchemeng.2025.109535","url":null,"abstract":"<div><div>Offline reinforcement learning (offline RL) offers a promising framework for developing control strategies in chemical process systems using historical data, without the risks or costs of online experimentation. This work investigates the application of offline RL to the safe and efficient control of an exothermic polymerisation continuous stirred-tank reactor. We introduce a Gymnasium-compatible simulation environment that captures the reactor’s nonlinear dynamics, including reaction kinetics, energy balances, and operational constraints. The environment supports three industrially relevant scenarios: startup, grade change down, and grade change up. It also includes reproducible offline datasets generated from proportional–integral controllers with randomised tunings, providing a benchmark for evaluating offline RL algorithms in realistic process control tasks.</div><div>We assess behaviour cloning and implicit Q-learning as baseline algorithms, highlighting the challenges offline agents face, including steady-state offsets and degraded performance near setpoints. To address these issues, we propose a novel deployment-time safety layer that performs gradient-based action correction using partially input convex neural networks (PICNNs) as learned cost models. The PICNN enables real-time, differentiable correction of policy actions by descending a convex, state-conditioned cost surface, without requiring retraining or environment interaction.</div><div>Experimental results show that offline RL, particularly when combined with convex action correction, can outperform traditional control approaches and maintain stability across all scenarios. These findings demonstrate the feasibility of integrating offline RL with interpretable and safety-aware corrections for high-stakes chemical process control, and lay the groundwork for more reliable data-driven automation in industrial systems.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"206 ","pages":"Article 109535"},"PeriodicalIF":3.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145880235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiperiod strategic planning of CO2 capture, transport, utilization and storage with multimodal transportation 采用多式联运方式进行二氧化碳捕集、运输、利用和封存的多期战略规划
IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-03-01 Epub Date: 2025-12-11 DOI: 10.1016/j.compchemeng.2025.109525
Etienne Ayotte-Sauvé , Robert Yandon , Philippe Navarri , Robert Symonds , Robin Hughes , Marzieh Shokrollahi , Rebecca Modler
To reach decarbonization objectives across the globe, scenario modelling studies indicate that carbon capture, utilization and storage (CCUS) will be essential. Planning CCUS deployment at the regional and national levels requires balancing complex trade-offs – such as when to phase-in infrastructure, where and how much CO2 to capture, transport, utilize and store, as well as which policy measures to adopt.
We introduce a new multiperiod mixed-integer linear programming (MILP) model for the strategic planning of CCUS, with CO2 transportation via pipelines, ships, trains and trucks. This model features the selection of capture units and their rates, geographically explicit pipeline network design (including reuse), vehicle fleet estimations, auxiliary transshipment processes, buffer storage, inland reservoirs as well as injection via ships in addition to offshore pipelines. To handle large scale case studies, a simple heuristic algorithm is presented.
The features of the proposed approach are demonstrated on an illustrative Eastern Canada case study involving pipelines, trains and ships. The influence of the temporal resolution chosen by the modeller on the quality of cost estimates and on calculation times is quantified. Compared to leading commercial algorithms, for large model instances the proposed heuristic is shown to produce better results (e.g. 10 % lower cost) with much less computation time (minutes instead of days). This widens the scope of potential use cases, including broader geographical regions and larger sensitivity studies. The annual evolution of an Eastern Canada CCUS value chain is analyzed, paying special attention to the interplay between CO2 storage reservoir and transport constraints.
为了在全球范围内实现脱碳目标,情景模拟研究表明,碳捕获、利用和封存(CCUS)将是必不可少的。规划CCUS在区域和国家层面的部署需要平衡复杂的权衡——例如,何时分阶段使用基础设施,在何处以及捕获、运输、利用和储存多少二氧化碳,以及采取何种政策措施。本文提出了一种新的多周期混合整数线性规划(MILP)模型,用于二氧化碳通过管道、船舶、火车和卡车运输的CCUS战略规划。该模型的特点是捕获单元的选择及其速率,地理上明确的管网设计(包括重用),车队估计,辅助转运过程,缓冲储存,内陆水库以及通过船舶注入以及海上管道。为了处理大规模的案例研究,提出了一种简单的启发式算法。在加拿大东部一个涉及管道、火车和船舶的说明性案例研究中,论证了拟议方法的特点。建模者选择的时间分辨率对成本估算质量和计算次数的影响是量化的。与领先的商业算法相比,对于大型模型实例,所提出的启发式算法显示出更好的结果(例如,成本降低10%),计算时间(几分钟而不是几天)要少得多。这扩大了潜在用例的范围,包括更广泛的地理区域和更大的敏感性研究。分析了加拿大东部CCUS价值链的年度演变,特别关注二氧化碳储存库和运输约束之间的相互作用。
{"title":"Multiperiod strategic planning of CO2 capture, transport, utilization and storage with multimodal transportation","authors":"Etienne Ayotte-Sauvé ,&nbsp;Robert Yandon ,&nbsp;Philippe Navarri ,&nbsp;Robert Symonds ,&nbsp;Robin Hughes ,&nbsp;Marzieh Shokrollahi ,&nbsp;Rebecca Modler","doi":"10.1016/j.compchemeng.2025.109525","DOIUrl":"10.1016/j.compchemeng.2025.109525","url":null,"abstract":"<div><div>To reach decarbonization objectives across the globe, scenario modelling studies indicate that carbon capture, utilization and storage (CCUS) will be essential. Planning CCUS deployment at the regional and national levels requires balancing complex trade-offs – such as when to phase-in infrastructure, where and how much CO<sub>2</sub> to capture, transport, utilize and store, as well as which policy measures to adopt.</div><div>We introduce a new multiperiod mixed-integer linear programming (MILP) model for the strategic planning of CCUS, with CO<sub>2</sub> transportation via pipelines, ships, trains and trucks. This model features the selection of capture units and their rates, geographically explicit pipeline network design (including reuse), vehicle fleet estimations, auxiliary transshipment processes, buffer storage, inland reservoirs as well as injection via ships in addition to offshore pipelines. To handle large scale case studies, a simple heuristic algorithm is presented.</div><div>The features of the proposed approach are demonstrated on an illustrative Eastern Canada case study involving pipelines, trains and ships. The influence of the temporal resolution chosen by the modeller on the quality of cost estimates and on calculation times is quantified. Compared to leading commercial algorithms, for large model instances the proposed heuristic is shown to produce better results (e.g. 10 % lower cost) with much less computation time (minutes instead of days). This widens the scope of potential use cases, including broader geographical regions and larger sensitivity studies. The annual evolution of an Eastern Canada CCUS value chain is analyzed, paying special attention to the interplay between CO<sub>2</sub> storage reservoir and transport constraints.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"206 ","pages":"Article 109525"},"PeriodicalIF":3.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145797286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating the potential of plastic waste upcycling using thermochemical technologies: A case study in Spain 利用热化学技术评估塑料废物升级回收的潜力:以西班牙为例研究
IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-03-01 Epub Date: 2025-12-11 DOI: 10.1016/j.compchemeng.2025.109524
Roberto Cifuentes García , Evan D. Erickson , Mariano Martín , Víctor M. Zavala
A plastic waste upcycling value chain model has been applied to assess the potential of processing packaging waste in Spain using thermo-chemical technologies to produce low-density polyethylene (LDPE) and polypropylene (PP), which are highly valuable materials. The model projects an annual profit of 120.6 M$/yr, with a capital investment of 789.3 M$, generating 3285 jobs and contributing 65.5 M$/yr to Spain’s economy. The achieved circularity rate of the waste processing infrastructure exceeds 40 %, incorporating recycled HDPE and PET. Despite these advantages, regulatory gaps and market hesitancy toward recycled materials due to quality concerns hinder adoption. Additionally, economies of scale remain underutilized in Spain due to lower plastic waste collection levels compared to countries such as the United States. This network, while less profitable, is environmentally superior, yielding upcycled products with a Global Warming Potential 20–35 % lower than their virgin, fossil-fuel counterparts, confirming this as a viable and sustainable alternative.
西班牙利用热化学技术生产低密度聚乙烯(LDPE)和聚丙烯(PP)这两种高价值材料,应用塑料废物升级回收价值链模型来评估处理包装废物的潜力。该模型预计年利润为1.206亿美元/年,资本投资为7.893亿美元,创造3285个就业岗位,每年为西班牙经济贡献6550万美元。废物处理基础设施的实现循环率超过40%,其中包括回收的HDPE和PET。尽管有这些优势,由于质量问题,监管缺口和市场对回收材料的犹豫阻碍了采用。此外,由于与美国等国家相比,西班牙的塑料废物收集水平较低,因此规模经济仍未得到充分利用。该网络虽然利润较低,但对环境有利,生产的升级回收产品的全球变暖潜能值比原始化石燃料产品低20 - 35%,证实了这是一种可行的可持续替代方案。
{"title":"Evaluating the potential of plastic waste upcycling using thermochemical technologies: A case study in Spain","authors":"Roberto Cifuentes García ,&nbsp;Evan D. Erickson ,&nbsp;Mariano Martín ,&nbsp;Víctor M. Zavala","doi":"10.1016/j.compchemeng.2025.109524","DOIUrl":"10.1016/j.compchemeng.2025.109524","url":null,"abstract":"<div><div>A plastic waste upcycling value chain model has been applied to assess the potential of processing packaging waste in Spain using thermo-chemical technologies to produce low-density polyethylene (LDPE) and polypropylene (PP), which are highly valuable materials. The model projects an annual profit of 120.6 M$/yr, with a capital investment of 789.3 M$, generating 3285 jobs and contributing 65.5 M$/yr to Spain’s economy. The achieved circularity rate of the waste processing infrastructure exceeds 40 %, incorporating recycled HDPE and PET. Despite these advantages, regulatory gaps and market hesitancy toward recycled materials due to quality concerns hinder adoption. Additionally, economies of scale remain underutilized in Spain due to lower plastic waste collection levels compared to countries such as the United States. This network, while less profitable, is environmentally superior, yielding upcycled products with a Global Warming Potential 20–35 % lower than their virgin, fossil-fuel counterparts, confirming this as a viable and sustainable alternative.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"206 ","pages":"Article 109524"},"PeriodicalIF":3.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145797284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online redesign of dynamic experiments for high-throughput bioprocess development using deep reinforcement learning 利用深度强化学习在线重新设计高通量生物工艺开发的动态实验
IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-03-01 Epub Date: 2025-11-27 DOI: 10.1016/j.compchemeng.2025.109500
Martin F. Luna , Federico M. Mione , Ernesto C. Martinez , M. Nicolas Cruz Bournazou
For efficiency and reproducibility, modern biotech laboratories increasingly rely on robotic platforms to perform complex, dynamic experiments to generate informative data for bioprocess development and knowledge discovery. Furthering the goal of self-driving biolabs imposes the need of automating cognitive demanding tasks such as redesigning online an experiment to maximize information gain in the face of different sources of uncertainty including microorganism behavior, hardware failure, and noisy measurements. In this work, a reinforcement learning (RL) based formulation of the online experimental redesign problem for dynamic experiments is proposed. Simulation-based learning of a redesign policy for parallel cultivations in a high-throughput platform is discussed to provide implementation details regarding the RL agent design (perceptions and actions) and the reward function used. The simulated environment and a training workflow for sequential information control are integrated with the Proximal Policy Optimization (PPO) algorithm to learn how to modify ’on the fly’ an offline design based solely on previous observations and actions in a given experiment. Results obtained demonstrate the feasibility of using deep RL to guarantee the quality of the generated data and increase the level of automation that can be used on high-throughput platforms.
为了提高效率和重现性,现代生物技术实验室越来越依赖机器人平台来执行复杂的动态实验,为生物工艺开发和知识发现生成信息数据。为了进一步实现自动驾驶生物实验室的目标,需要自动化认知要求任务,例如在面对不同的不确定性来源(包括微生物行为、硬件故障和噪声测量)时,重新设计在线实验以最大化信息增益。在这项工作中,提出了一种基于强化学习(RL)的动态实验在线再设计问题的表述。本文讨论了在高吞吐量平台中并行培养的重新设计策略的基于模拟的学习,以提供有关RL代理设计(感知和行动)和所使用的奖励函数的实现细节。模拟环境和序列信息控制的训练工作流程与近端策略优化(PPO)算法相结合,以学习如何仅基于给定实验中的先前观察和行为修改“飞行”离线设计。结果表明,使用深度强化学习可以保证生成数据的质量,提高自动化水平,可用于高通量平台。
{"title":"Online redesign of dynamic experiments for high-throughput bioprocess development using deep reinforcement learning","authors":"Martin F. Luna ,&nbsp;Federico M. Mione ,&nbsp;Ernesto C. Martinez ,&nbsp;M. Nicolas Cruz Bournazou","doi":"10.1016/j.compchemeng.2025.109500","DOIUrl":"10.1016/j.compchemeng.2025.109500","url":null,"abstract":"<div><div>For efficiency and reproducibility, modern biotech laboratories increasingly rely on robotic platforms to perform complex, dynamic experiments to generate informative data for bioprocess development and knowledge discovery. Furthering the goal of self-driving biolabs imposes the need of automating cognitive demanding tasks such as redesigning online an experiment to maximize information gain in the face of different sources of uncertainty including microorganism behavior, hardware failure, and noisy measurements. In this work, a reinforcement learning (RL) based formulation of the online experimental redesign problem for dynamic experiments is proposed. Simulation-based learning of a redesign policy for parallel cultivations in a high-throughput platform is discussed to provide implementation details regarding the RL agent design (perceptions and actions) and the reward function used. The simulated environment and a training workflow for sequential information control are integrated with the Proximal Policy Optimization (PPO) algorithm to learn how to modify ’on the fly’ an offline design based solely on previous observations and actions in a given experiment. Results obtained demonstrate the feasibility of using deep RL to guarantee the quality of the generated data and increase the level of automation that can be used on high-throughput platforms.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"206 ","pages":"Article 109500"},"PeriodicalIF":3.9,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145691665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Computers & Chemical Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1