A framework for enhancing industrial soft sensor learning models

IF 3 Q2 ENGINEERING, CHEMICAL Digital Chemical Engineering Pub Date : 2023-09-01 DOI:10.1016/j.dche.2023.100112

João Guilherme Mattos , Patrick Nigri Happ , William Fernandes , Helio Côrtes Vieira Lopes , Simone D J Barbosa , Marcos Kalinowski , Luisa Silveira Rosa , Cassia Novello , Leonardo Dorigo Ribeiro , Patricia Rodrigues Ventura , Marcelo Cardoso Marques , Renato Neves Pitta , Valmir Jose Camolesi , Livia Pereira Lemos Costa , Bruno Itagyba Paravidino , Cristiane Salgado Pereira

{"title":"A framework for enhancing industrial soft sensor learning models","authors":"João Guilherme Mattos , Patrick Nigri Happ , William Fernandes , Helio Côrtes Vieira Lopes , Simone D J Barbosa , Marcos Kalinowski , Luisa Silveira Rosa , Cassia Novello , Leonardo Dorigo Ribeiro , Patricia Rodrigues Ventura , Marcelo Cardoso Marques , Renato Neves Pitta , Valmir Jose Camolesi , Livia Pereira Lemos Costa , Bruno Itagyba Paravidino , Cristiane Salgado Pereira","doi":"10.1016/j.dche.2023.100112","DOIUrl":null,"url":null,"abstract":"<div><p>Refinery industrial processes are very complex with nonlinear dynamics resulting from varying feedstock characteristics and also from changes in product prioritization. Along these processes, there are key properties of intermediate compounds that must be monitored and controlled since they directly affect the quality of the end products commercialized by these manufacturers. However, most of these properties can only be measured through time-consuming and expensive laboratory analysis, which is impossible to obtain in high frequencies, as required to properly monitor them. In this sense, developing soft sensors is the most common way to obtain high-frequency estimations for these measurements, helping advanced control systems to establish the correct setpoints for temperatures, pressures, and other sensors along the refining process, controlling the quality of end products. Since the amount of labeled data is scarce, most academic research has focused on employing semi- supervised learning strategies to develop machine learning (ML) models as soft sensors. Our research, on the other hand, goes in another direction. We aim to elaborate a framework that leverages the knowledge of domain experts and employs data augmentation techniques to build an enhanced fully labeled dataset that could be fed to any supervised ML algorithm to generate a quality soft sensor. We applied our framework together with Automated ML to train a model capable of predicting a specific key property associated with the production of Naphtha compounds in a refinery: the ASTM <span>95</span><svg><path></path></svg>% distillation temperature of the Heavy Naphtha. Although our framework is model agnostic, we opted by using Automated ML for the optimization strategy, since it applies a diverse set of models to the dataset, reducing the bias of utilizing a single optimization algorithm. We evaluated the proposed framework on a case study carried out in an industrial refinery in Brazil, where the previous model in production for estimating the ASTM <span>95</span><svg><path></path></svg>% distillation temperature of the Heavy Naphtha was based entirely on the physicochemical knowledge of the process. By adopting our framework with Automated ML, we were capable of improving the R<sup>2</sup> score by 120%. The resulting ML model is currently operating in real-time inside the refinery, leading to significant economic gains.</p></div>","PeriodicalId":72815,"journal":{"name":"Digital Chemical Engineering","volume":"8 ","pages":"Article 100112"},"PeriodicalIF":3.0000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Chemical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772508123000303","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}

引用次数: 1

Abstract

Refinery industrial processes are very complex with nonlinear dynamics resulting from varying feedstock characteristics and also from changes in product prioritization. Along these processes, there are key properties of intermediate compounds that must be monitored and controlled since they directly affect the quality of the end products commercialized by these manufacturers. However, most of these properties can only be measured through time-consuming and expensive laboratory analysis, which is impossible to obtain in high frequencies, as required to properly monitor them. In this sense, developing soft sensors is the most common way to obtain high-frequency estimations for these measurements, helping advanced control systems to establish the correct setpoints for temperatures, pressures, and other sensors along the refining process, controlling the quality of end products. Since the amount of labeled data is scarce, most academic research has focused on employing semi- supervised learning strategies to develop machine learning (ML) models as soft sensors. Our research, on the other hand, goes in another direction. We aim to elaborate a framework that leverages the knowledge of domain experts and employs data augmentation techniques to build an enhanced fully labeled dataset that could be fed to any supervised ML algorithm to generate a quality soft sensor. We applied our framework together with Automated ML to train a model capable of predicting a specific key property associated with the production of Naphtha compounds in a refinery: the ASTM 95% distillation temperature of the Heavy Naphtha. Although our framework is model agnostic, we opted by using Automated ML for the optimization strategy, since it applies a diverse set of models to the dataset, reducing the bias of utilizing a single optimization algorithm. We evaluated the proposed framework on a case study carried out in an industrial refinery in Brazil, where the previous model in production for estimating the ASTM 95% distillation temperature of the Heavy Naphtha was based entirely on the physicochemical knowledge of the process. By adopting our framework with Automated ML, we were capable of improving the R² score by 120%. The resulting ML model is currently operating in real-time inside the refinery, leading to significant economic gains.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

增强工业软传感器学习模型的框架

炼油工业过程是非常复杂的，由于原料特性的变化和产品优先级的变化而产生非线性动力学。在这些过程中，中间化合物的一些关键特性必须被监测和控制，因为它们直接影响到这些制造商商业化的最终产品的质量。然而，大多数这些特性只能通过耗时和昂贵的实验室分析来测量，这是不可能在高频率下获得的，因为需要适当地监测它们。从这个意义上说，开发软传感器是获得这些测量的高频估计的最常见方法，有助于先进的控制系统在精炼过程中为温度、压力和其他传感器建立正确的设定值，控制最终产品的质量。由于标记数据的数量很少，大多数学术研究都集中在使用半监督学习策略来开发机器学习(ML)模型作为软传感器。另一方面，我们的研究则走向了另一个方向。我们的目标是制定一个框架，利用领域专家的知识，并采用数据增强技术来构建一个增强的完全标记数据集，该数据集可以馈送到任何有监督的ML算法，以生成高质量的软传感器。我们将我们的框架与Automated ML一起应用于训练一个模型，该模型能够预测炼油厂中与石脑油化合物生产相关的特定关键属性:重石脑油的ASTM 95%蒸馏温度。虽然我们的框架是模型不可知的，但我们选择使用自动化机器学习进行优化策略，因为它将不同的模型集应用于数据集，减少了使用单一优化算法的偏差。我们在巴西的一家工业精炼厂进行的一个案例研究中评估了拟议的框架，在该案例中，以前用于估计重石脑油的ASTM 95%蒸馏温度的生产模型完全基于该过程的物理化学知识。通过采用我们的自动化机器学习框架，我们能够将R2分数提高120%。由此产生的机器学习模型目前在炼油厂内实时运行，带来了显著的经济效益。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Digital Chemical Engineering

CiteScore

3.10

自引率

0.00%

发文量