Brett Metcalfe , Juan Camilo Acosta-Pavas , Carlos Eduardo Robles-Rodriguez , George K. Georgakilas , Theodore Dalamagas , Cesar Arturo Aceves-Lara , Fayza Daboussi , Jasper J Koehorst , David Camilo Corrales
{"title":"Towards a machine learning operations (MLOps) soft sensor for real-time predictions in industrial-scale fed-batch fermentation","authors":"Brett Metcalfe , Juan Camilo Acosta-Pavas , Carlos Eduardo Robles-Rodriguez , George K. Georgakilas , Theodore Dalamagas , Cesar Arturo Aceves-Lara , Fayza Daboussi , Jasper J Koehorst , David Camilo Corrales","doi":"10.1016/j.compchemeng.2024.108991","DOIUrl":null,"url":null,"abstract":"<div><div>Real-time predictions in fermentation processes are crucial because they enable continuous monitoring and control of bioprocessing. However, the availability of online measurements is limited by the availability and feasibility of sensing technology. Soft sensors - or software sensors that convert available measurements into measurements of interest (product yield, quality, etc.) - have the potential to improve efficiency and product quality. Machine learning (ML) based soft sensors have gained increased popularity over the years since they can incorporate variables that are measured in real-time, and exploit the intricate patterns embedded in such voluminous datasets. However, ML-based soft sensor requires more than just a classical ML learner with an unseen test set to evaluate the quality prediction of the model. When a ML model is deployed in production, its performance can deteriorate rapidly leading to an unanticipated decline in the quality of the output and predictions. Here a proof concept of Machine Learning Operations (MLOps) to automate the end-to-end soft sensor lifecycle in industrial scale fed-batch fermentation, from development and deployment to maintenance and monitoring is proposed. Using the industrial-scale penicillin fermentation (<em>IndPenSim)</em> dataset that includes 100 fermentation batches, to build a soft sensor based on Long Short Term Memory (LSTM) for penicillin concentration prediction. The batches containing deviations in the processes (91–100) were used to assess concept drift of the LSTM soft sensor. The evaluation of concept drift is evidenced by the soft sensor performance falling below the set threshold based on the Population Stability Index (PSI), which automatically triggers an alert to run the retraining pipeline.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"194 ","pages":"Article 108991"},"PeriodicalIF":3.9000,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135424004095","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Real-time predictions in fermentation processes are crucial because they enable continuous monitoring and control of bioprocessing. However, the availability of online measurements is limited by the availability and feasibility of sensing technology. Soft sensors - or software sensors that convert available measurements into measurements of interest (product yield, quality, etc.) - have the potential to improve efficiency and product quality. Machine learning (ML) based soft sensors have gained increased popularity over the years since they can incorporate variables that are measured in real-time, and exploit the intricate patterns embedded in such voluminous datasets. However, ML-based soft sensor requires more than just a classical ML learner with an unseen test set to evaluate the quality prediction of the model. When a ML model is deployed in production, its performance can deteriorate rapidly leading to an unanticipated decline in the quality of the output and predictions. Here a proof concept of Machine Learning Operations (MLOps) to automate the end-to-end soft sensor lifecycle in industrial scale fed-batch fermentation, from development and deployment to maintenance and monitoring is proposed. Using the industrial-scale penicillin fermentation (IndPenSim) dataset that includes 100 fermentation batches, to build a soft sensor based on Long Short Term Memory (LSTM) for penicillin concentration prediction. The batches containing deviations in the processes (91–100) were used to assess concept drift of the LSTM soft sensor. The evaluation of concept drift is evidenced by the soft sensor performance falling below the set threshold based on the Population Stability Index (PSI), which automatically triggers an alert to run the retraining pipeline.
期刊介绍:
Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.