Julles Mitoura dos Santos Junior, Ícaro Augusto Maccari Zelioli, A. Mariano
{"title":"机器学习与现象学模型混合建模用于预测超临界水中生物质气化制氢过程","authors":"Julles Mitoura dos Santos Junior, Ícaro Augusto Maccari Zelioli, A. Mariano","doi":"10.3390/eng4020086","DOIUrl":null,"url":null,"abstract":"Process monitoring and forecasting are essential to ensure the efficiency of industrial processes. Although it is possible to model processes using phenomenological approaches, these are not always easy to apply and generalize due to the complexity of the processes and the high number of unknown parameters. This work aims to present a hybrid modeling architecture that combines a phenomenological model with machine learning models. The proposal is to enable the use of simplified phenomenological models to explain the basic principles behind a phenomenon. Next, the data-oriented model corrects deviations from the simplified model predictions. The research hypothesis consists of showing the benefits of integrating prior knowledge of chemical engineering in simplifying data-based models, enhancing their generalization and improving their interpretability. The gasification process of lignin biomass with supercritical water was used as a case study for this methodology and the variable to be observed was the production of hydrogen. The real experimental data of this process were augmented using Gibbs energy minimization with the Peng–Robinson equation of state, thus generating a more voluminous database that was considered as real process data. The ideal gas model was used as a simplified model, producing significant deviations in predictions (relative deviations greater than 20%). Deviations (∆H2 = H2real−H2predict) were used as the target variable for the machine learning model. Linear regression models (LASSO and simple linear regression) were used to predict ∆H2 and this variable was added to the simplified forecast model. This consisted of the hybrid prediction of the resulting hydrogen formation (H2predict). Among the verified models, the simple linear regression adjusted better to the values of ∆H2 (R2 = 0.985) and MAE smaller than 0.1. Thus, the proposed hybrid architecture allowed for the prediction of the formation of hydrogen during the gasification process of lignin biomass, despite the thermodynamic limitations of the ideal gas model. Hybridization proved to be robust as a process monitoring tool, providing the abstraction of non-idealities of industrial processes through simple, data-oriented models, without losing predictive power. The objective of the work was fulfilled, presenting a new possibility for the monitoring of real industrial processes.","PeriodicalId":10630,"journal":{"name":"Comput. Chem. Eng.","volume":"4 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hybrid Modeling of Machine Learning and Phenomenological Model for Predicting the Biomass Gasification Process in Supercritical Water for Hydrogen Production\",\"authors\":\"Julles Mitoura dos Santos Junior, Ícaro Augusto Maccari Zelioli, A. Mariano\",\"doi\":\"10.3390/eng4020086\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Process monitoring and forecasting are essential to ensure the efficiency of industrial processes. Although it is possible to model processes using phenomenological approaches, these are not always easy to apply and generalize due to the complexity of the processes and the high number of unknown parameters. This work aims to present a hybrid modeling architecture that combines a phenomenological model with machine learning models. The proposal is to enable the use of simplified phenomenological models to explain the basic principles behind a phenomenon. Next, the data-oriented model corrects deviations from the simplified model predictions. The research hypothesis consists of showing the benefits of integrating prior knowledge of chemical engineering in simplifying data-based models, enhancing their generalization and improving their interpretability. The gasification process of lignin biomass with supercritical water was used as a case study for this methodology and the variable to be observed was the production of hydrogen. The real experimental data of this process were augmented using Gibbs energy minimization with the Peng–Robinson equation of state, thus generating a more voluminous database that was considered as real process data. The ideal gas model was used as a simplified model, producing significant deviations in predictions (relative deviations greater than 20%). Deviations (∆H2 = H2real−H2predict) were used as the target variable for the machine learning model. Linear regression models (LASSO and simple linear regression) were used to predict ∆H2 and this variable was added to the simplified forecast model. This consisted of the hybrid prediction of the resulting hydrogen formation (H2predict). Among the verified models, the simple linear regression adjusted better to the values of ∆H2 (R2 = 0.985) and MAE smaller than 0.1. Thus, the proposed hybrid architecture allowed for the prediction of the formation of hydrogen during the gasification process of lignin biomass, despite the thermodynamic limitations of the ideal gas model. Hybridization proved to be robust as a process monitoring tool, providing the abstraction of non-idealities of industrial processes through simple, data-oriented models, without losing predictive power. The objective of the work was fulfilled, presenting a new possibility for the monitoring of real industrial processes.\",\"PeriodicalId\":10630,\"journal\":{\"name\":\"Comput. Chem. Eng.\",\"volume\":\"4 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Comput. Chem. Eng.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3390/eng4020086\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Comput. Chem. Eng.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/eng4020086","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Hybrid Modeling of Machine Learning and Phenomenological Model for Predicting the Biomass Gasification Process in Supercritical Water for Hydrogen Production
Process monitoring and forecasting are essential to ensure the efficiency of industrial processes. Although it is possible to model processes using phenomenological approaches, these are not always easy to apply and generalize due to the complexity of the processes and the high number of unknown parameters. This work aims to present a hybrid modeling architecture that combines a phenomenological model with machine learning models. The proposal is to enable the use of simplified phenomenological models to explain the basic principles behind a phenomenon. Next, the data-oriented model corrects deviations from the simplified model predictions. The research hypothesis consists of showing the benefits of integrating prior knowledge of chemical engineering in simplifying data-based models, enhancing their generalization and improving their interpretability. The gasification process of lignin biomass with supercritical water was used as a case study for this methodology and the variable to be observed was the production of hydrogen. The real experimental data of this process were augmented using Gibbs energy minimization with the Peng–Robinson equation of state, thus generating a more voluminous database that was considered as real process data. The ideal gas model was used as a simplified model, producing significant deviations in predictions (relative deviations greater than 20%). Deviations (∆H2 = H2real−H2predict) were used as the target variable for the machine learning model. Linear regression models (LASSO and simple linear regression) were used to predict ∆H2 and this variable was added to the simplified forecast model. This consisted of the hybrid prediction of the resulting hydrogen formation (H2predict). Among the verified models, the simple linear regression adjusted better to the values of ∆H2 (R2 = 0.985) and MAE smaller than 0.1. Thus, the proposed hybrid architecture allowed for the prediction of the formation of hydrogen during the gasification process of lignin biomass, despite the thermodynamic limitations of the ideal gas model. Hybridization proved to be robust as a process monitoring tool, providing the abstraction of non-idealities of industrial processes through simple, data-oriented models, without losing predictive power. The objective of the work was fulfilled, presenting a new possibility for the monitoring of real industrial processes.