{"title":"An insight into the microorganism growth prediction by means of machine learning approaches","authors":"Amin Bemani , Alireza Kazemi , Mohammad Ahmadi","doi":"10.1016/j.petrol.2022.111162","DOIUrl":null,"url":null,"abstract":"<div><p><span><span>Microbial enhanced oil recovery (MEOR) is a well-known oil recovery method that is greatly influenced by the growth and metabolism of the </span>microorganisms<span><span><span><span>. Given the complexities and uncertainties associated with identifying the growth mechanism of microorganism, developing an approach to estimate bacterial concentration versus different factors viz. </span>Salinity<span>, temperature and time is still deemed a challenge. Hence, in this study, seven different machine learning methods namely </span></span>Artificial Neural Network, </span>Support Vector Machine, Decision Tree, K-nearest Neighbors, Ensemble Learning, Random Forest and Adaptive Boosting are utilized to predict bacterial cell concentration. A databank including 110 data points of bacterial cell concentration entailing the incubation time, salinity, temperature and yeast extract has been collected and used for preparation of these models. Graphical and statistical comparisons are used to analyze the performance and accuracy of each integrated model. The retrieved results revealed that the trained ensemble learning model is the most accurate method in estimating the bacterial growth with </span></span>correlation coefficient<span> and mean squared error of 0.9163 and 0.0542 on the tested dataset, respectively. Moreover, the KNN model with correlation coefficient and mean squared error of 0.6111 and 0.1192, respectively, is the worst model among the seven estimators. This model has great accuracy in training phase while it is not accurate in validation and testing phase. Due to this fact, it can be concluded that KNN model suffers from overfitting problem. In addition, the impacts of incubation time, yeast extract, temperature and salinity on bacterial cell concentration are also ascertained using sensitivity analysis. It is discerned that the temperature and yeast extract are the most and least effective factors on growth of microorganism, respectively.</span></p></div>","PeriodicalId":16717,"journal":{"name":"Journal of Petroleum Science and Engineering","volume":"220 ","pages":"Article 111162"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Petroleum Science and Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0920410522010142","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Earth and Planetary Sciences","Score":null,"Total":0}
引用次数: 0
Abstract
Microbial enhanced oil recovery (MEOR) is a well-known oil recovery method that is greatly influenced by the growth and metabolism of the microorganisms. Given the complexities and uncertainties associated with identifying the growth mechanism of microorganism, developing an approach to estimate bacterial concentration versus different factors viz. Salinity, temperature and time is still deemed a challenge. Hence, in this study, seven different machine learning methods namely Artificial Neural Network, Support Vector Machine, Decision Tree, K-nearest Neighbors, Ensemble Learning, Random Forest and Adaptive Boosting are utilized to predict bacterial cell concentration. A databank including 110 data points of bacterial cell concentration entailing the incubation time, salinity, temperature and yeast extract has been collected and used for preparation of these models. Graphical and statistical comparisons are used to analyze the performance and accuracy of each integrated model. The retrieved results revealed that the trained ensemble learning model is the most accurate method in estimating the bacterial growth with correlation coefficient and mean squared error of 0.9163 and 0.0542 on the tested dataset, respectively. Moreover, the KNN model with correlation coefficient and mean squared error of 0.6111 and 0.1192, respectively, is the worst model among the seven estimators. This model has great accuracy in training phase while it is not accurate in validation and testing phase. Due to this fact, it can be concluded that KNN model suffers from overfitting problem. In addition, the impacts of incubation time, yeast extract, temperature and salinity on bacterial cell concentration are also ascertained using sensitivity analysis. It is discerned that the temperature and yeast extract are the most and least effective factors on growth of microorganism, respectively.
期刊介绍:
The objective of the Journal of Petroleum Science and Engineering is to bridge the gap between the engineering, the geology and the science of petroleum and natural gas by publishing explicitly written articles intelligible to scientists and engineers working in any field of petroleum engineering, natural gas engineering and petroleum (natural gas) geology. An attempt is made in all issues to balance the subject matter and to appeal to a broad readership.
The Journal of Petroleum Science and Engineering covers the fields of petroleum (and natural gas) exploration, production and flow in its broadest possible sense. Topics include: origin and accumulation of petroleum and natural gas; petroleum geochemistry; reservoir engineering; reservoir simulation; rock mechanics; petrophysics; pore-level phenomena; well logging, testing and evaluation; mathematical modelling; enhanced oil and gas recovery; petroleum geology; compaction/diagenesis; petroleum economics; drilling and drilling fluids; thermodynamics and phase behavior; fluid mechanics; multi-phase flow in porous media; production engineering; formation evaluation; exploration methods; CO2 Sequestration in geological formations/sub-surface; management and development of unconventional resources such as heavy oil and bitumen, tight oil and liquid rich shales.