Gamal A. Alusta, H. Algdamsi, A. Amtereg, Ammar Agnia, Ahmed Alkouh, Bacem Kcharem
{"title":"集成自组织图和数据驱动方法预测油层体积因子:以北非原油为例","authors":"Gamal A. Alusta, H. Algdamsi, A. Amtereg, Ammar Agnia, Ahmed Alkouh, Bacem Kcharem","doi":"10.2118/205782-ms","DOIUrl":null,"url":null,"abstract":"\n In this paper we introduce for the first time an innovative approach for deriving Oil Formation Volume Factor (Bo) by mean of artificial intelligence method. In a new proposed application Self-Organizing Map (SOM) technology has been merged with statistical prediction methods integrating in a single step dimensionality reduction, extraction of input data structure pattern and prediction of formation volume factor Bo. The SOM neural network method applies an unsupervised training algorithm combined with back propagation neural network BPNN to subdivide the entire set of PVT input into different patterns identifying a set of data that have something in common and run individual MLFF ANN models for each specific PVT cluster and computing Bo. PVT data for more than two hundred oil samples (total of 804 data points) were collected from the north African region representing different basin and covering a greater geographical area were used in this study. To establish clear Bound on the accuracy of Bo determination several statistical parameters and terminology included in the presentation of the result from SOM-Neural Network solution. the main outcome is the reduction of error obtained by the new proposed competitive Learning Structure integration of SOM and MLFF ANN to less than 1 % compared to other method. however also investigated in this work five independents means of model driven and data driven approach for estimating Bo theses are 1) Optimal Transformations for Multiple Regression as introduced by (McCain, 1998) using alternating conditional expectations (ACE) for selecting multiple regression transformations 2), Genetic programing and heuristic modeling using Symbolic Regression (SR) and cross validation for model automatic tuning 3) Machine learning predictive model (Nearest Neighbor Regression, Kernel Ridge regression, Gaussian Process Regression (GPR), Random Forest Regression (RF), Support Vector Regression (SVM), Decision Tree Regression (DT), Gradient Boosting Machine Regression (GBM), Group modeling data handling (GMDH). Regression Model Accuracy Metrics (Average absolute relative error, R-square), diagnostic plot was used to address the more adequate techniques and model for predicting Bo.","PeriodicalId":10970,"journal":{"name":"Day 1 Tue, October 12, 2021","volume":"2 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Integration of Self Organizing Map and Date Driven Methods to Predict Oil Formation Volume Factor: North Africa Crude Oil Examples\",\"authors\":\"Gamal A. Alusta, H. Algdamsi, A. Amtereg, Ammar Agnia, Ahmed Alkouh, Bacem Kcharem\",\"doi\":\"10.2118/205782-ms\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n In this paper we introduce for the first time an innovative approach for deriving Oil Formation Volume Factor (Bo) by mean of artificial intelligence method. In a new proposed application Self-Organizing Map (SOM) technology has been merged with statistical prediction methods integrating in a single step dimensionality reduction, extraction of input data structure pattern and prediction of formation volume factor Bo. The SOM neural network method applies an unsupervised training algorithm combined with back propagation neural network BPNN to subdivide the entire set of PVT input into different patterns identifying a set of data that have something in common and run individual MLFF ANN models for each specific PVT cluster and computing Bo. PVT data for more than two hundred oil samples (total of 804 data points) were collected from the north African region representing different basin and covering a greater geographical area were used in this study. To establish clear Bound on the accuracy of Bo determination several statistical parameters and terminology included in the presentation of the result from SOM-Neural Network solution. the main outcome is the reduction of error obtained by the new proposed competitive Learning Structure integration of SOM and MLFF ANN to less than 1 % compared to other method. however also investigated in this work five independents means of model driven and data driven approach for estimating Bo theses are 1) Optimal Transformations for Multiple Regression as introduced by (McCain, 1998) using alternating conditional expectations (ACE) for selecting multiple regression transformations 2), Genetic programing and heuristic modeling using Symbolic Regression (SR) and cross validation for model automatic tuning 3) Machine learning predictive model (Nearest Neighbor Regression, Kernel Ridge regression, Gaussian Process Regression (GPR), Random Forest Regression (RF), Support Vector Regression (SVM), Decision Tree Regression (DT), Gradient Boosting Machine Regression (GBM), Group modeling data handling (GMDH). Regression Model Accuracy Metrics (Average absolute relative error, R-square), diagnostic plot was used to address the more adequate techniques and model for predicting Bo.\",\"PeriodicalId\":10970,\"journal\":{\"name\":\"Day 1 Tue, October 12, 2021\",\"volume\":\"2 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Day 1 Tue, October 12, 2021\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2118/205782-ms\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 1 Tue, October 12, 2021","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2118/205782-ms","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Integration of Self Organizing Map and Date Driven Methods to Predict Oil Formation Volume Factor: North Africa Crude Oil Examples
In this paper we introduce for the first time an innovative approach for deriving Oil Formation Volume Factor (Bo) by mean of artificial intelligence method. In a new proposed application Self-Organizing Map (SOM) technology has been merged with statistical prediction methods integrating in a single step dimensionality reduction, extraction of input data structure pattern and prediction of formation volume factor Bo. The SOM neural network method applies an unsupervised training algorithm combined with back propagation neural network BPNN to subdivide the entire set of PVT input into different patterns identifying a set of data that have something in common and run individual MLFF ANN models for each specific PVT cluster and computing Bo. PVT data for more than two hundred oil samples (total of 804 data points) were collected from the north African region representing different basin and covering a greater geographical area were used in this study. To establish clear Bound on the accuracy of Bo determination several statistical parameters and terminology included in the presentation of the result from SOM-Neural Network solution. the main outcome is the reduction of error obtained by the new proposed competitive Learning Structure integration of SOM and MLFF ANN to less than 1 % compared to other method. however also investigated in this work five independents means of model driven and data driven approach for estimating Bo theses are 1) Optimal Transformations for Multiple Regression as introduced by (McCain, 1998) using alternating conditional expectations (ACE) for selecting multiple regression transformations 2), Genetic programing and heuristic modeling using Symbolic Regression (SR) and cross validation for model automatic tuning 3) Machine learning predictive model (Nearest Neighbor Regression, Kernel Ridge regression, Gaussian Process Regression (GPR), Random Forest Regression (RF), Support Vector Regression (SVM), Decision Tree Regression (DT), Gradient Boosting Machine Regression (GBM), Group modeling data handling (GMDH). Regression Model Accuracy Metrics (Average absolute relative error, R-square), diagnostic plot was used to address the more adequate techniques and model for predicting Bo.