{"title":"利用机器学习算法从压力/体积/温度数据预测溶液气/油比","authors":"Asia Majid, Grant Charles Mwakipunda, Chaohua Guo","doi":"10.2118/217979-pa","DOIUrl":null,"url":null,"abstract":"Summary Many methods have been developed to determine the solution gas/oil ratio (Rs), starting with experiments, followed by empirical correlations establishments, and recently with machine learning applications receiving much interest due to their ability to produce precise results compared with empirical correlations. In this paper, the group method of data handling (GMDH) and the enhanced GMDH based on discrete differential evolution (GMDH-DDE) are used for the first time to estimate the Rs and to provide a correlation to the laboratory measured Rs from bubblepoint pressure (Pb), oil API gravity (API), gas-specific gravity (γg), and reservoir temperature (T) without crude oil properties. These two methods are compared with backpropagation neural networks (BPNN). The reason for using the hybrid GMDH (GMDH-DDE) is to overcome the drawbacks of the GMDH, such as the method used to calculate neuron weights (i.e., quadratic polynomial transfer function), which seems to have inaccuracies. Also, in selecting model inputs, the GMDH tends to choose the most appropriate inputs for the model; however, the selection criteria are not straightforward and may affect the final results. Furthermore, the GMDH has a multicollinearity problem, affecting model coefficient stability and overfitting problems, etc. A total of 420 data sets from the Mpyo oil field were used, with 70% used for training and 30% used for testing. According to the findings, the GMDH-DDE outperformed both the GMDH and BPNN. In comparison with the GMDH and BPNN, the GMDH-DDE has a higher correlation coefficient (R), lower root-mean-square error (RMSE), and lower mean absolute error (MAE). During training, R, RMSE, and MAE were 0.9849, 0.090, and 0.010, respectively, and during testing, R = 0.9603, RMSE = 0.290, and MAE = 0.017. The second-best technique (GMDH) produces R, RMSE, and MAE values of 0.9611, 0.122, and 0.032 in training, and R = 0.9438, RMSE = 0.349, and MAE = 0.055 in testing. Furthermore, the GMDH-DDE used less computational time (1.32 seconds) compared with the GMDH (2.01 seconds) and BPNN (4.96 seconds), proving that the GMDH-DDE has accurate and fast convergence compared with the GMDH and BPNN. These findings show that the GMDH-DDE and GMDH can be adopted as alternative methods for predicting the Rs.","PeriodicalId":22252,"journal":{"name":"SPE Journal","volume":"24 1","pages":"0"},"PeriodicalIF":3.2000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Solution Gas/Oil Ratio Prediction from Pressure/Volume/Temperature Data Using Machine Learning Algorithms\",\"authors\":\"Asia Majid, Grant Charles Mwakipunda, Chaohua Guo\",\"doi\":\"10.2118/217979-pa\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Summary Many methods have been developed to determine the solution gas/oil ratio (Rs), starting with experiments, followed by empirical correlations establishments, and recently with machine learning applications receiving much interest due to their ability to produce precise results compared with empirical correlations. In this paper, the group method of data handling (GMDH) and the enhanced GMDH based on discrete differential evolution (GMDH-DDE) are used for the first time to estimate the Rs and to provide a correlation to the laboratory measured Rs from bubblepoint pressure (Pb), oil API gravity (API), gas-specific gravity (γg), and reservoir temperature (T) without crude oil properties. These two methods are compared with backpropagation neural networks (BPNN). The reason for using the hybrid GMDH (GMDH-DDE) is to overcome the drawbacks of the GMDH, such as the method used to calculate neuron weights (i.e., quadratic polynomial transfer function), which seems to have inaccuracies. Also, in selecting model inputs, the GMDH tends to choose the most appropriate inputs for the model; however, the selection criteria are not straightforward and may affect the final results. Furthermore, the GMDH has a multicollinearity problem, affecting model coefficient stability and overfitting problems, etc. A total of 420 data sets from the Mpyo oil field were used, with 70% used for training and 30% used for testing. According to the findings, the GMDH-DDE outperformed both the GMDH and BPNN. In comparison with the GMDH and BPNN, the GMDH-DDE has a higher correlation coefficient (R), lower root-mean-square error (RMSE), and lower mean absolute error (MAE). During training, R, RMSE, and MAE were 0.9849, 0.090, and 0.010, respectively, and during testing, R = 0.9603, RMSE = 0.290, and MAE = 0.017. The second-best technique (GMDH) produces R, RMSE, and MAE values of 0.9611, 0.122, and 0.032 in training, and R = 0.9438, RMSE = 0.349, and MAE = 0.055 in testing. Furthermore, the GMDH-DDE used less computational time (1.32 seconds) compared with the GMDH (2.01 seconds) and BPNN (4.96 seconds), proving that the GMDH-DDE has accurate and fast convergence compared with the GMDH and BPNN. These findings show that the GMDH-DDE and GMDH can be adopted as alternative methods for predicting the Rs.\",\"PeriodicalId\":22252,\"journal\":{\"name\":\"SPE Journal\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2023-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"SPE Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2118/217979-pa\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, PETROLEUM\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"SPE Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2118/217979-pa","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, PETROLEUM","Score":null,"Total":0}
Solution Gas/Oil Ratio Prediction from Pressure/Volume/Temperature Data Using Machine Learning Algorithms
Summary Many methods have been developed to determine the solution gas/oil ratio (Rs), starting with experiments, followed by empirical correlations establishments, and recently with machine learning applications receiving much interest due to their ability to produce precise results compared with empirical correlations. In this paper, the group method of data handling (GMDH) and the enhanced GMDH based on discrete differential evolution (GMDH-DDE) are used for the first time to estimate the Rs and to provide a correlation to the laboratory measured Rs from bubblepoint pressure (Pb), oil API gravity (API), gas-specific gravity (γg), and reservoir temperature (T) without crude oil properties. These two methods are compared with backpropagation neural networks (BPNN). The reason for using the hybrid GMDH (GMDH-DDE) is to overcome the drawbacks of the GMDH, such as the method used to calculate neuron weights (i.e., quadratic polynomial transfer function), which seems to have inaccuracies. Also, in selecting model inputs, the GMDH tends to choose the most appropriate inputs for the model; however, the selection criteria are not straightforward and may affect the final results. Furthermore, the GMDH has a multicollinearity problem, affecting model coefficient stability and overfitting problems, etc. A total of 420 data sets from the Mpyo oil field were used, with 70% used for training and 30% used for testing. According to the findings, the GMDH-DDE outperformed both the GMDH and BPNN. In comparison with the GMDH and BPNN, the GMDH-DDE has a higher correlation coefficient (R), lower root-mean-square error (RMSE), and lower mean absolute error (MAE). During training, R, RMSE, and MAE were 0.9849, 0.090, and 0.010, respectively, and during testing, R = 0.9603, RMSE = 0.290, and MAE = 0.017. The second-best technique (GMDH) produces R, RMSE, and MAE values of 0.9611, 0.122, and 0.032 in training, and R = 0.9438, RMSE = 0.349, and MAE = 0.055 in testing. Furthermore, the GMDH-DDE used less computational time (1.32 seconds) compared with the GMDH (2.01 seconds) and BPNN (4.96 seconds), proving that the GMDH-DDE has accurate and fast convergence compared with the GMDH and BPNN. These findings show that the GMDH-DDE and GMDH can be adopted as alternative methods for predicting the Rs.
期刊介绍:
Covers theories and emerging concepts spanning all aspects of engineering for oil and gas exploration and production, including reservoir characterization, multiphase flow, drilling dynamics, well architecture, gas well deliverability, numerical simulation, enhanced oil recovery, CO2 sequestration, and benchmarking and performance indicators.