Machine learning prediction of methane, ethane, and propane solubility in pure water and electrolyte solutions: Implications for stray gas migration modeling
Ghazal Kooti, Reza Taherdangkoo, Chaofan Chen, Nikita Sergeev, Faramarz Doulati Ardejani, Tao Meng, Christoph Butscher
{"title":"Machine learning prediction of methane, ethane, and propane solubility in pure water and electrolyte solutions: Implications for stray gas migration modeling","authors":"Ghazal Kooti, Reza Taherdangkoo, Chaofan Chen, Nikita Sergeev, Faramarz Doulati Ardejani, Tao Meng, Christoph Butscher","doi":"10.1007/s11631-024-00680-8","DOIUrl":null,"url":null,"abstract":"<div><p>Hydraulic fracturing is an effective technology for hydrocarbon extraction from unconventional shale and tight gas reservoirs. A potential risk of hydraulic fracturing is the upward migration of stray gas from the deep subsurface to shallow aquifers. The stray gas can dissolve in groundwater leading to chemical and biological reactions, which could negatively affect groundwater quality and contribute to atmospheric emissions. The knowledge of light hydrocarbon solubility in the aqueous environment is essential for the numerical modelling of flow and transport in the subsurface. Herein, we compiled a database containing 2129 experimental data of methane, ethane, and propane solubility in pure water and various electrolyte solutions over wide ranges of operating temperature and pressure. Two machine learning algorithms, namely regression tree (RT) and boosted regression tree (BRT) tuned with a Bayesian optimization algorithm (BO) were employed to determine the solubility of gases. The predictions were compared with the experimental data as well as four well-established thermodynamic models. Our analysis shows that the BRT-BO is sufficiently accurate, and the predicted values agree well with those obtained from the thermodynamic models. The coefficient of determination (R<sup>2</sup>) between experimental and predicted values is 0.99 and the mean squared error (MSE) is 9.97 × 10<sup>−8</sup>. The leverage statistical approach further confirmed the validity of the model developed.</p></div>","PeriodicalId":7151,"journal":{"name":"Acta Geochimica","volume":"43 5","pages":"971 - 984"},"PeriodicalIF":1.4000,"publicationDate":"2024-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s11631-024-00680-8.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Geochimica","FirstCategoryId":"89","ListUrlMain":"https://link.springer.com/article/10.1007/s11631-024-00680-8","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
Hydraulic fracturing is an effective technology for hydrocarbon extraction from unconventional shale and tight gas reservoirs. A potential risk of hydraulic fracturing is the upward migration of stray gas from the deep subsurface to shallow aquifers. The stray gas can dissolve in groundwater leading to chemical and biological reactions, which could negatively affect groundwater quality and contribute to atmospheric emissions. The knowledge of light hydrocarbon solubility in the aqueous environment is essential for the numerical modelling of flow and transport in the subsurface. Herein, we compiled a database containing 2129 experimental data of methane, ethane, and propane solubility in pure water and various electrolyte solutions over wide ranges of operating temperature and pressure. Two machine learning algorithms, namely regression tree (RT) and boosted regression tree (BRT) tuned with a Bayesian optimization algorithm (BO) were employed to determine the solubility of gases. The predictions were compared with the experimental data as well as four well-established thermodynamic models. Our analysis shows that the BRT-BO is sufficiently accurate, and the predicted values agree well with those obtained from the thermodynamic models. The coefficient of determination (R2) between experimental and predicted values is 0.99 and the mean squared error (MSE) is 9.97 × 10−8. The leverage statistical approach further confirmed the validity of the model developed.
期刊介绍:
Acta Geochimica serves as the international forum for essential research on geochemistry, the science that uses the tools and principles of chemistry to explain the mechanisms behind major geological systems such as the Earth‘s crust, its oceans and the entire Solar System, as well as a number of processes including mantle convection, the formation of planets and the origins of granite and basalt. The journal focuses on, but is not limited to the following aspects:
• Cosmochemistry
• Mantle Geochemistry
• Ore-deposit Geochemistry
• Organic Geochemistry
• Environmental Geochemistry
• Computational Geochemistry
• Isotope Geochemistry
• NanoGeochemistry
All research articles published in this journal have undergone rigorous peer review. In addition to original research articles, Acta Geochimica publishes reviews and short communications, aiming to rapidly disseminate the research results of timely interest, and comprehensive reviews of emerging topics in all the areas of geochemistry.