Soft causal constraints in groundwater machine learning: a new way to balance accuracy and physical consistency

IF 2.8 4区 环境科学与生态学 Q3 ENVIRONMENTAL SCIENCES Environmental Earth Sciences Pub Date : 2025-01-15 DOI:10.1007/s12665-024-12063-6
Adoubi Vincent De Paul Adombi, Romain Chesnaux
{"title":"Soft causal constraints in groundwater machine learning: a new way to balance accuracy and physical consistency","authors":"Adoubi Vincent De Paul Adombi,&nbsp;Romain Chesnaux","doi":"10.1007/s12665-024-12063-6","DOIUrl":null,"url":null,"abstract":"<div><p>Physics-informed machine learning (PIML) seeks to integrate scientific knowledge into conventional machine learning models to mitigate the black-box nature of the latter and prevent them from producing physically inconsistent results. Recently, Adombi et al. (2024) [<i>a causal physics-informed deep learning formulation for groundwater flow modeling and climate change effect analysis</i>] have shown that incorporating scientific knowledge into machine learning models is not enough to make them obey certain fundamental principles of physics, such as causality. They then derived certain constraints, called causal relationship constraints (CRC), to force PIML to obey the principle of causality. However, in some situations, CRC constraints in PIML prioritize the satisfaction of the principle of causality to the detriment of performance. In this study, we propose new CRC conditions and a new architecture for PIML, with the aim of testing the hypothesis that these conditions improve the performance of PIML models without transgressing the principle of causality. The models were tasked with simulating groundwater levels in six piezometers located in Quebec, Canada. A conventional machine learning model (convolutional neural network, 1D-CNN), a PIML model based on Adombi et al. (2024) (H-Lin) and a PIML model based on the architecture proposed in this work (H-LinC) were trained and subsequently compared. The results show that 1D-CNN outperforms H-LinC, which in turn outperforms H-Lin in terms of accuracy, with median NSE and KGE of 0.76 and 0.87 for 1D-CNN, 0.68 and 0.76 fir H-LinC, and 0.53 and 0.59 fir H-Lin. However, only H-LinC and H-Lin satisfy the principle of causality.</p></div>","PeriodicalId":542,"journal":{"name":"Environmental Earth Sciences","volume":"84 2","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Earth Sciences","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s12665-024-12063-6","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Physics-informed machine learning (PIML) seeks to integrate scientific knowledge into conventional machine learning models to mitigate the black-box nature of the latter and prevent them from producing physically inconsistent results. Recently, Adombi et al. (2024) [a causal physics-informed deep learning formulation for groundwater flow modeling and climate change effect analysis] have shown that incorporating scientific knowledge into machine learning models is not enough to make them obey certain fundamental principles of physics, such as causality. They then derived certain constraints, called causal relationship constraints (CRC), to force PIML to obey the principle of causality. However, in some situations, CRC constraints in PIML prioritize the satisfaction of the principle of causality to the detriment of performance. In this study, we propose new CRC conditions and a new architecture for PIML, with the aim of testing the hypothesis that these conditions improve the performance of PIML models without transgressing the principle of causality. The models were tasked with simulating groundwater levels in six piezometers located in Quebec, Canada. A conventional machine learning model (convolutional neural network, 1D-CNN), a PIML model based on Adombi et al. (2024) (H-Lin) and a PIML model based on the architecture proposed in this work (H-LinC) were trained and subsequently compared. The results show that 1D-CNN outperforms H-LinC, which in turn outperforms H-Lin in terms of accuracy, with median NSE and KGE of 0.76 and 0.87 for 1D-CNN, 0.68 and 0.76 fir H-LinC, and 0.53 and 0.59 fir H-Lin. However, only H-LinC and H-Lin satisfy the principle of causality.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
地下水机器学习中的软因果约束:平衡准确性和物理一致性的新方法
基于物理的机器学习(PIML)旨在将科学知识整合到传统的机器学习模型中,以减轻后者的黑箱性质,并防止它们产生物理上不一致的结果。最近,Adombi等人(2024)[基于因果物理学的地下水流动建模和气候变化效应分析的深度学习公式]表明,将科学知识纳入机器学习模型不足以使它们遵守某些基本的物理原理,如因果关系。然后,他们推导出一定的约束,称为因果关系约束(CRC),以迫使PIML服从因果关系原则。然而,在某些情况下,PIML中的CRC约束优先考虑因果关系原则的满足,从而损害了绩效。在本研究中,我们提出了新的CRC条件和PIML的新架构,目的是验证这些条件在不违反因果关系原则的情况下提高PIML模型性能的假设。这些模型的任务是模拟位于加拿大魁北克省的六个气压计的地下水位。我们训练了一个传统的机器学习模型(卷积神经网络,1D-CNN)、一个基于Adombi等人(2024)的PIML模型(H-Lin)和一个基于本研究提出的架构的PIML模型(H-LinC),并进行了比较。结果表明,1D-CNN优于H-LinC, H-LinC在准确率方面优于H-Lin, 1D-CNN的中位数NSE和KGE分别为0.76和0.87,H-LinC的中位数NSE和KGE分别为0.68和0.76,H-Lin的中位数NSE和KGE分别为0.53和0.59。然而,只有H-LinC和H-Lin满足因果原则。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Environmental Earth Sciences
Environmental Earth Sciences 环境科学-地球科学综合
CiteScore
5.10
自引率
3.60%
发文量
494
审稿时长
8.3 months
期刊介绍: Environmental Earth Sciences is an international multidisciplinary journal concerned with all aspects of interaction between humans, natural resources, ecosystems, special climates or unique geographic zones, and the earth: Water and soil contamination caused by waste management and disposal practices Environmental problems associated with transportation by land, air, or water Geological processes that may impact biosystems or humans Man-made or naturally occurring geological or hydrological hazards Environmental problems associated with the recovery of materials from the earth Environmental problems caused by extraction of minerals, coal, and ores, as well as oil and gas, water and alternative energy sources Environmental impacts of exploration and recultivation – Environmental impacts of hazardous materials Management of environmental data and information in data banks and information systems Dissemination of knowledge on techniques, methods, approaches and experiences to improve and remediate the environment In pursuit of these topics, the geoscientific disciplines are invited to contribute their knowledge and experience. Major disciplines include: hydrogeology, hydrochemistry, geochemistry, geophysics, engineering geology, remediation science, natural resources management, environmental climatology and biota, environmental geography, soil science and geomicrobiology.
期刊最新文献
SMFCs-driven lead remediation: morphological transformation and targeted migration in contaminated soils Water resources modeling in Wadi Numan, Western Saudi Arabia using remote sensing and GIS-based multi-criteria Evaluation of arsenic and nitrate contamination in groundwater from alluvium aquifers: Hydrogeochemical features, water quality and health risk assessment Enhancing surface cracking and erosion resistances in lateritic soil with eco-friendly surface spraying agents Stability charts for unsaturated uniform slopes
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1