将分形维度纳入机器学习模型可提高导水性的预测精度

IF 3.9 3区 环境科学与生态学 Q1 ENGINEERING, CIVIL Stochastic Environmental Research and Risk Assessment Pub Date : 2024-08-11 DOI:10.1007/s00477-024-02793-1
Abhradip Sarkar, Pragati Pramanik Maity, Mrinmoy Ray, Aditi Kundu
{"title":"将分形维度纳入机器学习模型可提高导水性的预测精度","authors":"Abhradip Sarkar, Pragati Pramanik Maity, Mrinmoy Ray, Aditi Kundu","doi":"10.1007/s00477-024-02793-1","DOIUrl":null,"url":null,"abstract":"<p>Measurement of hydraulic conductivity (HC) in the field and laboratory is time-consuming, laborious, and expensive, pedo-transfer functions can be used to predict the soil HC using easy-to-measure soil properties like bulk density (BD), soil texture, fractal dimension (D), organic carbon (OC) and glomalin content. In this study, 121 soil samples were used to predict HC using Multi Linear Regression, and four machine learning-based models i.e., Artificial Neural Network (ANN), Support Vector Machine (SVM), Classification and Regression Trees (CART) and Random Forest (RF). Two sets of input data were used i.e., dataset 1: texture data, BD, OC, and glomalin content and dataset 2: D, BD, OC, and glomalin content (Dataset 2). The models were evaluated based on Mean Absolute Error, Mean Absolute Percentage Error, Nash–Sutcliffe model efficiency, Root Mean Square Error (RMSE), and correlation coefficient. ANN with three hidden layers performed significantly for both input sets. The RMSE value was decreased by 17% in the training dataset and by 5.55% in the testing dataset when D was added to the input set for ANN. For both datasets, RF performed better and outperformed CART in predicting HC. According to the results, SVM with dataset 2 outperformed all other models which showed the inclusion of D in the dataset could predict HC more efficiently. However, further study is required for different combinations of datasets for evaluating the prediction efficiency of machine learning models for various regions.</p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"12 1","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2024-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Inclusion of fractal dimension in machine learning models improves the prediction accuracy of hydraulic conductivity\",\"authors\":\"Abhradip Sarkar, Pragati Pramanik Maity, Mrinmoy Ray, Aditi Kundu\",\"doi\":\"10.1007/s00477-024-02793-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Measurement of hydraulic conductivity (HC) in the field and laboratory is time-consuming, laborious, and expensive, pedo-transfer functions can be used to predict the soil HC using easy-to-measure soil properties like bulk density (BD), soil texture, fractal dimension (D), organic carbon (OC) and glomalin content. In this study, 121 soil samples were used to predict HC using Multi Linear Regression, and four machine learning-based models i.e., Artificial Neural Network (ANN), Support Vector Machine (SVM), Classification and Regression Trees (CART) and Random Forest (RF). Two sets of input data were used i.e., dataset 1: texture data, BD, OC, and glomalin content and dataset 2: D, BD, OC, and glomalin content (Dataset 2). The models were evaluated based on Mean Absolute Error, Mean Absolute Percentage Error, Nash–Sutcliffe model efficiency, Root Mean Square Error (RMSE), and correlation coefficient. ANN with three hidden layers performed significantly for both input sets. The RMSE value was decreased by 17% in the training dataset and by 5.55% in the testing dataset when D was added to the input set for ANN. For both datasets, RF performed better and outperformed CART in predicting HC. According to the results, SVM with dataset 2 outperformed all other models which showed the inclusion of D in the dataset could predict HC more efficiently. However, further study is required for different combinations of datasets for evaluating the prediction efficiency of machine learning models for various regions.</p>\",\"PeriodicalId\":21987,\"journal\":{\"name\":\"Stochastic Environmental Research and Risk Assessment\",\"volume\":\"12 1\",\"pages\":\"\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2024-08-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Stochastic Environmental Research and Risk Assessment\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1007/s00477-024-02793-1\",\"RegionNum\":3,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stochastic Environmental Research and Risk Assessment","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s00477-024-02793-1","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

摘要

在野外和实验室测量水力传导性(HC)费时、费力且成本高昂,而利用体积密度(BD)、土壤质地、分形维度(D)、有机碳(OC)和胶褐素含量等易于测量的土壤特性,可以使用脚踏转移函数来预测土壤的水力传导性。本研究使用多元线性回归和四种基于机器学习的模型(即人工神经网络 (ANN)、支持向量机 (SVM)、分类回归树 (CART) 和随机森林 (RF))来预测 121 个土壤样本的碳氢化合物含量。使用了两组输入数据,即数据集 1:纹理数据、BD、OC 和胶霉素含量;数据集 2:D、BD、OC 和胶霉素含量(数据集 2)。根据平均绝对误差、平均绝对百分比误差、纳什-苏特克利夫模型效率、均方根误差(RMSE)和相关系数对模型进行了评估。具有三个隐藏层的 ANN 在两个输入集上都有显著表现。在训练数据集和测试数据集中,当将 D 加入到 ANN 的输入集时,RMSE 值分别降低了 17%和 5.55%。对于这两个数据集,RF 在预测 HC 方面表现更好,优于 CART。结果显示,使用数据集 2 的 SVM 的表现优于所有其他模型,这表明在数据集中加入 D 可以更有效地预测 HC。不过,还需要进一步研究不同的数据集组合,以评估机器学习模型对不同地区的预测效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Inclusion of fractal dimension in machine learning models improves the prediction accuracy of hydraulic conductivity

Measurement of hydraulic conductivity (HC) in the field and laboratory is time-consuming, laborious, and expensive, pedo-transfer functions can be used to predict the soil HC using easy-to-measure soil properties like bulk density (BD), soil texture, fractal dimension (D), organic carbon (OC) and glomalin content. In this study, 121 soil samples were used to predict HC using Multi Linear Regression, and four machine learning-based models i.e., Artificial Neural Network (ANN), Support Vector Machine (SVM), Classification and Regression Trees (CART) and Random Forest (RF). Two sets of input data were used i.e., dataset 1: texture data, BD, OC, and glomalin content and dataset 2: D, BD, OC, and glomalin content (Dataset 2). The models were evaluated based on Mean Absolute Error, Mean Absolute Percentage Error, Nash–Sutcliffe model efficiency, Root Mean Square Error (RMSE), and correlation coefficient. ANN with three hidden layers performed significantly for both input sets. The RMSE value was decreased by 17% in the training dataset and by 5.55% in the testing dataset when D was added to the input set for ANN. For both datasets, RF performed better and outperformed CART in predicting HC. According to the results, SVM with dataset 2 outperformed all other models which showed the inclusion of D in the dataset could predict HC more efficiently. However, further study is required for different combinations of datasets for evaluating the prediction efficiency of machine learning models for various regions.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.10
自引率
9.50%
发文量
189
审稿时长
3.8 months
期刊介绍: Stochastic Environmental Research and Risk Assessment (SERRA) will publish research papers, reviews and technical notes on stochastic and probabilistic approaches to environmental sciences and engineering, including interactions of earth and atmospheric environments with people and ecosystems. The basic idea is to bring together research papers on stochastic modelling in various fields of environmental sciences and to provide an interdisciplinary forum for the exchange of ideas, for communicating on issues that cut across disciplinary barriers, and for the dissemination of stochastic techniques used in different fields to the community of interested researchers. Original contributions will be considered dealing with modelling (theoretical and computational), measurements and instrumentation in one or more of the following topical areas: - Spatiotemporal analysis and mapping of natural processes. - Enviroinformatics. - Environmental risk assessment, reliability analysis and decision making. - Surface and subsurface hydrology and hydraulics. - Multiphase porous media domains and contaminant transport modelling. - Hazardous waste site characterization. - Stochastic turbulence and random hydrodynamic fields. - Chaotic and fractal systems. - Random waves and seafloor morphology. - Stochastic atmospheric and climate processes. - Air pollution and quality assessment research. - Modern geostatistics. - Mechanisms of pollutant formation, emission, exposure and absorption. - Physical, chemical and biological analysis of human exposure from single and multiple media and routes; control and protection. - Bioinformatics. - Probabilistic methods in ecology and population biology. - Epidemiological investigations. - Models using stochastic differential equations stochastic or partial differential equations. - Hazardous waste site characterization.
期刊最新文献
Hybrid method for rainfall-induced regional landslide susceptibility mapping Prediction of urban flood inundation using Bayesian convolutional neural networks Unravelling complexities: a study on geopolitical dynamics, economic complexity, R&D impact on green innovation in China AHP and FAHP-based multi-criteria analysis for suitable dam location analysis: a case study of the Bagmati Basin, Nepal Risk and retraction: asymmetric nexus between monetary policy uncertainty and eco-friendly investment
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1