Mapping surface soil organic carbon density of cultivated land using machine learning in Zhengzhou.

IF 3.2 3区 环境科学与生态学 Q3 ENGINEERING, ENVIRONMENTAL Environmental Geochemistry and Health Pub Date : 2024-11-28 DOI:10.1007/s10653-024-02313-8
Hengliang Guo, Jinyang Wang, Dujuan Zhang, Jian Cui, Yonghao Yuan, Haoming Bao, Mengjiao Yang, Jiahui Guo, Feng Chen, Wenge Zhou, Gang Wu, Yang Guo, Haitao Wei, Baojin Qiao, Shan Zhao
{"title":"Mapping surface soil organic carbon density of cultivated land using machine learning in Zhengzhou.","authors":"Hengliang Guo, Jinyang Wang, Dujuan Zhang, Jian Cui, Yonghao Yuan, Haoming Bao, Mengjiao Yang, Jiahui Guo, Feng Chen, Wenge Zhou, Gang Wu, Yang Guo, Haitao Wei, Baojin Qiao, Shan Zhao","doi":"10.1007/s10653-024-02313-8","DOIUrl":null,"url":null,"abstract":"<p><p>Research on soil organic carbon (SOC) is crucial for improving soil carbon sinks and achieving the \"double-carbon\" goal. This study introduces ten auxiliary variables based on the data from a 2021 land quality survey in Zhengzhou and a multi-objective regional geochemical survey. It uses geostatistical ordinary kriging (OK) interpolation, as well as classical machine learning (ML) models, including random forest (RF) and support vector machine (SVM), to map soil organic carbon density (SOCD) in the topsoil layer (0 - 20 cm) of cultivated land. It partitions the sampling data to assess the generalization capability of the machine learning models, with Zhongmu County designated as an independent test set (dataset2) and the remaining data as the training set (dataset1). The three models are trained using dataset1, and the trained machine learning models are directly applied to dataset2 to evaluate and compare their generalization performance. The distribution of SOCD and SOCS in soils of various types and textures is analyzed using the optimal interpolation method. The results indicated that: (1) The average SOC densities predicted by OK interpolation, RF, and SVM are 3.70, 3.74, and 3.63 kg/m<sup>2</sup>, with test set precisions (R<sup>2</sup>) of 0.34, 0.60, and 0.81, respectively. (2) ML achieves a significantly higher predictive precision than traditional OK interpolation. The RF model's precision is 0.21 higher than the SVM model and more precise in estimating carbon stock. (3) When applied to the dataset2, the RF model exhibited superior generalization capabilities (R<sup>2</sup> = 0.52, MSE = 0.32) over the SVM model (R<sup>2</sup> = 0.32, MSE = 0.45). (4) The spatial distribution of surface SOCD in the study area exhibits a decreasing gradient from west to east and from south to north. The total carbon stock in the study area is estimated at approximately 10.76 × 10<sup>6</sup>t. (5) The integration of soil attribute variables, climatic variables, remote sensing data, and machine learning techniques holds significant promise for the high-precision and high-quality mapping of soil organic carbon density (SOCD) in agricultural soils.</p>","PeriodicalId":11759,"journal":{"name":"Environmental Geochemistry and Health","volume":"47 1","pages":"1"},"PeriodicalIF":3.2000,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11604695/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Geochemistry and Health","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s10653-024-02313-8","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

Research on soil organic carbon (SOC) is crucial for improving soil carbon sinks and achieving the "double-carbon" goal. This study introduces ten auxiliary variables based on the data from a 2021 land quality survey in Zhengzhou and a multi-objective regional geochemical survey. It uses geostatistical ordinary kriging (OK) interpolation, as well as classical machine learning (ML) models, including random forest (RF) and support vector machine (SVM), to map soil organic carbon density (SOCD) in the topsoil layer (0 - 20 cm) of cultivated land. It partitions the sampling data to assess the generalization capability of the machine learning models, with Zhongmu County designated as an independent test set (dataset2) and the remaining data as the training set (dataset1). The three models are trained using dataset1, and the trained machine learning models are directly applied to dataset2 to evaluate and compare their generalization performance. The distribution of SOCD and SOCS in soils of various types and textures is analyzed using the optimal interpolation method. The results indicated that: (1) The average SOC densities predicted by OK interpolation, RF, and SVM are 3.70, 3.74, and 3.63 kg/m2, with test set precisions (R2) of 0.34, 0.60, and 0.81, respectively. (2) ML achieves a significantly higher predictive precision than traditional OK interpolation. The RF model's precision is 0.21 higher than the SVM model and more precise in estimating carbon stock. (3) When applied to the dataset2, the RF model exhibited superior generalization capabilities (R2 = 0.52, MSE = 0.32) over the SVM model (R2 = 0.32, MSE = 0.45). (4) The spatial distribution of surface SOCD in the study area exhibits a decreasing gradient from west to east and from south to north. The total carbon stock in the study area is estimated at approximately 10.76 × 106t. (5) The integration of soil attribute variables, climatic variables, remote sensing data, and machine learning techniques holds significant promise for the high-precision and high-quality mapping of soil organic carbon density (SOCD) in agricultural soils.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用机器学习绘制郑州耕地表层土壤有机碳密度图。
土壤有机碳(SOC)研究对于提高土壤碳汇、实现 "双碳 "目标至关重要。本研究以郑州市 2021 年土地质量调查和多目标区域地球化学调查数据为基础,引入了 10 个辅助变量。该研究利用地质统计普通克里金(OK)插值法以及经典的机器学习(ML)模型,包括随机森林(RF)和支持向量机(SVM),绘制了耕地表土层(0 - 20 cm)的土壤有机碳密度(SOCD)图。为了评估机器学习模型的泛化能力,该研究对采样数据进行了分区,将中牟县作为独立测试集(数据集 2),其余数据作为训练集(数据集 1)。使用数据集 1 对三个模型进行训练,然后将训练好的机器学习模型直接应用于数据集 2,以评估和比较它们的泛化性能。利用最优插值法分析了 SOCD 和 SOCS 在不同类型和质地土壤中的分布。结果表明(1) OK 插值法、RF 和 SVM 预测的平均 SOC 密度分别为 3.70、3.74 和 3.63 kg/m2,测试集精度(R2)分别为 0.34、0.60 和 0.81。(2) ML 的预测精度明显高于传统的 OK 插值法。RF 模型的精度比 SVM 模型高 0.21,在估算碳储量方面更为精确。(3) 应用于数据集2 时,RF 模型的泛化能力(R2 = 0.52,MSE = 0.32)优于 SVM 模型(R2 = 0.32,MSE = 0.45)。(4) 研究区地表 SOCD 的空间分布呈现由西向东、由南向北递减的梯度。(5) 将土壤属性变量、气候变量、遥感数据和机器学习技术相结合,有望实现高精度、高质量的农田土壤有机碳密度(SOCD)测绘。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Environmental Geochemistry and Health
Environmental Geochemistry and Health 环境科学-工程:环境
CiteScore
8.00
自引率
4.80%
发文量
279
审稿时长
4.2 months
期刊介绍: Environmental Geochemistry and Health publishes original research papers and review papers across the broad field of environmental geochemistry. Environmental geochemistry and health establishes and explains links between the natural or disturbed chemical composition of the earth’s surface and the health of plants, animals and people. Beneficial elements regulate or promote enzymatic and hormonal activity whereas other elements may be toxic. Bedrock geochemistry controls the composition of soil and hence that of water and vegetation. Environmental issues, such as pollution, arising from the extraction and use of mineral resources, are discussed. The effects of contaminants introduced into the earth’s geochemical systems are examined. Geochemical surveys of soil, water and plants show how major and trace elements are distributed geographically. Associated epidemiological studies reveal the possibility of causal links between the natural or disturbed geochemical environment and disease. Experimental research illuminates the nature or consequences of natural or disturbed geochemical processes. The journal particularly welcomes novel research linking environmental geochemistry and health issues on such topics as: heavy metals (including mercury), persistent organic pollutants (POPs), and mixed chemicals emitted through human activities, such as uncontrolled recycling of electronic-waste; waste recycling; surface-atmospheric interaction processes (natural and anthropogenic emissions, vertical transport, deposition, and physical-chemical interaction) of gases and aerosols; phytoremediation/restoration of contaminated sites; food contamination and safety; environmental effects of medicines; effects and toxicity of mixed pollutants; speciation of heavy metals/metalloids; effects of mining; disturbed geochemistry from human behavior, natural or man-made hazards; particle and nanoparticle toxicology; risk and the vulnerability of populations, etc.
期刊最新文献
Hydrogeochemical characterization of shallow and deep groundwater for drinking and irrigation water quality index of Kathmandu Valley, Nepal. The content and associated health risk assessment of toxic elements, micro-, and macrominerals in common carp, Wels catfish, and silver carp from the Danube River in Serbia. Fuzzy logic modelling of the pollution pattern of potentially toxic elements and naturally occurring radionuclide materials in quarry sites in Ogun State, Nigeria. Application of modified solidified soil in in-situ backfilling of coal gangue: evaluation of arsenic stabilization effect and mechanism study. Polychlorinated biphenyls induced toxicities upon cell lines and stem cells: a review.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1