Deep learning of the particulate and mineral-associated organic carbon fractions using a compositional transform and mid-infrared spectroscopy

IF 6.6 1区 农林科学 Q1 SOIL SCIENCE Geoderma Pub Date : 2025-03-01 Epub Date: 2025-02-16 DOI:10.1016/j.geoderma.2025.117207
Mingxi Zhang , Zefang Shen , Lewis Walden , Farid Sepanta , Zhongkui Luo , Lei Gao , Oscar Serrano , Raphael A. Viscarra Rossel
{"title":"Deep learning of the particulate and mineral-associated organic carbon fractions using a compositional transform and mid-infrared spectroscopy","authors":"Mingxi Zhang ,&nbsp;Zefang Shen ,&nbsp;Lewis Walden ,&nbsp;Farid Sepanta ,&nbsp;Zhongkui Luo ,&nbsp;Lei Gao ,&nbsp;Oscar Serrano ,&nbsp;Raphael A. Viscarra Rossel","doi":"10.1016/j.geoderma.2025.117207","DOIUrl":null,"url":null,"abstract":"<div><div>We need soil organic carbon (SOC) and the SOC fractions, the particulate and mineral-associated organic carbon (POC, MAOC), to understand SOC dynamics. They have implications for soil management, carbon sequestration and climate change mitigation. However, conventional laboratory measurements of the SOC fractions, which involve physical or chemical separations, are elaborate, time-consuming and expensive. Mid-infrared (MIR) spectroscopy combined with multivariate modelling can alleviate these limitations because the method can estimate SOC and its fractions rapidly, cost-effectively and accurately. Previous spectroscopic modelling has mostly ignored the compositional nature of the SOC fractions (i.e. SOC = <span><math><mo>∑</mo></math></span>fractions), causing discrepancies in the estimation such that the sum of the fractions does not equal the total SOC. We recorded the MIR spectra (4000–450 cm<sup>−1</sup>) of 397 soil samples from across Australia and then performed a granulometric fractionation to derive three SOC fractions, the POC in the macroaggregates (250–<span><math><mrow><mn>2000</mn><mspace></mspace><mi>μ</mi><mi>m</mi></mrow></math></span>, POC<span><math><msub><mrow></mrow><mrow><mi>m</mi><mi>a</mi><mi>c</mi></mrow></msub></math></span>), POC in the micro-aggregates (50–<span><math><mrow><mn>250</mn><mspace></mspace><mi>μ</mi><mi>m</mi></mrow></math></span>, POC<span><math><msub><mrow></mrow><mrow><mi>m</mi><mi>i</mi><mi>c</mi></mrow></msub></math></span>), and MAOC (<span><math><mrow><mo>&lt;</mo><mn>50</mn><mspace></mspace><mi>μ</mi><mi>m</mi></mrow></math></span>). We used the centred log ratio (CLR) method to transform the data compositionally and then modelled POC<span><math><msub><mrow></mrow><mrow><mi>m</mi><mi>a</mi><mi>c</mi></mrow></msub></math></span>, POC<span><math><msub><mrow></mrow><mrow><mi>m</mi><mi>i</mi><mi>c</mi></mrow></msub></math></span>, POC (POC<span><math><msub><mrow></mrow><mrow><mi>m</mi><mi>a</mi><mi>c</mi></mrow></msub></math></span> + POC<span><math><msub><mrow></mrow><mrow><mi>m</mi><mi>i</mi><mi>c</mi></mrow></msub></math></span>), and MAOC with the spectra, using convolutional neural networks (CNN) and <span>cubist</span> for benchmarking. We interpreted the models using the SHapley Additive exPlanations (SHAP) values and a land use classification of the data. Modelling the CLR-transformed SOC fractions with CNN maintained the composition of the fractions and improved the accuracy of the estimates (Lin’s concordance correlation coefficient (<span><math><msub><mrow><mi>ρ</mi></mrow><mrow><mi>c</mi></mrow></msub></math></span>) of 0.58, 0.86, and 0.94 for the POC<span><math><msub><mrow></mrow><mrow><mi>m</mi><mi>a</mi><mi>c</mi></mrow></msub></math></span>, POC<span><math><msub><mrow></mrow><mrow><mi>m</mi><mi>i</mi><mi>c</mi></mrow></msub></math></span>, and MAOC), compared to CLR with <span>cubist</span> (<span><math><msub><mrow><mi>ρ</mi></mrow><mrow><mi>c</mi></mrow></msub></math></span> of 0.49, 0.84, and 0.87 for the POC<span><math><msub><mrow></mrow><mrow><mi>m</mi><mi>a</mi><mi>c</mi></mrow></msub></math></span>, POC<span><math><msub><mrow></mrow><mrow><mi>m</mi><mi>i</mi><mi>c</mi></mrow></msub></math></span>, and MAOC) and <span>cubist</span> with no compositional transformation (<span><math><msub><mrow><mi>ρ</mi></mrow><mrow><mi>c</mi></mrow></msub></math></span> of 0.53, 0.85, and 0.88 for the POC<span><math><msub><mrow></mrow><mrow><mi>m</mi><mi>a</mi><mi>c</mi></mrow></msub></math></span>, POC<span><math><msub><mrow></mrow><mrow><mi>m</mi><mi>i</mi><mi>c</mi></mrow></msub></math></span>, and MAOC). The SHAP values reflected the compositional modelling and identified important organic and inorganic functional groups that differed by fraction and land use. Our approach can complement conventional physical SOC fractionations and improve the cost-effectiveness of the measurements, especially when there are many samples to measure, thus enhancing our understanding of SOC dynamics.</div></div>","PeriodicalId":12511,"journal":{"name":"Geoderma","volume":"455 ","pages":"Article 117207"},"PeriodicalIF":6.6000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoderma","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S001670612500045X","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/16 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"SOIL SCIENCE","Score":null,"Total":0}
引用次数: 0

Abstract

We need soil organic carbon (SOC) and the SOC fractions, the particulate and mineral-associated organic carbon (POC, MAOC), to understand SOC dynamics. They have implications for soil management, carbon sequestration and climate change mitigation. However, conventional laboratory measurements of the SOC fractions, which involve physical or chemical separations, are elaborate, time-consuming and expensive. Mid-infrared (MIR) spectroscopy combined with multivariate modelling can alleviate these limitations because the method can estimate SOC and its fractions rapidly, cost-effectively and accurately. Previous spectroscopic modelling has mostly ignored the compositional nature of the SOC fractions (i.e. SOC = fractions), causing discrepancies in the estimation such that the sum of the fractions does not equal the total SOC. We recorded the MIR spectra (4000–450 cm−1) of 397 soil samples from across Australia and then performed a granulometric fractionation to derive three SOC fractions, the POC in the macroaggregates (250–2000μm, POCmac), POC in the micro-aggregates (50–250μm, POCmic), and MAOC (<50μm). We used the centred log ratio (CLR) method to transform the data compositionally and then modelled POCmac, POCmic, POC (POCmac + POCmic), and MAOC with the spectra, using convolutional neural networks (CNN) and cubist for benchmarking. We interpreted the models using the SHapley Additive exPlanations (SHAP) values and a land use classification of the data. Modelling the CLR-transformed SOC fractions with CNN maintained the composition of the fractions and improved the accuracy of the estimates (Lin’s concordance correlation coefficient (ρc) of 0.58, 0.86, and 0.94 for the POCmac, POCmic, and MAOC), compared to CLR with cubist (ρc of 0.49, 0.84, and 0.87 for the POCmac, POCmic, and MAOC) and cubist with no compositional transformation (ρc of 0.53, 0.85, and 0.88 for the POCmac, POCmic, and MAOC). The SHAP values reflected the compositional modelling and identified important organic and inorganic functional groups that differed by fraction and land use. Our approach can complement conventional physical SOC fractionations and improve the cost-effectiveness of the measurements, especially when there are many samples to measure, thus enhancing our understanding of SOC dynamics.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用成分变换和中红外光谱对颗粒和矿物相关有机碳组分进行深度学习
我们需要土壤有机碳(SOC)及其组分、颗粒有机碳和矿物相关有机碳(POC, MAOC)来了解土壤有机碳动态。它们对土壤管理、碳固存和减缓气候变化都有影响。然而,传统的实验室SOC组分测量,包括物理或化学分离,是复杂的,耗时和昂贵的。中红外光谱与多变量建模相结合,可以快速、经济、准确地估算出有机碳及其组分。以前的光谱模拟大多忽略了有机碳组分的组成性质(即有机碳=∑分数),导致估算中的差异,使得分数的总和不等于总有机碳。我们记录了来自澳大利亚各地的397个土壤样品的MIR光谱(4000-450 cm−1),然后进行了颗粒分选,得到了三个有机碳组分,即大团聚体(250-2000μm, POCmac)中的POC,微团聚体(50-250μm, POCmic)中的POC和MAOC (<50μm)。我们使用中心对数比(CLR)方法对数据进行组合变换,然后用光谱对POCmac、POCmic、POC (POCmac + POCmic)和MAOC进行建模,使用卷积神经网络(CNN)和cubist进行基准测试。我们使用SHapley加性解释(SHAP)值和数据的土地利用分类来解释这些模型。用CNN对经过CLR变换的有机碳馏分进行建模,与经过CLR变换的立体派(POCmac、POCmic和MAOC的ρc分别为0.49、0.84和0.87)和未经CLR变换的立体派(POCmac、POCmic和MAOC的ρc分别为0.53、0.85和0.88)相比,保持了馏分的组成,提高了估计的准确性(POCmac、POCmic和MAOC的Lin’s一致性相关系数(ρc)分别为0.58、0.86和0.94)。SHAP值反映了组成模型,并确定了重要的有机和无机官能团,这些官能团因比例和土地利用而不同。我们的方法可以补充传统的物理有机碳分馏,提高测量的成本效益,特别是当有许多样品需要测量时,从而增强了我们对有机碳动力学的理解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Geoderma
Geoderma 农林科学-土壤科学
CiteScore
11.80
自引率
6.60%
发文量
597
审稿时长
58 days
期刊介绍: Geoderma - the global journal of soil science - welcomes authors, readers and soil research from all parts of the world, encourages worldwide soil studies, and embraces all aspects of soil science and its associated pedagogy. The journal particularly welcomes interdisciplinary work focusing on dynamic soil processes and functions across space and time.
期刊最新文献
The influence of tropical grass species on soil nitrification and denitrification across savanna ecosystems Comparison of automated chemical-guided segmentation and human annotation of soil organic matter in X-ray microcomputed tomography imaging in contrasted soil types Calcium concentrations as a proxy for inorganic carbon content in salt marsh soils Multi-year green manuring increases wheat yield and nitrogen utilization efficiency: Insights into soil microorganisms and metabolites mediated available nitrogen supply Biochar aging diminishes soil N2O mitigation by reducing electron-donating capacity and shifting denitrification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1