Franck Albinet , Yi Peng , Tetsuya Eguchi , Erik Smolders , Gerd Dercon
{"title":"利用中红外光谱和深度学习预测土壤交换性钾:从预测到可解释性","authors":"Franck Albinet , Yi Peng , Tetsuya Eguchi , Erik Smolders , Gerd Dercon","doi":"10.1016/j.aiia.2022.10.001","DOIUrl":null,"url":null,"abstract":"<div><p>The ability to characterize rapidly and repeatedly exchangeable potassium (K<sub>ex</sub>) content in the soil is essential for optimizing remediation of radiocaesium contamination in agriculture. In this paper, we show how this can be now achieved using a Convolutional Neural Network (CNN) model trained on a large Mid-Infrared (MIR) soil spectral library (40,000 samples with K<sub>ex</sub> determined with 1 M NH<sub>4</sub>OAc, pH 7), compiled by the National Soil Survey Center of the United States Department of Agriculture. Using Partial Least Squares Regression as a baseline, we found that our implemented CNN leads to a significantly higher prediction performance of K<sub>ex</sub> when a large amount of data is available (10000), increasing the coefficient of determination from 0.64 to 0.79, and reducing the Mean Absolute Percentage Error from 135% to 31%. Furthermore, in order to provide end-users with required interpretive keys, we implemented the GradientShap algorithm to identify the spectral regions considered important by the model for predicting K<sub>ex</sub>. Used in the context of the implemented CNN on various Soil Taxonomy Orders, it allowed (i) to relate the important spectral features to domain knowledge and (ii) to demonstrate that including all Soil Taxonomy Orders in CNN-based modeling is beneficial as spectral features learned can be reused across different, sometimes underrepresented orders.</p></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"6 ","pages":"Pages 230-241"},"PeriodicalIF":8.2000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2589721722000186/pdfft?md5=8126426530126bf7ca26081e52cbb6d7&pid=1-s2.0-S2589721722000186-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Prediction of exchangeable potassium in soil through mid-infrared spectroscopy and deep learning: From prediction to explainability\",\"authors\":\"Franck Albinet , Yi Peng , Tetsuya Eguchi , Erik Smolders , Gerd Dercon\",\"doi\":\"10.1016/j.aiia.2022.10.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The ability to characterize rapidly and repeatedly exchangeable potassium (K<sub>ex</sub>) content in the soil is essential for optimizing remediation of radiocaesium contamination in agriculture. In this paper, we show how this can be now achieved using a Convolutional Neural Network (CNN) model trained on a large Mid-Infrared (MIR) soil spectral library (40,000 samples with K<sub>ex</sub> determined with 1 M NH<sub>4</sub>OAc, pH 7), compiled by the National Soil Survey Center of the United States Department of Agriculture. Using Partial Least Squares Regression as a baseline, we found that our implemented CNN leads to a significantly higher prediction performance of K<sub>ex</sub> when a large amount of data is available (10000), increasing the coefficient of determination from 0.64 to 0.79, and reducing the Mean Absolute Percentage Error from 135% to 31%. Furthermore, in order to provide end-users with required interpretive keys, we implemented the GradientShap algorithm to identify the spectral regions considered important by the model for predicting K<sub>ex</sub>. Used in the context of the implemented CNN on various Soil Taxonomy Orders, it allowed (i) to relate the important spectral features to domain knowledge and (ii) to demonstrate that including all Soil Taxonomy Orders in CNN-based modeling is beneficial as spectral features learned can be reused across different, sometimes underrepresented orders.</p></div>\",\"PeriodicalId\":52814,\"journal\":{\"name\":\"Artificial Intelligence in Agriculture\",\"volume\":\"6 \",\"pages\":\"Pages 230-241\"},\"PeriodicalIF\":8.2000,\"publicationDate\":\"2022-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2589721722000186/pdfft?md5=8126426530126bf7ca26081e52cbb6d7&pid=1-s2.0-S2589721722000186-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial Intelligence in Agriculture\",\"FirstCategoryId\":\"1087\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2589721722000186\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Agriculture","FirstCategoryId":"1087","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2589721722000186","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
表征土壤中快速和反复交换性钾(Kex)含量的能力对于优化农业放射性铯污染的修复至关重要。在本文中,我们展示了如何使用卷积神经网络(CNN)模型在美国农业部国家土壤调查中心编制的大型中红外(MIR)土壤光谱库(40000个样品,用1 M NH4OAc测定Kex, pH为7)上进行训练来实现这一目标。使用偏最小二乘回归作为基线,我们发现我们实现的CNN在大量可用数据(10000)时显著提高了Kex的预测性能,将决定系数从0.64提高到0.79,并将平均绝对百分比误差从135%降低到31%。此外,为了向最终用户提供所需的解释键,我们实现了GradientShap算法来识别模型认为重要的光谱区域,以预测键值。在各种土壤分类阶的实现CNN的背景下使用,它允许(i)将重要的光谱特征与领域知识联系起来,(ii)证明在基于CNN的建模中包括所有土壤分类阶是有益的,因为学习到的光谱特征可以在不同的,有时是代表性不足的阶之间重用。
Prediction of exchangeable potassium in soil through mid-infrared spectroscopy and deep learning: From prediction to explainability
The ability to characterize rapidly and repeatedly exchangeable potassium (Kex) content in the soil is essential for optimizing remediation of radiocaesium contamination in agriculture. In this paper, we show how this can be now achieved using a Convolutional Neural Network (CNN) model trained on a large Mid-Infrared (MIR) soil spectral library (40,000 samples with Kex determined with 1 M NH4OAc, pH 7), compiled by the National Soil Survey Center of the United States Department of Agriculture. Using Partial Least Squares Regression as a baseline, we found that our implemented CNN leads to a significantly higher prediction performance of Kex when a large amount of data is available (10000), increasing the coefficient of determination from 0.64 to 0.79, and reducing the Mean Absolute Percentage Error from 135% to 31%. Furthermore, in order to provide end-users with required interpretive keys, we implemented the GradientShap algorithm to identify the spectral regions considered important by the model for predicting Kex. Used in the context of the implemented CNN on various Soil Taxonomy Orders, it allowed (i) to relate the important spectral features to domain knowledge and (ii) to demonstrate that including all Soil Taxonomy Orders in CNN-based modeling is beneficial as spectral features learned can be reused across different, sometimes underrepresented orders.