使用数据增强的多门混合专家从可见-近红外光谱中同时估计多种土壤特性

IF 5.6 1区 农林科学 Q1 SOIL SCIENCE Geoderma Pub Date : 2024-12-12 DOI:10.1016/j.geoderma.2024.117127
Xiaoqing Wang, Mei-Wei Zhang, Ya-Nan Zhou, Lingli Wang, Ling-Tao Zeng, Yu-Pei Cui, Xiao-Lin Sun
{"title":"使用数据增强的多门混合专家从可见-近红外光谱中同时估计多种土壤特性","authors":"Xiaoqing Wang, Mei-Wei Zhang, Ya-Nan Zhou, Lingli Wang, Ling-Tao Zeng, Yu-Pei Cui, Xiao-Lin Sun","doi":"10.1016/j.geoderma.2024.117127","DOIUrl":null,"url":null,"abstract":"Simultaneous estimation of multiple soil properties from vis-NIR hyperspectra presents a cost-effective and time-efficient approach. Previous studies have utilized multi-task convolutional neural network (multi-CNN) with share-bottom structures based on the hard parameter sharing. However, multi-CNN often ignores the differential characteristics of correlations between soil properties, limiting the accuracy of soil property estimation. The multi-gate mixture-of-experts network (MMoE) offers a solution by extracting both common features across all soil properties and unique features specific to each soil property, which probably could provide better estimation outcomes than the conventional shared-bottom multi-CNN. In the present study, a MMoE was built based on a total of 17,272 mineral soil samples from the Land Use/Cover Area Frame Survey (LUCAS) topsoil database that includes vis-NIR spectra with ten physicochemical properties, i.e., clay, silt, sand, pH (in water), organic content (OC), calcium carbonate (CaCO<ce:inf loc=\"post\">3</ce:inf>), nitrogen (N), phosphorous (P), potassium (K), and cation exchange capacity (CEC). To evaluate the performance of MMoE, a series of other models were also built, i.e., partial least square regression (PLSR), single-task convolutional neural network (single-CNN), multi-task convolutional neural network (multi-CNN) and multi-task long short-term memory (multi-LSTM). Furthermore, performance of feature-spectrum selected by competitive adaptive reweighted sampling (CARS) on the accuracy of the MMoE was also explored, as well as a data augmentation method of stacking raw spectra with five preprocessed spectra data. The results demonstrated that MMoE had higher accuracy than PLSR, single-CNN, and multi-LSTM models, with RMSE reduction of 5 %–48 %, R<ce:sup loc=\"post\">2</ce:sup> improvement of 1 %–119 %, and CCC improvement of 0 %–74 %. Compared with multi-CNN, MMoE showed better accuracy for all properties except pH, with RMSE reduction of 3 %–8 %, R<ce:sup loc=\"post\">2</ce:sup> improvement of 1 %–12 %, and CCC improvement of 0 %–5 %. However, the feature-spectrum selected by CARS did not improve the accuracy of MMoE compared to full-band spectrum, whereas the data augmentation method was effective in improving the estimation accuracy of MMoE compared to raw spectra, with RMSE reduction of 14 %–28 %, R<ce:sup loc=\"post\">2</ce:sup> improvement of 3 %–88 %, and CCC improvement of 1 %–63 %. Consequently, this study proves that MMoE based on data augmentation is an efficient and accurate method for the simultaneous estimation of multiple soil properties from vis-NIR spectra.","PeriodicalId":12511,"journal":{"name":"Geoderma","volume":"21 1","pages":""},"PeriodicalIF":5.6000,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Simultaneous estimation of multiple soil properties from vis-NIR spectra using a multi-gate mixture-of-experts with data augmentation\",\"authors\":\"Xiaoqing Wang, Mei-Wei Zhang, Ya-Nan Zhou, Lingli Wang, Ling-Tao Zeng, Yu-Pei Cui, Xiao-Lin Sun\",\"doi\":\"10.1016/j.geoderma.2024.117127\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Simultaneous estimation of multiple soil properties from vis-NIR hyperspectra presents a cost-effective and time-efficient approach. Previous studies have utilized multi-task convolutional neural network (multi-CNN) with share-bottom structures based on the hard parameter sharing. However, multi-CNN often ignores the differential characteristics of correlations between soil properties, limiting the accuracy of soil property estimation. The multi-gate mixture-of-experts network (MMoE) offers a solution by extracting both common features across all soil properties and unique features specific to each soil property, which probably could provide better estimation outcomes than the conventional shared-bottom multi-CNN. In the present study, a MMoE was built based on a total of 17,272 mineral soil samples from the Land Use/Cover Area Frame Survey (LUCAS) topsoil database that includes vis-NIR spectra with ten physicochemical properties, i.e., clay, silt, sand, pH (in water), organic content (OC), calcium carbonate (CaCO<ce:inf loc=\\\"post\\\">3</ce:inf>), nitrogen (N), phosphorous (P), potassium (K), and cation exchange capacity (CEC). To evaluate the performance of MMoE, a series of other models were also built, i.e., partial least square regression (PLSR), single-task convolutional neural network (single-CNN), multi-task convolutional neural network (multi-CNN) and multi-task long short-term memory (multi-LSTM). Furthermore, performance of feature-spectrum selected by competitive adaptive reweighted sampling (CARS) on the accuracy of the MMoE was also explored, as well as a data augmentation method of stacking raw spectra with five preprocessed spectra data. The results demonstrated that MMoE had higher accuracy than PLSR, single-CNN, and multi-LSTM models, with RMSE reduction of 5 %–48 %, R<ce:sup loc=\\\"post\\\">2</ce:sup> improvement of 1 %–119 %, and CCC improvement of 0 %–74 %. Compared with multi-CNN, MMoE showed better accuracy for all properties except pH, with RMSE reduction of 3 %–8 %, R<ce:sup loc=\\\"post\\\">2</ce:sup> improvement of 1 %–12 %, and CCC improvement of 0 %–5 %. However, the feature-spectrum selected by CARS did not improve the accuracy of MMoE compared to full-band spectrum, whereas the data augmentation method was effective in improving the estimation accuracy of MMoE compared to raw spectra, with RMSE reduction of 14 %–28 %, R<ce:sup loc=\\\"post\\\">2</ce:sup> improvement of 3 %–88 %, and CCC improvement of 1 %–63 %. Consequently, this study proves that MMoE based on data augmentation is an efficient and accurate method for the simultaneous estimation of multiple soil properties from vis-NIR spectra.\",\"PeriodicalId\":12511,\"journal\":{\"name\":\"Geoderma\",\"volume\":\"21 1\",\"pages\":\"\"},\"PeriodicalIF\":5.6000,\"publicationDate\":\"2024-12-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Geoderma\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.1016/j.geoderma.2024.117127\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"SOIL SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoderma","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1016/j.geoderma.2024.117127","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOIL SCIENCE","Score":null,"Total":0}
引用次数: 0

摘要

利用可见光-近红外高光谱同时估计多种土壤性质是一种经济高效的方法。以往的研究采用基于硬参数共享的多任务卷积神经网络(multi-CNN)。然而,multi-CNN往往忽略了土壤性质之间相关性的差异性特征,限制了土壤性质估计的准确性。多门混合专家网络(MMoE)提供了一种解决方案,通过提取所有土壤属性的共同特征和特定于每种土壤属性的独特特征,这可能比传统的共享底部多cnn提供更好的估计结果。基于LUCAS表层土壤数据库中17272份矿质土壤样品的可见光-近红外光谱,包括粘土、粉土、砂、pH(水中)、有机含量(OC)、碳酸钙(CaCO3)、氮(N)、磷(P)、钾(K)和阳离子交换容量(CEC)等10种理化性质,构建了MMoE模型。为了评估MMoE的性能,我们还建立了一系列其他模型,即偏最小二乘回归(PLSR)、单任务卷积神经网络(single-CNN)、多任务卷积神经网络(multi-CNN)和多任务长短期记忆(multi-LSTM)。此外,还探讨了竞争自适应重加权采样(CARS)选择的特征谱对MMoE精度的影响,以及将原始光谱与5个预处理光谱数据叠加的数据增强方法。结果表明,MMoE比PLSR、单cnn和多lstm模型具有更高的准确率,RMSE降低5% - 48%,R2提高1% - 119%,CCC提高0% - 74%。与multi-CNN相比,MMoE对除pH值外的所有属性都具有更好的准确性,RMSE降低3% - 8%,R2提高1% - 12%,CCC提高0% - 5%。然而,与全波段光谱相比,CARS选择的特征谱并没有提高MMoE的估计精度,而数据增强方法与原始光谱相比,有效提高了MMoE的估计精度,RMSE降低了14% - 28%,R2提高了3% - 88%,CCC提高了1% - 63%。因此,本研究证明了基于数据增强的MMoE是一种有效而准确的方法,可以同时估计可见光-近红外光谱中的多种土壤性质。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Simultaneous estimation of multiple soil properties from vis-NIR spectra using a multi-gate mixture-of-experts with data augmentation
Simultaneous estimation of multiple soil properties from vis-NIR hyperspectra presents a cost-effective and time-efficient approach. Previous studies have utilized multi-task convolutional neural network (multi-CNN) with share-bottom structures based on the hard parameter sharing. However, multi-CNN often ignores the differential characteristics of correlations between soil properties, limiting the accuracy of soil property estimation. The multi-gate mixture-of-experts network (MMoE) offers a solution by extracting both common features across all soil properties and unique features specific to each soil property, which probably could provide better estimation outcomes than the conventional shared-bottom multi-CNN. In the present study, a MMoE was built based on a total of 17,272 mineral soil samples from the Land Use/Cover Area Frame Survey (LUCAS) topsoil database that includes vis-NIR spectra with ten physicochemical properties, i.e., clay, silt, sand, pH (in water), organic content (OC), calcium carbonate (CaCO3), nitrogen (N), phosphorous (P), potassium (K), and cation exchange capacity (CEC). To evaluate the performance of MMoE, a series of other models were also built, i.e., partial least square regression (PLSR), single-task convolutional neural network (single-CNN), multi-task convolutional neural network (multi-CNN) and multi-task long short-term memory (multi-LSTM). Furthermore, performance of feature-spectrum selected by competitive adaptive reweighted sampling (CARS) on the accuracy of the MMoE was also explored, as well as a data augmentation method of stacking raw spectra with five preprocessed spectra data. The results demonstrated that MMoE had higher accuracy than PLSR, single-CNN, and multi-LSTM models, with RMSE reduction of 5 %–48 %, R2 improvement of 1 %–119 %, and CCC improvement of 0 %–74 %. Compared with multi-CNN, MMoE showed better accuracy for all properties except pH, with RMSE reduction of 3 %–8 %, R2 improvement of 1 %–12 %, and CCC improvement of 0 %–5 %. However, the feature-spectrum selected by CARS did not improve the accuracy of MMoE compared to full-band spectrum, whereas the data augmentation method was effective in improving the estimation accuracy of MMoE compared to raw spectra, with RMSE reduction of 14 %–28 %, R2 improvement of 3 %–88 %, and CCC improvement of 1 %–63 %. Consequently, this study proves that MMoE based on data augmentation is an efficient and accurate method for the simultaneous estimation of multiple soil properties from vis-NIR spectra.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Geoderma
Geoderma 农林科学-土壤科学
CiteScore
11.80
自引率
6.60%
发文量
597
审稿时长
58 days
期刊介绍: Geoderma - the global journal of soil science - welcomes authors, readers and soil research from all parts of the world, encourages worldwide soil studies, and embraces all aspects of soil science and its associated pedagogy. The journal particularly welcomes interdisciplinary work focusing on dynamic soil processes and functions across space and time.
期刊最新文献
Seasonal soil water origins and determinants in an alpine hillslope on the northeastern Qinghai-Tibet Plateau Low-severity wildfire prevents catastrophic impacts on fungal communities and soil carbon stability in a fire-affected Douglas-fir ecosystem Thermogravimetric data suggest synergy between different organic fractions and clay in soil structure formation Rhizodeposition stimulates soil carbon decomposition and promotes formation of mineral-associated carbon with increased clay content Mycorrhizal and nutrient controls of carbon sequestration in tropical rainforest soil
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1