A Spectral Hierarchical Machine Learning for Predicting Arsenic Concentration in Farmland Soil Using Sentinel-2 Imagery

IF 8.6 1区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Geoscience and Remote Sensing Pub Date : 2025-01-22 DOI:10.1109/TGRS.2025.3532678
Li Wang;Yong Zhou;Zehan Zhou;Shangrong Wu;Lang Xia;Yan Zha;Peng Yang
{"title":"A Spectral Hierarchical Machine Learning for Predicting Arsenic Concentration in Farmland Soil Using Sentinel-2 Imagery","authors":"Li Wang;Yong Zhou;Zehan Zhou;Shangrong Wu;Lang Xia;Yan Zha;Peng Yang","doi":"10.1109/TGRS.2025.3532678","DOIUrl":null,"url":null,"abstract":"Accurately predicting arsenic (As) concentration in farmland soil on a large scale is essential for effectively preventing and managing soil pollution in agricultural areas, thereby safeguarding food security. Multispectral imaging presents a cost-effective and efficient method for monitoring As concentration across extensive farmland regions. Nevertheless, the underlying process and mechanisms determining the relationship between As concentration in farmland soil and spectral data remain uncertain. The primary aim of this study was to evaluate whether employing a hierarchical strategy (based on soil organic matter (SOM) and pH) results in more accurate prediction of As concentration in farmland soil than those employing nonhierarchical (global) models. Our results show that with respect to global models, the best prediction of As concentration was achieved using the convolutional neural network (CNN) model (validated ratio of the model performance to the interquartile distance (RPIQ) =2.50), followed by the Cubist model (validated RPIQ =2.19) and the extreme learning machine (ELM) model (validated RPIQ =2.15). After SOM-based hierarchization, the Cubist model exhibited the highest prediction accuracy (validated coefficient of determination (<inline-formula> <tex-math>$R^{2})=0.73$ </tex-math></inline-formula>), representing a 0.02 improvement in the <inline-formula> <tex-math>$R^{2}$ </tex-math></inline-formula> compared with the that of global CNN model. The clay mineral ratio (CMR) was identified as the most important variable for predicting As concentration. Notably, the identification of high As concentration in the central old town areas underscores the importance of early soil contamination risk warnings on arable lands. These findings indicate that SOM-hierarchical machine learning models could serve as an effective approach to address the influence of soil environmental complications on spectral prediction of As concentration in farmland soil. By implementing this proposed method, soil environmental monitoring efforts can be significantly improved.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-14"},"PeriodicalIF":8.6000,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10849777/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Accurately predicting arsenic (As) concentration in farmland soil on a large scale is essential for effectively preventing and managing soil pollution in agricultural areas, thereby safeguarding food security. Multispectral imaging presents a cost-effective and efficient method for monitoring As concentration across extensive farmland regions. Nevertheless, the underlying process and mechanisms determining the relationship between As concentration in farmland soil and spectral data remain uncertain. The primary aim of this study was to evaluate whether employing a hierarchical strategy (based on soil organic matter (SOM) and pH) results in more accurate prediction of As concentration in farmland soil than those employing nonhierarchical (global) models. Our results show that with respect to global models, the best prediction of As concentration was achieved using the convolutional neural network (CNN) model (validated ratio of the model performance to the interquartile distance (RPIQ) =2.50), followed by the Cubist model (validated RPIQ =2.19) and the extreme learning machine (ELM) model (validated RPIQ =2.15). After SOM-based hierarchization, the Cubist model exhibited the highest prediction accuracy (validated coefficient of determination ( $R^{2})=0.73$ ), representing a 0.02 improvement in the $R^{2}$ compared with the that of global CNN model. The clay mineral ratio (CMR) was identified as the most important variable for predicting As concentration. Notably, the identification of high As concentration in the central old town areas underscores the importance of early soil contamination risk warnings on arable lands. These findings indicate that SOM-hierarchical machine learning models could serve as an effective approach to address the influence of soil environmental complications on spectral prediction of As concentration in farmland soil. By implementing this proposed method, soil environmental monitoring efforts can be significantly improved.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于Sentinel-2图像的光谱层次机器学习预测农田土壤砷浓度
大规模准确预测农田土壤砷(As)浓度,对于有效预防和治理农区土壤污染,保障粮食安全至关重要。多光谱成像是一种经济有效的监测农田砷浓度的方法。然而,决定农田土壤砷浓度与光谱数据之间关系的潜在过程和机制仍不确定。本研究的主要目的是评估采用分层策略(基于土壤有机质(SOM)和pH)是否比采用非分层(全局)模型更准确地预测农田土壤As浓度。研究结果表明,在全局模型中,卷积神经网络(CNN)模型(模型性能与四分位数距离的验证比(RPIQ) =2.50)对As浓度的预测效果最好,其次是Cubist模型(验证RPIQ =2.19)和极限学习机(ELM)模型(验证RPIQ =2.15)。基于som的分层后,Cubist模型的预测精度最高(验证的决定系数($R^{2})=0.73$),与全局CNN模型相比,$R^{2}$提高了0.02。黏土矿物比(CMR)是预测砷浓度最重要的变量。值得一提的是,中心老城区高砷浓度的发现凸显了耕地土壤污染早期风险预警的重要性。这些结果表明,som分层机器学习模型可以作为解决土壤环境复杂性对农田土壤as浓度光谱预测影响的有效方法。通过实施该方法,可以显著提高土壤环境监测工作的效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Geoscience and Remote Sensing
IEEE Transactions on Geoscience and Remote Sensing 工程技术-地球化学与地球物理
CiteScore
11.50
自引率
28.00%
发文量
1912
审稿时长
4.0 months
期刊介绍: IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.
期刊最新文献
Phase Error Suppression for Swaying Antennas in Base Station-Based Bridge Monitoring Multiscale Spiking Graph Convolution Aggregation Network for Hyperspectral Image Classification AD-GRT Deep Learning Waveform Inversion CroBIM-V: Memory-Quality Controlled Remote Sensing Referring Video Object Segmentation MapSAM2: Adapting SAM2 for Automatic Segmentation of Historical Map Images and Time Series
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1