An optimal sample size index for updating spatial soil models

IF 6.6 1区 农林科学 Q1 SOIL SCIENCE Geoderma Pub Date : 2025-03-01 Epub Date: 2025-02-12 DOI:10.1016/j.geoderma.2025.117208
Caner Ferhatoglu , Wei Chen , Marshall D. McDaniel , Bradley A. Miller
{"title":"An optimal sample size index for updating spatial soil models","authors":"Caner Ferhatoglu ,&nbsp;Wei Chen ,&nbsp;Marshall D. McDaniel ,&nbsp;Bradley A. Miller","doi":"10.1016/j.geoderma.2025.117208","DOIUrl":null,"url":null,"abstract":"<div><div>Soil map updates can be expensive due to soil sampling and analysis costs. This study introduces the Optimal Sample Size Index (OSSI), a flexible framework for digital soil mapping that balances model accuracy and sampling costs to update soil spatial models. OSSI determines optimal sample size from subsets of initially available training sets, accounting for different physiographies and model validation metrics with adjustable weights, including root-mean-square-error (RMSE) for cross-validation (CV) and RMSE for independent validation, the standard deviation of RMSE values from 10-fold CV, and relative sampling cost. Relative sampling cost represents the proportion of the number of samples used in modeling to the initially available sample size. We applied two OSSI scenarios to address limitations of the original approach, which prioritized cost reduction but occasionally resulted in unreliable models due to very small training sizes. By adjusting metrics and weights, the second scenario accounted for model uncertainty, producing more reliable models with sample sizes considerably lower than full training sets. Four soil properties (pH, clay, silt, and sand %) were spatially modeled for surface soils in three study areas in Iowa, USA, using random forest regressors. Both scenarios reduced relative sampling costs by up to 92 % compared to using all samples while maintaining similar or improved model performance. The second scenario further ensured model reliability, as shown by lower standard deviations of CV-RMSE values. Our results demonstrate OSSI’s flexibility to balance cost, accuracy, and reliability, offering a practical solution for optimizing soil sample sizes and updating soil survey maps.</div></div>","PeriodicalId":12511,"journal":{"name":"Geoderma","volume":"455 ","pages":"Article 117208"},"PeriodicalIF":6.6000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoderma","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0016706125000461","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/12 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"SOIL SCIENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Soil map updates can be expensive due to soil sampling and analysis costs. This study introduces the Optimal Sample Size Index (OSSI), a flexible framework for digital soil mapping that balances model accuracy and sampling costs to update soil spatial models. OSSI determines optimal sample size from subsets of initially available training sets, accounting for different physiographies and model validation metrics with adjustable weights, including root-mean-square-error (RMSE) for cross-validation (CV) and RMSE for independent validation, the standard deviation of RMSE values from 10-fold CV, and relative sampling cost. Relative sampling cost represents the proportion of the number of samples used in modeling to the initially available sample size. We applied two OSSI scenarios to address limitations of the original approach, which prioritized cost reduction but occasionally resulted in unreliable models due to very small training sizes. By adjusting metrics and weights, the second scenario accounted for model uncertainty, producing more reliable models with sample sizes considerably lower than full training sets. Four soil properties (pH, clay, silt, and sand %) were spatially modeled for surface soils in three study areas in Iowa, USA, using random forest regressors. Both scenarios reduced relative sampling costs by up to 92 % compared to using all samples while maintaining similar or improved model performance. The second scenario further ensured model reliability, as shown by lower standard deviations of CV-RMSE values. Our results demonstrate OSSI’s flexibility to balance cost, accuracy, and reliability, offering a practical solution for optimizing soil sample sizes and updating soil survey maps.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
空间土壤模型更新的最优样本量指标
由于土壤采样和分析的成本,土壤地图更新可能很昂贵。本研究引入了最优样本量指数(OSSI),这是一种灵活的数字土壤制图框架,可以平衡模型精度和采样成本,以更新土壤空间模型。OSSI从最初可用的训练集的子集中确定最佳样本量,考虑不同的地形和模型验证指标,具有可调整的权重,包括交叉验证(CV)的均方根误差(RMSE)和独立验证的RMSE, 10倍CV的RMSE值的标准偏差,以及相对采样成本。相对采样成本表示建模中使用的样本数量与初始可用样本量的比例。我们应用了两个OSSI场景来解决原始方法的局限性,原始方法优先考虑降低成本,但由于训练规模很小,偶尔会导致模型不可靠。通过调整度量和权重,第二种方案考虑了模型的不确定性,产生了更可靠的模型,其样本量大大低于完整的训练集。采用随机森林回归模型对美国爱荷华州3个研究区表层土壤的4种土壤性质(pH、粘土、粉土和砂%)进行了空间模拟。与使用所有样本同时保持相似或改进的模型性能相比,这两种方案都减少了高达92%的相对采样成本。第二种情况进一步保证了模型的可靠性,CV-RMSE值的标准差较低。我们的研究结果证明了OSSI在平衡成本、准确性和可靠性方面的灵活性,为优化土壤样本大小和更新土壤调查地图提供了实用的解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Geoderma
Geoderma 农林科学-土壤科学
CiteScore
11.80
自引率
6.60%
发文量
597
审稿时长
58 days
期刊介绍: Geoderma - the global journal of soil science - welcomes authors, readers and soil research from all parts of the world, encourages worldwide soil studies, and embraces all aspects of soil science and its associated pedagogy. The journal particularly welcomes interdisciplinary work focusing on dynamic soil processes and functions across space and time.
期刊最新文献
Spatially explicit impact of enhanced leaf litter leaching on the forest soil microbiome Organic amendments reshaped the chemical composition of soil organic matter: A meta-analysis Slope-dependent erosion control by rock fragments: experiments and model improvements Anecic earthworms and residue management synergistically enhance soil carbon sequestration via macroaggregate formation Mineral associated organic matter fractions have an outsized role in nitrogen mineralization and denitrification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1