Digital mapping of soil salinity with time-windows features optimization and ensemble learning model

IF 7.3 2区 环境科学与生态学 Q1 ECOLOGY Ecological Informatics Pub Date : 2025-03-01 Epub Date: 2024-12-27 DOI:10.1016/j.ecoinf.2024.102982
Shuaishuai Shi , Nan Wang , Songchao Chen , Bifeng Hu , Jie Peng , Zhou Shi
{"title":"Digital mapping of soil salinity with time-windows features optimization and ensemble learning model","authors":"Shuaishuai Shi ,&nbsp;Nan Wang ,&nbsp;Songchao Chen ,&nbsp;Bifeng Hu ,&nbsp;Jie Peng ,&nbsp;Zhou Shi","doi":"10.1016/j.ecoinf.2024.102982","DOIUrl":null,"url":null,"abstract":"<div><div>Soil salinization poses considerable global environmental and ecological risks. Remote-sensing time-series data enable more accurate monitoring and prediction of soil salinity levels, offering a refined approach to soil salinization assessment. However, the current limitations of time-series data analysis—particularly in terms of timely and effective information extraction—hinder high-precision soil salinity assessments. This study proposes a data mining approach using Sentinel-1 time-series data, integrating time-series decomposition and feature selection to capture seasonal and trend components correlated with soil salinity, and determine optimal time windows and effective time spans. An advanced feature-selection algorithm was then applied to refine the model-relevant features, and the transferability of the method across different regions was validated through empirical testing. The results revealed a 12 month periodicity in the correlation between Sentinel-1 time-series features and soil salinity, with an annual decay rate of 0.0019. In the study area, the optimal time window was from July to September, with the maximum effective years ranging from 19 to 21. Recursive feature elimination has shown a gradually increasing trend in the importance of SAR features from single-temporal to multi-temporal to time-series data. The time-series analysis combined with feature selection not only significantly reduced data volumes, but also improved the prediction accuracy of the model—increased R<sup>2</sup> of the prediction set was improved by 0.11, and a reduced root mean square error of 3.08 g kg<sup>−1</sup>, compared to single-temporal data. Furthermore, the results demonstrate that the ensemble model outperforms the individual models in terms of inversion accuracy, whereas the time-series mining method exhibits generalizability across diverse study areas and metrics. The combination of the time-series mining method with the ensemble model helps achieve a higher accuracy in digital soil mapping.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"85 ","pages":"Article 102982"},"PeriodicalIF":7.3000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954124005247","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/27 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Soil salinization poses considerable global environmental and ecological risks. Remote-sensing time-series data enable more accurate monitoring and prediction of soil salinity levels, offering a refined approach to soil salinization assessment. However, the current limitations of time-series data analysis—particularly in terms of timely and effective information extraction—hinder high-precision soil salinity assessments. This study proposes a data mining approach using Sentinel-1 time-series data, integrating time-series decomposition and feature selection to capture seasonal and trend components correlated with soil salinity, and determine optimal time windows and effective time spans. An advanced feature-selection algorithm was then applied to refine the model-relevant features, and the transferability of the method across different regions was validated through empirical testing. The results revealed a 12 month periodicity in the correlation between Sentinel-1 time-series features and soil salinity, with an annual decay rate of 0.0019. In the study area, the optimal time window was from July to September, with the maximum effective years ranging from 19 to 21. Recursive feature elimination has shown a gradually increasing trend in the importance of SAR features from single-temporal to multi-temporal to time-series data. The time-series analysis combined with feature selection not only significantly reduced data volumes, but also improved the prediction accuracy of the model—increased R2 of the prediction set was improved by 0.11, and a reduced root mean square error of 3.08 g kg−1, compared to single-temporal data. Furthermore, the results demonstrate that the ensemble model outperforms the individual models in terms of inversion accuracy, whereas the time-series mining method exhibits generalizability across diverse study areas and metrics. The combination of the time-series mining method with the ensemble model helps achieve a higher accuracy in digital soil mapping.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于时间窗特征优化和集成学习模型的土壤盐度数字制图
土壤盐渍化对全球环境和生态构成重大风险。遥感时间序列数据能够更准确地监测和预测土壤盐度水平,为土壤盐渍化评估提供了一种改进的方法。然而,目前时间序列数据分析的局限性——特别是在及时有效的信息提取方面——阻碍了高精度的土壤盐度评估。本研究提出了一种基于Sentinel-1时间序列数据的数据挖掘方法,将时间序列分解和特征选择相结合,捕捉与土壤盐分相关的季节和趋势分量,确定最佳时间窗和有效时间跨度。采用先进的特征选择算法对模型相关特征进行细化,并通过实证检验验证了该方法在不同区域间的可移植性。结果表明,Sentinel-1时间序列特征与土壤盐度的相关性具有12个月的周期性,年衰减率为0.0019。研究区最佳时间窗为7 ~ 9月,最大有效年限为19 ~ 21年。从单时相到多时相再到时间序列数据,SAR特征的重要性呈现出逐渐增加的趋势。时间序列分析结合特征选择不仅显著减少了数据量,而且提高了模型的预测精度——与单时间数据相比,预测集的R2提高了0.11,均方根误差降低了3.08 g kg−1。此外,结果表明,在反演精度方面,集成模型优于单个模型,而时间序列挖掘方法在不同的研究领域和指标上表现出通用性。将时间序列挖掘方法与集成模型相结合,可以提高数字土壤制图的精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Ecological Informatics
Ecological Informatics 环境科学-生态学
CiteScore
8.30
自引率
11.80%
发文量
346
审稿时长
46 days
期刊介绍: The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change. The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.
期刊最新文献
Can Google trends be used to estimate the geographic distribution of alien plants in the United States? WQEye (v1): A Python-based software for machine learning-based retrieval of water quality parameters from Sentinel-2 and Landsat-8/9 remote sensing data aided by Google Earth Engine LEPY: A Python pipeline for automated trait extraction from standardised Lepidoptera images A Hapke-based bi-temporal UAV hyperspectral index for reducing temporal spectral variations in individual-tree biomass estimation Are remote sensing-based crop type classifications suitable for calculating a landscape heterogeneity metric? A data-fitness-for-purpose assessment
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1