{"title":"Synthesizing regional irrigation data using machine learning – Towards global upscaling via metamodeling","authors":"Søren Julsgaard Kragh , Raphael Schneider , Rasmus Fensholt , Simon Stisen , Julian Koch","doi":"10.1016/j.agwat.2025.109404","DOIUrl":null,"url":null,"abstract":"<div><div>Knowledge on irrigation is key to sustainable water resource management, but spatio-temporal irrigation data are rarely available. Recent advances are based upon satellite remote sensing data to quantify irrigation at high spatial resolution, and this study utilizes published irrigation datasets at regional scale to develop a metamodel approach to synthesize the available irrigation knowledge. We investigate the potentials and limitations of a Random Forest-based metamodeling approach that predicts irrigation at monthly timescale using only globally available and easily accessible features related to hydroclimatic and vegetation variables. The training dataset consists of three irrigation water use datasets derived from the soil moisture-based inversion framework and covers a variety of climatic conditions and irrigation practices in Spain, Italy, and Australia. Further, the study includes irrigation predictions from three test sites representing major global hot spots for unsustainable irrigation management: the North China Plain, Indus, and Ganges Basins. Our study aims to test the model transferability in space and time based on a series of split-sample experiments. We quantify and outline model transferability based on the area of applicability analysis, showing that although the feature space was mostly well represented, the magnitude of the target variable was equally important for assessing model transferability. A comprehensive feature importance analysis reveals that ranking of the most important input features depends on geographical extent of the training dataset. We find that model transferability was more robust across space than time within the small study areas, mainly because of the small geographical extents of the training datasets. The developed metamodel demonstrates satisfying performance on irrigation water use with mean error of 3 mm/month (10% bias) for a successful model transferability outside the training study areas. The spatial pattern performance of irrigation was lower but spatial patterns of irrigation were nevertheless closely linked to climate and remote sensing features. Given the increase in published regional irrigation datasets, we see great potential for further developing metamodel approaches for synthesizing existing knowledge and work towards global upscaling opportunities.</div></div>","PeriodicalId":7634,"journal":{"name":"Agricultural Water Management","volume":"311 ","pages":"Article 109404"},"PeriodicalIF":5.9000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Agricultural Water Management","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378377425001180","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0
Abstract
Knowledge on irrigation is key to sustainable water resource management, but spatio-temporal irrigation data are rarely available. Recent advances are based upon satellite remote sensing data to quantify irrigation at high spatial resolution, and this study utilizes published irrigation datasets at regional scale to develop a metamodel approach to synthesize the available irrigation knowledge. We investigate the potentials and limitations of a Random Forest-based metamodeling approach that predicts irrigation at monthly timescale using only globally available and easily accessible features related to hydroclimatic and vegetation variables. The training dataset consists of three irrigation water use datasets derived from the soil moisture-based inversion framework and covers a variety of climatic conditions and irrigation practices in Spain, Italy, and Australia. Further, the study includes irrigation predictions from three test sites representing major global hot spots for unsustainable irrigation management: the North China Plain, Indus, and Ganges Basins. Our study aims to test the model transferability in space and time based on a series of split-sample experiments. We quantify and outline model transferability based on the area of applicability analysis, showing that although the feature space was mostly well represented, the magnitude of the target variable was equally important for assessing model transferability. A comprehensive feature importance analysis reveals that ranking of the most important input features depends on geographical extent of the training dataset. We find that model transferability was more robust across space than time within the small study areas, mainly because of the small geographical extents of the training datasets. The developed metamodel demonstrates satisfying performance on irrigation water use with mean error of 3 mm/month (10% bias) for a successful model transferability outside the training study areas. The spatial pattern performance of irrigation was lower but spatial patterns of irrigation were nevertheless closely linked to climate and remote sensing features. Given the increase in published regional irrigation datasets, we see great potential for further developing metamodel approaches for synthesizing existing knowledge and work towards global upscaling opportunities.
期刊介绍:
Agricultural Water Management publishes papers of international significance relating to the science, economics, and policy of agricultural water management. In all cases, manuscripts must address implications and provide insight regarding agricultural water management.