Proloy Deb, Virender Kumar, Anton Urfels, Jonathan Lautze, Baldev Raj Kamboj, Jeet Ram Sharma, Sudhir Yadav
{"title":"Which Machine Learning Algorithm Is Best Suited for Estimating Reference Evapotranspiration in Humid Subtropical Climate?","authors":"Proloy Deb, Virender Kumar, Anton Urfels, Jonathan Lautze, Baldev Raj Kamboj, Jeet Ram Sharma, Sudhir Yadav","doi":"10.1002/clen.202300441","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Timely and reliable estimates of reference evapotranspiration (ET<sub>0</sub>) are imperative for robust water resources planning and management. Applying machine learning (ML) algorithms for estimating ET<sub>0</sub> has been evolving, and their applicability in different sectors is still a compelling field of research. In this study, four Gaussian process regression (GPR) algorithms—polynomial kernel (PK), polynomial universal function kernel (PUK), normalized poly kernel (NPK), and radial basis function (RBF)—were compared against widely used random forest (RF) and a simpler locally weighted linear regression (LWLR) algorithm at a humid subtropical region in India. The sensitivity analysis of the input variables was followed by application of the best combination of variables in algorithm testing and training for generating ET<sub>0</sub>. The results were then compared against the Penman–Monteith method at both daily and monthly time steps. The results indicated that ET<sub>0</sub> is least sensitive to wind speed at 2 m height. Additionally, at a daily time step, RF, followed by PUK, generated the best results during both training and testing phases. In contrast, at a monthly time step, using multiple model evaluation matrices, PUK followed by RF performed best. These results demonstrate the application of the ML algorithms is subjected to user-required time steps. Although this study focused on Northwest India, the findings are relevant to all humid subtropical regions across the world.</p>\n </div>","PeriodicalId":10306,"journal":{"name":"Clean-soil Air Water","volume":"53 1","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clean-soil Air Water","FirstCategoryId":"93","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/clen.202300441","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Timely and reliable estimates of reference evapotranspiration (ET0) are imperative for robust water resources planning and management. Applying machine learning (ML) algorithms for estimating ET0 has been evolving, and their applicability in different sectors is still a compelling field of research. In this study, four Gaussian process regression (GPR) algorithms—polynomial kernel (PK), polynomial universal function kernel (PUK), normalized poly kernel (NPK), and radial basis function (RBF)—were compared against widely used random forest (RF) and a simpler locally weighted linear regression (LWLR) algorithm at a humid subtropical region in India. The sensitivity analysis of the input variables was followed by application of the best combination of variables in algorithm testing and training for generating ET0. The results were then compared against the Penman–Monteith method at both daily and monthly time steps. The results indicated that ET0 is least sensitive to wind speed at 2 m height. Additionally, at a daily time step, RF, followed by PUK, generated the best results during both training and testing phases. In contrast, at a monthly time step, using multiple model evaluation matrices, PUK followed by RF performed best. These results demonstrate the application of the ML algorithms is subjected to user-required time steps. Although this study focused on Northwest India, the findings are relevant to all humid subtropical regions across the world.
期刊介绍:
CLEAN covers all aspects of Sustainability and Environmental Safety. The journal focuses on organ/human--environment interactions giving interdisciplinary insights on a broad range of topics including air pollution, waste management, the water cycle, and environmental conservation. With a 2019 Journal Impact Factor of 1.603 (Journal Citation Reports (Clarivate Analytics, 2020), the journal publishes an attractive mixture of peer-reviewed scientific reviews, research papers, and short communications.
Papers dealing with environmental sustainability issues from such fields as agriculture, biological sciences, energy, food sciences, geography, geology, meteorology, nutrition, soil and water sciences, etc., are welcome.