Stream Nitrate Dynamics Driven Primarily by Discharge and Watershed Physical and Soil Characteristics at Intensively Monitored Sites: Insights From Deep Learning
G. Gorski, L. Larsen, J. Wingenroth, L. Zhang, D. Bellugi, A. P. Appling
{"title":"Stream Nitrate Dynamics Driven Primarily by Discharge and Watershed Physical and Soil Characteristics at Intensively Monitored Sites: Insights From Deep Learning","authors":"G. Gorski, L. Larsen, J. Wingenroth, L. Zhang, D. Bellugi, A. P. Appling","doi":"10.1029/2023wr036591","DOIUrl":null,"url":null,"abstract":"We developed a suite of models using deep learning to make hindcast predictions of the 7-day average backward-looking nitrate concentration at 46 predominantly agricultural sites across the midwestern and eastern United States. The models used daily observations of discharge and meteorological variables and watershed attributes describing anthropogenic modification to hydrology, nitrogen application, climate, groundwater, land use, watershed physiographic attributes, and soils. Across all sites, discharge and watershed soil and physiographic attributes showed a strong influence on model performance. Analysis of drivers across sites revealed considerable regional differences related to controlling processes such as groundwater contributions. We tested several ways to pool data across sites to develop accurate models and make the most effective use of available data. Single-site models, in which models are trained and tested at a single location, showed generally strong predictive performance (median Kling-Gupta Efficiency = 0.66), and accuracy at poorly performing sites could be improved by grouping sites with similar characteristics. Developing a single model for all sites reduced performance at several locations with distinct characteristics, suggesting that there is a threshold of dissimilarity beyond which more data does not improve the model. While many deep learning studies have shown that national or even global models can outperform local models, it is not clear that this is true for water quality constituents. This study demonstrates how data can be combined effectively, using deep learning to develop accurate and interpretable models of instream nitrate at sites where varying processes are responsible for changes in nitrate concentration.","PeriodicalId":23799,"journal":{"name":"Water Resources Research","volume":"38 1","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2024-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Resources Research","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1029/2023wr036591","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
We developed a suite of models using deep learning to make hindcast predictions of the 7-day average backward-looking nitrate concentration at 46 predominantly agricultural sites across the midwestern and eastern United States. The models used daily observations of discharge and meteorological variables and watershed attributes describing anthropogenic modification to hydrology, nitrogen application, climate, groundwater, land use, watershed physiographic attributes, and soils. Across all sites, discharge and watershed soil and physiographic attributes showed a strong influence on model performance. Analysis of drivers across sites revealed considerable regional differences related to controlling processes such as groundwater contributions. We tested several ways to pool data across sites to develop accurate models and make the most effective use of available data. Single-site models, in which models are trained and tested at a single location, showed generally strong predictive performance (median Kling-Gupta Efficiency = 0.66), and accuracy at poorly performing sites could be improved by grouping sites with similar characteristics. Developing a single model for all sites reduced performance at several locations with distinct characteristics, suggesting that there is a threshold of dissimilarity beyond which more data does not improve the model. While many deep learning studies have shown that national or even global models can outperform local models, it is not clear that this is true for water quality constituents. This study demonstrates how data can be combined effectively, using deep learning to develop accurate and interpretable models of instream nitrate at sites where varying processes are responsible for changes in nitrate concentration.
期刊介绍:
Water Resources Research (WRR) is an interdisciplinary journal that focuses on hydrology and water resources. It publishes original research in the natural and social sciences of water. It emphasizes the role of water in the Earth system, including physical, chemical, biological, and ecological processes in water resources research and management, including social, policy, and public health implications. It encompasses observational, experimental, theoretical, analytical, numerical, and data-driven approaches that advance the science of water and its management. Submissions are evaluated for their novelty, accuracy, significance, and broader implications of the findings.