{"title":"GPC/m: Global Precipitation Climatology by Machine Learning; Quasi-global, Daily, and One Degree Spatial Resolution","authors":"Hiroshi G. Takahashi","doi":"arxiv-2409.09639","DOIUrl":null,"url":null,"abstract":"This paper presents a new precipitation dataset that is daily, has a spatial\nresolution of one degree on a quasi-global scale, and spans more than 42 years,\nusing machine learning techniques. The ultimate goal of this dataset is to\nprovide a homogeneous daily precipitation dataset for several decades without\ngaps, which is suitable for climate analysis. As a first step, 42 years of\ndaily precipitation data was generated using machine learning techniques. The\nmachine learning methods are supervised learning, and the reference data are\nestimated precipitation datasets from 2001 to 2020. The three machine learning\nmethods are random forest, gradient-boosted decision trees, and convolutional\nneural networks. The input data are satellite observations and atmospheric\ncirculations from reanalysis, which are somewhat modified based on knowledge of\nthe climatological background. Using the trained statistical models, we predict\nback to 1979, when daily precipitation data was almost unavailable globally.\nThe detailed procedures are described in this paper. The produced data have\nbeen partially evaluated. However, additional evaluations from different\nperspectives are needed. The advantages and disadvantages of this precipitation\ndataset are also discussed. Currently, this GPC/m precipitation dataset version\nis GPC/m-v1-2024.","PeriodicalId":501166,"journal":{"name":"arXiv - PHYS - Atmospheric and Oceanic Physics","volume":"21 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Atmospheric and Oceanic Physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.09639","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper presents a new precipitation dataset that is daily, has a spatial
resolution of one degree on a quasi-global scale, and spans more than 42 years,
using machine learning techniques. The ultimate goal of this dataset is to
provide a homogeneous daily precipitation dataset for several decades without
gaps, which is suitable for climate analysis. As a first step, 42 years of
daily precipitation data was generated using machine learning techniques. The
machine learning methods are supervised learning, and the reference data are
estimated precipitation datasets from 2001 to 2020. The three machine learning
methods are random forest, gradient-boosted decision trees, and convolutional
neural networks. The input data are satellite observations and atmospheric
circulations from reanalysis, which are somewhat modified based on knowledge of
the climatological background. Using the trained statistical models, we predict
back to 1979, when daily precipitation data was almost unavailable globally.
The detailed procedures are described in this paper. The produced data have
been partially evaluated. However, additional evaluations from different
perspectives are needed. The advantages and disadvantages of this precipitation
dataset are also discussed. Currently, this GPC/m precipitation dataset version
is GPC/m-v1-2024.