{"title":"Forecasting paleoclimatic data with time series models","authors":"Gordon Reikard","doi":"10.1016/j.ringps.2021.100015","DOIUrl":null,"url":null,"abstract":"<div><p>Until recently, one obstacle to forecasting paleoclimatic data with time series models was gaps in the record. In 2020, highly resolved data sets became available. Time series models make it possible to determine how accurately climate can be predicted using techniques such as regressions and artificial intelligence. This paper runs forecasting tests for δ<sup>18</sup>O and δ<sup>13</sup>C using data spanning 34 Ma. The data exhibit several features: long-memory, irregular trending, and nonlinear variability. The probability distribution has heavy tails, and there are intermittent outliers. Because of the repeated changes in state, a simple train-validate-forecast method is inappropriate for these data sets. Instead, the testing methodology is iterative forecasting over moving windows: only recent observations are used to predict the future. There are several findings. First, at horizons of 2–4 kyr, all the methods perform well. Regressions and neural networks including the orbital parameters achieve the most accurate predictions for δ<sup>18</sup>O. In the tests for δ<sup>13</sup>C, the contest between the models is much closer. Second, as the horizon extends, accuracy deteriorates. At 10 kyr, the models track the central tendency of the data but miss the fluctuations. Third, forecast accuracy is found to vary substantially over time. There is a marked deterioration in accuracy starting around 2.58 Ma, coinciding with a period of steeper trending and greater amplitude in the cyclical fluctuations. The main limitation of time series models is that they do not incorporate the underlying physics. A more effective approach may be to combine statistical and physics-based models.</p></div>","PeriodicalId":101086,"journal":{"name":"Results in Geophysical Sciences","volume":"6 ","pages":"Article 100015"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.ringps.2021.100015","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Results in Geophysical Sciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666828921000067","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Until recently, one obstacle to forecasting paleoclimatic data with time series models was gaps in the record. In 2020, highly resolved data sets became available. Time series models make it possible to determine how accurately climate can be predicted using techniques such as regressions and artificial intelligence. This paper runs forecasting tests for δ18O and δ13C using data spanning 34 Ma. The data exhibit several features: long-memory, irregular trending, and nonlinear variability. The probability distribution has heavy tails, and there are intermittent outliers. Because of the repeated changes in state, a simple train-validate-forecast method is inappropriate for these data sets. Instead, the testing methodology is iterative forecasting over moving windows: only recent observations are used to predict the future. There are several findings. First, at horizons of 2–4 kyr, all the methods perform well. Regressions and neural networks including the orbital parameters achieve the most accurate predictions for δ18O. In the tests for δ13C, the contest between the models is much closer. Second, as the horizon extends, accuracy deteriorates. At 10 kyr, the models track the central tendency of the data but miss the fluctuations. Third, forecast accuracy is found to vary substantially over time. There is a marked deterioration in accuracy starting around 2.58 Ma, coinciding with a period of steeper trending and greater amplitude in the cyclical fluctuations. The main limitation of time series models is that they do not incorporate the underlying physics. A more effective approach may be to combine statistical and physics-based models.