{"title":"Emulated order identification for models of big time series data","authors":"Brian Wu, Dorin Drignei","doi":"10.1002/sam.11504","DOIUrl":null,"url":null,"abstract":"This interdisciplinary research includes elements of computing, optimization, and statistics for big data. Specifically, it addresses model order identification aspects of big time series data. Computing and minimizing information criteria, such as BIC, on a grid of integer orders becomes prohibitive for time series recorded at a large number of time points. We propose to compute information criteria only for a sample of integer orders and use kriging‐based methods to emulate the information criteria on the rest of the grid. Then we use an efficient global optimization (EGO) algorithm to identify the orders. The method is applied to both ARMA and ARMA‐GARCH models. We simulated times series from each type of model of prespecified orders and applied the method to identify the orders. We also used real big time series with tens of thousands of time points to illustrate the method. In particular, we used sentiment scores for news headlines on the economy for ARMA models, and the NASDAQ daily returns for ARMA‐GARCH models, from the beginning in 1971 to mid‐April 2020 in the early stages of the COVID‐19 pandemic. The proposed method identifies efficiently and accurately the orders of models for big time series data.","PeriodicalId":342679,"journal":{"name":"Statistical Analysis and Data Mining: The ASA Data Science Journal","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistical Analysis and Data Mining: The ASA Data Science Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/sam.11504","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This interdisciplinary research includes elements of computing, optimization, and statistics for big data. Specifically, it addresses model order identification aspects of big time series data. Computing and minimizing information criteria, such as BIC, on a grid of integer orders becomes prohibitive for time series recorded at a large number of time points. We propose to compute information criteria only for a sample of integer orders and use kriging‐based methods to emulate the information criteria on the rest of the grid. Then we use an efficient global optimization (EGO) algorithm to identify the orders. The method is applied to both ARMA and ARMA‐GARCH models. We simulated times series from each type of model of prespecified orders and applied the method to identify the orders. We also used real big time series with tens of thousands of time points to illustrate the method. In particular, we used sentiment scores for news headlines on the economy for ARMA models, and the NASDAQ daily returns for ARMA‐GARCH models, from the beginning in 1971 to mid‐April 2020 in the early stages of the COVID‐19 pandemic. The proposed method identifies efficiently and accurately the orders of models for big time series data.