首页 > 最新文献

Econometrics and Statistics最新文献

英文 中文
Empirical best predictors under multivariate Fay-Herriot models and their numerical approximation 多变量费-赫里奥特模型下的经验最佳预测值及其数值近似值
IF 1.9 Q2 ECONOMICS Pub Date : 2024-09-10 DOI: 10.1016/j.ecosta.2024.09.001
Jan Pablo Burgard, Joscha Krause, Domingo Morales, Anna-Lena Wölwer
Small area estimation of multivariable non-linear domain indicators using aggregated data is addressed. By assuming that the target vector follows a multivariate Fay-Herriot model, empirical best predictors of domain parameters that are arbitrary Lebesgue-measurable functions of multiple target variables are derived. In this context, Monte Carlo and Gauss-Hermite quadrature methods for integral approximation are discussed. A parametric bootstrap algorithm for mean squared error estimation is presented. Simulation experiments are conducted to study the behaviour of the introduced methodology. Moreover, an illustrative application to real data from the Spanish labour force survey is provided. In this example, province-level unemployment rates, crossed by age classes and sex, are estimated.
本文探讨了利用汇总数据对多变量非线性领域指标进行小范围估算的问题。通过假设目标向量遵循多变量 Fay-Herriot 模型,得出了作为多个目标变量的任意 Lebesgue 可量函数的领域参数的经验最佳预测值。在此背景下,讨论了用于积分近似的蒙特卡罗和高斯-赫米特正交方法。还介绍了用于均方误差估计的参数引导算法。通过模拟实验研究了所引入方法的性能。此外,还提供了西班牙劳动力调查真实数据的示例应用。在这个例子中,估算了按年龄和性别分类的省一级失业率。
{"title":"Empirical best predictors under multivariate Fay-Herriot models and their numerical approximation","authors":"Jan Pablo Burgard, Joscha Krause, Domingo Morales, Anna-Lena Wölwer","doi":"10.1016/j.ecosta.2024.09.001","DOIUrl":"https://doi.org/10.1016/j.ecosta.2024.09.001","url":null,"abstract":"Small area estimation of multivariable non-linear domain indicators using aggregated data is addressed. By assuming that the target vector follows a multivariate Fay-Herriot model, empirical best predictors of domain parameters that are arbitrary Lebesgue-measurable functions of multiple target variables are derived. In this context, Monte Carlo and Gauss-Hermite quadrature methods for integral approximation are discussed. A parametric bootstrap algorithm for mean squared error estimation is presented. Simulation experiments are conducted to study the behaviour of the introduced methodology. Moreover, an illustrative application to real data from the Spanish labour force survey is provided. In this example, province-level unemployment rates, crossed by age classes and sex, are estimated.","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142250941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecasting with Machine Learning methods and multiple large datasets[formula omitted] 使用机器学习方法和多个大型数据集进行预测[公式省略]
IF 1.9 Q2 ECONOMICS Pub Date : 2024-09-06 DOI: 10.1016/j.ecosta.2024.08.003
Nikoleta Anesti, Eleni Kalamara, George Kapetanios
The usefulness of machine learning techniques for forecasting macroeconomic variables using multiple large datasets is considered. The predictive content of surveys is compared with text-based indicators from newspaper articles and a standard macroeconomic dataset, extending the evidence on the contribution of each dataset in predicting economic activity. Among the linear models, the Ridge regression and the Partial Least Squares models report the largest gains consistently for most of the forecasting horizons, and among the non linear machine learning models, Support Vector Regression performs better at shorter horizons compared to the Neural Networks and Random Forest that yield more accurate forecasts up to two years ahead. Text based indicators have similar informational content to surveys, albeit combining the two datasets provides with more accurate forecasts for most of the forecast horizons. The largest forecasting gains are overwhelmingly concentrated at the shorter horizons for the majority of models and datasets and they decrease significantly after one year. Non-linear machine learning models appear to be mostly useful during the Great Financial Crisis and perform similarly to their linear counterparts in more normal periods.
本文探讨了机器学习技术在利用多个大型数据集预测宏观经济变量方面的实用性。将调查的预测内容与来自报纸文章和标准宏观经济数据集的基于文本的指标进行了比较,从而扩展了每个数据集在预测经济活动方面的贡献。在线性模型中,岭回归和偏最小二乘法模型在大多数预测期限内的收益最大,而在非线性机器学习模型中,支持向量回归在较短期限内的表现要好于神经网络和随机森林,前者可在两年内做出更准确的预测。基于文本的指标与调查具有相似的信息内容,尽管将这两个数据集结合起来,在大多数预测范围内都能提供更准确的预测。对于大多数模型和数据集来说,最大的预测收益绝大多数集中在较短的时间跨度上,并且在一年后会显著下降。非线性机器学习模型在大金融危机期间似乎最有用,而在较正常时期的表现与线性模型类似。
{"title":"Forecasting with Machine Learning methods and multiple large datasets[formula omitted]","authors":"Nikoleta Anesti, Eleni Kalamara, George Kapetanios","doi":"10.1016/j.ecosta.2024.08.003","DOIUrl":"https://doi.org/10.1016/j.ecosta.2024.08.003","url":null,"abstract":"The usefulness of machine learning techniques for forecasting macroeconomic variables using multiple large datasets is considered. The predictive content of surveys is compared with text-based indicators from newspaper articles and a standard macroeconomic dataset, extending the evidence on the contribution of each dataset in predicting economic activity. Among the linear models, the Ridge regression and the Partial Least Squares models report the largest gains consistently for most of the forecasting horizons, and among the non linear machine learning models, Support Vector Regression performs better at shorter horizons compared to the Neural Networks and Random Forest that yield more accurate forecasts up to two years ahead. Text based indicators have similar informational content to surveys, albeit combining the two datasets provides with more accurate forecasts for most of the forecast horizons. The largest forecasting gains are overwhelmingly concentrated at the shorter horizons for the majority of models and datasets and they decrease significantly after one year. Non-linear machine learning models appear to be mostly useful during the Great Financial Crisis and perform similarly to their linear counterparts in more normal periods.","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142218394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Specification tests for normal/gamma and stable/gamma stochastic frontier models based on empirical transforms 基于经验变换的正态/伽马和稳定/伽马随机前沿模型的规格检验
IF 1.9 Q2 ECONOMICS Pub Date : 2024-08-28 DOI: 10.1016/j.ecosta.2024.08.002
Christos K. Papadimitriou, Simos G. Meintanis, Bernardo B. Andrade, Mike G. Tsionas
Goodness–of–fit tests for the distribution of the composed error term in a Stochastic Frontier Model (SFM) are suggested. The focus is on the case of a normal/gamma SFM and the heavy–tailed stable/gamma SFM. In the first case the moment generating function is used as tool while in the latter case the characteristic function of the error term is employed. In both cases our test statistics are formulated as weighted integrals of properly standardized data. The new normal/gamma test is consistent, and is shown to have an intrinsic relation to moment–based tests. The finite–sample behavior of resampling versions of both tests is investigated by Monte Carlo simulation, while several real–data applications are also included.
提出了随机前沿模型(SFM)中组成误差项分布的拟合优度检验。重点是正态/伽马 SFM 和重尾稳定/伽马 SFM 的情况。在前一种情况下,使用矩生成函数作为工具,而在后一种情况下,则使用误差项的特征函数。在这两种情况下,我们的检验统计量都被表述为适当标准化数据的加权积分。新的正态/伽马检验是一致的,而且与基于矩的检验有内在联系。蒙特卡洛模拟研究了这两种检验的重采样版本的有限样本行为,同时还包括几个实际数据应用。
{"title":"Specification tests for normal/gamma and stable/gamma stochastic frontier models based on empirical transforms","authors":"Christos K. Papadimitriou, Simos G. Meintanis, Bernardo B. Andrade, Mike G. Tsionas","doi":"10.1016/j.ecosta.2024.08.002","DOIUrl":"https://doi.org/10.1016/j.ecosta.2024.08.002","url":null,"abstract":"Goodness–of–fit tests for the distribution of the composed error term in a Stochastic Frontier Model (SFM) are suggested. The focus is on the case of a normal/gamma SFM and the heavy–tailed stable/gamma SFM. In the first case the moment generating function is used as tool while in the latter case the characteristic function of the error term is employed. In both cases our test statistics are formulated as weighted integrals of properly standardized data. The new normal/gamma test is consistent, and is shown to have an intrinsic relation to moment–based tests. The finite–sample behavior of resampling versions of both tests is investigated by Monte Carlo simulation, while several real–data applications are also included.","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142250987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayesian flexible model for testing Granger causality 检验格兰杰因果关系的贝叶斯灵活模型
IF 1.9 Q2 ECONOMICS Pub Date : 2024-08-03 DOI: 10.1016/j.ecosta.2024.08.001
Iván Gutiérrez, Danilo Alvares, Luis Gutiérrez
A new Bayesian hypothesis testing procedure for evaluating the Granger causality between two or more time series is proposed. The test is based on a flexible model for the joint evolution of multiple series, where a latent binary matrix indicates whether there is a Granger-causal relationship between such time series. The model is specified through a dependent Geometric stick-breaking process that generalizes the standard parametric Gaussian vector autoregression model, whereas the prior distribution of the latent matrix ensures a multiple testing correction. A Monte Carlo simulation study is provided for comparing the robustness of the proposed hypothesis test with state-of-the-art alternatives. The results show that this proposal performs better than competing approaches. Finally, the new test is applied to real economic data.
本文提出了一种新的贝叶斯假设检验程序,用于评估两个或多个时间序列之间的格兰杰因果关系。该检验基于一个灵活的多序列联合演化模型,其中一个潜在的二元矩阵表示这些时间序列之间是否存在格兰杰因果关系。该模型是通过一个从属的几何破棒过程来指定的,它概括了标准参数高斯向量自回归模型,而潜矩阵的先验分布则确保了多重检验校正。本文提供了蒙特卡罗模拟研究,以比较建议的假设检验与最先进的替代方法的稳健性。结果表明,该建议的性能优于其他竞争方法。最后,新的检验方法被应用于真实的经济数据。
{"title":"A Bayesian flexible model for testing Granger causality","authors":"Iván Gutiérrez, Danilo Alvares, Luis Gutiérrez","doi":"10.1016/j.ecosta.2024.08.001","DOIUrl":"https://doi.org/10.1016/j.ecosta.2024.08.001","url":null,"abstract":"A new Bayesian hypothesis testing procedure for evaluating the Granger causality between two or more time series is proposed. The test is based on a flexible model for the joint evolution of multiple series, where a latent binary matrix indicates whether there is a Granger-causal relationship between such time series. The model is specified through a dependent Geometric stick-breaking process that generalizes the standard parametric Gaussian vector autoregression model, whereas the prior distribution of the latent matrix ensures a multiple testing correction. A Monte Carlo simulation study is provided for comparing the robustness of the proposed hypothesis test with state-of-the-art alternatives. The results show that this proposal performs better than competing approaches. Finally, the new test is applied to real economic data.","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142218395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Differentially Private Goodness-of-Fit Tests for Continuous Variables 连续变量的差分私有拟合优度测试
IF 2 Q2 ECONOMICS Pub Date : 2024-07-01 DOI: 10.1016/j.ecosta.2021.09.007
Seung Woo Kwak , Jeongyoun Ahn , Jaewoo Lee , Cheolwoo Park

Data privacy is a growing concern in modern data analyses as more and more types of information about individuals are collected and shared. Statistical analysis in consideration of privacy is thus becoming an exciting area of research. Differential privacy can provide a means by which one can measure the stochastic risk of violating the privacy of individuals that can result from conducting an analysis, such as a simple query from a database and a hypothesis test. The main interest of the work is a goodness-of-fit test that compares the sampled data to a known distribution. Many differentially private goodness-of-fit tests have been proposed for discrete random variables, but little work has been done for continuous variables. The objective is to review some existing tests that guarantee differential privacy for discrete random variables, and to propose an extension to continuous cases via a discretization process. The proposed test procedures are demonstrated through simulated examples and applied to the Household Financial Welfare Survey of South Korea in 2018.

随着越来越多类型的个人信息被收集和共享,数据隐私在现代数据分析中日益受到关注。因此,考虑隐私的统计分析正成为一个令人兴奋的研究领域。差分隐私可以提供一种方法,用来衡量进行分析(如从数据库中进行简单查询和假设检验)时可能产生的侵犯个人隐私的随机风险。这项工作的主要关注点是拟合优度测试,它将采样数据与已知分布进行比较。针对离散型随机变量,已经提出了许多不同的私有拟合优度检验,但针对连续型变量的研究还很少。本论文旨在回顾一些保证离散随机变量差分隐私性的现有检验方法,并提出通过离散化过程扩展到连续变量的方法。通过模拟示例演示了所提出的测试程序,并将其应用于 2018 年韩国家庭金融福利调查。
{"title":"Differentially Private Goodness-of-Fit Tests for Continuous Variables","authors":"Seung Woo Kwak ,&nbsp;Jeongyoun Ahn ,&nbsp;Jaewoo Lee ,&nbsp;Cheolwoo Park","doi":"10.1016/j.ecosta.2021.09.007","DOIUrl":"10.1016/j.ecosta.2021.09.007","url":null,"abstract":"<div><p>Data privacy is a growing concern in modern data analyses as more and more types of information about individuals are collected and shared. Statistical analysis in consideration of privacy is thus becoming an exciting area of research. Differential privacy can provide a means by which one can measure the stochastic risk of violating the privacy of individuals that can result from conducting an analysis, such as a simple query from a database and a hypothesis test. The main interest of the work is a goodness-of-fit test that compares the sampled data to a known distribution. Many differentially private goodness-of-fit tests have been proposed for discrete random variables, but little work has been done for continuous variables. The objective is to review some existing tests that guarantee differential privacy for discrete random variables, and to propose an extension to continuous cases via a discretization process. The proposed test procedures are demonstrated through simulated examples and applied to the Household Financial Welfare Survey of South Korea in 2018.</p></div>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74095071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semiparametric Averaging of Nonlinear Marginal Logistic Regressions and Forecasting for Time Series Classification 非线性边际 Logistic 回归的半参数平均化和时间序列分类预测
IF 2 Q2 ECONOMICS Pub Date : 2024-07-01 DOI: 10.1016/j.ecosta.2021.11.001
Rong Peng , Zudi Lu

Binary classification is an important issue in many applications but mostly studied for independent data in the literature. A binary time series classification is investigated by proposing a semiparametric procedure named “Model Averaging nonlinear MArginal LOgistic Regressions” (MAMaLoR) for binary time series data based on the time series information of predictor variables. The procedure involves approximating the logistic multivariate conditional regression function by combining low-dimensional non-parametric nonlinear marginal logistic regressions, in the sense of Kullback-Leibler distance. A time series conditional likelihood method is suggested for estimating the optimal averaging weights together with local maximum likelihood estimations of the nonparametric marginal time series logistic (auto)regressions. The asymptotic properties of the procedure are established under mild conditions on the time series observations that are of β-mixing property. The procedure is less computationally demanding and can avoid the “curse of dimensionality” for, and be easily applied to, high dimensional lagged information based nonlinear time series classification forecasting. The performances of the procedure are further confirmed both by Monte-Carlo simulation and an empirical study for market moving direction forecasting of the financial FTSE 100 index data.

二元分类是许多应用中的一个重要问题,但文献中大多是针对独立数据进行研究的。本文根据预测变量的时间序列信息,针对二元时间序列数据提出了一种名为 "模型平均化非线性边际逻辑回归"(MAMaLoR)的半参数程序,从而对二元时间序列分类进行了研究。该程序包括通过结合低维非参数非线性边际逻辑回归(Kullback-Leibler 距离)来近似逻辑多元条件回归函数。提出了一种时间序列条件似然法,用于估计最优平均权重以及非参数边际时间序列逻辑(自动)回归的局部最大似然估计。在具有 β 混合特性的时间序列观测数据的温和条件下,建立了该程序的渐近特性。该程序对计算的要求较低,可以避免基于滞后信息的高维非线性时间序列分类预测的 "维度诅咒",并易于应用。该程序的性能通过蒙特卡洛模拟和对金融时报 100 指数数据的市场移动方向预测的实证研究得到了进一步证实。
{"title":"Semiparametric Averaging of Nonlinear Marginal Logistic Regressions and Forecasting for Time Series Classification","authors":"Rong Peng ,&nbsp;Zudi Lu","doi":"10.1016/j.ecosta.2021.11.001","DOIUrl":"10.1016/j.ecosta.2021.11.001","url":null,"abstract":"<div><p>Binary classification is an important issue in many applications but mostly studied for independent data in the literature. A binary time series classification is investigated by proposing a semiparametric procedure named “Model Averaging nonlinear MArginal LOgistic Regressions” (MAMaLoR) for binary time series data based on the time series information of predictor variables. The procedure involves approximating the logistic multivariate conditional regression function by combining low-dimensional non-parametric nonlinear marginal logistic regressions, in the sense of Kullback-Leibler distance. A time series conditional likelihood method is suggested for estimating the optimal averaging weights together with local maximum likelihood estimations of the nonparametric marginal time series logistic (auto)regressions. The asymptotic properties of the procedure are established under mild conditions on the time series observations that are of <span><math><mi>β</mi></math></span>-mixing property. The procedure is less computationally demanding and can avoid the “curse of dimensionality” for, and be easily applied to, high dimensional lagged information based nonlinear time series classification forecasting. The performances of the procedure are further confirmed both by Monte-Carlo simulation and an empirical study for market moving direction forecasting of the financial FTSE 100 index data.</p></div>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86385087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conditional Quantile Functions for Zero-Inflated Longitudinal Count Data 零膨胀纵向计数数据的条件量子函数
IF 2 Q2 ECONOMICS Pub Date : 2024-07-01 DOI: 10.1016/j.ecosta.2021.09.003
Carlos Lamarche , Xuan Shi , Derek S. Young

The identification and estimation of conditional quantile functions for count responses using longitudinal data are considered. The approach is based on a continuous approximation to distribution functions for count responses within a class of parametric models that are commonly employed. It is first shown that conditional quantile functions for count responses are identified in zero-inflated models with subject heterogeneity. Then, a simple three-step approach is developed to estimate the effects of covariates on the quantiles of the response variable. A simulation study is presented to show the small sample performance of the estimator. Finally, the advantages of the proposed estimator in relation to some existing methods is illustrated by estimating a model of annual visits to physicians using data from a health insurance experiment.

研究考虑了利用纵向数据识别和估计计数响应的条件量子函数。该方法基于一类常用参数模型中计数响应分布函数的连续近似值。研究首先表明,在具有受试者异质性的零膨胀模型中,可以确定计数响应的条件量分函数。然后,提出了一种简单的三步法来估算协变量对响应变量量值的影响。模拟研究显示了估计器的小样本性能。最后,通过使用医疗保险实验数据对年度就诊模型进行估计,说明了所提出的估计方法相对于一些现有方法的优势。
{"title":"Conditional Quantile Functions for Zero-Inflated Longitudinal Count Data","authors":"Carlos Lamarche ,&nbsp;Xuan Shi ,&nbsp;Derek S. Young","doi":"10.1016/j.ecosta.2021.09.003","DOIUrl":"10.1016/j.ecosta.2021.09.003","url":null,"abstract":"<div><p>The identification and estimation of conditional quantile functions for count responses using longitudinal data are considered. The approach is based on a continuous approximation to distribution functions for count responses within a class of parametric models that are commonly employed. It is first shown that conditional quantile functions for count responses are identified in zero-inflated models with subject heterogeneity. Then, a simple three-step approach is developed to estimate the effects of covariates on the quantiles of the response variable. A simulation study is presented to show the small sample performance of the estimator. Finally, the advantages of the proposed estimator in relation to some existing methods is illustrated by estimating a model of annual visits to physicians using data from a health insurance experiment.</p></div>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86565670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecasting Near-equivalence of Linear Dimension Reduction Methods in Large Panels of Macro-variables 大型宏观变量面板中线性降维方法的预测近似性
IF 2 Q2 ECONOMICS Pub Date : 2024-07-01 DOI: 10.1016/j.ecosta.2021.10.007
Alessandro Barbarino , Efstathia Bura

In an extensive pseudo out-of-sample horserace, classical estimators (dynamic factor models, RIDGE and partial least squares regression) and the novel to forecasting, Regularized Sliced Inverse Regression, exhibit almost near-equivalent forecasting accuracy in a large panel of macroeconomic variables across targets, horizons and subsamples. This finding motivates the theoretical contributions in this paper. Most widely used linear dimension reduction methods are shown to solve closely related maximization problems with solutions that can be decomposed in signal and scaling components. They are organized under a common scheme that sheds light on their commonalities and differences as well as on their functionality. Regularized Sliced Inverse Regression delivers the most parsimonious forecast model and obtains the greatest reduction of the complexity of the forecasting problem. Nevertheless, the study’s findings are that (a) the intrinsic relationship between forecast target and the other macroseries in the panel is linear and (b) targeting contributes in reducing the complexity of modeling yet does not induce significant gains in macroeconomic forecasting accuracy.

在一场广泛的伪样本外赛马中,经典估计方法(动态因子模型、RIDGE 和偏最小二乘回归)和新的预测方法--正则化切片反回归--在跨目标、跨期和跨子样本的大型宏观经济变量面板中表现出几乎相等的预测准确性。这一发现激发了本文的理论贡献。大多数广泛使用的线性维度缩减方法都能解决密切相关的最大化问题,其解决方案可以分解为信号和缩放两个部分。本文根据一个共同的方案对这些方法进行了整理,从而揭示了它们之间的共性和差异,以及它们的功能。正则化切分反回归提供了最简洁的预测模型,并最大程度地降低了预测问题的复杂性。不过,本研究的结论是:(a) 预测目标与面板中其他宏观序列之间的内在关系是线性的;(b) 目标定位有助于降低建模的复杂性,但不会显著提高宏观经济预测的准确性。
{"title":"Forecasting Near-equivalence of Linear Dimension Reduction Methods in Large Panels of Macro-variables","authors":"Alessandro Barbarino ,&nbsp;Efstathia Bura","doi":"10.1016/j.ecosta.2021.10.007","DOIUrl":"10.1016/j.ecosta.2021.10.007","url":null,"abstract":"<div><p>In an extensive pseudo out-of-sample horserace, classical estimators (dynamic factor models, RIDGE and partial least squares regression) and the novel to forecasting, Regularized Sliced Inverse Regression, exhibit almost near-equivalent forecasting accuracy in a large panel of macroeconomic variables across targets, horizons and subsamples. This finding motivates the theoretical contributions in this paper. Most widely used linear dimension reduction methods are shown to solve closely related maximization problems with solutions that can be decomposed in <em>signal</em> and <em>scaling</em> components. They are organized under a common scheme that sheds light on their commonalities and differences as well as on their functionality. Regularized Sliced Inverse Regression delivers the most parsimonious forecast model and obtains the greatest reduction of the complexity of the forecasting problem. Nevertheless, the study’s findings are that (a) the intrinsic relationship between forecast target and the other macroseries in the panel is linear and (b) targeting contributes in reducing the complexity of modeling yet does not induce significant gains in macroeconomic forecasting accuracy.</p></div>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80948953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multivariate Count Time Series Modelling 多变量计数时间序列建模
IF 2 Q2 ECONOMICS Pub Date : 2024-07-01 DOI: 10.1016/j.ecosta.2021.11.006
Konstantinos Fokianos

Autoregressive models are reviewed for the analysis of multivariate count time series. A particular topic of interest which is discussed in detail is that of the choice of a suitable distribution for a vectors of count random variables. The focus is on three main approaches taken for multivariate count time series analysis: (a) integer autoregressive processes, (b) parameter-driven models and (c) observation-driven models. The aim is to highlight some recent methodological developments and propose some potentially useful research topics.

本论文评述了用于分析多元计数时间序列的自回归模型。详细讨论了一个特别感兴趣的话题,即如何为计数随机变量向量选择合适的分布。重点是多元计数时间序列分析的三种主要方法:(a) 整数自回归过程,(b) 参数驱动模型和 (c) 观察驱动模型。目的是强调一些最新的方法论发展,并提出一些可能有用的研究课题。
{"title":"Multivariate Count Time Series Modelling","authors":"Konstantinos Fokianos","doi":"10.1016/j.ecosta.2021.11.006","DOIUrl":"10.1016/j.ecosta.2021.11.006","url":null,"abstract":"<div><p>Autoregressive models are reviewed for the analysis of multivariate count time series. A particular topic of interest which is discussed in detail is that of the choice of a suitable distribution for a vectors of count random variables. The focus is on three main approaches taken for multivariate count time series analysis: (a) integer autoregressive processes, (b) parameter-driven models and (c) observation-driven models. The aim is to highlight some recent methodological developments and propose some potentially useful research topics.</p></div>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89113611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Edgeworth expansions for multivariate random sums 多元随机和的埃奇沃斯展开式
IF 2 Q2 ECONOMICS Pub Date : 2024-07-01 DOI: 10.1016/j.ecosta.2021.04.005
Farrukh Javed , Nicola Loperfido , Stepan Mazur

The sum of a random number of independent and identically distributed random vectors has a distribution which is not analytically tractable, in the general case. The problem has been addressed by means of asymptotic approximations embedding the number of summands in a stochastically increasing sequence. Another approach relies on fitting flexible and tractable parametric, multivariate distributions, as for example finite mixtures. Both approaches are investigated within the framework of Edgeworth expansions. A general formula for the fourth-order cumulants of the random sum of independent and identically distributed random vectors is derived and it is shown that the above mentioned asymptotic approach does not necessarily lead to valid asymptotic normal approximations. The problem is addressed by means of Edgeworth expansions. Both theoretical and empirical results suggest that mixtures of two multivariate normal distributions with proportional covariance matrices satisfactorily fit data generated from random sums where the counting random variable and the random summands are Poisson and multivariate skew-normal, respectively.

在一般情况下,随机数个独立且同分布的随机向量之和的分布是无法分析的。解决这一问题的方法是将和的数量嵌入随机递增序列中的渐近近似值。另一种方法则依赖于拟合灵活可控的参数多元分布,例如有限混合物。这两种方法都是在埃奇沃斯展开的框架内进行研究的。推导出独立且同分布随机向量的随机和的四阶累积量的一般公式,并证明上述渐近方法并不一定导致有效的渐近正态近似。这个问题是通过埃奇沃斯展开来解决的。理论和实证结果都表明,具有比例协方差矩阵的两种多元正态分布的混合物能令人满意地拟合由随机和生成的数据,其中计数随机变量和随机和分别是泊松和多元偏斜正态分布。
{"title":"Edgeworth expansions for multivariate random sums","authors":"Farrukh Javed ,&nbsp;Nicola Loperfido ,&nbsp;Stepan Mazur","doi":"10.1016/j.ecosta.2021.04.005","DOIUrl":"10.1016/j.ecosta.2021.04.005","url":null,"abstract":"<div><p>The sum of a random number of independent and identically distributed random vectors has a distribution which is not analytically tractable, in the general case. The problem has been addressed by means of asymptotic approximations embedding the number of summands in a stochastically increasing sequence. Another approach relies on fitting flexible and tractable parametric, multivariate distributions, as for example finite mixtures. Both approaches are investigated within the framework of Edgeworth expansions. A general formula for the fourth-order cumulants of the random sum of independent and identically distributed random vectors is derived and it is shown that the above mentioned asymptotic approach does not necessarily lead to valid asymptotic normal approximations. The problem is addressed by means of Edgeworth expansions. Both theoretical and empirical results suggest that mixtures of two multivariate normal distributions with proportional covariance matrices satisfactorily fit data generated from random sums where the counting random variable and the random summands are Poisson and multivariate skew-normal, respectively.</p></div>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86545405","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Econometrics and Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1