首页 > 最新文献

Journal of Forecasting最新文献

英文 中文
Default Prediction Framework With Optimal Feature Set and Matching Ratio 具有最优特征集和匹配率的默认预测框架
IF 2.7 3区 经济学 Q1 ECONOMICS Pub Date : 2025-05-26 DOI: 10.1002/for.3284
Guotai Chi, Fengshan Bai, Hongping Tan, Ying Zhou

We propose a default prediction framework that incorporates imbalance handling and feature selection. For imbalance handling, we determine the optimal ratio of non-default to default firms by minimizing the Type-II error of the majority voting deep fully connected network (MV-DFCN) model. For feature selection, we design a two-stage process that first eliminates highly correlated and redundant features, and then refines the feature set using backward selection. Experimental results show that the DFCN model within the proposed framework outperforms baseline models in terms of G-Mean and AUC and achieves the lowest Type-II error rate. Furthermore, the framework outperforms eight baseline combinations of imbalance handling and feature selection strategies. Additionally, SHAP values are used to assess feature contributions, and nine features with statistically significant impacts are identified.

我们提出了一个包含不平衡处理和特征选择的默认预测框架。对于不平衡处理,我们通过最小化多数投票深度全连接网络(MV-DFCN)模型的ii型误差来确定非违约公司与违约公司的最佳比例。对于特征选择,我们设计了一个两阶段的过程,首先消除高度相关和冗余的特征,然后使用反向选择来细化特征集。实验结果表明,该框架下的DFCN模型在G-Mean和AUC方面优于基线模型,并且实现了最低的Type-II错误率。此外,该框架优于8种不平衡处理和特征选择策略的基线组合。此外,SHAP值用于评估特征贡献,并确定了具有统计显著影响的9个特征。
{"title":"Default Prediction Framework With Optimal Feature Set and Matching Ratio","authors":"Guotai Chi,&nbsp;Fengshan Bai,&nbsp;Hongping Tan,&nbsp;Ying Zhou","doi":"10.1002/for.3284","DOIUrl":"https://doi.org/10.1002/for.3284","url":null,"abstract":"<div>\u0000 \u0000 <p>We propose a default prediction framework that incorporates imbalance handling and feature selection. For imbalance handling, we determine the optimal ratio of non-default to default firms by minimizing the Type-II error of the majority voting deep fully connected network (MV-DFCN) model. For feature selection, we design a two-stage process that first eliminates highly correlated and redundant features, and then refines the feature set using backward selection. Experimental results show that the DFCN model within the proposed framework outperforms baseline models in terms of G-Mean and AUC and achieves the lowest Type-II error rate. Furthermore, the framework outperforms eight baseline combinations of imbalance handling and feature selection strategies. Additionally, SHAP values are used to assess feature contributions, and nine features with statistically significant impacts are identified.</p>\u0000 </div>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":"44 7","pages":"2067-2088"},"PeriodicalIF":2.7,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145197103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Real-Time Forecasting Using Mixed-Frequency VARs With Time-Varying Parameters 带时变参数的混频var实时预测
IF 2.7 3区 经济学 Q1 ECONOMICS Pub Date : 2025-05-07 DOI: 10.1002/for.3276
Markus Heinrich, Magnus Reif

This paper provides a detailed assessment of the real-time forecast accuracy of a wide range of vector autoregressive models that allow for both structural change and indicators sampled at different frequencies. We extend the literature by evaluating a mixed-frequency time-varying parameter vector autoregressive model with stochastic volatility. Monte Carlo simulation shows that the novel model is well-suited to estimate missing monthly observations in an environment that is subject to parameter instability. In a real-time forecast exercise, the model delivers accurate now- and forecasts and, on average, outperforms its competitors. Particularly, inflation and unemployment rate forecasts are more precise.

本文提供了广泛的矢量自回归模型的实时预测精度的详细评估,这些模型允许结构变化和以不同频率采样的指标。我们扩展了文献,通过评估一个随机波动的混合频率时变参数向量自回归模型。蒙特卡罗模拟结果表明,该模型能很好地估计参数不稳定环境下缺失的月观测值。在实时预测练习中,该模型提供准确的现在和预测,平均而言,优于其竞争对手。特别是,通货膨胀和失业率的预测更加精确。
{"title":"Real-Time Forecasting Using Mixed-Frequency VARs With Time-Varying Parameters","authors":"Markus Heinrich,&nbsp;Magnus Reif","doi":"10.1002/for.3276","DOIUrl":"https://doi.org/10.1002/for.3276","url":null,"abstract":"<p>This paper provides a detailed assessment of the real-time forecast accuracy of a wide range of vector autoregressive models that allow for both structural change and indicators sampled at different frequencies. We extend the literature by evaluating a mixed-frequency time-varying parameter vector autoregressive model with stochastic volatility. Monte Carlo simulation shows that the novel model is well-suited to estimate missing monthly observations in an environment that is subject to parameter instability. In a real-time forecast exercise, the model delivers accurate now- and forecasts and, on average, outperforms its competitors. Particularly, inflation and unemployment rate forecasts are more precise.</p>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":"44 7","pages":"2055-2066"},"PeriodicalIF":2.7,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/for.3276","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145196592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiple Seasonal Autoregressive Integrated Moving Average Models 多季节自回归综合移动平均模型
IF 2.7 3区 经济学 Q1 ECONOMICS Pub Date : 2025-05-01 DOI: 10.1002/for.3283
Francesco Lisi, Matteo Grigoletto

Many empirical time series show periodic patterns. SARIMA models and exponential smoothing methods are classical approaches to account for seasonal dynamics. However, they allow to model just one periodic component, while several time series have multiple seasonality, with periodic components possibly tangled among them. To face this case, some seasonal-trend decomposition methods have been proposed in the literature, for example, the TBATS model, the MSTL model, the ADAM model, and the Prophet model, while SARIMA models have been quite neglected. To fill this gap, in this work, we suggest a suitable generalization of the SARIMA model, called mSARIMA, able to account for multiple seasonality. First, we define the model, describe its characteristics, and propose a test for residual multiperiodic correlation. Then, we analyze the predictive performance by comparing the mSARIMA model with other approaches, namely, the TBATS, MSTL, ADAM, and Prophet models, under different kinds of seasonality. The results suggest that when seasonality has a stochastic nature, mSARIMA models are more effective in predicting the series. However, if seasonality is basically deterministic, then the model decomposition approach is more suitable. Finally, we provide two comparative forecasting applications for the 5-min series of the number of calls handled by a large North American commercial bank and for the 10-min traffic data on the eastbound lanes of the Ventura Highway in Los Angeles.

许多经验时间序列显示周期性模式。SARIMA模型和指数平滑方法是解释季节动态的经典方法。然而,它们只允许建模一个周期成分,而几个时间序列具有多个季节性,周期成分可能在它们之间纠缠在一起。针对这种情况,文献中提出了一些季节趋势分解方法,如TBATS模型、MSTL模型、ADAM模型和Prophet模型,而SARIMA模型却被忽视了。为了填补这一空白,在这项工作中,我们建议对SARIMA模型进行适当的推广,称为mSARIMA,能够解释多重季节性。首先,我们定义了模型,描述了其特征,并提出了残差多周期相关的检验方法。然后,通过将mSARIMA模型与TBATS、MSTL、ADAM和Prophet模型在不同季节条件下的预测性能进行对比分析。结果表明,当季节性具有随机性质时,mSARIMA模型对季节序列的预测更有效。然而,如果季节性基本是确定的,那么模型分解方法更合适。最后,我们为一家大型北美商业银行处理的5分钟电话数量系列和洛杉矶文图拉高速公路东行10分钟交通数据提供了两个比较预测应用程序。
{"title":"Multiple Seasonal Autoregressive Integrated Moving Average Models","authors":"Francesco Lisi,&nbsp;Matteo Grigoletto","doi":"10.1002/for.3283","DOIUrl":"https://doi.org/10.1002/for.3283","url":null,"abstract":"<p>Many empirical time series show periodic patterns. SARIMA models and exponential smoothing methods are classical approaches to account for seasonal dynamics. However, they allow to model just one periodic component, while several time series have multiple seasonality, with periodic components possibly tangled among them. To face this case, some seasonal-trend decomposition methods have been proposed in the literature, for example, the TBATS model, the MSTL model, the ADAM model, and the Prophet model, while SARIMA models have been quite neglected. To fill this gap, in this work, we suggest a suitable generalization of the SARIMA model, called mSARIMA, able to account for multiple seasonality. First, we define the model, describe its characteristics, and propose a test for residual multiperiodic correlation. Then, we analyze the predictive performance by comparing the mSARIMA model with other approaches, namely, the TBATS, MSTL, ADAM, and Prophet models, under different kinds of seasonality. The results suggest that when seasonality has a stochastic nature, mSARIMA models are more effective in predicting the series. However, if seasonality is basically deterministic, then the model decomposition approach is more suitable. Finally, we provide two comparative forecasting applications for the 5-min series of the number of calls handled by a large North American commercial bank and for the 10-min traffic data on the eastbound lanes of the Ventura Highway in Los Angeles.</p>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":"44 6","pages":"2037-2052"},"PeriodicalIF":2.7,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/for.3283","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144767431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stock Return Prediction Based on a Functional Capital Asset Pricing Model 基于功能资本资产定价模型的股票收益预测
IF 2.7 3区 经济学 Q1 ECONOMICS Pub Date : 2025-04-21 DOI: 10.1002/for.3282
Ufuk Beyaztas, Kaiying Ji, Han Lin Shang, Eliza Wu

The capital asset pricing model (CAPM) is readily used to capture a linear relationship between the daily returns of an asset and a market index. We extend this model to an intraday high-frequency setting by proposing a functional CAPM estimation approach. The functional CAPM is a stylized example of a function-on-function linear regression with a bivariate functional regression coefficient. The two-dimensional regression coefficient measures the cross-covariance between cumulative intraday asset returns and market returns. We apply it to the Standard and Poor's 500 index and its constituent stocks to demonstrate its practicality. We investigate the functional CAPM's in-sample goodness of fit and out-of-sample prediction for an asset's cumulative intraday return. The findings suggest that the proposed functional CAPM methods have superior model goodness of fit and forecast accuracy compared to the traditional CAPM empirical estimation. In particular, the functional methods produce better model goodness of fit and prediction accuracy for stocks traditionally considered less price efficient or more information opaque.

资本资产定价模型(CAPM)很容易用于捕捉资产的日收益与市场指数之间的线性关系。我们通过提出一种功能CAPM估计方法,将该模型扩展到日内高频设置。函数CAPM是一个具有二元函数回归系数的函数对函数线性回归的程式化示例。二维回归系数衡量累积日内资产收益与市场收益之间的交叉协方差。我们将其应用于标准普尔500指数及其成分股,以证明其实用性。我们研究了函数CAPM对资产累积日内收益的样本内拟合优度和样本外预测。结果表明,与传统的CAPM经验估计相比,所提出的功能CAPM方法具有更好的模型拟合优度和预测精度。特别是,对于传统上被认为价格效率较低或信息不透明的股票,函数方法产生了更好的模型拟合优度和预测精度。
{"title":"Stock Return Prediction Based on a Functional Capital Asset Pricing Model","authors":"Ufuk Beyaztas,&nbsp;Kaiying Ji,&nbsp;Han Lin Shang,&nbsp;Eliza Wu","doi":"10.1002/for.3282","DOIUrl":"https://doi.org/10.1002/for.3282","url":null,"abstract":"<p>The capital asset pricing model (CAPM) is readily used to capture a linear relationship between the daily returns of an asset and a market index. We extend this model to an intraday high-frequency setting by proposing a functional CAPM estimation approach. The functional CAPM is a stylized example of a function-on-function linear regression with a bivariate functional regression coefficient. The two-dimensional regression coefficient measures the cross-covariance between cumulative intraday asset returns and market returns. We apply it to the Standard and Poor's 500 index and its constituent stocks to demonstrate its practicality. We investigate the functional CAPM's in-sample goodness of fit and out-of-sample prediction for an asset's cumulative intraday return. The findings suggest that the proposed functional CAPM methods have superior model goodness of fit and forecast accuracy compared to the traditional CAPM empirical estimation. In particular, the functional methods produce better model goodness of fit and prediction accuracy for stocks traditionally considered less price efficient or more information opaque.</p>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":"44 6","pages":"2017-2036"},"PeriodicalIF":2.7,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/for.3282","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144767963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparse Ensemble Matters: Evidence From Unemployment Rate Forecasting 稀疏集合问题:来自失业率预测的证据
IF 2.7 3区 经济学 Q1 ECONOMICS Pub Date : 2025-04-19 DOI: 10.1002/for.3281
Sheng Cheng, Han Feng, Jue Wang

Sparse ensemble forecasting has become an increasingly competitive technique for forecasting research and practice in recent years. This paper examines the role of sparse ensemble in unemployment rates forecasting using expert forecasters. First, we show how the effectiveness of sparse ensembles is influenced by the complexity and accuracy of the base models. Second, we extend sparse regularization techniques to settings with unknown bias and variance employing Monte Carlo simulations. Third, we highlight the critical role of the regularization coefficient λ, which serves as a key shrinkage factor and necessitates a balance between model sparsity and forecasting accuracy. Experimental results on unemployment rate data demonstrate the superiority of sparse ensemble learning over equal-weight strategies. This framework provides novel insights into predictive modeling within the fields of economics and labor markets.

近年来,稀疏集合预测已成为预测研究和实践中日益具有竞争力的技术。本文利用专家预测方法研究了稀疏集合在失业率预测中的作用。首先,我们展示了稀疏集成的有效性如何受到基本模型的复杂性和准确性的影响。其次,我们使用蒙特卡罗模拟将稀疏正则化技术扩展到具有未知偏差和方差的设置。第三,我们强调正则化系数λ的关键作用,它是一个关键的收缩因子,需要在模型稀疏性和预测精度之间取得平衡。在失业率数据上的实验结果表明,稀疏集成学习优于等权策略。该框架为经济学和劳动力市场领域的预测建模提供了新颖的见解。
{"title":"Sparse Ensemble Matters: Evidence From Unemployment Rate Forecasting","authors":"Sheng Cheng,&nbsp;Han Feng,&nbsp;Jue Wang","doi":"10.1002/for.3281","DOIUrl":"https://doi.org/10.1002/for.3281","url":null,"abstract":"<div>\u0000 \u0000 <p>Sparse ensemble forecasting has become an increasingly competitive technique for forecasting research and practice in recent years. This paper examines the role of sparse ensemble in unemployment rates forecasting using expert forecasters. First, we show how the effectiveness of sparse ensembles is influenced by the complexity and accuracy of the base models. Second, we extend sparse regularization techniques to settings with unknown bias and variance employing Monte Carlo simulations. Third, we highlight the critical role of the regularization coefficient \u0000<span></span><math>\u0000 <mi>λ</mi></math>, which serves as a key shrinkage factor and necessitates a balance between model sparsity and forecasting accuracy. Experimental results on unemployment rate data demonstrate the superiority of sparse ensemble learning over equal-weight strategies. This framework provides novel insights into predictive modeling within the fields of economics and labor markets.</p>\u0000 </div>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":"44 6","pages":"2002-2016"},"PeriodicalIF":2.7,"publicationDate":"2025-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144767992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Deep Learning Test of the Martingale Difference Hypothesis 鞅差分假设的深度学习检验
IF 2.7 3区 经济学 Q1 ECONOMICS Pub Date : 2025-04-14 DOI: 10.1002/for.3280
João A. Bastos

A deep learning binary classifier is proposed to test if asset returns follow martingale difference sequences. The Neyman–Pearson classification paradigm is applied to control the type I error of the test. In Monte Carlo simulations, I find that this approach has better power properties than variance ratio and portmanteau tests against several alternative processes. I apply this procedure to a large set of exchange rate returns and find that it detects several potential deviations from the martingale difference hypothesis that the conventional statistical tests fail to capture.

提出了一种深度学习二值分类器来检验资产收益是否遵循鞅差分序列。内曼-皮尔逊分类范式用于控制测试的I型误差。在蒙特卡罗模拟中,我发现这种方法比方差比和组合测试对几个替代过程具有更好的功率特性。我将这个程序应用于一组大的汇率回报,发现它检测到传统统计检验无法捕获的鞅差异假设的几个潜在偏差。
{"title":"A Deep Learning Test of the Martingale Difference Hypothesis","authors":"João A. Bastos","doi":"10.1002/for.3280","DOIUrl":"https://doi.org/10.1002/for.3280","url":null,"abstract":"<div>\u0000 \u0000 <p>A deep learning binary classifier is proposed to test if asset returns follow martingale difference sequences. The Neyman–Pearson classification paradigm is applied to control the type I error of the test. In Monte Carlo simulations, I find that this approach has better power properties than variance ratio and portmanteau tests against several alternative processes. I apply this procedure to a large set of exchange rate returns and find that it detects several potential deviations from the martingale difference hypothesis that the conventional statistical tests fail to capture.</p>\u0000 </div>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":"44 6","pages":"1993-2001"},"PeriodicalIF":2.7,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144767904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hierarchical Regularizers for Reverse Unrestricted Mixed Data Sampling Regressions 反向无限制混合数据抽样回归的层次正则化
IF 2.7 3区 经济学 Q1 ECONOMICS Pub Date : 2025-04-11 DOI: 10.1002/for.3277
Alain Hecq, Marie Ternes, Ines Wilms

Reverse Unrestricted MIxed DAta Sampling (RU-MIDAS) regressions are used to model high-frequency responses by means of low-frequency variables. However, due to the periodic structure of RU-MIDAS regressions, the dimensionality grows quickly if the frequency mismatch between the high- and low-frequency variables is large. Additionally, the number of high-frequency observations available for estimation decreases. We propose to counteract this reduction in sample size by pooling the high-frequency coefficients and further reducing the dimensionality through a sparsity-inducing convex regularizer that accounts for the temporal ordering among the different lags. To this end, the regularizer prioritizes the inclusion of lagged coefficients according to the recency of the information they contain. We demonstrate the proposed method on two empirical applications, one on realized volatility forecasting with macroeconomic data and another on demand forecasting for a bicycle-sharing system with ridership data on other transportation types.

反向无限制混合数据抽样(RU-MIDAS)回归被用于通过低频变量来模拟高频响应。然而,由于RU-MIDAS回归的周期性结构,当高、低频变量之间的频率不匹配较大时,其维数增长很快。此外,可用于估计的高频观测值的数量减少。我们建议通过池化高频系数来抵消这种样本量的减少,并通过稀疏性诱导凸正则化器进一步降低维数,该正则化器考虑了不同滞后之间的时间顺序。为此,正则化器根据滞后系数包含的信息的近代性来优先考虑滞后系数的包含。我们在两个实证应用中展示了所提出的方法,一个是基于宏观经济数据的已实现波动率预测,另一个是基于其他交通类型的乘客数据的共享单车系统的需求预测。
{"title":"Hierarchical Regularizers for Reverse Unrestricted Mixed Data Sampling Regressions","authors":"Alain Hecq,&nbsp;Marie Ternes,&nbsp;Ines Wilms","doi":"10.1002/for.3277","DOIUrl":"https://doi.org/10.1002/for.3277","url":null,"abstract":"<p>Reverse Unrestricted MIxed DAta Sampling (RU-MIDAS) regressions are used to model high-frequency responses by means of low-frequency variables. However, due to the periodic structure of RU-MIDAS regressions, the dimensionality grows quickly if the frequency mismatch between the high- and low-frequency variables is large. Additionally, the number of high-frequency observations available for estimation decreases. We propose to counteract this reduction in sample size by pooling the high-frequency coefficients and further reducing the dimensionality through a sparsity-inducing convex regularizer that accounts for the temporal ordering among the different lags. To this end, the regularizer prioritizes the inclusion of lagged coefficients according to the recency of the information they contain. We demonstrate the proposed method on two empirical applications, one on realized volatility forecasting with macroeconomic data and another on demand forecasting for a bicycle-sharing system with ridership data on other transportation types.</p>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":"44 6","pages":"1946-1968"},"PeriodicalIF":2.7,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/for.3277","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144767382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Framework for Agricultural Futures Price Prediction With BERT-Based Topic Identification and Sentiment Analysis 基于bert主题识别和情感分析的农产品期货价格预测新框架
IF 2.7 3区 经济学 Q1 ECONOMICS Pub Date : 2025-04-11 DOI: 10.1002/for.3278
Wensheng Wang, Yuxi Liu

In China's financial and economic system, the agricultural futures market plays an important role in guiding the market to self regulate and providing efficient information transmission for regulators. The effective prediction of futures prices can assist in guiding agricultural production, monitoring operational risks arising from significant price fluctuations, and enhancing the predictability and pertinence of the country's macroeconomic regulation policies. This study investigates the main variety of grain futures—soybean futures, taking into account complex market and non-market influencing factors. Using historical market data and related news headlines of soybean futures as source data and integrating topic identification and sentiment analysis techniques, a novel framework for predicting agricultural futures prices that integrates topic sentiment is constructed. This model uses BERTopic to extract topic information from agricultural news texts, then integrates FinBERT to construct topic-based sentiment features, fuses them with structured market features, and constructs LSTM price prediction model with multi-feature inputs. In order to better model the short-term features and state transfer patterns of the time series, hidden Markov model (HMM) is further used to extract the hidden states, which are deeply fused with the LSTM model. The empirical results show that the model fusing topic and sentiment features significantly improves the forecasting accuracy in all lags, LSTM works best in short-term forecasting, and the combination of HMM and LSTM exhibits significant performance advantages in medium- and long-term forecasting. Compared with the baseline model that relies only on market features, topic sentiment features provide important incremental information for price forecasting, and the contribution of each topic sentiment feature calculated based on the PI metric is close to 50%. In addition, deep learning–based prediction model performs better than baseline machine learning models in dealing with extreme external shocks such as climate disasters, the COVID-19 pandemic, and the Russia–Ukraine conflict.

在中国的金融经济体系中,农产品期货市场在引导市场自我调节和为监管机构提供有效的信息传递方面发挥着重要作用。有效预测期货价格,有利于指导农业生产,监测价格大幅波动带来的经营风险,增强国家宏观调控政策的可预见性和针对性。本研究考察了粮食期货的主要品种——大豆期货,考虑了复杂的市场和非市场影响因素。以大豆期货历史市场数据和相关新闻标题为源数据,结合主题识别和情感分析技术,构建了一个融合主题情感的农产品期货价格预测框架。该模型利用BERTopic从农业新闻文本中提取主题信息,然后结合FinBERT构建基于主题的情绪特征,并将其与结构化市场特征融合,构建多特征输入的LSTM价格预测模型。为了更好地建模时间序列的短期特征和状态转移模式,进一步使用隐马尔可夫模型(HMM)提取隐藏状态,并将其与LSTM模型深度融合。实证结果表明,融合主题和情感特征的模型在所有滞后时间内都显著提高了预测精度,LSTM在短期预测中效果最好,HMM和LSTM结合在中长期预测中表现出显著的性能优势。与仅依赖市场特征的基线模型相比,主题情绪特征为价格预测提供了重要的增量信息,基于PI指标计算的每个主题情绪特征的贡献接近50%。此外,基于深度学习的预测模型在应对气候灾害、COVID-19大流行、俄乌冲突等极端外部冲击方面的表现优于基线机器学习模型。
{"title":"A Novel Framework for Agricultural Futures Price Prediction With BERT-Based Topic Identification and Sentiment Analysis","authors":"Wensheng Wang,&nbsp;Yuxi Liu","doi":"10.1002/for.3278","DOIUrl":"https://doi.org/10.1002/for.3278","url":null,"abstract":"<div>\u0000 \u0000 <p>In China's financial and economic system, the agricultural futures market plays an important role in guiding the market to self regulate and providing efficient information transmission for regulators. The effective prediction of futures prices can assist in guiding agricultural production, monitoring operational risks arising from significant price fluctuations, and enhancing the predictability and pertinence of the country's macroeconomic regulation policies. This study investigates the main variety of grain futures—soybean futures, taking into account complex market and non-market influencing factors. Using historical market data and related news headlines of soybean futures as source data and integrating topic identification and sentiment analysis techniques, a novel framework for predicting agricultural futures prices that integrates topic sentiment is constructed. This model uses BERTopic to extract topic information from agricultural news texts, then integrates FinBERT to construct topic-based sentiment features, fuses them with structured market features, and constructs LSTM price prediction model with multi-feature inputs. In order to better model the short-term features and state transfer patterns of the time series, hidden Markov model (HMM) is further used to extract the hidden states, which are deeply fused with the LSTM model. The empirical results show that the model fusing topic and sentiment features significantly improves the forecasting accuracy in all lags, LSTM works best in short-term forecasting, and the combination of HMM and LSTM exhibits significant performance advantages in medium- and long-term forecasting. Compared with the baseline model that relies only on market features, topic sentiment features provide important incremental information for price forecasting, and the contribution of each topic sentiment feature calculated based on the PI metric is close to 50%. In addition, deep learning–based prediction model performs better than baseline machine learning models in dealing with extreme external shocks such as climate disasters, the COVID-19 pandemic, and the Russia–Ukraine conflict.</p>\u0000 </div>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":"44 6","pages":"1969-1992"},"PeriodicalIF":2.7,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144767381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Measuring the Impact of Transition Risk on Financial Markets: A Joint VaR-ES Approach 衡量转型风险对金融市场的影响:一个联合VaR-ES方法
IF 2.7 3区 经济学 Q1 ECONOMICS Pub Date : 2025-04-09 DOI: 10.1002/for.3274
Laura Garcia-Jorcano, Lidia Sanchis-Marco
<p>Based on a joint quantile and expected shortfall semiparametric methodology, we propose a novel approach to forecasting market risk conditioned to transition risk exposure. This method allows us to forecast two climate-related financial risk measures called <span></span><math> <mi>C</mi> <mi>o</mi> <mi>C</mi> <mi>l</mi> <mi>i</mi> <mi>m</mi> <mi>a</mi> <mi>t</mi> <mi>e</mi> <mi>V</mi> <mi>a</mi> <mi>R</mi></math> and <span></span><math> <mi>C</mi> <mi>o</mi> <mi>C</mi> <mi>l</mi> <mi>i</mi> <mi>m</mi> <mi>a</mi> <mi>t</mi> <mi>e</mi> <mi>E</mi> <mi>S</mi></math>, being jointly elicitable, that capture the dependence of the European extreme bank returns on changes in carbon returns at extreme quantiles representing green and brown states. We evaluate our approach using a novel backtesting procedure and introduce related measures (<span></span><math> <mi>Δ</mi> <mi>C</mi> <mi>o</mi> <mi>C</mi> <mi>l</mi> <mi>i</mi> <mi>m</mi> <mi>a</mi> <mi>t</mi> <mi>e</mi></math> and <span></span><math> <mi>E</mi> <mi>x</mi> <mi>p</mi> <mi>o</mi> <mi>s</mi> <mi>u</mi> <mi>r</mi> <mi>e</mi> <mi>C</mi> <mi>l</mi> <mi>i</mi> <mi>m</mi> <mi>a</mi> <mi>t</mi> <mi>e</mi></math>). The main evidence states that the <span></span><math> <mi>C</mi> <mi>o</mi> <mi>C</mi> <mi>l</mi> <mi>i</mi> <mi>m</mi> <mi>a</mi> <mi>t</mi> <mi>e</mi> <mi>E</mi> <mi>S</mi></math> measure presents the highest risk for the brown (green) state due to the presence of carbon cost (carbon risk premium) in Ph.II (Ph.III) of the EU Emissions Trading System. Furthermore, we found the highest (lowest) financial risk forecasts for <span></span><math> <mi>C</mi> <mi>o</mi> <mi>C</mi> <mi>l</mi> <mi>i</mi> <mi>m</mi> <mi>a</mi> <mi>t</mi> <mi>e</mi> <mi>E</mi> <mi>S</mi></math> in green (brown) states during COVID-19. These results offer important implications for investors and policymakers regarding the effects of transition risk on the European financial system.<
基于联合分位数和预期不足半参数方法,提出了一种以过渡风险暴露为条件的市场风险预测方法。这种方法使我们能够预测两种与气候相关的金融风险指标,即C / C / C / C / R和C / C / C / C / C / S,它们是共同可获得的。捕捉了欧洲极端银行回报对代表绿色和棕色州的极端分位数的碳回报变化的依赖。我们评估我们的方法使用一个小说,val过程和采取相关措施 ( Δ C o C l 我 米 一个 t e和 E x p o 年代 u r e C l 我 米 一个 t e)。​主要证据表明,由于欧盟排放交易体系Ph.II (Ph.III)中的碳成本(碳风险溢价)的存在,碳/碳/碳排放在e / S测量中对棕色(绿色)州的风险最高。此外,我们发现,在2019冠状病毒病期间,绿色(棕色)州的最高(最低)金融风险预测为美国和美国。这些结果为投资者和政策制定者提供了关于转型风险对欧洲金融体系影响的重要启示。
{"title":"Measuring the Impact of Transition Risk on Financial Markets: A Joint VaR-ES Approach","authors":"Laura Garcia-Jorcano,&nbsp;Lidia Sanchis-Marco","doi":"10.1002/for.3274","DOIUrl":"https://doi.org/10.1002/for.3274","url":null,"abstract":"&lt;p&gt;Based on a joint quantile and expected shortfall semiparametric methodology, we propose a novel approach to forecasting market risk conditioned to transition risk exposure. This method allows us to forecast two climate-related financial risk measures called \u0000&lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;mi&gt;C&lt;/mi&gt;\u0000 &lt;mi&gt;o&lt;/mi&gt;\u0000 &lt;mi&gt;C&lt;/mi&gt;\u0000 &lt;mi&gt;l&lt;/mi&gt;\u0000 &lt;mi&gt;i&lt;/mi&gt;\u0000 &lt;mi&gt;m&lt;/mi&gt;\u0000 &lt;mi&gt;a&lt;/mi&gt;\u0000 &lt;mi&gt;t&lt;/mi&gt;\u0000 &lt;mi&gt;e&lt;/mi&gt;\u0000 &lt;mi&gt;V&lt;/mi&gt;\u0000 &lt;mi&gt;a&lt;/mi&gt;\u0000 &lt;mi&gt;R&lt;/mi&gt;&lt;/math&gt; and \u0000&lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;mi&gt;C&lt;/mi&gt;\u0000 &lt;mi&gt;o&lt;/mi&gt;\u0000 &lt;mi&gt;C&lt;/mi&gt;\u0000 &lt;mi&gt;l&lt;/mi&gt;\u0000 &lt;mi&gt;i&lt;/mi&gt;\u0000 &lt;mi&gt;m&lt;/mi&gt;\u0000 &lt;mi&gt;a&lt;/mi&gt;\u0000 &lt;mi&gt;t&lt;/mi&gt;\u0000 &lt;mi&gt;e&lt;/mi&gt;\u0000 &lt;mi&gt;E&lt;/mi&gt;\u0000 &lt;mi&gt;S&lt;/mi&gt;&lt;/math&gt;, being jointly elicitable, that capture the dependence of the European extreme bank returns on changes in carbon returns at extreme quantiles representing green and brown states. We evaluate our approach using a novel backtesting procedure and introduce related measures (\u0000&lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;mi&gt;Δ&lt;/mi&gt;\u0000 &lt;mi&gt;C&lt;/mi&gt;\u0000 &lt;mi&gt;o&lt;/mi&gt;\u0000 &lt;mi&gt;C&lt;/mi&gt;\u0000 &lt;mi&gt;l&lt;/mi&gt;\u0000 &lt;mi&gt;i&lt;/mi&gt;\u0000 &lt;mi&gt;m&lt;/mi&gt;\u0000 &lt;mi&gt;a&lt;/mi&gt;\u0000 &lt;mi&gt;t&lt;/mi&gt;\u0000 &lt;mi&gt;e&lt;/mi&gt;&lt;/math&gt; and \u0000&lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;mi&gt;E&lt;/mi&gt;\u0000 &lt;mi&gt;x&lt;/mi&gt;\u0000 &lt;mi&gt;p&lt;/mi&gt;\u0000 &lt;mi&gt;o&lt;/mi&gt;\u0000 &lt;mi&gt;s&lt;/mi&gt;\u0000 &lt;mi&gt;u&lt;/mi&gt;\u0000 &lt;mi&gt;r&lt;/mi&gt;\u0000 &lt;mi&gt;e&lt;/mi&gt;\u0000 &lt;mi&gt;C&lt;/mi&gt;\u0000 &lt;mi&gt;l&lt;/mi&gt;\u0000 &lt;mi&gt;i&lt;/mi&gt;\u0000 &lt;mi&gt;m&lt;/mi&gt;\u0000 &lt;mi&gt;a&lt;/mi&gt;\u0000 &lt;mi&gt;t&lt;/mi&gt;\u0000 &lt;mi&gt;e&lt;/mi&gt;&lt;/math&gt;). The main evidence states that the \u0000&lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;mi&gt;C&lt;/mi&gt;\u0000 &lt;mi&gt;o&lt;/mi&gt;\u0000 &lt;mi&gt;C&lt;/mi&gt;\u0000 &lt;mi&gt;l&lt;/mi&gt;\u0000 &lt;mi&gt;i&lt;/mi&gt;\u0000 &lt;mi&gt;m&lt;/mi&gt;\u0000 &lt;mi&gt;a&lt;/mi&gt;\u0000 &lt;mi&gt;t&lt;/mi&gt;\u0000 &lt;mi&gt;e&lt;/mi&gt;\u0000 &lt;mi&gt;E&lt;/mi&gt;\u0000 &lt;mi&gt;S&lt;/mi&gt;&lt;/math&gt; measure presents the highest risk for the brown (green) state due to the presence of carbon cost (carbon risk premium) in Ph.II (Ph.III) of the EU Emissions Trading System. Furthermore, we found the highest (lowest) financial risk forecasts for \u0000&lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;mi&gt;C&lt;/mi&gt;\u0000 &lt;mi&gt;o&lt;/mi&gt;\u0000 &lt;mi&gt;C&lt;/mi&gt;\u0000 &lt;mi&gt;l&lt;/mi&gt;\u0000 &lt;mi&gt;i&lt;/mi&gt;\u0000 &lt;mi&gt;m&lt;/mi&gt;\u0000 &lt;mi&gt;a&lt;/mi&gt;\u0000 &lt;mi&gt;t&lt;/mi&gt;\u0000 &lt;mi&gt;e&lt;/mi&gt;\u0000 &lt;mi&gt;E&lt;/mi&gt;\u0000 &lt;mi&gt;S&lt;/mi&gt;&lt;/math&gt; in green (brown) states during COVID-19. These results offer important implications for investors and policymakers regarding the effects of transition risk on the European financial system.&lt;","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":"44 6","pages":"1907-1945"},"PeriodicalIF":2.7,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/for.3274","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144767378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fundamentals Models Versus Random Walk: Evidence From an Emerging Economy 基本面模型与随机漫步:来自新兴经济体的证据
IF 2.7 3区 经济学 Q1 ECONOMICS Pub Date : 2025-04-07 DOI: 10.1002/for.3279
Helder Ferreira de Mendonça, Luciano Vereda, Luan Mateus Matos de Araújo

We analyze the predictive power of fundamentals versus random walk models for horizons from 1 to 24 months in an emerging market. Specifically, we investigate what fundamentals models outperform random walk during periods of appreciation and depreciation of the exchange rate. Furthermore, we analyze whether the fundamentals models that beat random walk contain information not considered by market expectations. Based on data from the Brazilian economy, the findings point out that some fundamentals models are useful for forecasting the exchange rate. The predictive power of fundamentals models increases in periods marked by a trend of currency appreciation or depreciation. In particular, the PPP-type fundamentals models have greater predictive power than the random walk and add information to market expectations for different time horizons and periods of exchange rate appreciation and depreciation.

我们分析了基本面与随机游走模型在新兴市场1至24个月期间的预测能力。具体来说,我们研究了哪些基本面模型在汇率升值和贬值期间优于随机漫步。此外,我们分析战胜随机漫步的基本面模型是否包含市场预期未考虑的信息。基于巴西经济的数据,研究结果指出,一些基本面模型对预测汇率是有用的。在货币出现升值或贬值趋势的时期,基本面模型的预测能力会增强。特别是,ppp类型的基本面模型比随机漫步具有更大的预测能力,并为不同时间范围和汇率升值和贬值时期的市场预期增加了信息。
{"title":"Fundamentals Models Versus Random Walk: Evidence From an Emerging Economy","authors":"Helder Ferreira de Mendonça,&nbsp;Luciano Vereda,&nbsp;Luan Mateus Matos de Araújo","doi":"10.1002/for.3279","DOIUrl":"https://doi.org/10.1002/for.3279","url":null,"abstract":"<p>We analyze the predictive power of fundamentals versus random walk models for horizons from 1 to 24 months in an emerging market. Specifically, we investigate what fundamentals models outperform random walk during periods of appreciation and depreciation of the exchange rate. Furthermore, we analyze whether the fundamentals models that beat random walk contain information not considered by market expectations. Based on data from the Brazilian economy, the findings point out that some fundamentals models are useful for forecasting the exchange rate. The predictive power of fundamentals models increases in periods marked by a trend of currency appreciation or depreciation. In particular, the PPP-type fundamentals models have greater predictive power than the random walk and add information to market expectations for different time horizons and periods of exchange rate appreciation and depreciation.</p>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":"44 6","pages":"1884-1906"},"PeriodicalIF":2.7,"publicationDate":"2025-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/for.3279","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144767496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Forecasting
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1