首页 > 最新文献

Journal of Forecasting最新文献

英文 中文
Research on occupant injury severity prediction of autonomous vehicles based on transfer learning 基于迁移学习的自动驾驶汽车乘员伤害严重程度预测研究
IF 3.4 3区 经济学 Q1 ECONOMICS Pub Date : 2024-08-12 DOI: 10.1002/for.3186
Na Yang, Dongwei Liu, Qi Liu, Zhiwei Li, Tao Liu, Jianfeng Wang, Ze Xu
The focus of the future of autonomous vehicles has shifted from feasibility to safety and comfort. The seat of an autonomous vehicle may be equipped with a rotational function, and the occupant's sitting position would be diverse. This poses a higher challenge to occupant injury protection during vehicle collisions. The main objective of the current study is to develop occupant injury prediction models for autonomous vehicles that can be used to predict the injury severity of occupants in different seat orientations and sitting positions. The first step is to establish an occupant crash model database with different seat orientations. It is used to simulate the occupant crash injury database of an autonomous vehicle, considering seat rotation and the back inclination angle. The second step is to establish a pre‐training occupant injury prediction model based on the existing database and then train the autonomous vehicle occupant injury prediction model using an in‐house database based on the transfer learning method. Occupant injury prediction models achieve good accuracy (82.8% on the numerical database and 62.9% on the real verification database) and shorter computational time (4.86 ± 0.33 ms) on the prediction tasks. Finally, the influence of the model input variables is analyzed. This study demonstrates the feasibility of using a small‐sample database based on transfer learning for occupant injury prediction in autonomous vehicles.
未来自动驾驶汽车的重点已从可行性转向安全性和舒适性。自动驾驶汽车的座椅可能具有旋转功能,乘员的坐姿也将多种多样。这对车辆碰撞时的乘员伤害保护提出了更高的挑战。本研究的主要目的是为自动驾驶汽车开发乘员伤害预测模型,用于预测不同座椅方向和坐姿下乘员的伤害严重程度。第一步是建立不同座椅方向的乘员碰撞模型数据库。它用于模拟自动驾驶汽车的乘员碰撞伤害数据库,同时考虑座椅旋转和背部倾斜角度。第二步是基于现有数据库建立预训练乘员伤害预测模型,然后基于迁移学习方法使用内部数据库训练自主车辆乘员伤害预测模型。乘员伤害预测模型在预测任务上实现了良好的准确率(在数值数据库上为 82.8%,在真实验证数据库上为 62.9%)和较短的计算时间(4.86 ± 0.33 ms)。最后,分析了模型输入变量的影响。这项研究证明了基于迁移学习的小样本数据库用于自动驾驶汽车乘员伤害预测的可行性。
{"title":"Research on occupant injury severity prediction of autonomous vehicles based on transfer learning","authors":"Na Yang, Dongwei Liu, Qi Liu, Zhiwei Li, Tao Liu, Jianfeng Wang, Ze Xu","doi":"10.1002/for.3186","DOIUrl":"https://doi.org/10.1002/for.3186","url":null,"abstract":"The focus of the future of autonomous vehicles has shifted from feasibility to safety and comfort. The seat of an autonomous vehicle may be equipped with a rotational function, and the occupant's sitting position would be diverse. This poses a higher challenge to occupant injury protection during vehicle collisions. The main objective of the current study is to develop occupant injury prediction models for autonomous vehicles that can be used to predict the injury severity of occupants in different seat orientations and sitting positions. The first step is to establish an occupant crash model database with different seat orientations. It is used to simulate the occupant crash injury database of an autonomous vehicle, considering seat rotation and the back inclination angle. The second step is to establish a pre‐training occupant injury prediction model based on the existing database and then train the autonomous vehicle occupant injury prediction model using an in‐house database based on the transfer learning method. Occupant injury prediction models achieve good accuracy (82.8% on the numerical database and 62.9% on the real verification database) and shorter computational time (4.86 ± 0.33 ms) on the prediction tasks. Finally, the influence of the model input variables is analyzed. This study demonstrates the feasibility of using a small‐sample database based on transfer learning for occupant injury prediction in autonomous vehicles.","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Macroeconomic real‐time forecasts of univariate models with flexible error structures 具有灵活误差结构的单变量模型的宏观经济实时预测
IF 3.4 3区 经济学 Q1 ECONOMICS Pub Date : 2024-08-10 DOI: 10.1002/for.3182
Kelly Trinh, Bo Zhang, Chenghan Hou
This paper investigates the importance of flexible error structure specifications in two widely used univariate models, namely, autoregressive and unobserved component models, in fitting and forecasting 20 significant US macroeconomic variables. The in‐sample estimation reveals that the models with flexible error structures provide better in‐sample fit than the univariate models with homoscedastic errors. Furthermore, the density forecast analysis suggests that accommodating heavy tail, stochastic volatility, and serial correlation in error structures leads to significant improvements in short‐term forecasts. For most macroeconomic variables, the univariate models tend to yield more accurate one‐step‐ahead forecasts than the multivariate (vector autoregressive) models in terms of both point and density forecasts.
本文研究了自回归模型和无观测成分模型这两种广泛使用的单变量模型中灵活误差结构规格在拟合和预测 20 个重要美国宏观经济变量中的重要性。样本内估计结果显示,具有灵活误差结构的模型比具有同方差误差的单变量模型具有更好的样本内拟合效果。此外,密度预测分析表明,在误差结构中考虑重尾、随机波动性和序列相关性可显著改善短期预测。对于大多数宏观经济变量,单变量模型往往比多变量(向量自回归)模型的点预测和密度预测更准确。
{"title":"Macroeconomic real‐time forecasts of univariate models with flexible error structures","authors":"Kelly Trinh, Bo Zhang, Chenghan Hou","doi":"10.1002/for.3182","DOIUrl":"https://doi.org/10.1002/for.3182","url":null,"abstract":"This paper investigates the importance of flexible error structure specifications in two widely used univariate models, namely, autoregressive and unobserved component models, in fitting and forecasting 20 significant US macroeconomic variables. The in‐sample estimation reveals that the models with flexible error structures provide better in‐sample fit than the univariate models with homoscedastic errors. Furthermore, the density forecast analysis suggests that accommodating heavy tail, stochastic volatility, and serial correlation in error structures leads to significant improvements in short‐term forecasts. For most macroeconomic variables, the univariate models tend to yield more accurate one‐step‐ahead forecasts than the multivariate (vector autoregressive) models in terms of both point and density forecasts.","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141920406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Short‐term multivariate airworthiness forecasting based on decomposition and deep prediction models 基于分解和深度预测模型的短期多变量适航性预测
IF 3.4 3区 经济学 Q1 ECONOMICS Pub Date : 2024-08-07 DOI: 10.1002/for.3179
Ali Tatli, Tansu Filik, Erdogan Bocu, Hikmet Tahir Karakoc
This study introduces a model for predicting airworthiness in terms of meteorology information within the viewpoint of not only formal regulations but also informal rules based on acquired indicators from flight training organization experience (AIs‐FTOE). The case study is carried out in the Hasan Polatkan Airport which is used by the Department of Flight Training of Eskişehir Technical University (ESTU‐P), which is also recognized as a flight training organization. Within the study, the constraints (derived from regulations and AIs‐FTOE) and the data set used in models are explained. Also, the models are introduced based on the gated recurrent unit (GRU) and long short‐term memory (LSTM) with the use of empirical mode decomposition (EMD) and variational mode decomposition (VMD). Finally, a model‐selective mechanism (MSM) is proposed to use the models in common. The findings show that the models presented in the study produce successful results that can be used in flight training organization's (FTO) planning studies. The MSM uses GRU and LSTM together with decomposition techniques to provide more advanced prediction capabilities. When the literature is examined, it is observed that although meteorological conditions are of vital importance in the efficiency of FTOs, there are not enough studies on airworthiness based on meteorology. So, a model that will assist in scheduling plans is presented for FTOs. Airworthiness analysis of forecasting can provide a comprehensive reference to support planning efficiency in FTOs. To the authors' knowledge, this study will be the first in the literature on airworthiness that presents the MSM using a hybrid deep learning algorithm and decomposition of time series models in concurrent.
本研究从气象信息的角度引入了一个适航性预测模型,该模型不仅基于正式法规,还基于基于飞行训练组织经验(AIs-FTOE)所获得指标的非正式规则。案例研究在埃斯基谢希尔技术大学(ESTU-P)飞行培训部使用的哈桑-波拉特坎机场进行,该机场也是公认的飞行培训机构。本研究对模型中使用的约束条件(源自法规和 AIs-FTOE)和数据集进行了解释。此外,还介绍了基于门控循环单元(GRU)和长短期记忆(LSTM)的模型,并使用了经验模式分解(EMD)和变异模式分解(VMD)。最后,还提出了一种模型选择机制(MSM),以使用共同的模型。研究结果表明,研究中提出的模型能产生成功的结果,可用于飞行训练组织(FTO)的规划研究。MSM 使用 GRU 和 LSTM 以及分解技术来提供更先进的预测能力。在研究文献时,我们发现虽然气象条件对 FTO 的效率至关重要,但基于气象学的适航性研究还不够多。因此,我们提出了一个有助于为 FTO 制定排班计划的模型。适航性预报分析可为支持 FTO 计划效率提供全面参考。据作者所知,这项研究将是适航性文献中首次使用混合深度学习算法和并发时间序列模型分解提出 MSM。
{"title":"Short‐term multivariate airworthiness forecasting based on decomposition and deep prediction models","authors":"Ali Tatli, Tansu Filik, Erdogan Bocu, Hikmet Tahir Karakoc","doi":"10.1002/for.3179","DOIUrl":"https://doi.org/10.1002/for.3179","url":null,"abstract":"This study introduces a model for predicting airworthiness in terms of meteorology information within the viewpoint of not only formal regulations but also informal rules based on acquired indicators from flight training organization experience (AIs‐FTOE). The case study is carried out in the Hasan Polatkan Airport which is used by the Department of Flight Training of Eskişehir Technical University (ESTU‐P), which is also recognized as a flight training organization. Within the study, the constraints (derived from regulations and AIs‐FTOE) and the data set used in models are explained. Also, the models are introduced based on the gated recurrent unit (GRU) and long short‐term memory (LSTM) with the use of empirical mode decomposition (EMD) and variational mode decomposition (VMD). Finally, a model‐selective mechanism (MSM) is proposed to use the models in common. The findings show that the models presented in the study produce successful results that can be used in flight training organization's (FTO) planning studies. The MSM uses GRU and LSTM together with decomposition techniques to provide more advanced prediction capabilities. When the literature is examined, it is observed that although meteorological conditions are of vital importance in the efficiency of FTOs, there are not enough studies on airworthiness based on meteorology. So, a model that will assist in scheduling plans is presented for FTOs. Airworthiness analysis of forecasting can provide a comprehensive reference to support planning efficiency in FTOs. To the authors' knowledge, this study will be the first in the literature on airworthiness that presents the MSM using a hybrid deep learning algorithm and decomposition of time series models in concurrent.","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Shapley-value-based forecast combination 基于形状值的预测组合
IF 3.4 3区 经济学 Q1 ECONOMICS Pub Date : 2024-08-07 DOI: 10.1002/for.3178
Philip Hans Franses, Jiahui Zou, Wendun Wang

This paper puts forward a new and simple method to combine forecasts, which is particularly useful when the forecasts are strongly correlated. It is based on the Mincer Zarnowitz regression, and a subsequent determination using Shapley values of the weights of the forecasts in a new combination. For a stylized case, it is proved that such a Shapley-value-based combination improves upon an equal-weight combination. Simulation experiments and a detailed illustration show the merits of the Shapley-value-based forecast combination.

本文提出了一种新的、简单的预测组合方法,这种方法在预测高度相关时特别有用。该方法基于 Mincer Zarnowitz 回归,随后使用 Shapley 值确定新组合中预测的权重。在一个典型的案例中,证明了这种基于夏普利值的组合比等权重组合更好。模拟实验和详细说明显示了基于夏普利值的预测组合的优点。
{"title":"Shapley-value-based forecast combination","authors":"Philip Hans Franses,&nbsp;Jiahui Zou,&nbsp;Wendun Wang","doi":"10.1002/for.3178","DOIUrl":"10.1002/for.3178","url":null,"abstract":"<p>This paper puts forward a new and simple method to combine forecasts, which is particularly useful when the forecasts are strongly correlated. It is based on the Mincer Zarnowitz regression, and a subsequent determination using Shapley values of the weights of the forecasts in a new combination. For a stylized case, it is proved that such a Shapley-value-based combination improves upon an equal-weight combination. Simulation experiments and a detailed illustration show the merits of the Shapley-value-based forecast combination.</p>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/for.3178","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
New forecasting methods for an old problem: Predicting 147 years of systemic financial crises 老问题的新预测方法:预测 147 年的系统性金融危机
IF 3.4 3区 经济学 Q1 ECONOMICS Pub Date : 2024-08-07 DOI: 10.1002/for.3184
Emile du Plessis, Ulrich Fritsche
This paper develops new forecasting methods for an old and ongoing problem by employing 13 machine learning algorithms to study 147 years of systemic financial crises across 17 countries. Findings suggest that fixed capital formation is the most important variable. GDP per capita and consumer inflation have increased in prominence whereas debt‐to‐GDP, stock market, and consumption were dominant at the turn of the 20th century. A lag structure and rolling window both improve on optimized contemporaneous and individual country formats. Through a lag structure, banking sector predictors on average describe 28% of the variation in crisis prevalence, the real sector 64%, and the external sector 8%. Nearly half of all algorithms reach peak performance through a lag structure. As measured through AUC, and Brier scores, top‐performing machine learning methods consistently produce high accuracy rates, with both random forests and gradient boosting in front with 77% correct forecasts, and consistently outperform traditional regression algorithms. Learning from other countries improves predictive strength, and non‐linear models generally deliver higher accuracy rates than linear models. Algorithms retaining all variables perform better than those minimizing the influence of variables.
本文采用 13 种机器学习算法,对 17 个国家 147 年的系统性金融危机进行了研究,为这一持续存在的老问题开发了新的预测方法。研究结果表明,固定资本形成是最重要的变量。人均 GDP 和消费通胀的重要性日益突出,而在 20 世纪之交,债务与 GDP 的比率、股票市场和消费则占主导地位。滞后结构和滚动窗口都改进了同期和单个国家的优化格式。通过滞后结构,银行业预测指标平均描述了危机发生率变化的 28%,实体经济预测指标的 64%,对外经济预测指标的 8%。近一半的算法通过滞后结构达到了峰值性能。通过 AUC 和 Brier 分数衡量,表现最佳的机器学习方法始终保持较高的准确率,其中随机森林和梯度提升以 77% 的正确预测率遥遥领先,并始终优于传统回归算法。向其他国家学习可提高预测能力,非线性模型的准确率通常高于线性模型。保留所有变量的算法比最小化变量影响的算法表现更好。
{"title":"New forecasting methods for an old problem: Predicting 147 years of systemic financial crises","authors":"Emile du Plessis, Ulrich Fritsche","doi":"10.1002/for.3184","DOIUrl":"https://doi.org/10.1002/for.3184","url":null,"abstract":"This paper develops new forecasting methods for an old and ongoing problem by employing 13 machine learning algorithms to study 147 years of systemic financial crises across 17 countries. Findings suggest that fixed capital formation is the most important variable. GDP per capita and consumer inflation have increased in prominence whereas debt‐to‐GDP, stock market, and consumption were dominant at the turn of the 20th century. A lag structure and rolling window both improve on optimized contemporaneous and individual country formats. Through a lag structure, banking sector predictors on average describe 28% of the variation in crisis prevalence, the real sector 64%, and the external sector 8%. Nearly half of all algorithms reach peak performance through a lag structure. As measured through AUC, and Brier scores, top‐performing machine learning methods consistently produce high accuracy rates, with both random forests and gradient boosting in front with 77% correct forecasts, and consistently outperform traditional regression algorithms. Learning from other countries improves predictive strength, and non‐linear models generally deliver higher accuracy rates than linear models. Algorithms retaining all variables perform better than those minimizing the influence of variables.","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A GARCH model selection and estimation method based on neural network with the loss function of mean square error and model confidence set 基于均方误差损失函数和模型置信集的神经网络 GARCH 模型选择和估计方法
IF 3.4 3区 经济学 Q1 ECONOMICS Pub Date : 2024-08-07 DOI: 10.1002/for.3175
Yanhao Huang, Ruibin Ren

This paper proposes a method that uses mean square error (MSE) and model confidence set (MCS) as the loss function of back-propagation neural network (BPNN), aiming to train and find a generalized autoregressive conditional heteroskedastic (GARCH) model that has the best forecasting performance of a time series. Combining MSE and the p-value of MCS can not only estimate better parameters for the GARCH models but also find the best GARCH model to forecast the volatility of a time series. Meanwhile, we divide a time series into several parts and use each part as the input of the BPNN. Through the BPNN, each part of the time series will be turned into several forecasting values. These values will be used to calculate the MSE and the p-value of MCS, which will then be used to update the parameters of the BPNN. In the end, we use MCS to choose the best GARCH model among the trained GARCH models and compare this method with maximum likelihood estimation (MLE) and the generalized least squares estimation (GLS). The result shows that the p-value of MCS of the best model estimated by this method is higher than the p-value of MCS of the best model estimated by MLE and GLS. According to the theory of MCS, a model that has a larger p-value does have a better forecasting performance. The method proposed by this paper can provide a potential application of neural network in GARCH model forecasting and estimation.

本文提出了一种将均方误差(MSE)和模型置信集(MCS)作为反向传播神经网络(BPNN)损失函数的方法,旨在训练并找到对时间序列具有最佳预测性能的广义自回归条件异方差(GARCH)模型。结合 MSE 和 MCS 的 p 值,不仅可以估计出更好的 GARCH 模型参数,还能找到预测时间序列波动性的最佳 GARCH 模型。同时,我们将时间序列分为几个部分,并将每个部分作为 BPNN 的输入。通过 BPNN,时间序列的每一部分都将转化为多个预测值。这些值将用于计算 MCS 的 MSE 和 P 值,然后用于更新 BPNN 的参数。最后,我们使用 MCS 从训练好的 GARCH 模型中选择最佳 GARCH 模型,并将此方法与最大似然估计(MLE)和广义最小二乘估计(GLS)进行比较。结果表明,该方法估计的最佳模型的 MCS 的 p 值高于 MLE 和 GLS 估计的最佳模型的 MCS 的 p 值。根据 MCS 理论,p 值越大的模型预测效果越好。本文提出的方法为神经网络在 GARCH 模型预测和估计中的应用提供了可能。
{"title":"A GARCH model selection and estimation method based on neural network with the loss function of mean square error and model confidence set","authors":"Yanhao Huang,&nbsp;Ruibin Ren","doi":"10.1002/for.3175","DOIUrl":"10.1002/for.3175","url":null,"abstract":"<p>This paper proposes a method that uses mean square error (MSE) and model confidence set (MCS) as the loss function of back-propagation neural network (BPNN), aiming to train and find a generalized autoregressive conditional heteroskedastic (GARCH) model that has the best forecasting performance of a time series. Combining MSE and the <i>p</i>-value of MCS can not only estimate better parameters for the GARCH models but also find the best GARCH model to forecast the volatility of a time series. Meanwhile, we divide a time series into several parts and use each part as the input of the BPNN. Through the BPNN, each part of the time series will be turned into several forecasting values. These values will be used to calculate the MSE and the <i>p</i>-value of MCS, which will then be used to update the parameters of the BPNN. In the end, we use MCS to choose the best GARCH model among the trained GARCH models and compare this method with maximum likelihood estimation (MLE) and the generalized least squares estimation (GLS). The result shows that the <i>p</i>-value of MCS of the best model estimated by this method is higher than the <i>p</i>-value of MCS of the best model estimated by MLE and GLS. According to the theory of MCS, a model that has a larger <i>p</i>-value does have a better forecasting performance. The method proposed by this paper can provide a potential application of neural network in GARCH model forecasting and estimation.</p>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of a precise ensemble expert system for crop yield prediction using machine learning analytics 利用机器学习分析设计用于作物产量预测的精确集合专家系统
IF 3.4 3区 经济学 Q1 ECONOMICS Pub Date : 2024-08-05 DOI: 10.1002/for.3183
Deeksha Tripathi, Saroj K. Biswas

Agriculture is facing significant challenges in the development of crop yield forecasts, which are important aspects of decision-making at the international, regional, and local levels. The area of agriculture is attracting growing attention because of increasing the demand for food supplies. To ensure future food supplies, crop yield prediction (CYP) provides the best decision-making to assist farmers in agricultural yield forecasting efficiently. Nevertheless, CYP is a difficult endeavor because of the intricacy of the underlying mechanisms and the effect of numerous factors, including weather patterns, soil characteristics, and crop management techniques. In today's era, ensemble learning (EL) approaches have recently demonstrated significant promise for enhancing the reliability and accuracy of CYP. The success of the EL techniques depends on several facts, including how the base learner models are trained and how these are combined. This study provides important insights into the EL techniques for CYP. This paper proposes an expert system model named precise ensemble expert system for crop yield prediction (PEESCYP) to predict the best crop for agricultural land. The proposed PEESCYP model employs multiple imputation by chained equation (MICE) data imputation technique to treat the missing values of the collected dataset, the isolation forest (IF) technique for outlier detection, the ant colony optimization (ACO) technique to perform feature selection, robust scaling (RS) technique to perform data normalization, and the extra tree (ET) is used for classification to overcome the variance and overfitting problem of the single classifiers. The measurements of the proposed PEESCYP model have been collected by means of accuracy, precision, recall, and F-1 score using a prepared dataset, which is collected from International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), and the proposed model is compared with different single-classifier based ML models, EL models, and various existing models available in the literature. The results of this experiment underline that the proposed PEESCYP model outperforms the others.

作物产量预测是国际、地区和地方各级决策的重要方面,而农业在发展作物产量预测方面正面临着重大挑战。由于对粮食供应的需求日益增长,农业领域正受到越来越多的关注。为确保未来的粮食供应,农作物产量预测(CYP)提供了最佳决策,可帮助农民有效地进行农业产量预测。然而,作物产量预测是一项艰巨的任务,因为其背后的机制错综复杂,并受到天气模式、土壤特性和作物管理技术等众多因素的影响。在当今时代,集合学习(EL)方法在提高作物产量预测的可靠性和准确性方面大有可为。集合学习技术的成功取决于多个因素,包括如何训练基础学习模型以及如何将这些模型组合在一起。本研究为 CYP 的 EL 技术提供了重要的启示。本文提出了一种专家系统模型,名为作物产量预测精确集合专家系统(PEESCYP),用于预测农田的最佳作物。所提出的 PEESCYP 模型采用链式方程多重估算(MICE)数据估算技术处理所收集数据集的缺失值,采用隔离森林(IF)技术检测离群值,采用蚁群优化(ACO)技术进行特征选择,采用稳健缩放(RS)技术进行数据归一化,并采用额外树(ET)进行分类,以克服单一分类器的方差和过拟合问题。利用从国际半干旱热带作物研究所(ICRISAT)收集的数据集,通过准确度、精确度、召回率和 F-1 分数对所提出的 PEESCYP 模型进行了测量,并将所提出的模型与不同的基于单分类器的 ML 模型、EL 模型以及文献中现有的各种模型进行了比较。实验结果表明,所提出的 PEESCYP 模型优于其他模型。
{"title":"Design of a precise ensemble expert system for crop yield prediction using machine learning analytics","authors":"Deeksha Tripathi,&nbsp;Saroj K. Biswas","doi":"10.1002/for.3183","DOIUrl":"10.1002/for.3183","url":null,"abstract":"<p>Agriculture is facing significant challenges in the development of crop yield forecasts, which are important aspects of decision-making at the international, regional, and local levels. The area of agriculture is attracting growing attention because of increasing the demand for food supplies. To ensure future food supplies, crop yield prediction (CYP) provides the best decision-making to assist farmers in agricultural yield forecasting efficiently. Nevertheless, CYP is a difficult endeavor because of the intricacy of the underlying mechanisms and the effect of numerous factors, including weather patterns, soil characteristics, and crop management techniques. In today's era, ensemble learning (EL) approaches have recently demonstrated significant promise for enhancing the reliability and accuracy of CYP. The success of the EL techniques depends on several facts, including how the base learner models are trained and how these are combined. This study provides important insights into the EL techniques for CYP. This paper proposes an expert system model named precise ensemble expert system for crop yield prediction (PEESCYP) to predict the best crop for agricultural land. The proposed PEESCYP model employs multiple imputation by chained equation (MICE) data imputation technique to treat the missing values of the collected dataset, the isolation forest (IF) technique for outlier detection, the ant colony optimization (ACO) technique to perform feature selection, robust scaling (RS) technique to perform data normalization, and the extra tree (ET) is used for classification to overcome the variance and overfitting problem of the single classifiers. The measurements of the proposed PEESCYP model have been collected by means of accuracy, precision, recall, and F-1 score using a prepared dataset, which is collected from International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), and the proposed model is compared with different single-classifier based ML models, EL models, and various existing models available in the literature. The results of this experiment underline that the proposed PEESCYP model outperforms the others.</p>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Corporate financial distress prediction in a transition economy 转型经济中的公司财务危机预测
IF 3.4 3区 经济学 Q1 ECONOMICS Pub Date : 2024-08-05 DOI: 10.1002/for.3177
Minh Nguyen, Bang Nguyen, Minh-Lý Liêu

Forecasting financial distress of corporations is a difficult task in economies undergoing transition, as data are scarce and are highly imbalanced. This research tackles these difficulties by gathering reliable financial distress data in the context of a transition economy and employing the synthetic minority oversampling technique (SMOTE). The study employs seven different models, including linear discriminant analysis (LDA), logistic regression (LR), support vector machines (SVMs), neural networks (NNs), decision trees (DTs), random forests (RFs), and the Merton model, to predict financial distress among publicly traded companies in Vietnam between 2011 and 2021. The first six models use accounting-based variables, while the Merton model utilizes market-based variables. The findings indicate that while all models perform fairly well in predicting results for nondelisted firms, they perform somewhat poorly in predicting results for delisted firms in terms of various measures including balanced accuracy, Matthews correlation coefficient (MCC), precision, recall, and F1 score. The study shows that the models that incorporate both Altman's and Ohlson's variables consistently outperform those that only use Altman's or Ohlson's variables in terms of balanced accuracy. Additionally, the study finds that NNs are generally the most effective models in terms of both balanced accuracy and MCC. The most important variable in Altman's variables as well as the combination of Altman's and Ohlson's variables is “reat” (retained earnings on total assets), whereas “ltat” (total liabilities on total assets) and “wcapat” (working capital on total assets) are the most important variables in Ohlson's variables. The study also reveals that in most cases, the models perform better in predicting results for big firms than for small firms and typically better than in good years than for bad years.

在转型期经济体中,由于数据稀缺且高度不平衡,预测企业的财务困境是一项艰巨的任务。本研究通过收集转型经济体中可靠的财务困境数据,并采用合成少数群体过度取样技术(SMOTE)来解决这些难题。研究采用了七种不同的模型,包括线性判别分析(LDA)、逻辑回归(LR)、支持向量机(SVM)、神经网络(NN)、决策树(DT)、随机森林(RF)和默顿模型,以预测 2011 年至 2021 年越南上市公司的财务困境。前六个模型使用基于会计的变量,而默顿模型使用基于市场的变量。研究结果表明,虽然所有模型在预测非上市公司的结果方面都表现得相当好,但在预测退市公司的结果方面,它们在平衡准确度、马太相关系数(MCC)、精确度、召回率和得分等各种指标上的表现都略逊一筹。研究表明,同时包含 Altman 变量和 Ohlson 变量的模型在均衡准确性方面始终优于仅使用 Altman 或 Ohlson 变量的模型。此外,研究还发现,就平衡精度和 MCC 而言,NN 通常是最有效的模型。Altman 变量以及 Altman 和 Ohlson 变量组合中最重要的变量是 "reat"(总资产留存收益),而 "ltat"(总资产负债)和 "wcapat"(总资产营运资本)是 Ohlson 变量中最重要的变量。研究还表明,在大多数情况下,模型对大公司业绩的预测效果要好于对小公司业绩的预测效果,通常在好的年份要好于坏的年份。
{"title":"Corporate financial distress prediction in a transition economy","authors":"Minh Nguyen,&nbsp;Bang Nguyen,&nbsp;Minh-Lý Liêu","doi":"10.1002/for.3177","DOIUrl":"10.1002/for.3177","url":null,"abstract":"<p>Forecasting financial distress of corporations is a difficult task in economies undergoing transition, as data are scarce and are highly imbalanced. This research tackles these difficulties by gathering reliable financial distress data in the context of a transition economy and employing the synthetic minority oversampling technique (SMOTE). The study employs seven different models, including linear discriminant analysis (LDA), logistic regression (LR), support vector machines (SVMs), neural networks (NNs), decision trees (DTs), random forests (RFs), and the Merton model, to predict financial distress among publicly traded companies in Vietnam between 2011 and 2021. The first six models use accounting-based variables, while the Merton model utilizes market-based variables. The findings indicate that while all models perform fairly well in predicting results for nondelisted firms, they perform somewhat poorly in predicting results for delisted firms in terms of various measures including balanced accuracy, Matthews correlation coefficient (MCC), precision, recall, and \u0000<span></span><math>\u0000 <msub>\u0000 <mrow>\u0000 <mi>F</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mn>1</mn>\u0000 </mrow>\u0000 </msub></math> score. The study shows that the models that incorporate both Altman's and Ohlson's variables consistently outperform those that only use Altman's or Ohlson's variables in terms of balanced accuracy. Additionally, the study finds that NNs are generally the most effective models in terms of both balanced accuracy and MCC. The most important variable in Altman's variables as well as the combination of Altman's and Ohlson's variables is “reat” (retained earnings on total assets), whereas “ltat” (total liabilities on total assets) and “wcapat” (working capital on total assets) are the most important variables in Ohlson's variables. The study also reveals that in most cases, the models perform better in predicting results for big firms than for small firms and typically better than in good years than for bad years.</p>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High frequency volatility of oil futures in China: Components, modeling, and prediction 中国石油期货的高频波动:成分、建模和预测
IF 3.4 3区 经济学 Q1 ECONOMICS Pub Date : 2024-08-02 DOI: 10.1002/for.3173
Yi Hong, Xiaofan Xu, Chen Yang

This paper investigates the high-frequency volatility modeling and prediction for crude oil futures in China, a new asset class emerging in recent years. Two volatility measures, the realized variance (RV) and realized bi-power variations (RBV) are constructed at various frequencies by virtue of 1-minute crude oil futures prices. The distinctive components of these volatility estimators are further identified to exploit the information contents in the in-sample explanatory power of the realized variance dynamics and the out-of-sample prediction of realized variance across different horizons, leading to four new HAR-RV-type models. First, the empirical results show that the continuous component of the weekly realized variance, representing investors' trading behavior in the medium-term, is the dominant factor driving up volatility trends in China's crude oil futures market over a range of market conditions. Second, the monthly jump component in realized variance presents the significant in-sample explanatory power, and yet marginally improves prediction performance in realized variance during the two out-of-sample periods. Finally, these results are robust toward various market/model setups, over day- and night-trading hours, and across a range of prediction horizons and relative to prediction benchmarks.

中国原油期货是近年来新兴的资产类别,本文研究了中国原油期货的高频波动率建模与预测。根据 1 分钟原油期货价格,构建了不同频率的两种波动率指标,即已实现方差(RV)和已实现双幂变率(RBV)。进一步确定了这些波动率估计器的独特成分,以利用已实现方差动态的样本内解释力和已实现方差在不同期限的样本外预测中的信息含量,从而得出四个新的 HAR-RV 型模型。首先,实证结果表明,在不同市场条件下,代表投资者中期交易行为的周已实现方差连续分量是推动中国原油期货市场波动趋势上升的主导因素。其次,已实现方差中的月度跳跃成分具有显著的样本内解释力,但在两个样本外期间却略微提高了已实现方差的预测性能。最后,这些结果对各种市场/模型设置、日间和夜间交易时间、各种预测范围以及相对于预测基准都是稳健的。
{"title":"High frequency volatility of oil futures in China: Components, modeling, and prediction","authors":"Yi Hong,&nbsp;Xiaofan Xu,&nbsp;Chen Yang","doi":"10.1002/for.3173","DOIUrl":"https://doi.org/10.1002/for.3173","url":null,"abstract":"<p>This paper investigates the high-frequency volatility modeling and prediction for crude oil futures in China, a new asset class emerging in recent years. Two volatility measures, the realized variance (\u0000<span></span><math>\u0000 <mi>RV</mi></math>) and realized bi-power variations (\u0000<span></span><math>\u0000 <mi>RBV</mi></math>) are constructed at various frequencies by virtue of 1-minute crude oil futures prices. The distinctive components of these volatility estimators are further identified to exploit the information contents in the in-sample explanatory power of the realized variance dynamics and the out-of-sample prediction of realized variance across different horizons, leading to four new HAR-RV-type models. First, the empirical results show that the continuous component of the weekly realized variance, representing investors' trading behavior in the medium-term, is the dominant factor driving up volatility trends in China's crude oil futures market over a range of market conditions. Second, the monthly jump component in realized variance presents the significant in-sample explanatory power, and yet marginally improves prediction performance in realized variance during the two out-of-sample periods. Finally, these results are robust toward various market/model setups, over day- and night-trading hours, and across a range of prediction horizons and relative to prediction benchmarks.</p>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142579654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecast performance of noncausal autoregressions and the importance of unit root pretesting 非因果自回归的预测性能和单位根预检验的重要性
IF 3.4 3区 经济学 Q1 ECONOMICS Pub Date : 2024-07-31 DOI: 10.1002/for.3172
Frédérique Bec, Heino Bohn Nielsen

Based on a large simulation study, this paper investigates which strategy to adopt in order to choose the most accurate forecasting model for mixed causal-noncausal autoregressions (MAR) data generating processes: always differencing (D), never differencing (L), or unit root pretesting (P). Relying on recent econometric developments regarding forecasting and unit root testing in the MAR framework, the main results suggest that from a practitioner's point of view, the P strategy at the 10% level is a good compromise. In fact, it never departs too much from the best model in terms of forecast accuracy, unlike the L (respectively, D) strategy when the DGP becomes very persistent (respectively, less persistent). This approach is illustrated using recent monthly Brent crude oil price data.

本文基于一项大型模拟研究,探讨了在因果-非因果混合自回归(MAR)数据生成过程中,应采用哪种策略来选择最准确的预测模型:始终差分(D)、从不差分(L)或单位根预测(P)。根据 MAR 框架中有关预测和单位根检验的最新计量经济学发展,主要结果表明,从实践者的角度来看,10% 水平的 P 策略是一个很好的折衷方案。事实上,就预测准确性而言,它从未偏离最佳模型太多,不像 L(分别为 D)策略那样在 DGP 变得非常持久(分别为较不持久)时那样。我们用最近的布伦特原油月度价格数据来说明这种方法。
{"title":"Forecast performance of noncausal autoregressions and the importance of unit root pretesting","authors":"Frédérique Bec,&nbsp;Heino Bohn Nielsen","doi":"10.1002/for.3172","DOIUrl":"10.1002/for.3172","url":null,"abstract":"<p>Based on a large simulation study, this paper investigates which strategy to adopt in order to choose the most accurate forecasting model for mixed causal-noncausal autoregressions (MAR) data generating processes: always differencing (D), never differencing (L), or unit root pretesting (P). Relying on recent econometric developments regarding forecasting and unit root testing in the MAR framework, the main results suggest that from a practitioner's point of view, the P strategy at the 10% level is a good compromise. In fact, it never departs too much from the best model in terms of forecast accuracy, unlike the L (respectively, D) strategy when the DGP becomes very persistent (respectively, less persistent). This approach is illustrated using recent monthly Brent crude oil price data.</p>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/for.3172","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141869804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Forecasting
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1