Eko Putra Wahyuddin , Rezzy Eko Caraka , Robert Kurniawan , Wahyu Caesarendra , Prana Ugiana Gio , Bens Pardamean
{"title":"Improved LSTM hyperparameters alongside sentiment walk-forward validation for time series prediction","authors":"Eko Putra Wahyuddin , Rezzy Eko Caraka , Robert Kurniawan , Wahyu Caesarendra , Prana Ugiana Gio , Bens Pardamean","doi":"10.1016/j.joitmc.2024.100458","DOIUrl":null,"url":null,"abstract":"<div><div>This study aims to address the common issue of biased estimation errors in time series modeling by analyzing the error in locating ideal hyperparameters and defining appropriate validation methods. Specifically, it focuses on predicting the stock price of Bank Rakyat Indonesia using a combination of historical stock prices, technical indicators, exchange rates, and news sentiment data, while determining the optimal variables for deep learning models. Employing a deep learning-based Long-Short Term Memory (LSTM) model, the study optimizes hyperparameters alongside walk-forward validation for time series prediction. It explores different combinations of variables and adapts the sliding window approach to the context of the data. The results highlight the importance of optimizing hyperparameters and utilizing walk-forward validation for accurate time series prediction. The model incorporating historical stock prices and sentiment scores outperforms others, achieving an RMSE of 96.61 and MAE of 86.97. Incorporating sentiment scores reduces RMSE by 39.55 % compared to models using only historical stock prices, while adding technical indicators does not yield improvement. This study contributes to the field by addressing the issue of biased estimation errors in time series modeling, offering insights into the optimization of hyperparameters and validation methods for accurate predictions. It also underscores the significance of incorporating sentiment analysis alongside historical stock prices for improved forecasting accuracy.</div></div>","PeriodicalId":16678,"journal":{"name":"Journal of Open Innovation: Technology, Market, and Complexity","volume":"11 1","pages":"Article 100458"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Open Innovation: Technology, Market, and Complexity","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S219985312400252X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Economics, Econometrics and Finance","Score":null,"Total":0}
引用次数: 0
Abstract
This study aims to address the common issue of biased estimation errors in time series modeling by analyzing the error in locating ideal hyperparameters and defining appropriate validation methods. Specifically, it focuses on predicting the stock price of Bank Rakyat Indonesia using a combination of historical stock prices, technical indicators, exchange rates, and news sentiment data, while determining the optimal variables for deep learning models. Employing a deep learning-based Long-Short Term Memory (LSTM) model, the study optimizes hyperparameters alongside walk-forward validation for time series prediction. It explores different combinations of variables and adapts the sliding window approach to the context of the data. The results highlight the importance of optimizing hyperparameters and utilizing walk-forward validation for accurate time series prediction. The model incorporating historical stock prices and sentiment scores outperforms others, achieving an RMSE of 96.61 and MAE of 86.97. Incorporating sentiment scores reduces RMSE by 39.55 % compared to models using only historical stock prices, while adding technical indicators does not yield improvement. This study contributes to the field by addressing the issue of biased estimation errors in time series modeling, offering insights into the optimization of hyperparameters and validation methods for accurate predictions. It also underscores the significance of incorporating sentiment analysis alongside historical stock prices for improved forecasting accuracy.