On the Impact of News for Reliable Stock Market Predictions: An LSTM-based Ensemble using FinBERT Word-Embeddings

Mohsen A. Hassan, Aliaa Youssif, Osama Imam, A. Ghoneim
{"title":"On the Impact of News for Reliable Stock Market Predictions: An LSTM-based Ensemble using FinBERT Word-Embeddings","authors":"Mohsen A. Hassan, Aliaa Youssif, Osama Imam, A. Ghoneim","doi":"10.37394/23205.2022.21.36","DOIUrl":null,"url":null,"abstract":"Stock market (SM) prediction methods can be divided into two categories based on the number of information sources used: single-source methods and dual-source approaches. To estimate the price of a stock, single-source approaches rely solely on numerical data. The Efficient Market Hypothesis (EMH), [1]. States that the stock price will represent all important information. Different sources of information might complement one another and influence the stock price. Machine learning and deep learning techniques have long been used to anticipate stock market movements, [2], [3]. The researcher gathered the dataset, [4], [5], [6], [7]. The dataset contains the date of the reading, the opening price, the high and low value of the stock, news about the stock, and the volume. The researcher uses a variety of machine Learning and deep learning approaches to compare performance and prediction error rates, in addition, the researcher also compared the effect of adding the news text as a feature and as a label model. and using a dedicated model for news sentiment analysis by applying the FinBERT word embedding and using them to construct a Long Short-Term Memory (LSTM). From our observation, it is evident that Deep learning-based models performed better than their Machine learning counterparts. The author shows that information extracted from news sources is better at predicting rather than its direction of price movement. And the best-performing model without news is the LSTM with an RMSE of 0.0259 while the best-performing model with news is the LSTM with a stand-alone and LSTM model for news yields RMSE of 0.0220.","PeriodicalId":332148,"journal":{"name":"WSEAS TRANSACTIONS ON COMPUTERS","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"WSEAS TRANSACTIONS ON COMPUTERS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37394/23205.2022.21.36","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Stock market (SM) prediction methods can be divided into two categories based on the number of information sources used: single-source methods and dual-source approaches. To estimate the price of a stock, single-source approaches rely solely on numerical data. The Efficient Market Hypothesis (EMH), [1]. States that the stock price will represent all important information. Different sources of information might complement one another and influence the stock price. Machine learning and deep learning techniques have long been used to anticipate stock market movements, [2], [3]. The researcher gathered the dataset, [4], [5], [6], [7]. The dataset contains the date of the reading, the opening price, the high and low value of the stock, news about the stock, and the volume. The researcher uses a variety of machine Learning and deep learning approaches to compare performance and prediction error rates, in addition, the researcher also compared the effect of adding the news text as a feature and as a label model. and using a dedicated model for news sentiment analysis by applying the FinBERT word embedding and using them to construct a Long Short-Term Memory (LSTM). From our observation, it is evident that Deep learning-based models performed better than their Machine learning counterparts. The author shows that information extracted from news sources is better at predicting rather than its direction of price movement. And the best-performing model without news is the LSTM with an RMSE of 0.0259 while the best-performing model with news is the LSTM with a stand-alone and LSTM model for news yields RMSE of 0.0220.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
新闻对可靠股市预测的影响:使用FinBERT词嵌入的基于lstm的集成
股票市场预测方法根据使用的信息源的数量可以分为两类:单源方法和双源方法。为了估计股票的价格,单一来源方法完全依赖于数字数据。有效市场假说(EMH), 2010。声明股票价格将代表所有重要信息。不同的信息来源可能相互补充,从而影响股票价格。机器学习和深度学习技术早就被用来预测股市走势,b[3], b[3]。研究人员收集了数据集[4][5][6][7]。数据集包含读数的日期、开盘价、股票的高价和低价、有关股票的新闻和成交量。研究人员使用多种机器学习和深度学习方法来比较性能和预测错误率,此外,研究人员还比较了添加新闻文本作为特征和作为标签模型的效果。并利用FinBERT词嵌入构建长短期记忆(LSTM),建立了新闻情感分析的专用模型。从我们的观察来看,很明显,基于深度学习的模型比机器学习的模型表现得更好。作者表明,从新闻来源中提取的信息更善于预测价格走势,而不是预测价格走势的方向。无新闻时表现最好的模型是RMSE为0.0259的LSTM,而有新闻时表现最好的模型是带有独立的LSTM, LSTM模型的RMSE为0.0220。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Medical Image Classification using a Many to Many Relation, Multilayered Fuzzy Systems and AI Aspects of Symmetry in Petri Nets Chaos in Order: Applying ML, NLP, and Chaos Theory in Open Source Intelligence for Counter-Terrorism Combinatorial Optimization of Engineering Systems based on Diagrammatic Design Federated Learning: Attacks and Defenses, Rewards, Energy Efficiency: Past, Present and Future
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1