A Deep Learning Method for the Detection and Compensation of Outlier Events in Stock Data

IF 2.6 3区 工程技术 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Electronics Pub Date : 2022-10-26 DOI:10.3390/electronics11213465
Vashalen Naidoo, Shengzhi Du
{"title":"A Deep Learning Method for the Detection and Compensation of Outlier Events in Stock Data","authors":"Vashalen Naidoo, Shengzhi Du","doi":"10.3390/electronics11213465","DOIUrl":null,"url":null,"abstract":"The stock price is a culmination of numerous factors that are not necessarily quantifiable and significantly affected by anomalies. The forecasting accuracy of stock prices is negatively affected by these anomalies. However, very few methods are available for detecting, modelling, and compensating for anomalies in financial time series given the critical roles of better management of funds and accurate forecasting of anomalies. Time series data are a data type that changes over a defined time interval. Each value in the data set has some relation to the previous values in the series. This attribute of time series data allows us to predict the values that will follow in the series. Typical prediction models are limited to following the patterns in the data set without being able to compensate for anomalous periods. This research will attempt to find a machine learning method to detect outliers and then compensate for these detections in the prediction made. This concept was previously unimplemented, and therefore, it will make use of theoretical work on market forecasting, outliers and their effects, and machine learning methods. The ideas implemented in the paper are based upon the efficient market hypothesis (EMH), in which the stock price reflects knowledge about the market. The EMH hypothesis cannot account for consumer sentiment towards a stock. This sentiment could produce anomalies in stock data that have a significant influence on the movement of the stock market. Therefore, the detection and compensation of outliers may improve the predictions made on stock movements. This paper proposes a deep learning method that consists of two sequential stages. The first stage is an outlier detection model based on a long short-term memory (LSTM) network auto-encoder that can determine if an outlier event has occurred and then create an associated value of this occurrence for the next stage. The second stage of the proposed method uses a higher-order neural network (HONN) model to make a prediction based on the output of the first stage and the stock time series data. Real stock data and standalone prediction models are used to validate this method. This method is superior at predicting stock time series data by compensating for outlier events. The improvement is quantifiable if the data set contains an adequate amount of anomalous periods. We may further apply the proposed method of compensating for outliers in combination with other financial time series prediction methods to offer further improvements and stability.","PeriodicalId":11646,"journal":{"name":"Electronics","volume":"39 1","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2022-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Electronics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3390/electronics11213465","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

The stock price is a culmination of numerous factors that are not necessarily quantifiable and significantly affected by anomalies. The forecasting accuracy of stock prices is negatively affected by these anomalies. However, very few methods are available for detecting, modelling, and compensating for anomalies in financial time series given the critical roles of better management of funds and accurate forecasting of anomalies. Time series data are a data type that changes over a defined time interval. Each value in the data set has some relation to the previous values in the series. This attribute of time series data allows us to predict the values that will follow in the series. Typical prediction models are limited to following the patterns in the data set without being able to compensate for anomalous periods. This research will attempt to find a machine learning method to detect outliers and then compensate for these detections in the prediction made. This concept was previously unimplemented, and therefore, it will make use of theoretical work on market forecasting, outliers and their effects, and machine learning methods. The ideas implemented in the paper are based upon the efficient market hypothesis (EMH), in which the stock price reflects knowledge about the market. The EMH hypothesis cannot account for consumer sentiment towards a stock. This sentiment could produce anomalies in stock data that have a significant influence on the movement of the stock market. Therefore, the detection and compensation of outliers may improve the predictions made on stock movements. This paper proposes a deep learning method that consists of two sequential stages. The first stage is an outlier detection model based on a long short-term memory (LSTM) network auto-encoder that can determine if an outlier event has occurred and then create an associated value of this occurrence for the next stage. The second stage of the proposed method uses a higher-order neural network (HONN) model to make a prediction based on the output of the first stage and the stock time series data. Real stock data and standalone prediction models are used to validate this method. This method is superior at predicting stock time series data by compensating for outlier events. The improvement is quantifiable if the data set contains an adequate amount of anomalous periods. We may further apply the proposed method of compensating for outliers in combination with other financial time series prediction methods to offer further improvements and stability.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
股票数据异常事件检测与补偿的深度学习方法
股票价格是众多因素的顶点,这些因素不一定是可量化的,而且受到异常现象的显著影响。这些异常对股价预测的准确性有负向影响。然而,考虑到更好的资金管理和准确预测异常的关键作用,很少有方法可用于检测、建模和补偿金融时间序列中的异常。时间序列数据是在定义的时间间隔内发生变化的数据类型。数据集中的每个值都与序列中的前一个值有一定的关系。时间序列数据的这个属性使我们能够预测序列中接下来的值。典型的预测模型仅限于遵循数据集中的模式,而不能补偿异常周期。本研究将尝试找到一种机器学习方法来检测异常值,然后在预测中补偿这些检测。这个概念以前没有实现过,因此,它将利用市场预测、异常值及其影响和机器学习方法的理论工作。本文实施的思想是基于有效市场假设(EMH),其中股票价格反映了对市场的了解。有效市场假说不能解释消费者对股票的情绪。这种情绪可能导致股票数据出现异常,对股市走势产生重大影响。因此,异常值的检测和补偿可以改善对股票走势的预测。本文提出了一种由两个连续阶段组成的深度学习方法。第一阶段是基于长短期记忆(LSTM)网络自动编码器的离群点检测模型,该模型可以确定是否发生了离群点事件,然后为下一阶段创建该事件的关联值。该方法的第二阶段使用高阶神经网络(HONN)模型根据第一阶段的输出和库存时间序列数据进行预测。用实际股票数据和独立预测模型对该方法进行了验证。该方法通过对异常值事件的补偿,在预测股票时间序列数据方面具有优越性。如果数据集包含足够数量的异常周期,则改进是可量化的。我们可以进一步将所提出的异常值补偿方法与其他金融时间序列预测方法相结合,以提供进一步的改进和稳定性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Electronics
Electronics Computer Science-Computer Networks and Communications
CiteScore
1.10
自引率
10.30%
发文量
3515
审稿时长
16.71 days
期刊介绍: Electronics (ISSN 2079-9292; CODEN: ELECGJ) is an international, open access journal on the science of electronics and its applications published quarterly online by MDPI.
期刊最新文献
A Markov Chain Replacement Strategy for Surrogate Identifiers: Minimizing Re-Identification Risk While Preserving Text Reuse. Data-Driven Maturity Level Evaluation for Cardiomyocytes Derived from Human Pluripotent Stem Cells (Invited Paper). Accurate Medical Vial Identification Through Mixed Reality: A HoloLens 2 Implementation. A Design Review for Biomedical Wireless Power Transfer Systems with a Three-Coil Inductive Link through a Case Study for NICU Applications. A Deep Reinforcement Learning Method Based on a Transformer Model for the Flexible Job Shop Scheduling Problem
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1