Zi Qi, Zhan Bu, Xi Xiong, Hongliang Sun, Jie Cao, Chengcui Zhang
{"title":"A Stock Index Prediction Framework: Integrating Technical and Topological Mesoscale Indicators","authors":"Zi Qi, Zhan Bu, Xi Xiong, Hongliang Sun, Jie Cao, Chengcui Zhang","doi":"10.1109/IRI.2019.00018","DOIUrl":null,"url":null,"abstract":"With its growing importance in predicting future stock trends, nearly everyone watches the Chinese financial market. Traditional approaches typically employ a variety of statistical techniques or machine learning methods for stock index predicting, and often rely on analysis of technical indicators. In the existing literature, researchers rarely attempt to predict the stock index by using the topological features of temporal stock correlation networks. Keeping this in mind, we first calculate the correlation coefficient of any two stocks using the classic Visibility Graph Model (VGM). Then, by using the Planar Maximally Filtered Graph (PMFG) method, we generate temporal stock correlation networks from historical stock quantitative data. Next, we choose fourteen frequently adopted Technical Indicators (TIs) and five Topological Mesoscale Indicators (TMIs, extracted from the temporal stock correlation networks) as predictive variables of six machine learning classifiers. To improve forecast accuracy and to address potential overfitting problems, we modify the classic Sequential Backward Selection (SBS) algorithm to learn the most significant predictive variables for each classifier. We then conduct a series of comprehensive experiments on three Chinese stock indices to validate our prediction framework's performance. Experimental results show that using a combination of TIs and TMIs significantly improves forecast accuracy over conventional methods that use either TIs or TMIs exclusively.","PeriodicalId":295028,"journal":{"name":"2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI.2019.00018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
With its growing importance in predicting future stock trends, nearly everyone watches the Chinese financial market. Traditional approaches typically employ a variety of statistical techniques or machine learning methods for stock index predicting, and often rely on analysis of technical indicators. In the existing literature, researchers rarely attempt to predict the stock index by using the topological features of temporal stock correlation networks. Keeping this in mind, we first calculate the correlation coefficient of any two stocks using the classic Visibility Graph Model (VGM). Then, by using the Planar Maximally Filtered Graph (PMFG) method, we generate temporal stock correlation networks from historical stock quantitative data. Next, we choose fourteen frequently adopted Technical Indicators (TIs) and five Topological Mesoscale Indicators (TMIs, extracted from the temporal stock correlation networks) as predictive variables of six machine learning classifiers. To improve forecast accuracy and to address potential overfitting problems, we modify the classic Sequential Backward Selection (SBS) algorithm to learn the most significant predictive variables for each classifier. We then conduct a series of comprehensive experiments on three Chinese stock indices to validate our prediction framework's performance. Experimental results show that using a combination of TIs and TMIs significantly improves forecast accuracy over conventional methods that use either TIs or TMIs exclusively.