Gabriel Rodrigues Palma, Mariusz Skoczeń, Phil Maguire
{"title":"Combining supervised and unsupervised learning methods to predict financial market movements","authors":"Gabriel Rodrigues Palma, Mariusz Skoczeń, Phil Maguire","doi":"arxiv-2409.03762","DOIUrl":null,"url":null,"abstract":"The decisions traders make to buy or sell an asset depend on various\nanalyses, with expertise required to identify patterns that can be exploited\nfor profit. In this paper we identify novel features extracted from emergent\nand well-established financial markets using linear models and Gaussian Mixture\nModels (GMM) with the aim of finding profitable opportunities. We used\napproximately six months of data consisting of minute candles from the Bitcoin,\nPepecoin, and Nasdaq markets to derive and compare the proposed novel features\nwith commonly used ones. These features were extracted based on the previous 59\nminutes for each market and used to identify predictions for the hour ahead. We\nexplored the performance of various machine learning strategies, such as Random\nForests (RF) and K-Nearest Neighbours (KNN) to classify market movements. A\nnaive random approach to selecting trading decisions was used as a benchmark,\nwith outcomes assumed to be equally likely. We used a temporal cross-validation\napproach using test sets of 40%, 30% and 20% of total hours to evaluate the\nlearning algorithms' performances. Our results showed that filtering the time\nseries facilitates algorithms' generalisation. The GMM filtering approach\nrevealed that the KNN and RF algorithms produced higher average returns than\nthe random algorithm.","PeriodicalId":501139,"journal":{"name":"arXiv - QuantFin - Statistical Finance","volume":"9 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuantFin - Statistical Finance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.03762","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The decisions traders make to buy or sell an asset depend on various
analyses, with expertise required to identify patterns that can be exploited
for profit. In this paper we identify novel features extracted from emergent
and well-established financial markets using linear models and Gaussian Mixture
Models (GMM) with the aim of finding profitable opportunities. We used
approximately six months of data consisting of minute candles from the Bitcoin,
Pepecoin, and Nasdaq markets to derive and compare the proposed novel features
with commonly used ones. These features were extracted based on the previous 59
minutes for each market and used to identify predictions for the hour ahead. We
explored the performance of various machine learning strategies, such as Random
Forests (RF) and K-Nearest Neighbours (KNN) to classify market movements. A
naive random approach to selecting trading decisions was used as a benchmark,
with outcomes assumed to be equally likely. We used a temporal cross-validation
approach using test sets of 40%, 30% and 20% of total hours to evaluate the
learning algorithms' performances. Our results showed that filtering the time
series facilitates algorithms' generalisation. The GMM filtering approach
revealed that the KNN and RF algorithms produced higher average returns than
the random algorithm.