Jordan Truman Paul Noel, Vinicius Prado da Fonseca, Amilcar Soares
{"title":"A Comprehensive Data Pipeline for Comparing the Effects of Momentum on Sports Leagues","authors":"Jordan Truman Paul Noel, Vinicius Prado da Fonseca, Amilcar Soares","doi":"10.3390/data9020029","DOIUrl":null,"url":null,"abstract":"Momentum has been a consistently studied aspect of sports science for decades. Among the established literature, there has, at times, been a discrepancy between conclusions. However, if momentum is indeed an actual phenomenon, it would affect all aspects of sports, from player evaluation to pre-game prediction and betting. Therefore, using momentum-based features that quantify a team’s linear trend of play, we develop a data pipeline that uses a small sample of recent games to assess teams’ quality of play and measure the predictive power of momentum-based features versus the predictive power of more traditional frequency-based features across several leagues using several machine learning techniques. More precisely, we use our pipeline to determine the differences in the predictive power of momentum-based features and standard statistical features for the National Hockey League (NHL), National Basketball Association (NBA), and five major first-division European football leagues. Our findings show little evidence that momentum has superior predictive power in the NBA. Still, we found some instances of the effects of momentum on the NHL that produced better pre-game predictors, whereas we view a similar trend in European football/soccer. Our results indicate that momentum-based features combined with frequency-based features could improve pre-game prediction models and that, in the future, momentum should be studied more from a feature/performance indicator point-of-view and less from the view of the dependence of sequential outcomes, thus attempting to distance momentum from the binary view of winning and losing.","PeriodicalId":502371,"journal":{"name":"Data","volume":"26 20","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/data9020029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Momentum has been a consistently studied aspect of sports science for decades. Among the established literature, there has, at times, been a discrepancy between conclusions. However, if momentum is indeed an actual phenomenon, it would affect all aspects of sports, from player evaluation to pre-game prediction and betting. Therefore, using momentum-based features that quantify a team’s linear trend of play, we develop a data pipeline that uses a small sample of recent games to assess teams’ quality of play and measure the predictive power of momentum-based features versus the predictive power of more traditional frequency-based features across several leagues using several machine learning techniques. More precisely, we use our pipeline to determine the differences in the predictive power of momentum-based features and standard statistical features for the National Hockey League (NHL), National Basketball Association (NBA), and five major first-division European football leagues. Our findings show little evidence that momentum has superior predictive power in the NBA. Still, we found some instances of the effects of momentum on the NHL that produced better pre-game predictors, whereas we view a similar trend in European football/soccer. Our results indicate that momentum-based features combined with frequency-based features could improve pre-game prediction models and that, in the future, momentum should be studied more from a feature/performance indicator point-of-view and less from the view of the dependence of sequential outcomes, thus attempting to distance momentum from the binary view of winning and losing.
几十年来,动量一直是体育科学研究的一个方面。在已有的文献中,有时会出现结论不一致的情况。然而,如果动量确实是一种实际现象,那么它将影响体育的方方面面,从球员评估到赛前预测和投注。因此,我们使用基于动量的特征来量化一支球队的线性比赛趋势,开发了一个数据管道,使用近期比赛的小样本来评估球队的比赛质量,并使用几种机器学习技术来衡量基于动量的特征的预测能力与几个联赛中更传统的基于频率的特征的预测能力。更准确地说,我们使用我们的管道来确定基于动量特征的预测能力与标准统计特征在美国国家冰球联盟(NHL)、美国国家篮球协会(NBA)和欧洲五大甲级足球联赛中的差异。我们的研究结果表明,几乎没有证据表明动量在 NBA 中具有更强的预测能力。尽管如此,我们还是发现了一些动量对国家曲棍球协会的影响,这些影响产生了更好的赛前预测结果,而我们在欧洲足球/橄榄球中也发现了类似的趋势。我们的研究结果表明,基于动量的特征与基于频率的特征相结合,可以改进赛前预测模型,而且未来应更多地从特征/性能指标的角度研究动量,而不是从连续结果的依赖性角度研究动量,从而尝试将动量与二元胜负观拉开距离。