{"title":"基于Elo评级和基于机器学习的网球比赛结果预测方法的比较评估","authors":"Rory Bunker, Calvin Yeung, Teo Susnjak, Chester Espie, Keisuke Fujii","doi":"10.1177/17543371231212235","DOIUrl":null,"url":null,"abstract":"Elo ratings-based methods, including the recently proposed Weighted Elo method, have been found to perform well when forecasting tennis match results, however, whether they can outperform machine learning (ML) has not been established. In this study, a comparative evaluation of the two types of methods is conducted using the Sports Result Prediction CRISP-DM experimental framework. The first full year of mens ATP tennis data (2006), in a dataset containing matches from 2005 to 2020, was set to be the initial training set and 1 year of data was incrementally added to this set to predict 14 test years, from 2007 to 2020. Features were ranked based on their average rank across five feature selection techniques. It was found that, of the five ML models, Alternating Decision Trees (ADTrees) and Logistic Regression achieved higher accuracies than Elo ratings and similar accuracies to predictions derived from betting odds. Furthermore, ADTrees show potential in this domain, with solid performance achieved with an interpretable decision tree that allows for variation in the average betting odds difference threshold.","PeriodicalId":20674,"journal":{"name":"Proceedings of the Institution of Mechanical Engineers, Part P: Journal of Sports Engineering and Technology","volume":" 638","pages":"0"},"PeriodicalIF":1.1000,"publicationDate":"2023-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A comparative evaluation of Elo ratings- and machine learning-based methods for tennis match result prediction\",\"authors\":\"Rory Bunker, Calvin Yeung, Teo Susnjak, Chester Espie, Keisuke Fujii\",\"doi\":\"10.1177/17543371231212235\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Elo ratings-based methods, including the recently proposed Weighted Elo method, have been found to perform well when forecasting tennis match results, however, whether they can outperform machine learning (ML) has not been established. In this study, a comparative evaluation of the two types of methods is conducted using the Sports Result Prediction CRISP-DM experimental framework. The first full year of mens ATP tennis data (2006), in a dataset containing matches from 2005 to 2020, was set to be the initial training set and 1 year of data was incrementally added to this set to predict 14 test years, from 2007 to 2020. Features were ranked based on their average rank across five feature selection techniques. It was found that, of the five ML models, Alternating Decision Trees (ADTrees) and Logistic Regression achieved higher accuracies than Elo ratings and similar accuracies to predictions derived from betting odds. Furthermore, ADTrees show potential in this domain, with solid performance achieved with an interpretable decision tree that allows for variation in the average betting odds difference threshold.\",\"PeriodicalId\":20674,\"journal\":{\"name\":\"Proceedings of the Institution of Mechanical Engineers, Part P: Journal of Sports Engineering and Technology\",\"volume\":\" 638\",\"pages\":\"0\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2023-11-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Institution of Mechanical Engineers, Part P: Journal of Sports Engineering and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1177/17543371231212235\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ENGINEERING, MECHANICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Institution of Mechanical Engineers, Part P: Journal of Sports Engineering and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/17543371231212235","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, MECHANICAL","Score":null,"Total":0}
A comparative evaluation of Elo ratings- and machine learning-based methods for tennis match result prediction
Elo ratings-based methods, including the recently proposed Weighted Elo method, have been found to perform well when forecasting tennis match results, however, whether they can outperform machine learning (ML) has not been established. In this study, a comparative evaluation of the two types of methods is conducted using the Sports Result Prediction CRISP-DM experimental framework. The first full year of mens ATP tennis data (2006), in a dataset containing matches from 2005 to 2020, was set to be the initial training set and 1 year of data was incrementally added to this set to predict 14 test years, from 2007 to 2020. Features were ranked based on their average rank across five feature selection techniques. It was found that, of the five ML models, Alternating Decision Trees (ADTrees) and Logistic Regression achieved higher accuracies than Elo ratings and similar accuracies to predictions derived from betting odds. Furthermore, ADTrees show potential in this domain, with solid performance achieved with an interpretable decision tree that allows for variation in the average betting odds difference threshold.
期刊介绍:
The Journal of Sports Engineering and Technology covers the development of novel sports apparel, footwear, and equipment; and the materials, instrumentation, and processes that make advances in sports possible.