{"title":"Ordinal versus nominal regression models and the problem of correctly predicting draws in soccer","authors":"L. M. Hvattum","doi":"10.1515/ijcss-2017-0004","DOIUrl":null,"url":null,"abstract":"Abstract Ordinal regression models are frequently used in academic literature to model outcomes of soccer matches, and seem to be preferred over nominal models. One reason is that, obviously, there is a natural hierarchy of outcomes, with victory being preferred to a draw and a draw being preferred to a loss. However, the often used ordinal models have an assumption of proportional odds: the influence of an independent variable on the log odds is the same for each outcome. This paper illustrates how ordinal regression models therefore fail to fully utilize independent variables that contain information about the likelihood of matches ending in a draw. However, in practice, this flaw does not seem to have a substantial effect on the predictive accuracy of an ordered logit regression model when compared to a multinomial logistic regression model.","PeriodicalId":38466,"journal":{"name":"International Journal of Computer Science in Sport","volume":"16 1","pages":"50 - 64"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/ijcss-2017-0004","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computer Science in Sport","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/ijcss-2017-0004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 4
Abstract
Abstract Ordinal regression models are frequently used in academic literature to model outcomes of soccer matches, and seem to be preferred over nominal models. One reason is that, obviously, there is a natural hierarchy of outcomes, with victory being preferred to a draw and a draw being preferred to a loss. However, the often used ordinal models have an assumption of proportional odds: the influence of an independent variable on the log odds is the same for each outcome. This paper illustrates how ordinal regression models therefore fail to fully utilize independent variables that contain information about the likelihood of matches ending in a draw. However, in practice, this flaw does not seem to have a substantial effect on the predictive accuracy of an ordered logit regression model when compared to a multinomial logistic regression model.