{"title":"评判裁判:评价国际体操裁判的准确性和国家偏见","authors":"Sandro Heiniger, Hugues Mercier","doi":"10.1515/jqas-2019-0113","DOIUrl":null,"url":null,"abstract":"Abstract We design, describe and implement a statistical engine to analyze the performance of gymnastics judges with three objectives: (1) provide constructive feedback to judges, executive committees and national federations; (2) assign the best judges to the most important competitions; (3) detect bias and persistent misjudging. Judging a gymnastics routine is a random process, and we model this process using heteroscedastic random variables. The developed marking score scales the difference between the mark of a judge and the true performance level of a gymnast as a function of the intrinsic judging error variability estimated from historical data for each apparatus. This dependence between judging variability and performance quality has never been properly studied. We leverage the intrinsic judging error variability and the marking score to detect outlier marks and study the national bias of judges favoring athletes of the same nationality. We also study ranking scores assessing to what extent judges rate gymnasts in the correct order. Our main observation is that there are significant differences between the best and worst judges, both in terms of accuracy and national bias. The insights from this work have led to recommendations and rule changes at the Fédération Internationale de Gymnastique.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"6 1","pages":"289 - 305"},"PeriodicalIF":1.1000,"publicationDate":"2021-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Judging the judges: evaluating the accuracy and national bias of international gymnastics judges\",\"authors\":\"Sandro Heiniger, Hugues Mercier\",\"doi\":\"10.1515/jqas-2019-0113\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract We design, describe and implement a statistical engine to analyze the performance of gymnastics judges with three objectives: (1) provide constructive feedback to judges, executive committees and national federations; (2) assign the best judges to the most important competitions; (3) detect bias and persistent misjudging. Judging a gymnastics routine is a random process, and we model this process using heteroscedastic random variables. The developed marking score scales the difference between the mark of a judge and the true performance level of a gymnast as a function of the intrinsic judging error variability estimated from historical data for each apparatus. This dependence between judging variability and performance quality has never been properly studied. We leverage the intrinsic judging error variability and the marking score to detect outlier marks and study the national bias of judges favoring athletes of the same nationality. We also study ranking scores assessing to what extent judges rate gymnasts in the correct order. Our main observation is that there are significant differences between the best and worst judges, both in terms of accuracy and national bias. The insights from this work have led to recommendations and rule changes at the Fédération Internationale de Gymnastique.\",\"PeriodicalId\":16925,\"journal\":{\"name\":\"Journal of Quantitative Analysis in Sports\",\"volume\":\"6 1\",\"pages\":\"289 - 305\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2021-10-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Quantitative Analysis in Sports\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1515/jqas-2019-0113\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"SOCIAL SCIENCES, MATHEMATICAL METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Quantitative Analysis in Sports","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/jqas-2019-0113","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"SOCIAL SCIENCES, MATHEMATICAL METHODS","Score":null,"Total":0}
Judging the judges: evaluating the accuracy and national bias of international gymnastics judges
Abstract We design, describe and implement a statistical engine to analyze the performance of gymnastics judges with three objectives: (1) provide constructive feedback to judges, executive committees and national federations; (2) assign the best judges to the most important competitions; (3) detect bias and persistent misjudging. Judging a gymnastics routine is a random process, and we model this process using heteroscedastic random variables. The developed marking score scales the difference between the mark of a judge and the true performance level of a gymnast as a function of the intrinsic judging error variability estimated from historical data for each apparatus. This dependence between judging variability and performance quality has never been properly studied. We leverage the intrinsic judging error variability and the marking score to detect outlier marks and study the national bias of judges favoring athletes of the same nationality. We also study ranking scores assessing to what extent judges rate gymnasts in the correct order. Our main observation is that there are significant differences between the best and worst judges, both in terms of accuracy and national bias. The insights from this work have led to recommendations and rule changes at the Fédération Internationale de Gymnastique.
期刊介绍:
The Journal of Quantitative Analysis in Sports (JQAS), an official journal of the American Statistical Association, publishes timely, high-quality peer-reviewed research on the quantitative aspects of professional and amateur sports, including collegiate and Olympic competition. The scope of application reflects the increasing demand for novel methods to analyze and understand data in the growing field of sports analytics. Articles come from a wide variety of sports and diverse perspectives, and address topics such as game outcome models, measurement and evaluation of player performance, tournament structure, analysis of rules and adjudication, within-game strategy, analysis of sporting technologies, and player and team ranking methods. JQAS seeks to publish manuscripts that demonstrate original ways of approaching problems, develop cutting edge methods, and apply innovative thinking to solve difficult challenges in sports contexts. JQAS brings together researchers from various disciplines, including statistics, operations research, machine learning, scientific computing, econometrics, and sports management.