首页 > 最新文献

Journal of Quantitative Analysis in Sports最新文献

英文 中文
Clustering of football players based on performance data and aggregated clustering validity indexes 基于成绩数据和聚类效度指标的足球运动员聚类
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2022-04-20 DOI: 10.1515/jqas-2022-0037
Serhat Emre Akhanli, C. Hennig
Abstract We analyse football (soccer) player performance data with mixed type variables from the 2014-15 season of eight European major leagues. We cluster these data based on a tailor-made dissimilarity measure. In order to decide between the many available clustering methods and to choose an appropriate number of clusters, we use the approach by Akhanli and Hennig (2020. “Comparing Clusterings and Numbers of Clusters by Aggregation of Calibrated Clustering Validity Indexes.” Statistics and Computing 30 (5): 1523–44). This is based on several validation criteria that refer to different desirable characteristics of a clustering. These characteristics are chosen based on the aim of clustering, and this allows to define a suitable validation index as weighted average of calibrated individual indexes measuring the desirable features. We derive two different clusterings. The first one is a partition of the data set into major groups of essentially different players, which can be used for the analysis of a team’s composition. The second one divides the data set into many small clusters (with 10 players on average), which can be used for finding players with a very similar profile to a given player. It is discussed in depth what characteristics are desirable for these clusterings. Weighting the criteria for the second clustering is informed by a survey of football experts.
摘要本文采用混合类型变量分析了2014-15赛季欧洲八大联赛足球运动员的表现数据。我们根据量身定制的不相似性度量对这些数据进行聚类。为了在许多可用的聚类方法之间做出决定并选择适当数量的聚类,我们使用了Akhanli和Hennig(2020)的方法。通过校准聚类有效性指标的聚合来比较聚类和聚类数量。统计与计算30(5):1523-44。这是基于几个验证标准,这些标准涉及聚类的不同期望特征。这些特征是根据聚类的目的选择的,这允许定义一个合适的验证指数作为衡量理想特征的校准单个指数的加权平均值。我们得到两种不同的聚类。第一种方法是将数据集划分为本质上不同的球员的主要组,这可以用于分析球队的组成。第二种方法将数据集分成许多小集群(平均10个玩家),这可以用于寻找与给定玩家非常相似的玩家。深入讨论了这些聚类所需的特征。对第二次聚类的标准进行加权是通过对足球专家的调查得出的。
{"title":"Clustering of football players based on performance data and aggregated clustering validity indexes","authors":"Serhat Emre Akhanli, C. Hennig","doi":"10.1515/jqas-2022-0037","DOIUrl":"https://doi.org/10.1515/jqas-2022-0037","url":null,"abstract":"Abstract We analyse football (soccer) player performance data with mixed type variables from the 2014-15 season of eight European major leagues. We cluster these data based on a tailor-made dissimilarity measure. In order to decide between the many available clustering methods and to choose an appropriate number of clusters, we use the approach by Akhanli and Hennig (2020. “Comparing Clusterings and Numbers of Clusters by Aggregation of Calibrated Clustering Validity Indexes.” Statistics and Computing 30 (5): 1523–44). This is based on several validation criteria that refer to different desirable characteristics of a clustering. These characteristics are chosen based on the aim of clustering, and this allows to define a suitable validation index as weighted average of calibrated individual indexes measuring the desirable features. We derive two different clusterings. The first one is a partition of the data set into major groups of essentially different players, which can be used for the analysis of a team’s composition. The second one divides the data set into many small clusters (with 10 players on average), which can be used for finding players with a very similar profile to a given player. It is discussed in depth what characteristics are desirable for these clusterings. Weighting the criteria for the second clustering is informed by a survey of football experts.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"12 1","pages":"103 - 123"},"PeriodicalIF":0.8,"publicationDate":"2022-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87941458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Optical tracking in team sports 团队运动中的光学跟踪
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2022-03-01 DOI: 10.1515/jqas-2020-0088
Pegah Rahimian, László Toka
Abstract Sports analysis has gained paramount importance for coaches, scouts, and fans. Recently, computer vision researchers have taken on the challenge of collecting the necessary data by proposing several methods of automatic player and ball tracking. Building on the gathered tracking data, data miners are able to perform quantitative analysis on the performance of players and teams. With this survey, our goal is to provide a basic understanding for quantitative data analysts about the process of creating the input data and the characteristics thereof. Thus, we summarize the recent methods of optical tracking by providing a comprehensive taxonomy of conventional and deep learning methods, separately. Moreover, we discuss the preprocessing steps of tracking, the most common challenges in this domain, and the application of tracking data to sports teams. Finally, we compare the methods by their cost and limitations, and conclude the work by highlighting potential future research directions.
体育分析对于教练、球探和球迷来说已经变得至关重要。最近,计算机视觉研究人员通过提出几种自动跟踪球员和球的方法,接受了收集必要数据的挑战。基于收集到的跟踪数据,数据挖掘者能够对球员和球队的表现进行定量分析。通过这项调查,我们的目标是为定量数据分析师提供关于创建输入数据的过程及其特征的基本了解。因此,我们通过提供传统和深度学习方法的综合分类,分别总结了最近的光学跟踪方法。此外,我们还讨论了跟踪的预处理步骤,该领域最常见的挑战,以及跟踪数据在运动队中的应用。最后,对各种方法的成本和局限性进行了比较,并对工作进行了总结,指出了未来可能的研究方向。
{"title":"Optical tracking in team sports","authors":"Pegah Rahimian, László Toka","doi":"10.1515/jqas-2020-0088","DOIUrl":"https://doi.org/10.1515/jqas-2020-0088","url":null,"abstract":"Abstract Sports analysis has gained paramount importance for coaches, scouts, and fans. Recently, computer vision researchers have taken on the challenge of collecting the necessary data by proposing several methods of automatic player and ball tracking. Building on the gathered tracking data, data miners are able to perform quantitative analysis on the performance of players and teams. With this survey, our goal is to provide a basic understanding for quantitative data analysts about the process of creating the input data and the characteristics thereof. Thus, we summarize the recent methods of optical tracking by providing a comprehensive taxonomy of conventional and deep learning methods, separately. Moreover, we discuss the preprocessing steps of tracking, the most common challenges in this domain, and the application of tracking data to sports teams. Finally, we compare the methods by their cost and limitations, and conclude the work by highlighting potential future research directions.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"16 1","pages":"35 - 57"},"PeriodicalIF":0.8,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84821423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
MSE-optimal K-factor of the Elo rating system for round-robin tournament 循环赛Elo评分系统的mse最优k因子
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2022-03-01 DOI: 10.1515/jqas-2021-0079
Victor S. Chan
Abstract The Elo rating system contains a coefficient called the K-factor which governs the amount of change to the updated ratings and is often determined by empirical or heuristic means. Theoretical studies on the K-factor have been sparse and not much is known about the pertinent factors that impact its appropriate values in applications. This paper has two main goals: to present a new formulation of the K-factor that is optimal with respect to the mean-squared-error (MSE) criterion in a round-robin tournament setting and to investigate the effects of the relevant variables, including the number of tournament participants n, on the optimal K-factor (based on the model-averaged MSE). It is found that n and the variability of the deviation between the true rating and the pre-tournament rating have a strong influence on the optimal K-factor. Comparisons between the MSE-optimal K-factor and the K-factors from Elo and from the US Chess Federation as a function of n are also provided. Although the results are applicable to other sports in similar settings, the study focuses on chess and makes use of the rating data and the K-factor values from the chess world.
Elo评级系统包含一个称为k因子的系数,该系数控制更新评级的变化量,通常由经验或启发式方法确定。关于k因子的理论研究很少,对于影响其在应用中的适当值的相关因素所知不多。本文有两个主要目标:在循环赛设置中,提出一种关于均方误差(MSE)标准的最佳k因子的新公式,并研究包括比赛参与者数量n在内的相关变量对最优k因子(基于模型平均MSE)的影响。研究发现,n和真实评分与赛前评分偏差的可变性对最优k因子有很强的影响。还提供了mse最优k因子与来自Elo和美国国际象棋联合会的k因子作为n的函数的比较。虽然研究结果也适用于类似环境下的其他体育项目,但本研究的重点是国际象棋,并利用了国际象棋世界的评级数据和k因子值。
{"title":"MSE-optimal K-factor of the Elo rating system for round-robin tournament","authors":"Victor S. Chan","doi":"10.1515/jqas-2021-0079","DOIUrl":"https://doi.org/10.1515/jqas-2021-0079","url":null,"abstract":"Abstract The Elo rating system contains a coefficient called the K-factor which governs the amount of change to the updated ratings and is often determined by empirical or heuristic means. Theoretical studies on the K-factor have been sparse and not much is known about the pertinent factors that impact its appropriate values in applications. This paper has two main goals: to present a new formulation of the K-factor that is optimal with respect to the mean-squared-error (MSE) criterion in a round-robin tournament setting and to investigate the effects of the relevant variables, including the number of tournament participants n, on the optimal K-factor (based on the model-averaged MSE). It is found that n and the variability of the deviation between the true rating and the pre-tournament rating have a strong influence on the optimal K-factor. Comparisons between the MSE-optimal K-factor and the K-factors from Elo and from the US Chess Federation as a function of n are also provided. Although the results are applicable to other sports in similar settings, the study focuses on chess and makes use of the rating data and the K-factor values from the chess world.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"53 1","pages":"59 - 72"},"PeriodicalIF":0.8,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76581251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating the performance of elite level volleyball players 优秀排球运动员竞技水平的评价
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2022-02-23 DOI: 10.1515/jqas-2021-0056
G. Fellingham
Abstract Evaluation of individuals in a team sport setting is inherently difficult. The level of play of one individual is fundamentally tied to the level of play of the teammates. One way to think about evaluation of individuals is to ‘insert’ the posterior distribution of the parameter that measures individual play into an ‘average’ team, and see how the probability of success (or failure) changes. Using a Bayesian hierarchical logistic model, we can estimate both the average contribution to success of various positions, and the individual contribution of all the players in that position. In this paper, we use data from the 2018 World Championships in Volleyball to model both the position played and the players within each position. Using both the posterior distributions for the mean performance of the different positions, and the posterior distributions for the individual players, we can then estimate the change in the number of points scored for a team with a change from an average player to the individual under consideration. We compute both the points scored above average per set (PAAPS) and the points scored above average per 100 touches (PP100) for 168 men and 168 women playing five different positions. Contributions of the various position groups and of individual players within each position are evaluated and compared.
在团队运动环境中对个人进行评估本身就是困难的。一个人的竞技水平与队友的竞技水平息息相关。考虑个人评估的一种方法是将衡量个人发挥的参数的后验分布“插入”到“平均”团队中,并观察成功(或失败)的概率如何变化。利用贝叶斯层次逻辑模型,我们既可以估计各个位置对成功的平均贡献,也可以估计该位置上所有参与者的个人贡献。在本文中,我们使用2018年世界排球锦标赛的数据来模拟比赛位置和每个位置上的球员。使用不同位置的平均表现的后验分布和个体球员的后验分布,我们就可以估计从平均球员到个体球员的变化对球队得分的影响。我们计算了168名男子和168名女子在5个不同位置上的每一局高于平均得分(PAAPS)和每百次触球高于平均得分(PP100)。评估和比较不同位置组和每个位置内个人球员的贡献。
{"title":"Evaluating the performance of elite level volleyball players","authors":"G. Fellingham","doi":"10.1515/jqas-2021-0056","DOIUrl":"https://doi.org/10.1515/jqas-2021-0056","url":null,"abstract":"Abstract Evaluation of individuals in a team sport setting is inherently difficult. The level of play of one individual is fundamentally tied to the level of play of the teammates. One way to think about evaluation of individuals is to ‘insert’ the posterior distribution of the parameter that measures individual play into an ‘average’ team, and see how the probability of success (or failure) changes. Using a Bayesian hierarchical logistic model, we can estimate both the average contribution to success of various positions, and the individual contribution of all the players in that position. In this paper, we use data from the 2018 World Championships in Volleyball to model both the position played and the players within each position. Using both the posterior distributions for the mean performance of the different positions, and the posterior distributions for the individual players, we can then estimate the change in the number of points scored for a team with a change from an average player to the individual under consideration. We compute both the points scored above average per set (PAAPS) and the points scored above average per 100 touches (PP100) for 168 men and 168 women playing five different positions. Contributions of the various position groups and of individual players within each position are evaluated and compared.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"1 1","pages":"15 - 34"},"PeriodicalIF":0.8,"publicationDate":"2022-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89615955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Frontmatter
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2021-11-26 DOI: 10.1515/jqas-2021-frontmatter4
{"title":"Frontmatter","authors":"","doi":"10.1515/jqas-2021-frontmatter4","DOIUrl":"https://doi.org/10.1515/jqas-2021-frontmatter4","url":null,"abstract":"","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"32 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2021-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90362442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The effects of draw restrictions on knockout tournaments 抽签限制对淘汰赛的影响
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2021-10-26 DOI: 10.1515/jqas-2022-0061
L'aszl'o Csat'o
Abstract The paper analyses how draw constraints influence the outcome of a knockout tournament. The research question is inspired by European club football competitions, where the organiser generally imposes an association constraint in the first round of the knockout phase: teams from the same country cannot be drawn against each other. Its effects are explored in both theoretical and simulation models. An association constraint in the first round(s) is found to increase the likelihood of same nation matchups to approximately the same extent in each subsequent round. If the favourite teams are concentrated in some associations, they will have a higher probability to win the tournament under this policy but the increase is less than linear if it is used in more rounds. Our results might explain the recent introduction of the association constraint for both the knockout round play-offs with 16 teams and the Round of 16 in the UEFA Europa League and UEFA Europa Conference League.
摘要:本文分析了平局限制对淘汰赛结果的影响。研究问题的灵感来自欧洲俱乐部足球比赛,在淘汰赛阶段的第一轮,组织者通常会施加协会限制:来自同一国家的球队不能相互抽签。在理论和仿真模型中探讨了其影响。研究发现,第一轮的关联约束在随后的每一轮中增加了相同国家配对的可能性,其程度大致相同。如果最受欢迎的球队集中在一些协会,在这种政策下,他们赢得比赛的可能性会更高,但如果在更多轮中使用这种政策,这种增长就不是线性的。我们的研究结果可以解释最近在16支球队的淘汰赛附加赛和欧联杯和欧联杯16强赛中引入的协会约束。
{"title":"The effects of draw restrictions on knockout tournaments","authors":"L'aszl'o Csat'o","doi":"10.1515/jqas-2022-0061","DOIUrl":"https://doi.org/10.1515/jqas-2022-0061","url":null,"abstract":"Abstract The paper analyses how draw constraints influence the outcome of a knockout tournament. The research question is inspired by European club football competitions, where the organiser generally imposes an association constraint in the first round of the knockout phase: teams from the same country cannot be drawn against each other. Its effects are explored in both theoretical and simulation models. An association constraint in the first round(s) is found to increase the likelihood of same nation matchups to approximately the same extent in each subsequent round. If the favourite teams are concentrated in some associations, they will have a higher probability to win the tournament under this policy but the increase is less than linear if it is used in more rounds. Our results might explain the recent introduction of the association constraint for both the knockout round play-offs with 16 teams and the Round of 16 in the UEFA Europa League and UEFA Europa Conference League.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"22 1","pages":"227 - 239"},"PeriodicalIF":0.8,"publicationDate":"2021-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90863890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Judging the judges: evaluating the accuracy and national bias of international gymnastics judges 评判裁判:评价国际体操裁判的准确性和国家偏见
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2021-10-19 DOI: 10.1515/jqas-2019-0113
Sandro Heiniger, Hugues Mercier
Abstract We design, describe and implement a statistical engine to analyze the performance of gymnastics judges with three objectives: (1) provide constructive feedback to judges, executive committees and national federations; (2) assign the best judges to the most important competitions; (3) detect bias and persistent misjudging. Judging a gymnastics routine is a random process, and we model this process using heteroscedastic random variables. The developed marking score scales the difference between the mark of a judge and the true performance level of a gymnast as a function of the intrinsic judging error variability estimated from historical data for each apparatus. This dependence between judging variability and performance quality has never been properly studied. We leverage the intrinsic judging error variability and the marking score to detect outlier marks and study the national bias of judges favoring athletes of the same nationality. We also study ranking scores assessing to what extent judges rate gymnasts in the correct order. Our main observation is that there are significant differences between the best and worst judges, both in terms of accuracy and national bias. The insights from this work have led to recommendations and rule changes at the Fédération Internationale de Gymnastique.
摘要:本文设计、描述并实现了一个统计引擎来分析体操裁判的表现,其目标有三个:(1)为裁判、执行委员会和国家联合会提供建设性的反馈;(2)选出最优秀的裁判参加最重要的比赛;(3)发现偏见和持续的误判。判断一个体操动作是一个随机过程,我们使用异方差随机变量对这个过程进行建模。所开发的评分分数将裁判的评分与体操运动员的真实表现水平之间的差异作为每个器械的历史数据估计的固有判断误差变异性的函数。判断可变性和性能质量之间的这种依赖关系从未得到过适当的研究。我们利用固有判断误差变异性和评分分数来检测异常值,研究裁判偏向同一国籍运动员的民族偏见。我们还研究了排名分数,以评估裁判在多大程度上以正确的顺序评价体操运动员。我们的主要观察是,在准确性和民族偏见方面,最好和最差的法官之间存在显著差异。这项工作的见解导致了国际体操联合会的建议和规则的变化。
{"title":"Judging the judges: evaluating the accuracy and national bias of international gymnastics judges","authors":"Sandro Heiniger, Hugues Mercier","doi":"10.1515/jqas-2019-0113","DOIUrl":"https://doi.org/10.1515/jqas-2019-0113","url":null,"abstract":"Abstract We design, describe and implement a statistical engine to analyze the performance of gymnastics judges with three objectives: (1) provide constructive feedback to judges, executive committees and national federations; (2) assign the best judges to the most important competitions; (3) detect bias and persistent misjudging. Judging a gymnastics routine is a random process, and we model this process using heteroscedastic random variables. The developed marking score scales the difference between the mark of a judge and the true performance level of a gymnast as a function of the intrinsic judging error variability estimated from historical data for each apparatus. This dependence between judging variability and performance quality has never been properly studied. We leverage the intrinsic judging error variability and the marking score to detect outlier marks and study the national bias of judges favoring athletes of the same nationality. We also study ranking scores assessing to what extent judges rate gymnasts in the correct order. Our main observation is that there are significant differences between the best and worst judges, both in terms of accuracy and national bias. The insights from this work have led to recommendations and rule changes at the Fédération Internationale de Gymnastique.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"6 1","pages":"289 - 305"},"PeriodicalIF":0.8,"publicationDate":"2021-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74966788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
A Markov process approach to untangling intention versus execution in tennis 一种马尔可夫过程方法来解开网球中意图与执行的纠缠
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2021-10-04 DOI: 10.1515/jqas-2021-0077
Timothy C. Y. Chan, Douglas Fearing, Craig Fernandes, S. Kovalchik
Abstract Value functions are used in sports to determine the optimal action players should employ. However, most literature implicitly assumes that players can perform the prescribed action with known and fixed probability of success. The effect of varying this probability or, equivalently, “execution error” in implementing an action (e.g., hitting a tennis ball to a specific location on the court) on the design of optimal strategies, has received limited attention. In this paper, we develop a novel modeling framework based on Markov reward processes and Markov decision processes to investigate how execution error impacts a player’s value function and strategy in tennis. We power our models with hundreds of millions of simulated tennis shots with 3D ball and 2D player tracking data. We find that optimal shot selection strategies in tennis become more conservative as execution error grows, and that having perfect execution with the empirical shot selection strategy is roughly equivalent to choosing one or two optimal shots with average execution error. We find that execution error on backhand shots is more costly than on forehand shots, and that optimal shot selection on a serve return is more valuable than on any other shot, over all values of execution error.
在体育运动中,价值函数被用来确定运动员应该采取的最佳动作。然而,大多数文献隐含地假设玩家能够以已知和固定的成功概率执行规定的动作。在执行一个动作(例如,将网球打到球场上的特定位置)时,改变这种概率或“执行错误”对最佳策略设计的影响受到了有限的关注。本文建立了一个基于马尔可夫奖励过程和马尔可夫决策过程的建模框架,研究执行错误如何影响网球运动员的价值函数和策略。我们用数亿个模拟网球击球的3D球和2D球员跟踪数据来支持我们的模型。我们发现网球运动中最优击球选择策略随着执行误差的增大而趋于保守,使用经验击球选择策略获得完美的执行大致相当于在执行误差平均的情况下选择一个或两个最优击球。我们发现反手击球的执行失误比正手击球的代价更大,而发球回球的最佳击球选择比其他任何击球的执行失误都更有价值。
{"title":"A Markov process approach to untangling intention versus execution in tennis","authors":"Timothy C. Y. Chan, Douglas Fearing, Craig Fernandes, S. Kovalchik","doi":"10.1515/jqas-2021-0077","DOIUrl":"https://doi.org/10.1515/jqas-2021-0077","url":null,"abstract":"Abstract Value functions are used in sports to determine the optimal action players should employ. However, most literature implicitly assumes that players can perform the prescribed action with known and fixed probability of success. The effect of varying this probability or, equivalently, “execution error” in implementing an action (e.g., hitting a tennis ball to a specific location on the court) on the design of optimal strategies, has received limited attention. In this paper, we develop a novel modeling framework based on Markov reward processes and Markov decision processes to investigate how execution error impacts a player’s value function and strategy in tennis. We power our models with hundreds of millions of simulated tennis shots with 3D ball and 2D player tracking data. We find that optimal shot selection strategies in tennis become more conservative as execution error grows, and that having perfect execution with the empirical shot selection strategy is roughly equivalent to choosing one or two optimal shots with average execution error. We find that execution error on backhand shots is more costly than on forehand shots, and that optimal shot selection on a serve return is more valuable than on any other shot, over all values of execution error.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"7 1","pages":"127 - 145"},"PeriodicalIF":0.8,"publicationDate":"2021-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89110515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Estimating player value in American football using plus–minus models 用正负模型估计美式橄榄球球员的价值
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2021-08-27 DOI: 10.1515/jqas-2020-0033
R. Sabin
Abstract Calculating the value of football player’s on-field performance has been limited to scouting methods while data-driven methods are mostly limited to quarterbacks. A popular method to calculate player value in other sports are Adjusted Plus–Minus (APM) and Regularized Adjusted Plus–Minus (RAPM) models. These models have been used in other sports, most notably basketball (Rosenbaum, D. T. 2004. Measuring How NBA Players Help Their Teams Win. http://www.82games.com/comm30.htm#_ftn1; Kubatko, J., D. Oliver, K. Pelton, and D. T. Rosenbaum. 2007. “A Starting Point for Analyzing Basketball Statistics.” Journal of Quantitative Analysis in Sports 3 (3); Winston, W. 2009. Player and Lineup Analysis in the NBA. Cambridge, Massachusetts; Sill, J. 2010. “Improved NBA Adjusted +/− Using Regularization and Out-Of-Sample Testing.” In Proceedings of the 2010 MIT Sloan Sports Analytics Conference) to estimate each player’s value by accounting for those in the game at the same time. Football is less amenable to APM models due to its few scoring events, few lineup changes, restrictive positioning, and small quantity of games relative to the number of teams. More recent methods have found ways to incorporate plus–minus models in other sports such as Hockey (Macdonald, B. 2011. “A Regression-Based Adjusted Plus-Minus Statistic for NHL players.” Journal of Quantitative Analysis in Sports 7 (3)) and Soccer (Schultze, S. R., and C.-M. Wellbrock. 2018. “A Weighted Plus/Minus Metric for Individual Soccer Player Performance.” Journal of Sports Analytics 4 (2): 121–31 and Matano, F., L. F. Richardson, T. Pospisil, C. Eubanks, and J. Qin (2018). Augmenting Adjusted Plus-Minus in Soccer with Fifa Ratings. arXiv preprint arXiv:1810.08032). These models are useful in coming up with results-oriented estimation of each player’s value. In American football, many positions such as offensive lineman have no recorded statistics which hinders the ability to estimate a player’s value. I provide a fully hierarchical Bayesian plus–minus (HBPM) model framework that extends RAPM to include position-specific penalization that solves many of the shortcomings of APM and RAPM models in American football. Cross-validated results show the HBPM to be more predictive out of sample than RAPM or APM models. Results for the HBPM models are provided for both Collegiate and NFL football players as well as deeper insights into positional value and position-specific age curves.
摘要足球运动员场上表现价值的计算一直局限于球探方法,而数据驱动方法大多局限于四分卫。在其他运动中,计算球员价值的常用方法是调整正负(APM)和正则化调整正负(RAPM)模型。这些模型也被用于其他运动,最著名的是篮球(Rosenbaum, D. T. 2004)。衡量NBA球员如何帮助他们的球队获胜。http://www.82games.com/comm30.htm _ftn1;J. Kubatko, D. Oliver, K. Pelton和D. T. Rosenbaum. 2007。《篮球统计分析的起点》体育定量分析杂志3 (3);温斯顿,W. 2009。NBA中的球员和阵容分析。马萨诸塞州剑桥;刘志强,2010。“改进NBA调整+/−使用正则化和样本外测试。”在2010年麻省理工学院斯隆体育分析会议的论文集中),通过计算同时参加比赛的球员来估计每个球员的价值。足球由于得分事件少,阵容变化少,定位受限,比赛数量相对于球队数量较少,因此不太适合APM模型。最近的方法已经找到了将正负模型纳入其他运动(如曲棍球)的方法(Macdonald, B. 2011)。“基于回归的NHL球员调整正负统计。”体育定量分析杂志7(3))和足球(舒尔茨,S. R.和c.m。Wellbrock》2018。“足球运动员个人表现的加权正负指标。”体育分析杂志4(2):121-31和Matano, F., L. F. Richardson, T. Pospisil, C. Eubanks, J. Qin(2018)。扩大调整正负在足球与国际足联评级。arXiv:1810.08032)。这些模型有助于以结果为导向估算每个玩家的价值。在美式足球中,许多位置,如进攻线卫,没有记录的数据,这阻碍了人们对球员价值的估计。我提供了一个完全分层的贝叶斯加减(HBPM)模型框架,它将RAPM扩展到包括特定位置的惩罚,从而解决了美式橄榄球中APM和RAPM模型的许多缺点。交叉验证的结果表明,HBPM在样本外比RAPM或APM模型更具预测性。HBPM模型的结果提供了大学和NFL橄榄球运动员以及更深入的位置价值和位置特定年龄曲线的见解。
{"title":"Estimating player value in American football using plus–minus models","authors":"R. Sabin","doi":"10.1515/jqas-2020-0033","DOIUrl":"https://doi.org/10.1515/jqas-2020-0033","url":null,"abstract":"Abstract Calculating the value of football player’s on-field performance has been limited to scouting methods while data-driven methods are mostly limited to quarterbacks. A popular method to calculate player value in other sports are Adjusted Plus–Minus (APM) and Regularized Adjusted Plus–Minus (RAPM) models. These models have been used in other sports, most notably basketball (Rosenbaum, D. T. 2004. Measuring How NBA Players Help Their Teams Win. http://www.82games.com/comm30.htm#_ftn1; Kubatko, J., D. Oliver, K. Pelton, and D. T. Rosenbaum. 2007. “A Starting Point for Analyzing Basketball Statistics.” Journal of Quantitative Analysis in Sports 3 (3); Winston, W. 2009. Player and Lineup Analysis in the NBA. Cambridge, Massachusetts; Sill, J. 2010. “Improved NBA Adjusted +/− Using Regularization and Out-Of-Sample Testing.” In Proceedings of the 2010 MIT Sloan Sports Analytics Conference) to estimate each player’s value by accounting for those in the game at the same time. Football is less amenable to APM models due to its few scoring events, few lineup changes, restrictive positioning, and small quantity of games relative to the number of teams. More recent methods have found ways to incorporate plus–minus models in other sports such as Hockey (Macdonald, B. 2011. “A Regression-Based Adjusted Plus-Minus Statistic for NHL players.” Journal of Quantitative Analysis in Sports 7 (3)) and Soccer (Schultze, S. R., and C.-M. Wellbrock. 2018. “A Weighted Plus/Minus Metric for Individual Soccer Player Performance.” Journal of Sports Analytics 4 (2): 121–31 and Matano, F., L. F. Richardson, T. Pospisil, C. Eubanks, and J. Qin (2018). Augmenting Adjusted Plus-Minus in Soccer with Fifa Ratings. arXiv preprint arXiv:1810.08032). These models are useful in coming up with results-oriented estimation of each player’s value. In American football, many positions such as offensive lineman have no recorded statistics which hinders the ability to estimate a player’s value. I provide a fully hierarchical Bayesian plus–minus (HBPM) model framework that extends RAPM to include position-specific penalization that solves many of the shortcomings of APM and RAPM models in American football. Cross-validated results show the HBPM to be more predictive out of sample than RAPM or APM models. Results for the HBPM models are provided for both Collegiate and NFL football players as well as deeper insights into positional value and position-specific age curves.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"8 1","pages":"313 - 364"},"PeriodicalIF":0.8,"publicationDate":"2021-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84747452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Towards a more objective time standard in competitive rowing 在赛艇比赛中建立更客观的时间标准
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2021-08-10 DOI: 10.1515/jqas-2020-0055
Kenneth M. Kimmins, M. Tsai
Abstract Rowing needs a standardized Gold Medal Standard (GMS) to clearly compare performance across boat classes in competition. Here, we report a method to factor out environmental effects, developing a fairer GMS for individual rowing events. We used results from World Rowing Championships and Olympics Games (2005–2016) to calculate the difference between the fastest winning time of the day and other event winning times on the same day. From this, we calculated a prognostic GMS time for each event via repeated k-fold cross-validation linear regression. Then, we compared these values with the 10-year average winning time and the World Best Time (WBT). We repeated this process to develop prognostic podium standard (PS) times. The prognostic GMS times (RMSE = 9.47; R 2 = 0.875) were universally slower than the WBT (current GMS) by 6.2 s on average but faster than the 10-year average by 12.3 s. The prognostic PS times (RMSE = 10.5; R 2 = 897) were also slower than the WBT but faster than the 10-year average, by 12.2 and 6.3 s respectively. Our time-difference prediction model based on historical data generates non-outlier prognostic times. With the utilization of relative time difference, this approach promises a selection standard independent of environmental conditions, easily applicable across different sports.
赛艇需要一个标准化的金牌标准(GMS),以便在比赛中清晰地比较不同级别赛艇的表现。在这里,我们报告了一种方法来排除环境影响,为个人赛艇项目开发一个更公平的GMS。我们使用世界赛艇锦标赛和奥运会(2005-2016)的结果来计算当天最快获胜时间与同一天其他项目获胜时间之间的差异。由此,我们通过重复的k倍交叉验证线性回归计算了每个事件的预后GMS时间。然后,我们将这些值与10年平均获胜时间和世界最佳时间(WBT)进行比较。我们重复这一过程来制定预测平台标准(PS)时间。预测GMS次数(RMSE = 9.47;r2 = 0.875)普遍比WBT(当前GMS)平均慢6.2秒,但比10年平均值快12.3秒。预后PS次数(RMSE = 10.5;r2 = 897)也比WBT慢,但比10年平均值快,分别快12.2秒和6.3秒。我们基于历史数据的时差预测模型产生非离群预测时间。该方法利用相对时差,提供了一个独立于环境条件的选择标准,易于适用于不同的运动项目。
{"title":"Towards a more objective time standard in competitive rowing","authors":"Kenneth M. Kimmins, M. Tsai","doi":"10.1515/jqas-2020-0055","DOIUrl":"https://doi.org/10.1515/jqas-2020-0055","url":null,"abstract":"Abstract Rowing needs a standardized Gold Medal Standard (GMS) to clearly compare performance across boat classes in competition. Here, we report a method to factor out environmental effects, developing a fairer GMS for individual rowing events. We used results from World Rowing Championships and Olympics Games (2005–2016) to calculate the difference between the fastest winning time of the day and other event winning times on the same day. From this, we calculated a prognostic GMS time for each event via repeated k-fold cross-validation linear regression. Then, we compared these values with the 10-year average winning time and the World Best Time (WBT). We repeated this process to develop prognostic podium standard (PS) times. The prognostic GMS times (RMSE = 9.47; R 2 = 0.875) were universally slower than the WBT (current GMS) by 6.2 s on average but faster than the 10-year average by 12.3 s. The prognostic PS times (RMSE = 10.5; R 2 = 897) were also slower than the WBT but faster than the 10-year average, by 12.2 and 6.3 s respectively. Our time-difference prediction model based on historical data generates non-outlier prognostic times. With the utilization of relative time difference, this approach promises a selection standard independent of environmental conditions, easily applicable across different sports.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"1 1","pages":"307 - 311"},"PeriodicalIF":0.8,"publicationDate":"2021-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89791568","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Journal of Quantitative Analysis in Sports
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1