首页 > 最新文献

Journal of Quantitative Analysis in Sports最新文献

英文 中文
Improving the aggregation and evaluation of NBA mock drafts 改进 NBA 模拟选秀的汇总和评估
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2024-08-22 DOI: 10.1515/jqas-2023-0100
Jared D. Fisher, Colin Montague
If professional teams can accurately predict the order of their league’s draft, they would have a competitive advantage when using or trading their draft picks. Many experts and enthusiasts publish forecasts of the order players are drafted into professional sports leagues, known as mock drafts. Using a novel dataset of mock drafts for the National Basketball Association (NBA), we explore mock drafts’ ability to forecast the actual draft. We analyze authors’ mock draft accuracy over time and ask how we can reasonably aggregate information from multiple authors. For both tasks, mock drafts are usually analyzed as ranked lists, and in this paper, we propose ways to improve on these methods. We propose that rank-biased distance is the appropriate error metric for measuring accuracy of mock drafts as ranked lists. To best combine information from multiple mock drafts into a single consensus mock draft, we also propose a combination method based on the ideas of ranked-choice voting. We show that this method provides improved forecasts over the standard Borda count combination method used for most similar analyses in sports, and that either combination method provides a more accurate forecast across seasons than any single author.
如果职业球队能够准确预测其联盟的选秀顺序,那么他们在使用或交易选秀权时就会获得竞争优势。许多专家和爱好者都会发布职业体育联盟球员选秀顺序的预测,即模拟选秀。我们利用美国国家篮球协会(NBA)模拟选秀的新数据集,探讨模拟选秀预测实际选秀的能力。我们分析了作者在一段时间内模拟选秀的准确性,并询问我们如何才能合理地汇总来自多个作者的信息。对于这两项任务,模拟选秀通常是作为排名列表来分析的,而在本文中,我们提出了改进这些方法的方法。我们提出,基于排名的距离是衡量模拟选秀准确性的合适误差指标。为了将多个模拟选秀的信息最好地整合到一个共识模拟选秀中,我们还提出了一种基于排序选择投票思想的组合方法。我们的研究表明,与体育界大多数类似分析所使用的标准博尔达计数组合方法相比,这种方法能提供更好的预测,而且任何一种组合方法都能提供比任何单一作者更准确的跨赛季预测。
{"title":"Improving the aggregation and evaluation of NBA mock drafts","authors":"Jared D. Fisher, Colin Montague","doi":"10.1515/jqas-2023-0100","DOIUrl":"https://doi.org/10.1515/jqas-2023-0100","url":null,"abstract":"If professional teams can accurately predict the order of their league’s draft, they would have a competitive advantage when using or trading their draft picks. Many experts and enthusiasts publish forecasts of the order players are drafted into professional sports leagues, known as mock drafts. Using a novel dataset of mock drafts for the National Basketball Association (NBA), we explore mock drafts’ ability to forecast the actual draft. We analyze authors’ mock draft accuracy over time and ask how we can reasonably aggregate information from multiple authors. For both tasks, mock drafts are usually analyzed as ranked lists, and in this paper, we propose ways to improve on these methods. We propose that rank-biased distance is the appropriate error metric for measuring accuracy of mock drafts as ranked lists. To best combine information from multiple mock drafts into a single consensus mock draft, we also propose a combination method based on the ideas of ranked-choice voting. We show that this method provides improved forecasts over the standard Borda count combination method used for most similar analyses in sports, and that either combination method provides a more accurate forecast across seasons than any single author.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"195 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142185558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A basketball paradox: exploring NBA team defensive efficiency in a positionless game 篮球悖论:探索无位置比赛中 NBA 球队的防守效率
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2024-08-16 DOI: 10.1515/jqas-2024-0010
Charles South
In the last decade, the offensive and defensive philosophies employed by teams in the National Basketball Association (NBA) have changed substantially. As a result, most players can no longer be classified into only one of the five traditional positions (PG, SG, SF, PF, C) and instead spend a percentage of their playing time at multiple positions, making positional data compositional. Further, given the desirability for versatile players, an argument can be made that traditional positions themselves are archaic. Using data from the 2016–17, 2017–18, and 2018–19 seasons, I explore how Bayesian hierarchical models can be used to estimate team defensive strength in three ways. First, only considering players classified by their majority traditional position. Second, by using compositional traditional positional data. Third, using compositional data from modern positions (archetypes) defined by fuzzy k-means clustering. I find that the fuzzy k-means approach leads to a modest improvement in both the root mean squared error and median 95 % posterior predictive interval width for the test data, and, more importantly, identifies 11 modern archetypes that, when combined, are correlated with team win total and adjusted team defensive rating. The modern archetype compositions can be used by stakeholders to better understand team defensive strength.
在过去十年中,美国篮球协会(NBA)各队采用的进攻和防守理念发生了巨大变化。因此,大多数球员不再只能被归类到五个传统位置(PG、SG、SF、PF、C)中的一个,而是在多个位置上花费一定比例的上场时间,这就使得位置数据具有了构成性。此外,鉴于人们对全能球员的渴望,可以说传统位置本身已经过时。利用2016-17、2017-18和2018-19赛季的数据,我从三个方面探讨了如何利用贝叶斯层次模型来估计球队的防守强度。首先,只考虑按主要传统位置分类的球员。第二,使用传统位置的组成数据。第三,使用模糊均值聚类所定义的现代位置(原型)的组成数据。我发现,模糊 K 均值聚类方法使测试数据的均方根误差和中位 95 % 后验预测区间宽度都得到了适度改善,更重要的是,它识别出了 11 种现代原型,这些原型组合起来与球队总胜场数和调整后的球队防守评级相关。利益相关者可以利用现代原型组合更好地了解球队的防守实力。
{"title":"A basketball paradox: exploring NBA team defensive efficiency in a positionless game","authors":"Charles South","doi":"10.1515/jqas-2024-0010","DOIUrl":"https://doi.org/10.1515/jqas-2024-0010","url":null,"abstract":"In the last decade, the offensive and defensive philosophies employed by teams in the National Basketball Association (NBA) have changed substantially. As a result, most players can no longer be classified into only one of the five traditional positions (PG, SG, SF, PF, C) and instead spend a percentage of their playing time at multiple positions, making positional data compositional. Further, given the desirability for versatile players, an argument can be made that traditional positions themselves are archaic. Using data from the 2016–17, 2017–18, and 2018–19 seasons, I explore how Bayesian hierarchical models can be used to estimate team defensive strength in three ways. First, only considering players classified by their majority traditional position. Second, by using compositional traditional positional data. Third, using compositional data from modern positions (archetypes) defined by fuzzy <jats:italic>k</jats:italic>-means clustering. I find that the fuzzy <jats:italic>k</jats:italic>-means approach leads to a modest improvement in both the root mean squared error and median 95 % posterior predictive interval width for the test data, and, more importantly, identifies 11 modern archetypes that, when combined, are correlated with team win total and adjusted team defensive rating. The modern archetype compositions can be used by stakeholders to better understand team defensive strength.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"25 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142185559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Success factors in national team football: an analysis of the UEFA EURO 2020 国家队足球的成功因素:对 2020 年欧洲杯的分析
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2024-07-20 DOI: 10.1515/jqas-2023-0026
Vincent Renner, Konstantin Görgen, Alexander Woll, Hagen Wäsche, Melanie Schienle
Identifying success factors in football is of sporting and economic interest. However, research in this field for national teams and their competitions is rare despite the popularity of teams and events. Therefore, we analyze data for the UEFA EURO 2020 and, for comparison purposes, the previous tournament in 2016. To mitigate the challenges of perceived multicollinearity and a small sample size, and to identify the relevant variables, we apply the ‘LASSO Cross-fitted Stability-Selection’ algorithm. This approach involves iterative splitting of data, with variables chosen via a ‘least absolute shrinkage and selection operator’ (LASSO) model (Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. J. Roy. Stat. Soc. B 58: 267–288) on one half of the observations, while coefficients are estimated on the other half. Subsequently, we inspect the frequency of selection and stability of coefficient estimation for each variable over the repeated samples to identify factors as relevant. By that, we are able to differentiate generally valid success factors such as the market value ratio from on-field variables whose importance is tournament-dependent, e.g. the tackles attempted. As the latter is connected to a team’s tactics, we conclude that their observed relevance is correlated to the results of the linked playing style in the specific tournaments. We also show the changing effect of these playing-styles on success across tournaments.
确定足球运动的成功因素具有体育和经济意义。然而,尽管球队和赛事很受欢迎,但针对国家队及其赛事的研究却很少见。因此,我们分析了 2020 年欧洲杯的数据,并与 2016 年的上届赛事进行比较。为了减轻多重共线性和样本量较小带来的挑战,并确定相关变量,我们采用了 "LASSO 交叉拟合稳定性选择 "算法。这种方法涉及数据的迭代分割,通过 "最小绝对收缩和选择算子"(LASSO)模型选择变量(Tibshirani, R. (1996)。Regression shrinkage and selection via the lasso.J. Roy.J. Roy.Soc. B 58: 267-288),而系数则是在另一半观测值上估算的。随后,我们检查重复样本中每个变量的选择频率和系数估计的稳定性,以确定相关因素。这样,我们就能将市值比等普遍有效的成功因素与场上变量(其重要性取决于赛事)(如拦截成功率)区分开来。由于后者与球队的战术相关,我们得出结论,观察到的相关性与特定赛事中相关打法的结果相关。我们还展示了这些打法在不同赛事中对成功的影响变化。
{"title":"Success factors in national team football: an analysis of the UEFA EURO 2020","authors":"Vincent Renner, Konstantin Görgen, Alexander Woll, Hagen Wäsche, Melanie Schienle","doi":"10.1515/jqas-2023-0026","DOIUrl":"https://doi.org/10.1515/jqas-2023-0026","url":null,"abstract":"Identifying success factors in football is of sporting and economic interest. However, research in this field for national teams and their competitions is rare despite the popularity of teams and events. Therefore, we analyze data for the UEFA EURO 2020 and, for comparison purposes, the previous tournament in 2016. To mitigate the challenges of perceived multicollinearity and a small sample size, and to identify the relevant variables, we apply the ‘LASSO Cross-fitted Stability-Selection’ algorithm. This approach involves iterative splitting of data, with variables chosen via a ‘least absolute shrinkage and selection operator’ (LASSO) model (Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. <jats:italic>J. Roy. Stat. Soc. B</jats:italic> 58: 267–288) on one half of the observations, while coefficients are estimated on the other half. Subsequently, we inspect the frequency of selection and stability of coefficient estimation for each variable over the repeated samples to identify factors as relevant. By that, we are able to differentiate generally valid success factors such as the market value ratio from on-field variables whose importance is tournament-dependent, e.g. the tackles attempted. As the latter is connected to a team’s tactics, we conclude that their observed relevance is correlated to the results of the linked playing style in the specific tournaments. We also show the changing effect of these playing-styles on success across tournaments.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"37 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141737664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An empirical Bayes approach for estimating skill models for professional darts players 估计职业飞镖运动员技能模型的经验贝叶斯方法
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2024-07-13 DOI: 10.1515/jqas-2023-0084
Martin B. Haugh, Chun Wang
We perform an exploratory data analysis on a data-set for the top 16 professional darts players from the 2019 season. We use this data-set to fit player skill models which can then be used in dynamic zero-sum games (ZSGs) that model real-world matches between players. We propose an empirical Bayesian approach based on the Dirichlet-Multinomial (DM) model that overcomes limitations in the data. Specifically we introduce two DM-based skill models where the first model borrows strength from other darts players and the second model borrows strength from other regions of the dartboard. We find these DM-based models outperform simpler benchmark models with respect to Brier and Spherical scores, both of which are proper scoring rules. We also show in ZSGs settings that the difference between DM-based skill models and the simpler benchmark models is practically significant. Finally, we use our DM-based model to analyze specific situations that arose in real-world darts matches during the 2019 season.
我们对 2019 赛季前 16 名职业飞镖选手的数据集进行了探索性数据分析。我们利用该数据集来拟合球员技能模型,然后将其用于模拟球员之间真实比赛的动态零和博弈(ZSGs)中。我们提出了一种基于 Dirichlet-Multinomial (DM) 模型的经验贝叶斯方法,该方法克服了数据的局限性。具体来说,我们引入了两个基于 DM 的技能模型,其中第一个模型借用了其他飞镖玩家的力量,第二个模型借用了镖盘其他区域的力量。我们发现这些基于 DM 的模型在 Brier 和 Spherical 分数方面优于简单的基准模型,而这两种分数都是适当的评分规则。我们还表明,在 ZSGs 设置中,基于 DM 的技能模型与较简单的基准模型之间的差异实际上非常明显。最后,我们使用基于 DM 的模型分析了 2019 赛季实际飞镖比赛中出现的具体情况。
{"title":"An empirical Bayes approach for estimating skill models for professional darts players","authors":"Martin B. Haugh, Chun Wang","doi":"10.1515/jqas-2023-0084","DOIUrl":"https://doi.org/10.1515/jqas-2023-0084","url":null,"abstract":"We perform an exploratory data analysis on a data-set for the top 16 professional darts players from the 2019 season. We use this data-set to fit player skill models which can then be used in dynamic zero-sum games (ZSGs) that model real-world matches between players. We propose an empirical Bayesian approach based on the Dirichlet-Multinomial (DM) model that overcomes limitations in the data. Specifically we introduce two DM-based skill models where the first model borrows strength from other darts players and the second model borrows strength from other regions of the dartboard. We find these DM-based models outperform simpler benchmark models with respect to Brier and Spherical scores, both of which are proper scoring rules. We also show in ZSGs settings that the difference between DM-based skill models and the simpler benchmark models is practically significant. Finally, we use our DM-based model to analyze specific situations that arose in real-world darts matches during the 2019 season.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"2011 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141610981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive survey of the home advantage in American football 美式橄榄球主场优势综合调查
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2024-07-09 DOI: 10.1515/jqas-2024-0016
Luke Benz, Thompson Bliss, Michael Lopez
The existence and justification to the home advantage – the benefit a sports team receives when playing at home – has been studied across sport. The majority of research on this topic is limited to individual leagues in short time frames, which hinders extrapolation and a deeper understanding of possible causes. Using nearly two decades of data from the National Football League (NFL), the National Collegiate Athletic Association (NCAA), and high schools from across the United States, we provide a uniform approach to understanding the home advantage in American football. Our findings suggest home advantage is declining in the NFL and the highest levels of collegiate football, but not in amateur football. This increases the possibility that characteristics of the NCAA and NFL, such as travel improvements and instant replay, have helped level the playing field.
主场优势--运动队在主场比赛时获得的利益--的存在和合理性已在各种体育运动中得到研究。有关这一主题的研究大多局限于个别联赛的短时间内,这阻碍了对可能原因的推断和深入理解。利用近二十年来美国国家橄榄球联盟(NFL)、美国大学生体育协会(NCAA)和全美高中的数据,我们提供了一种统一的方法来理解美式橄榄球的主场优势。我们的研究结果表明,主场优势在美国橄榄球联盟(NFL)和最高级别的大学橄榄球比赛中正在下降,但在业余橄榄球比赛中却没有下降。这增加了一种可能性,即 NCAA 和 NFL 的特点(如旅行改善和即时重播)有助于公平竞争。
{"title":"A comprehensive survey of the home advantage in American football","authors":"Luke Benz, Thompson Bliss, Michael Lopez","doi":"10.1515/jqas-2024-0016","DOIUrl":"https://doi.org/10.1515/jqas-2024-0016","url":null,"abstract":"The existence and justification to the home advantage – the benefit a sports team receives when playing at home – has been studied across sport. The majority of research on this topic is limited to individual leagues in short time frames, which hinders extrapolation and a deeper understanding of possible causes. Using nearly two decades of data from the National Football League (NFL), the National Collegiate Athletic Association (NCAA), and high schools from across the United States, we provide a uniform approach to understanding the home advantage in American football. Our findings suggest home advantage is declining in the NFL and the highest levels of collegiate football, but not in amateur football. This increases the possibility that characteristics of the NCAA and NFL, such as travel improvements and instant replay, have helped level the playing field.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"31 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141576616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving NHL draft outcome predictions using scouting reports 利用球探报告改进国家冰球联盟选秀结果预测
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2024-06-26 DOI: 10.1515/jqas-2024-0047
Hubert Luo
We leverage Large Language Models (LLMs) to extract information from scouting report texts and improve predictions of National Hockey League (NHL) draft outcomes. In parallel, we derive statistical features based on a player’s on-ice performance leading up to the draft. These two datasets are then combined using ensemble machine learning models. We find that both on-ice statistics and scouting reports have predictive value, however combining them leads to the strongest results.
我们利用大型语言模型(LLMs)从球探报告文本中提取信息,并改进对美国国家冰球联盟(NHL)选秀结果的预测。与此同时,我们根据球员在选秀前的场上表现得出统计特征。然后使用集合机器学习模型将这两个数据集结合起来。我们发现,冰上统计数据和球探报告都具有预测价值,但将它们结合起来会产生最强的结果。
{"title":"Improving NHL draft outcome predictions using scouting reports","authors":"Hubert Luo","doi":"10.1515/jqas-2024-0047","DOIUrl":"https://doi.org/10.1515/jqas-2024-0047","url":null,"abstract":"We leverage Large Language Models (LLMs) to extract information from scouting report texts and improve predictions of National Hockey League (NHL) draft outcomes. In parallel, we derive statistical features based on a player’s on-ice performance leading up to the draft. These two datasets are then combined using ensemble machine learning models. We find that both on-ice statistics and scouting reports have predictive value, however combining them leads to the strongest results.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"34 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141505767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A generative approach to frame-level multi-competitor races 框架级多人竞赛的生成方法
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2024-05-24 DOI: 10.1515/jqas-2023-0091
Tyrel Stokes, Gurashish Bagga, Kimberly Kroetch, Brendan Kumagai, Liam Welsh
Multi-competitor races often feature complicated within-race strategies that are difficult to capture when training data on race outcome level data. Models which do not account for race-level strategy may suffer from confounded inferences and predictions. We develop a generative model for multi-competitor races which explicitly models race-level effects like drafting and separates strategy from competitor ability. The model allows one to simulate full races from any real or created starting position opening new avenues for attributing value to within-race actions and performing counter-factual analyses. This methodology is sufficiently general to apply to any track based multi-competitor races where both tracking data is available and competitor movement is well described by simultaneous forward and lateral movements. We apply this methodology to one-mile horse races using frame-level tracking data provided by the New York Racing Association (NYRA) and the New York Thoroughbred Horsemen’s Association (NYTHA) for the Big Data Derby 2022 Kaggle Competition. We demonstrate how this model can yield new inferences, such as the estimation of horse-specific speed profiles and examples of posterior predictive counterfactual simulations to answer questions of interest such as starting lane impacts on race outcomes.
多选手比赛往往具有复杂的赛内策略,而根据比赛结果数据进行训练时很难捕捉到这些策略。不考虑比赛层面策略的模型可能会导致推论和预测的混淆。我们为多选手比赛开发了一个生成模型,该模型明确地模拟了牵制等比赛层面的影响,并将策略与选手能力区分开来。该模型允许人们从任何真实或创建的起始位置模拟完整的比赛,为归因于赛内行为的价值和进行反事实分析开辟了新的途径。该方法具有足够的通用性,可适用于任何基于赛道的多选手比赛,在这些比赛中,跟踪数据可用,选手的运动也可通过同时向前和横向运动得到很好的描述。我们利用纽约赛马协会 (NYRA) 和纽约纯血马骑士协会 (NYTHA) 为 2022 年 Kaggle 大数据德比大赛提供的帧级跟踪数据,将此方法应用于一英里赛马比赛。我们展示了这一模型如何产生新的推论,例如对特定马匹速度曲线的估计,以及后验预测反事实模拟的示例,以回答人们感兴趣的问题,例如起跑线对比赛结果的影响。
{"title":"A generative approach to frame-level multi-competitor races","authors":"Tyrel Stokes, Gurashish Bagga, Kimberly Kroetch, Brendan Kumagai, Liam Welsh","doi":"10.1515/jqas-2023-0091","DOIUrl":"https://doi.org/10.1515/jqas-2023-0091","url":null,"abstract":"Multi-competitor races often feature complicated within-race strategies that are difficult to capture when training data on race outcome level data. Models which do not account for race-level strategy may suffer from confounded inferences and predictions. We develop a generative model for multi-competitor races which explicitly models race-level effects like drafting and separates strategy from competitor ability. The model allows one to simulate full races from any real or created starting position opening new avenues for attributing value to within-race actions and performing counter-factual analyses. This methodology is sufficiently general to apply to any track based multi-competitor races where both tracking data is available and competitor movement is well described by simultaneous forward and lateral movements. We apply this methodology to one-mile horse races using frame-level tracking data provided by the New York Racing Association (NYRA) and the New York Thoroughbred Horsemen’s Association (NYTHA) for the Big Data Derby 2022 Kaggle Competition. We demonstrate how this model can yield new inferences, such as the estimation of horse-specific speed profiles and examples of posterior predictive counterfactual simulations to answer questions of interest such as starting lane impacts on race outcomes.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"44 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141145809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
No cheering in the background? Individual performance in professional darts during COVID-19 背景中没有欢呼声?COVID-19 期间职业飞镖的个人表现
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2024-04-15 DOI: 10.1515/jqas-2022-0036
Finn Spilker, Marius Ötting
The COVID-19 pandemic has led to a global shutdown of sporting activities. While professional sports competitions restarted in mid-2020, spectators were usually not allowed. This paper investigates the effect of absent fans and reduced social pressure on performance in professional darts – a setting where individual player performances can be well observed. Considering almost five years of tournament data, we use Bayesian multilevel models to investigate potential heterogeneity across players concerning reduced social pressure. For our analysis, we consider the two main performance measures in darts: the three-dart average and the checkout performance. Our results indicate that the effect of reduced social pressure on performance varies substantially across players. We further find experienced players to be less affected by social pressure compared to relatively inexperienced players.
COVID-19 大流行导致全球体育活动停摆。虽然职业体育比赛在 2020 年中期重新开始,但通常不允许观众入场。本文研究了球迷缺席和社会压力减小对职业飞镖比赛成绩的影响--在这种情况下,可以很好地观察选手的个人表现。考虑到近五年的比赛数据,我们使用贝叶斯多层次模型来研究不同选手在社会压力减少方面的潜在异质性。在分析中,我们考虑了飞镖运动中的两个主要成绩衡量标准:三镖平均成绩和结账成绩。我们的结果表明,社会压力的减少对不同选手成绩的影响存在很大差异。我们还发现,与相对缺乏经验的选手相比,经验丰富的选手受社会压力的影响较小。
{"title":"No cheering in the background? Individual performance in professional darts during COVID-19","authors":"Finn Spilker, Marius Ötting","doi":"10.1515/jqas-2022-0036","DOIUrl":"https://doi.org/10.1515/jqas-2022-0036","url":null,"abstract":"The COVID-19 pandemic has led to a global shutdown of sporting activities. While professional sports competitions restarted in mid-2020, spectators were usually not allowed. This paper investigates the effect of absent fans and reduced social pressure on performance in professional darts – a setting where individual player performances can be well observed. Considering almost five years of tournament data, we use Bayesian multilevel models to investigate potential heterogeneity across players concerning reduced social pressure. For our analysis, we consider the two main performance measures in darts: the three-dart average and the checkout performance. Our results indicate that the effect of reduced social pressure on performance varies substantially across players. We further find experienced players to be less affected by social pressure compared to relatively inexperienced players.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"65 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140582580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Career path clustering of elite soccer players among European Big-5 nations utilizing Dynamic Time Warping 利用动态时间扭曲对欧洲五大联赛国家精英足球运动员的职业道路进行分类
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2024-04-04 DOI: 10.1515/jqas-2023-0080
Viktor Wolf, Ralf Lanwehr, Marcel Bieschke, Daniel Leyhr
Prior clustering approaches of soccer players have employed a variety of methods based on various data categories, but none of them have focused on clustering by career paths characterized through a time series analysis of yearly performance quality. Therefore, this study aims to propose a methodology how a career path can be represented as a time series of a player’s seasonal qualities and then be clustered with players that have a similar career path. The underlying data focuses on soccer players from the five largest European soccer nations (Big-5). This allows for the identification of different types of career paths of players and the investigation of significant disparities between career paths among the Big-5 nations. In line with our proposed methodological approach, we identified and interpreted 13 different clusters of player career paths. These range from the cluster with the highest player quality scores to the pattern comprising players with the weakest scores. Further, the detected clusters show significant differences regarding variables of soccer players’ early career phase in adolescence (e.g., age of debut in professional soccer, years spent in a youth academy). The presented approach might represent a first step for stakeholders in soccer to get an objective insight in players’ career by utilizing mainly freely available data sources.
之前的足球运动员聚类方法采用了基于各种数据类别的多种方法,但没有一种方法侧重于通过对年度表现质量的时间序列分析来对职业生涯路径进行聚类。因此,本研究旨在提出一种方法,即如何用球员赛季表现质量的时间序列来表示球员的职业生涯轨迹,然后对具有相似职业生涯轨迹的球员进行聚类。基础数据主要来自欧洲五大足球国家(Big-5)的足球运动员。这样就可以识别不同类型球员的职业生涯轨迹,并调查五大联赛国家之间职业生涯轨迹的显著差异。根据我们提出的方法论,我们确定并解释了 13 个不同的球员职业道路集群。这些群组既有球员质量得分最高的群组,也有球员质量得分最弱的群组。此外,所发现的聚类在足球运动员青春期早期职业生涯阶段的变量(如首次参加职业足球比赛的年龄、在青训学校度过的年数)方面显示出显著差异。对于足球领域的利益相关者来说,本文提出的方法可能是利用主要免费数据源客观了解球员职业生涯的第一步。
{"title":"Career path clustering of elite soccer players among European Big-5 nations utilizing Dynamic Time Warping","authors":"Viktor Wolf, Ralf Lanwehr, Marcel Bieschke, Daniel Leyhr","doi":"10.1515/jqas-2023-0080","DOIUrl":"https://doi.org/10.1515/jqas-2023-0080","url":null,"abstract":"Prior clustering approaches of soccer players have employed a variety of methods based on various data categories, but none of them have focused on clustering by career paths characterized through a time series analysis of yearly performance quality. Therefore, this study aims to propose a methodology how a career path can be represented as a time series of a player’s seasonal qualities and then be clustered with players that have a similar career path. The underlying data focuses on soccer players from the five largest European soccer nations (Big-5). This allows for the identification of different types of career paths of players and the investigation of significant disparities between career paths among the Big-5 nations. In line with our proposed methodological approach, we identified and interpreted 13 different clusters of player career paths. These range from the cluster with the highest player quality scores to the pattern comprising players with the weakest scores. Further, the detected clusters show significant differences regarding variables of soccer players’ early career phase in adolescence (e.g., age of debut in professional soccer, years spent in a youth academy). The presented approach might represent a first step for stakeholders in soccer to get an objective insight in players’ career by utilizing mainly freely available data sources.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"61 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140582579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial roles in hockey special teams 冰球特别小组的空间作用
IF 0.8 Q3 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2024-04-04 DOI: 10.1515/jqas-2023-0019
Jonathan Arsenault, Margaret Cunniff, Eric Tulsky, James Richard Forbes
Special teams (i.e. power play and penalty kill) situations play an outsized role in determining the outcome of ice hockey games. Yet, quantitative methods for characterizing special teams tactics are limited. This work focuses on team structure and player deployment during in-zone special teams possessions. Leveraging player and puck tracking data from the National Hockey League (NHL), a framework is developed for describing player positioning during 5-on-4 power play and 4-on-5 penalty kill possessions. More specifically, player roles are defined directly from the player tracking data using non-negative matrix factorization, and every player is allocated a unique role at every frame of tracking data by solving a linear assignment problem. Team formations naturally arise through the combination of roles occupied in a frame. Roles that vary on a per-frame basis allow for a fine-grained analysis of team structure. This property of the roles-based representation is used to group together similar power play possessions using latent Dirichlet allocation, a topic modelling technique. The concept of assignments, which remain constant over an entire possession, is also introduced. Assignments provide a more stable measure of player positioning, which may be preferable when assessing deployment over longer periods of time.
在决定冰球比赛结果的过程中,特别小组(即强力进攻和罚球)的情况起着非常重要的作用。然而,用于描述特别小组战术的定量方法却很有限。这项研究的重点是区内特殊球队控球时的球队结构和球员部署。利用来自美国曲棍球联盟(NHL)的球员和冰球跟踪数据,开发了一个框架,用于描述球员在 5 对 4 强赛和 4 对 5 点球大战中的定位。更具体地说,使用非负矩阵因式分解法直接从球员追踪数据中定义球员角色,并通过解决线性分配问题,在每一帧追踪数据中为每个球员分配一个独特的角色。通过对一帧中占据的角色进行组合,自然会形成球队阵型。按帧变化的角色允许对团队结构进行精细分析。基于角色的表示法的这一特性被用于利用潜狄利克特分配(一种主题建模技术)将类似的控球权组合在一起。此外,还引入了在整个控球过程中保持不变的分配概念。分配提供了一种更稳定的球员定位衡量标准,在评估较长时间内的部署情况时可能更为可取。
{"title":"Spatial roles in hockey special teams","authors":"Jonathan Arsenault, Margaret Cunniff, Eric Tulsky, James Richard Forbes","doi":"10.1515/jqas-2023-0019","DOIUrl":"https://doi.org/10.1515/jqas-2023-0019","url":null,"abstract":"Special teams (i.e. power play and penalty kill) situations play an outsized role in determining the outcome of ice hockey games. Yet, quantitative methods for characterizing special teams tactics are limited. This work focuses on team structure and player deployment during in-zone special teams possessions. Leveraging player and puck tracking data from the National Hockey League (NHL), a framework is developed for describing player positioning during 5-on-4 power play and 4-on-5 penalty kill possessions. More specifically, player roles are defined directly from the player tracking data using non-negative matrix factorization, and every player is allocated a unique role at every frame of tracking data by solving a linear assignment problem. Team formations naturally arise through the combination of roles occupied in a frame. Roles that vary on a per-frame basis allow for a fine-grained analysis of team structure. This property of the roles-based representation is used to group together similar power play possessions using latent Dirichlet allocation, a topic modelling technique. The concept of assignments, which remain constant over an entire possession, is also introduced. Assignments provide a more stable measure of player positioning, which may be preferable when assessing deployment over longer periods of time.","PeriodicalId":16925,"journal":{"name":"Journal of Quantitative Analysis in Sports","volume":"45 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140582583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Quantitative Analysis in Sports
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1