首页 > 最新文献

American Statistician最新文献

英文 中文
Zero-Truncated Modelling in a Meta-Analysis on Suicide Data after Bariatric Surgery 零截断模型在减肥手术后自杀数据荟萃分析中的应用
IF 1.8 4区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2025-05-20 DOI: 10.1080/00031305.2025.2507380
Layna Charlie Dennett, Antony Overstall, Dankmar Böhning
Meta-analysis is a well-established method for integrating results from several independent studies to estimate a common quantity of interest. However, meta-analysis is prone to selection bias, notably when particular studies are systematically excluded. This can lead to bias in estimating the quantity of interest. Motivated by a meta-analysis to estimate the rate of completed-suicide after bariatric surgery, where studies which reported no suicides were excluded, a novel zero-truncated count modeling approach was developed. This approach addresses heterogeneity, both observed and unobserved, through covariate and overdispersion modeling, respectively. Additionally, through the Horvitz-Thompson estimator, an approach is developed to estimate the number of excluded studies, a quantity of potential interest for researchers. Uncertainty quantification for both estimation of suicide rates and number of excluded studies is achieved through a parametric bootstrapping approach.
荟萃分析是一种完善的方法,用于整合几个独立研究的结果来估计一个共同的兴趣量。然而,荟萃分析容易产生选择偏差,特别是当系统地排除特定研究时。这可能导致在估计兴趣数量时产生偏差。在一项评估减肥手术后自杀率的荟萃分析的激励下,研究人员开发了一种新的零截断计数模型方法,该方法排除了没有自杀报告的研究。这种方法分别通过协变量和过分散建模来解决观察到的和未观察到的异质性。此外,通过Horvitz-Thompson估计器,开发了一种方法来估计被排除的研究的数量,这是研究人员潜在兴趣的数量。通过参数自举方法实现了自杀率估计和排除研究数量的不确定性量化。
{"title":"Zero-Truncated Modelling in a Meta-Analysis on Suicide Data after Bariatric Surgery","authors":"Layna Charlie Dennett, Antony Overstall, Dankmar Böhning","doi":"10.1080/00031305.2025.2507380","DOIUrl":"https://doi.org/10.1080/00031305.2025.2507380","url":null,"abstract":"Meta-analysis is a well-established method for integrating results from several independent studies to estimate a common quantity of interest. However, meta-analysis is prone to selection bias, notably when particular studies are systematically excluded. This can lead to bias in estimating the quantity of interest. Motivated by a meta-analysis to estimate the rate of completed-suicide after bariatric surgery, where studies which reported no suicides were excluded, a novel zero-truncated count modeling approach was developed. This approach addresses heterogeneity, both observed and unobserved, through covariate and overdispersion modeling, respectively. Additionally, through the Horvitz-Thompson estimator, an approach is developed to estimate the number of excluded studies, a quantity of potential interest for researchers. Uncertainty quantification for both estimation of suicide rates and number of excluded studies is achieved through a parametric bootstrapping approach.","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"14 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144104095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flexible distributed lag models for count data using mgcv 使用mgcv计数数据的灵活分布式滞后模型
IF 1.8 4区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2025-05-18 DOI: 10.1080/00031305.2025.2505514
Theo Economou, Daphne Parliari, Aurelio Tobias, Laura Dawkins, Hamish Steptoe, Christophe Sarran, Oliver Stoner, Rachel Lowe, Jos Lelieveld
In this tutorial we present the use of R package mgcv to implement Distributed Lag Non-Linear Models (DLNMs) in a flexible way. Interpretation of smoothing splines as random quantities enables approximate Bayesian inference, which in turn allows uncertainty quantification and comprehensive model checking. We illustrate various modeling situations using open-access epidemiological data in conjunction with simulation experiments. We demonstrate the inclusion of temporal structures and the use of mixture distributions to allow for extreme outliers. Moreover, we demonstrate interactions of the temporal lagged structures with other covariates with different lagged periods for different covariates. Spatial structures are also demonstrated, including smooth spatial variability and Markov random fields, in addition to hierarchical formulations to allow for non-structured dependency. Posterior predictive simulation is used to ensure models verify well against the data.
在本教程中,我们介绍了使用R包mgcv以灵活的方式实现分布式滞后非线性模型(dlnm)。将平滑样条解释为随机量可以实现近似贝叶斯推理,从而允许不确定性量化和全面的模型检查。我们使用开放获取的流行病学数据结合模拟实验说明了各种建模情况。我们展示了时间结构的包含和混合分布的使用,以允许极端异常值。此外,我们证明了时间滞后结构与其他协变量的相互作用,不同的协变量具有不同的滞后周期。还演示了空间结构,包括平滑空间变异性和马尔可夫随机场,以及允许非结构化依赖的分层公式。后验预测仿真用于确保模型与数据的良好验证。
{"title":"Flexible distributed lag models for count data using mgcv","authors":"Theo Economou, Daphne Parliari, Aurelio Tobias, Laura Dawkins, Hamish Steptoe, Christophe Sarran, Oliver Stoner, Rachel Lowe, Jos Lelieveld","doi":"10.1080/00031305.2025.2505514","DOIUrl":"https://doi.org/10.1080/00031305.2025.2505514","url":null,"abstract":"In this tutorial we present the use of R package <span>mgcv</span> to implement Distributed Lag Non-Linear Models (DLNMs) in a flexible way. Interpretation of smoothing splines as random quantities enables approximate Bayesian inference, which in turn allows uncertainty quantification and comprehensive model checking. We illustrate various modeling situations using open-access epidemiological data in conjunction with simulation experiments. We demonstrate the inclusion of temporal structures and the use of mixture distributions to allow for extreme outliers. Moreover, we demonstrate interactions of the temporal lagged structures with other covariates with different lagged periods for different covariates. Spatial structures are also demonstrated, including smooth spatial variability and Markov random fields, in addition to hierarchical formulations to allow for non-structured dependency. Posterior predictive simulation is used to ensure models verify well against the data.","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"33 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144088102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Loser’s Curse and the Critical Role of the Utility Function 失败者的诅咒与效用函数的关键作用
IF 1.8 4区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2025-05-16 DOI: 10.1080/00031305.2025.2505512
Ryan S. Brill, Abraham J. Wyner
A longstanding question in the judgment and decision making literature is whether experts, even in high-stakes environments, exhibit the same cognitive biases observed in controlled experiments with inexperienced participants. Massey and Thaler (2013) claim to have found an example of bias and irrationality in expert decision making: general managers’ behavior in the National Football League draft pick trade market. They argue that general managers systematically overvalue top draft picks, which generate less surplus value on average than later first-round picks, a phenomenon known as the loser’s curse. Their conclusion hinges on the assumption that general managers should use expected surplus value as their utility function for evaluating draft picks. This assumption, however, is neither explicitly justified nor necessarily aligned with the strategic complexities of constructing a National Football League roster. In this paper, we challenge their framework by considering alternative utility functions, particularly those that emphasize the acquisition of transformational players––those capable of dramatically increasing a team’s chances of winning the Super Bowl. Under a decision rule that prioritizes the probability of acquiring elite players, which we construct from a novel Bayesian hierarchical Beta regression model, general managers’ draft trade behavior appears rational rather than systematically flawed. More broadly, our findings highlight the critical role of carefully specifying a utility function when evaluating the quality of decisions.
判断和决策文献中一个长期存在的问题是,即使在高风险环境中,专家是否会表现出与没有经验的参与者在对照实验中观察到的相同的认知偏见。Massey和Thaler(2013)声称发现了专家决策中的偏见和非理性的一个例子:国家橄榄球联盟选秀权交易市场中的总经理行为。他们认为,总经理们系统性地高估了状元选秀权,这些状元选秀权产生的剩余价值平均低于后来的首轮选秀权,这种现象被称为“失败者的诅咒”。他们的结论基于这样一个假设:总经理应该使用预期剩余价值作为评估选秀权的效用函数。然而,这种假设既没有明确的理由,也不一定符合构建国家橄榄球联盟名单的战略复杂性。在本文中,我们通过考虑替代效用函数来挑战他们的框架,特别是那些强调获得变革性球员的函数——那些能够显著增加球队赢得超级碗机会的球员。我们从一个新的贝叶斯分层Beta回归模型中构建了一个优先考虑获得精英球员概率的决策规则,在这个规则下,总经理的选秀交易行为似乎是理性的,而不是系统性的缺陷。更广泛地说,我们的发现强调了在评估决策质量时仔细指定效用函数的关键作用。
{"title":"The Loser’s Curse and the Critical Role of the Utility Function","authors":"Ryan S. Brill, Abraham J. Wyner","doi":"10.1080/00031305.2025.2505512","DOIUrl":"https://doi.org/10.1080/00031305.2025.2505512","url":null,"abstract":"A longstanding question in the judgment and decision making literature is whether experts, even in high-stakes environments, exhibit the same cognitive biases observed in controlled experiments with inexperienced participants. Massey and Thaler (2013) claim to have found an example of bias and irrationality in expert decision making: general managers’ behavior in the National Football League draft pick trade market. They argue that general managers systematically overvalue top draft picks, which generate less surplus value on average than later first-round picks, a phenomenon known as the loser’s curse. Their conclusion hinges on the assumption that general managers should use expected surplus value as their utility function for evaluating draft picks. This assumption, however, is neither explicitly justified nor necessarily aligned with the strategic complexities of constructing a National Football League roster. In this paper, we challenge their framework by considering alternative utility functions, particularly those that emphasize the acquisition of transformational players––those capable of dramatically increasing a team’s chances of winning the Super Bowl. Under a decision rule that prioritizes the probability of acquiring elite players, which we construct from a novel Bayesian hierarchical Beta regression model, general managers’ draft trade behavior appears rational rather than systematically flawed. More broadly, our findings highlight the critical role of carefully specifying a utility function when evaluating the quality of decisions.","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"16 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144066829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High Dimensional Space Oddity 高维空间奇度
IF 1.8 4区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2025-05-15 DOI: 10.1080/00031305.2025.2505507
Haim Bar, Vladimir Pozdnyakov
In his 1996 paper, Talagrand highlighted that the Law of Large Numbers (LLN) for independent random variables can be viewed as a geometric property of multidimensional product spaces. This phenomenon is known as the concentration of measure. To illustrate this profound connection between geometry and probability theory, we consider a seemingly intractable geometric problem in multidimensional Euclidean space and solve it using standard probabilistic tools such as the LLN and the Central Limit Theorem (CLT).
Talagrand在1996年的论文中强调,独立随机变量的大数定律(LLN)可以看作是多维积空间的一个几何性质。这种现象被称为度量的集中。为了说明几何和概率论之间的这种深刻联系,我们考虑了多维欧几里德空间中一个看似棘手的几何问题,并使用标准概率工具(如LLN和中心极限定理(CLT))来解决它。
{"title":"High Dimensional Space Oddity","authors":"Haim Bar, Vladimir Pozdnyakov","doi":"10.1080/00031305.2025.2505507","DOIUrl":"https://doi.org/10.1080/00031305.2025.2505507","url":null,"abstract":"In his 1996 paper, Talagrand highlighted that the Law of Large Numbers (LLN) for independent random variables can be viewed as a geometric property of multidimensional product spaces. This phenomenon is known as the concentration of measure. To illustrate this profound connection between geometry and probability theory, we consider a seemingly intractable geometric problem in multidimensional Euclidean space and solve it using standard probabilistic tools such as the LLN and the Central Limit Theorem (CLT).","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"16 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144066641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Inference and the Principle of Maximum Entropy 贝叶斯推理与最大熵原理
IF 1.8 4区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2025-05-06 DOI: 10.1080/00031305.2025.2501799
Duncan K. Foley, Ellis Scharfenaker
Bayes’ theorem incorporates distinct types of information through the likelihood and prior. Direct observations of state variables enter the likelihood and modify posterior probabilities through consistent updating. Information in terms of expected values of state variables modify posterior probabilities by constraining prior probabilities to be consistent with the information. Constraints on the prior can be exact, limiting hypothetical frequency distributions to only those that satisfy the constraints, or be approximate, allowing residual deviations from the exact constraint to some degree of tolerance. When the model parameters and constraint tolerances are known, posterior probabilities follow directly from Bayes’ theorem. When parameters and tolerances are unknown a prior for them must be specified. When the system is close to statistical equilibrium the computation of posterior probabilities is simplified due to the concentration of the prior on the maximum entropy hypothesis. The relationship between maximum entropy reasoning and Bayes’ theorem from this point of view is that maximum entropy reasoning is a special case of Bayesian inference with a constrained entropy-favoring prior.
贝叶斯定理通过似然和先验结合了不同类型的信息。状态变量的直接观测进入似然,并通过一致更新修正后验概率。以状态变量期望值表示的信息通过约束先验概率使其与信息一致来修改后验概率。对先验的约束可以是精确的,将假设的频率分布限制为仅满足约束的那些,或者是近似的,允许从精确约束到某种程度的容忍的剩余偏差。当模型参数和约束容限已知时,后验概率直接遵循贝叶斯定理。当参数和公差未知时,必须指定它们的先验。当系统接近统计平衡时,由于先验集中在最大熵假设上,后验概率的计算得到简化。从这个角度看,最大熵推理与贝叶斯定理的关系是,最大熵推理是贝叶斯推理的一种特殊情况,具有约束熵偏好先验。
{"title":"Bayesian Inference and the Principle of Maximum Entropy","authors":"Duncan K. Foley, Ellis Scharfenaker","doi":"10.1080/00031305.2025.2501799","DOIUrl":"https://doi.org/10.1080/00031305.2025.2501799","url":null,"abstract":"Bayes’ theorem incorporates distinct types of information through the likelihood and prior. Direct observations of state variables enter the likelihood and modify posterior probabilities through consistent updating. Information in terms of expected values of state variables modify posterior probabilities by constraining prior probabilities to be consistent with the information. Constraints on the prior can be exact, limiting hypothetical frequency distributions to only those that satisfy the constraints, or be approximate, allowing residual deviations from the exact constraint to some degree of tolerance. When the model parameters and constraint tolerances are known, posterior probabilities follow directly from Bayes’ theorem. When parameters and tolerances are unknown a prior for them must be specified. When the system is close to statistical equilibrium the computation of posterior probabilities is simplified due to the concentration of the prior on the maximum entropy hypothesis. The relationship between maximum entropy reasoning and Bayes’ theorem from this point of view is that maximum entropy reasoning is a special case of Bayesian inference with a constrained entropy-favoring prior.","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"4 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143933216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Much Ado About Survey Tables: A Comparison of Chi-Square Tests and Software to Analyze Categorical Survey Data 调查表格:卡方检验与软件分析分类调查数据的比较
IF 1.8 4区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2025-05-05 DOI: 10.1080/00031305.2025.2501800
Li-Yen R. Hu, Yulei He, Katherine E. Irimata, Vladislav Beresovsky
Chi-square tests are often employed to examine the association of categorical variables, the homogeneity of proportions between two or more samples, and the goodness-of-fit for a specified distribution. To account for the complex design of survey data, variants of chi-square tests as well as software packages that implement these tests have been developed. Nevertheless, from a survey practitioner’s perspective, there is a lack of applied literature that reviews and compares alternative options of survey chi-square tests and their associated programming and output. This paper aims to fill such a gap.Many modern statistical software packages for survey analysis are capable of computing either the Wald chi-square test or the Rao-Scott chi-square test, along with other types of chi-square tests, including the Rao-Scott likelihood ratio chi-square test and the Wald log-linear chi-square test. This paper focuses on these four types of chi-square tests, and examines four statistical packages that compute them in SAS®, R, Python and SUDAAN®. While the same type of tests using different packages yield similar results, different types of chi-square tests may yield variations in p-values when conducting the same comparison. Sample programming code is included in Appendix for readers’ reference.
卡方检验通常用于检验分类变量之间的关联、两个或多个样本之间比例的均匀性以及特定分布的拟合优度。为了解释调查数据的复杂设计,卡方检验的变体以及实现这些检验的软件包已经开发出来。然而,从调查从业者的角度来看,缺乏应用文献来审查和比较调查卡方检验的备选方案及其相关的编程和输出。本文旨在填补这一空白。许多用于调查分析的现代统计软件包都能够计算Wald卡方检验或Rao-Scott卡方检验,以及其他类型的卡方检验,包括Rao-Scott似然比卡方检验和Wald对数线性卡方检验。本文重点介绍了这四种类型的卡方检验,并检查了在SAS®,R, Python和SUDAAN®中计算它们的四个统计包。虽然使用不同包的同一类型的检验产生相似的结果,但在进行相同的比较时,不同类型的卡方检验可能产生p值的变化。示例编程代码包含在附录中,供读者参考。
{"title":"Much Ado About Survey Tables: A Comparison of Chi-Square Tests and Software to Analyze Categorical Survey Data","authors":"Li-Yen R. Hu, Yulei He, Katherine E. Irimata, Vladislav Beresovsky","doi":"10.1080/00031305.2025.2501800","DOIUrl":"https://doi.org/10.1080/00031305.2025.2501800","url":null,"abstract":"Chi-square tests are often employed to examine the association of categorical variables, the homogeneity of proportions between two or more samples, and the goodness-of-fit for a specified distribution. To account for the complex design of survey data, variants of chi-square tests as well as software packages that implement these tests have been developed. Nevertheless, from a survey practitioner’s perspective, there is a lack of applied literature that reviews and compares alternative options of survey chi-square tests and their associated programming and output. This paper aims to fill such a gap.Many modern statistical software packages for survey analysis are capable of computing either the Wald chi-square test or the Rao-Scott chi-square test, along with other types of chi-square tests, including the Rao-Scott likelihood ratio chi-square test and the Wald log-linear chi-square test. This paper focuses on these four types of chi-square tests, and examines four statistical packages that compute them in SAS®, R, Python and SUDAAN<sup>®</sup>. While the same type of tests using different packages yield similar results, different types of chi-square tests may yield variations in p-values when conducting the same comparison. Sample programming code is included in Appendix for readers’ reference.","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"34 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143933218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analytics, Have Some Humility: A Statistical View of Fourth-Down Decision Making 分析,要谦虚:第四回合决策的统计观点
IF 1.8 4区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2025-04-18 DOI: 10.1080/00031305.2025.2475801
Ryan S. Brill, Ronald Yurko, Abraham J. Wyner
The standard mathematical approach to fourth-down decision-making in American football is to make the decision that maximizes estimated win probability. Win probability estimates arise from machine learning models fit from historical data. These models attempt to capture a nuanced relationship between a noisy binary outcome variable and game-state variables replete with interactions and non-linearities from a finite dataset of just a few thousand games. Thus, it is imperative to knit uncertainty quantification into the fourth-down decision procedure; we do so using bootstrapping. We find that uncertainty in the estimated optimal fourth-down decision is far greater than that currently expressed by sports analysts in popular sports media.
在美式橄榄球比赛中,标准的数学方法是做出使获胜概率最大化的决定。获胜概率估计来自于根据历史数据拟合的机器学习模型。这些模型试图捕捉嘈杂的二元结果变量和充满互动和非线性的游戏状态变量之间的微妙关系,这些变量来自只有几千个游戏的有限数据集。因此,必须将不确定性量化纳入第四次决策过程;我们使用自引导来做到这一点。我们发现,估计的最优第四进攻决策的不确定性远远大于目前流行体育媒体中体育分析师所表达的不确定性。
{"title":"Analytics, Have Some Humility: A Statistical View of Fourth-Down Decision Making","authors":"Ryan S. Brill, Ronald Yurko, Abraham J. Wyner","doi":"10.1080/00031305.2025.2475801","DOIUrl":"https://doi.org/10.1080/00031305.2025.2475801","url":null,"abstract":"The standard mathematical approach to fourth-down decision-making in American football is to make the decision that maximizes estimated win probability. Win probability estimates arise from machine learning models fit from historical data. These models attempt to capture a nuanced relationship between a noisy binary outcome variable and game-state variables replete with interactions and non-linearities from a finite dataset of just a few thousand games. Thus, it is imperative to knit uncertainty quantification into the fourth-down decision procedure; we do so using bootstrapping. We find that uncertainty in the estimated optimal fourth-down decision is far greater than that currently expressed by sports analysts in popular sports media.","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"51 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143933338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Play-by-Play Volleyball Win Probability Model 逐场排球获胜概率模型
IF 1.8 4区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2025-04-10 DOI: 10.1080/00031305.2025.2490786
Nathan Hawkins, Gilbert W. Fellingham, Garritt L. Page
This paper introduces a volleyball point-by-point win probability model that updates the probability of winning a set after each play in the set. The covariate informed product partition model (PPMx) is well suited to flexibly include in-set team performance information when making predictions. However, making predictions in real time would be too expensive computationally as it would require refitting the PPMx for each prediction. Instead, we develop a predictive procedure based on a single training of the PPMx that predicts in real-time. We deploy this procedure using data from the 2018 Men’s World Volleyball Championship. The procedure first trains a PPMx model using end-of-set team performance statistics from the round robin stage of the tournament. Then based on the PPMx predictive distribution, we predict the win probability after every play of every match in the knockout stages. Finally, we show how the prediction procedure can be enhanced by including pre-set information towards the beginning of the set and set score towards the end.
本文介绍了一个排球逐点获胜概率模型,该模型在每一局比赛后更新一局获胜的概率。协变量知情产品划分模型(PPMx)非常适合在进行预测时灵活地包括集内团队绩效信息。然而,进行实时预测在计算上过于昂贵,因为它需要为每个预测重新调整PPMx。相反,我们开发了一种基于PPMx的单一训练的预测程序,可以实时预测。我们使用2018年世界男子排球锦标赛的数据部署了这一程序。该程序首先使用比赛循环赛阶段结束的球队表现统计数据来训练PPMx模型。然后基于PPMx预测分布,预测淘汰赛阶段每场比赛的胜率。最后,我们展示了如何通过在集合的开始处包含预设信息和在结尾处包含预设分数来增强预测过程。
{"title":"Play-by-Play Volleyball Win Probability Model","authors":"Nathan Hawkins, Gilbert W. Fellingham, Garritt L. Page","doi":"10.1080/00031305.2025.2490786","DOIUrl":"https://doi.org/10.1080/00031305.2025.2490786","url":null,"abstract":"This paper introduces a volleyball point-by-point win probability model that updates the probability of winning a set after each play in the set. The covariate informed product partition model (PPMx) is well suited to flexibly include in-set team performance information when making predictions. However, making predictions in real time would be too expensive computationally as it would require refitting the PPMx for each prediction. Instead, we develop a predictive procedure based on a single training of the PPMx that predicts in real-time. We deploy this procedure using data from the 2018 Men’s World Volleyball Championship. The procedure first trains a PPMx model using end-of-set team performance statistics from the round robin stage of the tournament. Then based on the PPMx predictive distribution, we predict the win probability after every play of every match in the knockout stages. Finally, we show how the prediction procedure can be enhanced by including pre-set information towards the beginning of the set and set score towards the end.","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"1 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143940081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learn R: As a Language, 2nd ed. 《学习R:作为一门语言》第二版。
IF 1.8 4区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2025-04-09 DOI: 10.1080/00031305.2025.2490305
Haihan Yu
Pedro J. Aphalo. Boca Raton, FL: Chapman & Hall/CRC Press, 2024, xvii + 447 pp., $220.00(H), ISBN: 978-1-032-51843-5.R programming has become an essential tool for data analysis and statistical com...
佩德罗·j·阿波洛。博卡拉顿,佛罗里达州:查普曼和霍尔/CRC出版社,2024,17 + 447页,$220.00(H), ISBN: 978-1-032-51843-5。R编程已经成为数据分析和统计的必备工具。
{"title":"Learn R: As a Language, 2nd ed.","authors":"Haihan Yu","doi":"10.1080/00031305.2025.2490305","DOIUrl":"https://doi.org/10.1080/00031305.2025.2490305","url":null,"abstract":"Pedro J. Aphalo. Boca Raton, FL: Chapman & Hall/CRC Press, 2024, xvii + 447 pp., $220.00(H), ISBN: 978-1-032-51843-5.R programming has become an essential tool for data analysis and statistical com...","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"2 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143933335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Science in Practice. 数据科学实践。
IF 1.8 4区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2025-04-09 DOI: 10.1080/00031305.2025.2490304
Xiao Hui Tai
Tom Alby. Boca Raton, FL: Chapman & Hall/CRC Press, 2024, xvi + 301 pp., $200.00(H), ISBN: 978-1-032-50524-4.This book is a comprehensive introduction to data science, with a focus on how it is use...
汤姆Alby。博卡拉顿,佛罗里达州:查普曼和霍尔/CRC出版社,2024,16 + 301页,200.00美元(H), ISBN: 978-1-032-50524-4。这本书是对数据科学的全面介绍,重点是如何使用它…
{"title":"Data Science in Practice.","authors":"Xiao Hui Tai","doi":"10.1080/00031305.2025.2490304","DOIUrl":"https://doi.org/10.1080/00031305.2025.2490304","url":null,"abstract":"Tom Alby. Boca Raton, FL: Chapman & Hall/CRC Press, 2024, xvi + 301 pp., $200.00(H), ISBN: 978-1-032-50524-4.This book is a comprehensive introduction to data science, with a focus on how it is use...","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"15 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143933333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
American Statistician
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1