首页 > 最新文献

Australian & New Zealand Journal of Statistics最新文献

英文 中文
Spying on the prior of the number of data clusters and the partition distribution in Bayesian cluster analysis 监视贝叶斯聚类分析中数据簇数的先验性和分区分布
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-02-10 DOI: 10.1111/anzs.12350
Jan Greve, Bettina Grün, Gertraud Malsiner-Walli, Sylvia Frühwirth-Schnatter

Cluster analysis aims at partitioning data into groups or clusters. In applications, it is common to deal with problems where the number of clusters is unknown. Bayesian mixture models employed in such applications usually specify a flexible prior that takes into account the uncertainty with respect to the number of clusters. However, a major empirical challenge involving the use of these models is in the characterisation of the induced prior on the partitions. This work introduces an approach to compute descriptive statistics of the prior on the partitions for three selected Bayesian mixture models developed in the areas of Bayesian finite mixtures and Bayesian nonparametrics. The proposed methodology involves computationally efficient enumeration of the prior on the number of clusters in-sample (termed as ‘data clusters’) and determining the first two prior moments of symmetric additive statistics characterising the partitions. The accompanying reference implementation is made available in the R package fipp. Finally, we illustrate the proposed methodology through comparisons and also discuss the implications for prior elicitation in applications.

聚类分析的目的是将数据划分成组或簇。在应用程序中,通常会处理集群数量未知的问题。在这类应用中使用的贝叶斯混合模型通常指定了一个灵活的先验,该先验考虑了与集群数量有关的不确定性。然而,涉及使用这些模型的主要经验挑战是在分区上诱导先验的表征。这项工作介绍了一种方法来计算在贝叶斯有限混合和贝叶斯非参数领域开发的三个选定贝叶斯混合模型分区上的先验描述性统计。所提出的方法包括对样本内簇(称为“数据簇”)数量的先验进行计算效率枚举,并确定描述分区的对称加性统计的前两个先验矩。附带的参考实现可在R包fipp中获得。最后,我们通过比较说明了所提出的方法,并讨论了在应用程序中对先验启发的影响。
{"title":"Spying on the prior of the number of data clusters and the partition distribution in Bayesian cluster analysis","authors":"Jan Greve,&nbsp;Bettina Grün,&nbsp;Gertraud Malsiner-Walli,&nbsp;Sylvia Frühwirth-Schnatter","doi":"10.1111/anzs.12350","DOIUrl":"10.1111/anzs.12350","url":null,"abstract":"<p>Cluster analysis aims at partitioning data into groups or clusters. In applications, it is common to deal with problems where the number of clusters is unknown. Bayesian mixture models employed in such applications usually specify a flexible prior that takes into account the uncertainty with respect to the number of clusters. However, a major empirical challenge involving the use of these models is in the characterisation of the induced prior on the partitions. This work introduces an approach to compute descriptive statistics of the prior on the partitions for three selected Bayesian mixture models developed in the areas of Bayesian finite mixtures and Bayesian nonparametrics. The proposed methodology involves computationally efficient enumeration of the prior on the number of clusters in-sample (termed as ‘data clusters’) and determining the first two prior moments of symmetric additive statistics characterising the partitions. The accompanying reference implementation is made available in the <span>R</span> package <span>fipp</span>. Finally, we illustrate the proposed methodology through comparisons and also discuss the implications for prior elicitation in applications.</p>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"64 2","pages":"205-229"},"PeriodicalIF":1.1,"publicationDate":"2022-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/anzs.12350","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72830218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Variable selection and debiased estimation for single-index expectile model 单指标期望模型的变量选择与去偏估计
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-02-02 DOI: 10.1111/anzs.12348
Rong Jiang, Yexun Peng, Yufei Deng

This article develops a penalised asymmetric least squares estimator for single-index expectile model. The oracle property of the proposed estimator is established. Moreover, the debiasing technique is used to construct an estimator that is asymptotically normal, which enables the construction of valid confidence intervals and hypothesis testing. Simulation studies and one real data application are conducted to illustrate the finite sample performance of the proposed methods.

本文提出了单指标期望模型的惩罚非对称最小二乘估计。建立了该估计器的预言性。此外,利用消偏技术构造渐近正态的估计量,使有效置信区间的构造和假设检验成为可能。通过仿真研究和一个实际数据应用来说明所提出方法的有限样本性能。
{"title":"Variable selection and debiased estimation for single-index expectile model","authors":"Rong Jiang,&nbsp;Yexun Peng,&nbsp;Yufei Deng","doi":"10.1111/anzs.12348","DOIUrl":"10.1111/anzs.12348","url":null,"abstract":"<div>\u0000 \u0000 <p>This article develops a penalised asymmetric least squares estimator for single-index expectile model. The oracle property of the proposed estimator is established. Moreover, the debiasing technique is used to construct an estimator that is asymptotically normal, which enables the construction of valid confidence intervals and hypothesis testing. Simulation studies and one real data application are conducted to illustrate the finite sample performance of the proposed methods.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"63 4","pages":"658-673"},"PeriodicalIF":1.1,"publicationDate":"2022-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79758873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient estimation of partially linear tail index models using B-splines 部分线性尾指数模型的b样条有效估计
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-02-02 DOI: 10.1111/anzs.12357
Yaolan Ma, Bo Wei

The tail index is an important parameter in extreme value theory. In this paper, we consider a simple yet flexible spline estimation method for partially linear tail index models. We approximate the unknown function by B-splines and construct an approximate log-likelihood function to estimate the coefficients of the linear covariates and the B-spline basis functions. Consistency and asymptotic normality of the estimators are established. Subsequently, the proposed method is illustrated by using simulations and applications to the Fremantle annual maximum sea levels data and Chicago air pollution data.

尾指数是极值理论中的一个重要参数。本文考虑了部分线性尾指数模型的一种简单而灵活的样条估计方法。我们用b样条近似未知函数,构造一个近似对数似然函数来估计线性协变量和b样条基函数的系数。建立了估计量的相合性和渐近正态性。随后,通过对Fremantle年最高海平面数据和芝加哥空气污染数据的模拟和应用说明了所提出的方法。
{"title":"Efficient estimation of partially linear tail index models using B-splines","authors":"Yaolan Ma,&nbsp;Bo Wei","doi":"10.1111/anzs.12357","DOIUrl":"10.1111/anzs.12357","url":null,"abstract":"<div>\u0000 \u0000 <p>The tail index is an important parameter in extreme value theory. In this paper, we consider a simple yet flexible spline estimation method for partially linear tail index models. We approximate the unknown function by B-splines and construct an approximate log-likelihood function to estimate the coefficients of the linear covariates and the B-spline basis functions. Consistency and asymptotic normality of the estimators are established. Subsequently, the proposed method is illustrated by using simulations and applications to the Fremantle annual maximum sea levels data and Chicago air pollution data.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"64 1","pages":"27-44"},"PeriodicalIF":1.1,"publicationDate":"2022-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84190109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Properties of the affine-invariant ensemble sampler's ‘stretch move’ in high dimensions 高维仿射不变系综采样器“拉伸移动”的性质
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-02-02 DOI: 10.1111/anzs.12358
David Huijser, Jesse Goodman, Brendon J. Brewer

We present theoretical and practical properties of the affine-invariant ensemble sampler Markov Chain Monte Carlo method. In high dimensions, the sampler's ‘stretch move’ has unusual and undesirable properties. We demonstrate this with an n-dimensional correlated Gaussian toy problem with a known mean and covariance structure, and a multivariate version of the Rosenbrock problem. Visual inspection of a trace plots suggests the burn-in period is short. Upon closer inspection, we discover the mean and the variance of the target distribution do not match the known values, and the chain takes a very long time to converge. This problem becomes severe as n increases beyond 50. We also applied different diagnostics adapted to be applicable to ensemble methods to determine any lack of convergence. The diagnostics include the Gelman–Rubin method, the Heidelberger–Welch test, the integrated autocorrelation and the acceptance rate. The trace plot of individual walkers appears to be useful as well. We therefore conclude that the stretch move should be used with caution in moderate to high dimensions. We also present some heuristic results explaining this behaviour.

给出了仿射不变集合采样器马尔可夫链蒙特卡罗方法的理论和实际性质。在高维中,采样器的“拉伸移动”具有不寻常和不受欢迎的特性。我们用一个已知均值和协方差结构的n维相关高斯玩具问题和一个多变量版本的Rosenbrock问题来证明这一点。目视检查痕迹图表明烧蚀期很短。经过仔细检查,我们发现目标分布的均值和方差与已知值不匹配,并且链需要很长时间才能收敛。当n大于50时,这个问题变得更加严重。我们还应用了适用于集成方法的不同诊断方法来确定是否缺乏收敛性。诊断方法包括Gelman-Rubin法、海德堡-韦尔奇检验、综合自相关和接受率。单个步行者的轨迹图似乎也很有用。因此,我们得出结论,拉伸移动应谨慎使用中至高维。我们还提出了一些启发式结果来解释这种行为。
{"title":"Properties of the affine-invariant ensemble sampler's ‘stretch move’ in high dimensions","authors":"David Huijser,&nbsp;Jesse Goodman,&nbsp;Brendon J. Brewer","doi":"10.1111/anzs.12358","DOIUrl":"10.1111/anzs.12358","url":null,"abstract":"<div>\u0000 \u0000 <p>We present theoretical and practical properties of the affine-invariant ensemble sampler Markov Chain Monte Carlo method. In high dimensions, the sampler's ‘stretch move’ has unusual and undesirable properties. We demonstrate this with an <i>n</i>-dimensional correlated Gaussian toy problem with a known mean and covariance structure, and a multivariate version of the Rosenbrock problem. Visual inspection of a trace plots suggests the burn-in period is short. Upon closer inspection, we discover the mean and the variance of the target distribution do not match the known values, and the chain takes a very long time to converge. This problem becomes severe as <i>n</i> increases beyond 50. We also applied different diagnostics adapted to be applicable to ensemble methods to determine any lack of convergence. The diagnostics include the Gelman–Rubin method, the Heidelberger–Welch test, the integrated autocorrelation and the acceptance rate. The trace plot of individual walkers appears to be useful as well. We therefore conclude that the stretch move should be used with caution in moderate to high dimensions. We also present some heuristic results explaining this behaviour.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"64 1","pages":"1-26"},"PeriodicalIF":1.1,"publicationDate":"2022-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88726225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Global implicit function theorems and the online expectation–maximisation algorithm 全局隐函数定理和在线期望最大化算法
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-24 DOI: 10.1111/anzs.12356
Hien Duy Nguyen, Florence Forbes
The expectation–maximisation (EM) algorithm framework is an important tool for statistical computation. Due to the changing nature of data, online and mini‐batch variants of EM and EM‐like algorithms have become increasingly popular. The consistency of the estimator sequences that are produced by these EM variants often rely on an assumption regarding the continuous differentiability of a parameter update function. In many cases, the parameter update function is not in closed form and may only be defined implicitly, which makes the verification of the continuous differentiability property difficult. We demonstrate how a global implicit function theorem can be used to verify such properties in the cases of finite mixtures of distributions in the exponential family, and more generally, when the component‐specific distributions admit data augmentation schemes, within the exponential family. We then illustrate the use of such a theorem in the cases of mixtures of beta distributions, gamma distributions, fully visible Boltzmann machines and Student distributions. Via numerical simulations, we provide empirical evidence towards the consistency of the online EM algorithm parameter estimates in such cases.
期望最大化(EM)算法框架是统计计算的重要工具。由于数据性质的变化,EM和类EM算法的在线和小批量变体变得越来越流行。由这些EM变量产生的估计序列的一致性通常依赖于关于参数更新函数的连续可微性的假设。在许多情况下,参数更新函数不是封闭形式,只能隐式定义,这使得连续可微性的验证变得困难。我们演示了如何使用全局隐函数定理来验证指数族分布的有限混合情况下的这些性质,更一般地说,当成分特定分布允许数据增强方案时,在指数族内。然后,我们说明了在β分布、γ分布、完全可见玻尔兹曼机和学生分布的混合情况下使用这个定理。通过数值模拟,我们为在线EM算法参数估计在这种情况下的一致性提供了经验证据。
{"title":"Global implicit function theorems and the online expectation–maximisation algorithm","authors":"Hien Duy Nguyen,&nbsp;Florence Forbes","doi":"10.1111/anzs.12356","DOIUrl":"10.1111/anzs.12356","url":null,"abstract":"The expectation–maximisation (EM) algorithm framework is an important tool for statistical computation. Due to the changing nature of data, online and mini‐batch variants of EM and EM‐like algorithms have become increasingly popular. The consistency of the estimator sequences that are produced by these EM variants often rely on an assumption regarding the continuous differentiability of a parameter update function. In many cases, the parameter update function is not in closed form and may only be defined implicitly, which makes the verification of the continuous differentiability property difficult. We demonstrate how a global implicit function theorem can be used to verify such properties in the cases of finite mixtures of distributions in the exponential family, and more generally, when the component‐specific distributions admit data augmentation schemes, within the exponential family. We then illustrate the use of such a theorem in the cases of mixtures of beta distributions, gamma distributions, fully visible Boltzmann machines and Student distributions. Via numerical simulations, we provide empirical evidence towards the consistency of the online EM algorithm parameter estimates in such cases.","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"64 2","pages":"255-281"},"PeriodicalIF":1.1,"publicationDate":"2022-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83582209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Sufficient dimension reduction for clustered data via finite mixture modelling 通过有限混合模型对聚类数据进行足够的降维
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-22 DOI: 10.1111/anzs.12349
F.K.C. Hui, L.H. Nghiem

Sufficient dimension reduction (SDR) is an attractive approach to regression modelling. However, despite its rich literature and growing popularity in application, surprisingly little research has been done on how to perform SDR for clustered data, for example as is commonly arises in longitudinal studies. Indeed, current popular SDR methods have been mostly based on a marginal estimating equation approach. In this article, we propose a new approach to SDR for clustered data based on a combination of finite mixture modelling and mixed effects regression. Finite mixture models offer a flexible means of estimating the fixed effects central subspace, based on slicing the space up and probabilistically clustering observations to each slice (mixture component). Dimension reduction is achieved by having the mixing proportions vary only through the sufficient fixed effect predictors. We then incorporate random effects as a natural means of accounting for correlations within clusters. We employ a Monte Carlo expectation–maximisation algorithm to estimate the model parameters and fixed effects central subspace, and discuss methods for associated uncertainty quantification and prediction. Simulation studies demonstrate that our approach performs strongly against both estimating equation methods for estimating the fixed effects central subspace, and SDR methods which do not account for within-cluster correlation. Finally, we apply the proposed approach to a data set on air pollutant monitoring across 13 stations in the Eastern United States.

充分降维(SDR)是一种有吸引力的回归建模方法。然而,尽管其文献丰富,应用日益普及,但令人惊讶的是,关于如何对聚类数据执行SDR的研究却很少,例如在纵向研究中常见的研究。事实上,目前流行的SDR方法大多基于边际估计方程方法。在本文中,我们提出了一种基于有限混合建模和混合效应回归相结合的聚类数据SDR新方法。有限混合模型提供了一种灵活的方法来估计固定效应的中心子空间,基于对空间的分割和对每个切片(混合分量)的概率聚类观察。只有通过足够的固定效应预测因子,混合比例才会发生变化,从而实现降维。然后,我们将随机效应作为计算集群内相关性的自然手段。我们采用蒙特卡罗期望最大化算法来估计模型参数和固定效应中心子空间,并讨论了相关的不确定性量化和预测方法。仿真研究表明,我们的方法对估计固定效应中心子空间的估计方程方法和不考虑簇内相关性的SDR方法都有很强的性能。最后,我们将提出的方法应用于美国东部13个站点的空气污染物监测数据集。
{"title":"Sufficient dimension reduction for clustered data via finite mixture modelling","authors":"F.K.C. Hui,&nbsp;L.H. Nghiem","doi":"10.1111/anzs.12349","DOIUrl":"10.1111/anzs.12349","url":null,"abstract":"<div>\u0000 \u0000 <p>Sufficient dimension reduction (SDR) is an attractive approach to regression modelling. However, despite its rich literature and growing popularity in application, surprisingly little research has been done on how to perform SDR for clustered data, for example as is commonly arises in longitudinal studies. Indeed, current popular SDR methods have been mostly based on a marginal estimating equation approach. In this article, we propose a new approach to SDR for clustered data based on a combination of finite mixture modelling and mixed effects regression. Finite mixture models offer a flexible means of estimating the fixed effects central subspace, based on slicing the space up and probabilistically clustering observations to each slice (mixture component). Dimension reduction is achieved by having the mixing proportions vary only through the sufficient fixed effect predictors. We then incorporate random effects as a natural means of accounting for correlations within clusters. We employ a Monte Carlo expectation–maximisation algorithm to estimate the model parameters and fixed effects central subspace, and discuss methods for associated uncertainty quantification and prediction. Simulation studies demonstrate that our approach performs strongly against both estimating equation methods for estimating the fixed effects central subspace, and SDR methods which do not account for within-cluster correlation. Finally, we apply the proposed approach to a data set on air pollutant monitoring across 13 stations in the Eastern United States.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"64 2","pages":"133-157"},"PeriodicalIF":1.1,"publicationDate":"2022-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73971724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Bayesian credible intervals for population attributable risk from case–control, cohort and cross-sectional studies 来自病例对照、队列和横断面研究的人群归因风险的贝叶斯可信区间
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-17 DOI: 10.1111/anzs.12352
Sarah Pirikahu, Geoffrey Jones, Martin L. Hazelton

Population attributable risk (PAR) and population attributable fraction (PAF) are used in epidemiology to predict the impact of removing a risk factor from the population. Until recently, no standard approach for calculating confidence intervals or the variance for PAR in particular was available in the literature. Previously we outlined a fully Bayesian approach to provide credible intervals for the PAR and PAF from a cross-sectional study, where the data was presented in the form of a 2×2 table. However, extensions to cater for other frequently used study designs were not provided. In this paper we provide methodology to calculate credible intervals for the PAR and PAF for case–control and cohort studies. Additionally, we extend the cross-sectional example to allow for the incorporation of uncertainty that arises when an imperfect diagnostic test is used. In all these situations the model becomes over-parameterised, or non-identifiable, which can result in standard ‘off-the-shelf’ Markov Chain Monte Carlo (MCMC) updaters taking a long time to converge or even failing altogether. We adapt an importance sampling methodology to overcome this problem, and propose some novel MCMC samplers that take into consideration the shape of the posterior ridge to aid in the convergence of the Markov chain.

人口归因风险(PAR)和人口归因分数(PAF)在流行病学中用于预测从人群中去除危险因素的影响。直到最近,在文献中还没有计算置信区间或PAR方差的标准方法。之前,我们概述了一种完全贝叶斯方法,从横断面研究中为PAR和PAF提供可信的区间,其中数据以2×2表的形式呈现。但是,没有提供扩展以满足其他常用的研究设计。在本文中,我们提供了计算病例对照和队列研究的PAR和PAF可信区间的方法。此外,我们扩展了横断面的例子,以允许合并不确定性,当一个不完善的诊断测试是使用。在所有这些情况下,模型变得过度参数化或不可识别,这可能导致标准的“现成”马尔可夫链蒙特卡罗(MCMC)更新需要很长时间才能收敛,甚至完全失败。为了克服这一问题,我们采用了一种重要采样方法,并提出了一些考虑后脊形状的新型MCMC采样器,以帮助马尔可夫链收敛。
{"title":"Bayesian credible intervals for population attributable risk from case–control, cohort and cross-sectional studies","authors":"Sarah Pirikahu,&nbsp;Geoffrey Jones,&nbsp;Martin L. Hazelton","doi":"10.1111/anzs.12352","DOIUrl":"10.1111/anzs.12352","url":null,"abstract":"<div>\u0000 \u0000 <p>Population attributable risk (PAR) and population attributable fraction (PAF) are used in epidemiology to predict the impact of removing a risk factor from the population. Until recently, no standard approach for calculating confidence intervals or the variance for PAR in particular was available in the literature. Previously we outlined a fully Bayesian approach to provide credible intervals for the PAR and PAF from a cross-sectional study, where the data was presented in the form of a 2×2 table. However, extensions to cater for other frequently used study designs were not provided. In this paper we provide methodology to calculate credible intervals for the PAR and PAF for case–control and cohort studies. Additionally, we extend the cross-sectional example to allow for the incorporation of uncertainty that arises when an imperfect diagnostic test is used. In all these situations the model becomes over-parameterised, or non-identifiable, which can result in standard ‘off-the-shelf’ Markov Chain Monte Carlo (MCMC) updaters taking a long time to converge or even failing altogether. We adapt an importance sampling methodology to overcome this problem, and propose some novel MCMC samplers that take into consideration the shape of the posterior ridge to aid in the convergence of the Markov chain.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"63 4","pages":"639-657"},"PeriodicalIF":1.1,"publicationDate":"2022-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79829223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Measuring the values of cricket players 衡量板球运动员的价值
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-15 DOI: 10.1111/anzs.12353
Pranjal Chandrakar, Shubhabrata Das

Sports franchises that participate in team sports can make better decisions regarding their players’ financial compensation, renewal of the contracts, bidding strategies during the auction, etc., if they can adequately assess the value or worth of their players. Evaluating the value of a player in a team sport is difficult because various team members play different roles. In this study, we resolve this by measuring the value of a player in terms of how his inclusion in the team affects the team's probability of winning. With this notion of value, we develop a technique to measure the worth of a cricket player for his franchise. To illustrate this technique, we evaluate the values of cricket players who play in the Indian Premier League. We also study the relationship between players’ values and their salaries. We find that a few popular players earn disproportionately more than others. This disproportionality in the income of popular players cannot be justified by their performance alone, as adjudged by their values in this work. We attribute the disproportionality in the income to the factors not captured via conventional yardsticks, including leadership or brand value.

如果能够充分评估球员的价值或价值,参与团队运动的体育特许经营机构可以在球员的经济补偿、合同续约、拍卖期间的竞标策略等方面做出更好的决策。评估团队运动中球员的价值是困难的,因为不同的团队成员扮演不同的角色。在本研究中,我们通过衡量球员的价值来解决这个问题,即球员加入球队对球队获胜概率的影响。有了这个价值的概念,我们开发了一种技术来衡量一个板球运动员对他的球队的价值。为了说明这一技术,我们评估了在印度超级联赛中打球的板球运动员的价值观。我们还研究了球员的价值和薪水之间的关系。我们发现一些受欢迎的球员比其他人赚得多得不成比例。受欢迎球员收入的不均衡不能仅仅通过他们的表现来证明,正如他们在这项工作中的价值观所判断的那样。我们将收入的不均衡归因于传统标准无法捕捉到的因素,包括领导力或品牌价值。
{"title":"Measuring the values of cricket players","authors":"Pranjal Chandrakar,&nbsp;Shubhabrata Das","doi":"10.1111/anzs.12353","DOIUrl":"10.1111/anzs.12353","url":null,"abstract":"<div>\u0000 \u0000 <p>Sports franchises that participate in team sports can make better decisions regarding their players’ financial compensation, renewal of the contracts, bidding strategies during the auction, etc., if they can adequately assess the value or worth of their players. Evaluating the value of a player in a team sport is difficult because various team members play different roles. In this study, we resolve this by measuring the value of a player in terms of how his inclusion in the team affects the team's probability of winning. With this notion of value, we develop a technique to measure the worth of a cricket player for his franchise. To illustrate this technique, we evaluate the values of cricket players who play in the Indian Premier League. We also study the relationship between players’ values and their salaries. We find that a few popular players earn disproportionately more than others. This disproportionality in the income of popular players cannot be justified by their performance alone, as adjudged by their values in this work. We attribute the disproportionality in the income to the factors not captured via conventional yardsticks, including leadership or brand value.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"63 4","pages":"565-578"},"PeriodicalIF":1.1,"publicationDate":"2022-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74128017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection boundary for a sparse gamma scale mixture model 稀疏伽玛尺度混合模型的检测边界
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-11 DOI: 10.1111/anzs.12347
Michael I. Stewart

We derive the detection boundary for the one-sided version of the gamma scale mixture model where the contaminating component has a larger mean than the known reference distribution. We also derive an adaptive test which is able to almost uniformly attain the best possible performance in terms of detection of local alternatives.

我们导出了单侧版本的伽马尺度混合模型的检测边界,其中污染成分的平均值大于已知的参考分布。我们还推导了一种自适应测试,该测试能够在检测局部替代方案方面几乎一致地获得最佳性能。
{"title":"Detection boundary for a sparse gamma scale mixture model","authors":"Michael I. Stewart","doi":"10.1111/anzs.12347","DOIUrl":"10.1111/anzs.12347","url":null,"abstract":"<div>\u0000 \u0000 <p>We derive the detection boundary for the one-sided version of the gamma scale mixture model where the contaminating component has a larger mean than the known reference distribution. We also derive an adaptive test which is able to almost uniformly attain the best possible performance in terms of detection of local alternatives.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"64 2","pages":"282-296"},"PeriodicalIF":1.1,"publicationDate":"2022-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77949170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Odds-symmetry model for cumulative probabilities and decomposition of a conditional symmetry model in square contingency tables 平方列联表中累积概率的奇数-对称模型及条件对称模型的分解
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2021-12-06 DOI: 10.1111/anzs.12346
Shuji Ando

For the analysis of square contingency tables, it is necessary to estimate an unknown distribution with high confidence from an obtained observation. For that purpose, we need to introduce a statistical model that fits the data well and has parsimony. This study proposes asymmetry models based on cumulative probabilities for square contingency tables with the same row and column ordinal classifications. In the proposed models, the odds, for all i<j, that an observation will fall in row category i or below, and column category j or above, instead of row category j or above, and column category i or below, depend on only row category i or column category j. This is notwithstanding that the odds are constant without relying on row and column categories under the conditional symmetry (CS) model. The proposed models constantly hold when the CS model holds. However, the converse is not necessarily true. This study also shows that it is necessary to satisfy the extended marginal homogeneity model, in addition to the proposed models, to satisfy the CS model. These decomposition theorems explain why the CS model does not hold. The proposed models provide a better fit for application to a single data set of real-world occupational data for father-and-son dyads.

对于平方列联表的分析,需要从已获得的观测值中估计出具有高置信度的未知分布。为此,我们需要引入一种能够很好地拟合数据并具有简约性的统计模型。本研究提出了基于累积概率的方形列联表的不对称模型,具有相同的行和列顺序分类。在提出的模型中,对于所有i<j,一个观测值将落在第i行类别或以下,列类别j或以上,而不是行类别j或以上,列类别i或以下的几率,仅取决于行类别i或列类别j。尽管在条件对称(CS)模型下,几率是恒定的,不依赖于行和列类别。当CS模型成立时,所提出的模型一直成立。然而,反过来未必正确。研究还表明,在满足CS模型的基础上,还需要满足扩展边际均匀性模型。这些分解定理解释了为什么CS模型不成立。所提出的模型更适合应用于父子二人组真实职业数据的单一数据集。
{"title":"Odds-symmetry model for cumulative probabilities and decomposition of a conditional symmetry model in square contingency tables","authors":"Shuji Ando","doi":"10.1111/anzs.12346","DOIUrl":"10.1111/anzs.12346","url":null,"abstract":"<div>\u0000 \u0000 <p>For the analysis of square contingency tables, it is necessary to estimate an unknown distribution with high confidence from an obtained observation. For that purpose, we need to introduce a statistical model that fits the data well and has parsimony. This study proposes asymmetry models based on cumulative probabilities for square contingency tables with the same row and column ordinal classifications. In the proposed models, the odds, for all <i>i</i>&lt;<i>j</i>, that an observation will fall in row category <i>i</i> or below, and column category <i>j</i> or above, instead of row category <i>j</i> or above, and column category <i>i</i> or below, depend on only row category <i>i</i> or column category <i>j</i>. This is notwithstanding that the odds are constant without relying on row and column categories under the conditional symmetry (CS) model. The proposed models constantly hold when the CS model holds. However, the converse is not necessarily true. This study also shows that it is necessary to satisfy the extended marginal homogeneity model, in addition to the proposed models, to satisfy the CS model. These decomposition theorems explain why the CS model does not hold. The proposed models provide a better fit for application to a single data set of real-world occupational data for father-and-son dyads.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"63 4","pages":"674-684"},"PeriodicalIF":1.1,"publicationDate":"2021-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77244237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Australian & New Zealand Journal of Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1