首页 > 最新文献

Australian & New Zealand Journal of Statistics最新文献

英文 中文
A class of kth-order dependence-driven random coefficient mixed thinning integer-valued autoregressive process to analyse epileptic seizure data and COVID-19 data 一类用于分析癫痫发作数据和 COVID-19 数据的 kth 阶依赖性驱动随机系数混合稀疏整数值自回归过程
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2024-04-08 DOI: 10.1111/anzs.12411
Xiufang Liu, Dehui Wang, Huaping Chen, Lifang Zhao, Liang Liu

Data related to the counting of elements of variable character are frequently encountered in time series studies. This paper brings forward a new class of k$$ k $$th-order dependence-driven random coefficient mixed thinning integer-valued autoregressive time series model (DDRCMTINAR(k$$ k $$)) to deal with such data. Stationarity and ergodicity properties of the proposed model are derived in detail. The unknown parameters are estimated by conditional least squares, and modified quasi-likelihood and asymptotic normality of the obtained parameter estimators is established. The performances of the adopted estimate methods are checked via simulations, which present that modified quasi-likelihood estimators perform better than the conditional least squares considering the proportion of within-Ω$$ Omega $$ estimates in certain regions of the parameter space. The validity and practical utility of the model are investigated by epileptic seizure data and COVID-19 data of suspected cases in China.

摘要 在时间序列研究中经常会遇到与变量元素计数有关的数据。本文提出了一类新的三阶依赖驱动随机系数混合稀疏整数值自回归时间序列模型(DDRCMTINAR())来处理这类数据。详细推导了所提模型的平稳性和遍历性。用条件最小二乘法估计未知参数,并建立了修正准似然法和所获参数估计值的渐近正态性。通过模拟检验了所采用的估计方法的性能,结果表明,考虑到参数空间某些区域内估计值的比例,修正的准似然估计值的性能优于条件最小二乘法。该模型的有效性和实用性通过中国癫痫发作数据和 COVID-19 疑似病例数据进行了研究。
{"title":"A class of kth-order dependence-driven random coefficient mixed thinning integer-valued autoregressive process to analyse epileptic seizure data and COVID-19 data","authors":"Xiufang Liu,&nbsp;Dehui Wang,&nbsp;Huaping Chen,&nbsp;Lifang Zhao,&nbsp;Liang Liu","doi":"10.1111/anzs.12411","DOIUrl":"10.1111/anzs.12411","url":null,"abstract":"<div>\u0000 \u0000 <p>Data related to the counting of elements of variable character are frequently encountered in time series studies. This paper brings forward a new class of <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>k</mi>\u0000 </mrow>\u0000 <annotation>$$ k $$</annotation>\u0000 </semantics></math>th-order dependence-driven random coefficient mixed thinning integer-valued autoregressive time series model (DDRCMTINAR(<span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>k</mi>\u0000 </mrow>\u0000 <annotation>$$ k $$</annotation>\u0000 </semantics></math>)) to deal with such data. Stationarity and ergodicity properties of the proposed model are derived in detail. The unknown parameters are estimated by conditional least squares, and modified quasi-likelihood and asymptotic normality of the obtained parameter estimators is established. The performances of the adopted estimate methods are checked via simulations, which present that modified quasi-likelihood estimators perform better than the conditional least squares considering the proportion of within-<span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>Ω</mi>\u0000 </mrow>\u0000 <annotation>$$ Omega $$</annotation>\u0000 </semantics></math> estimates in certain regions of the parameter space. The validity and practical utility of the model are investigated by epileptic seizure data and COVID-19 data of suspected cases in China.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140599831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian hypothesis tests with diffuse priors: Can we have our cake and eat it too? 具有扩散先验的贝叶斯假设检验:我们能既吃蛋糕又吃蛋糕吗?
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2024-03-19 DOI: 10.1111/anzs.12410
J. T. Ormerod, M. Stewart, W. Yu, S. E. Romanes

We propose a new class of priors for Bayesian hypothesis testing, which we name ‘cake priors’. These priors circumvent the Jeffreys–Lindley paradox (also called Bartlett's paradox) a problem associated with the use of diffuse priors leading to nonsensical statistical inferences. Cake priors allow the use of diffuse priors (having one's cake) while achieving theoretically justified inferences (eating it too). We demonstrate this methodology for Bayesian hypotheses tests for various common scenarios. The resulting Bayesian test statistic takes the form of a penalised likelihood ratio test statistic. Under typical regularity conditions, we show that Bayesian hypothesis tests based on cake priors are Chernoff consistent, that is, achieve zero type I and II error probabilities asymptotically. We also discuss Lindley's paradox and argue that the paradox occurs with small and vanishing probability as sample size increases.

我们提出了一类新的贝叶斯假设检验先验,并将其命名为 "蛋糕先验"。这些先验值规避了杰弗里斯-林德利悖论(又称巴特利悖论),这是一个与使用扩散先验值导致不合理统计推断有关的问题。蛋糕先验允许使用扩散先验(拥有自己的蛋糕),同时实现理论上合理的推论(吃蛋糕)。我们针对各种常见情况的贝叶斯假设检验演示了这种方法。由此得出的贝叶斯检验统计量采用了惩罚似然比检验统计量的形式。在典型的正则条件下,我们证明了基于饼先验的贝叶斯假设检验是切尔诺夫一致的,即渐进地达到零I型和II型误差概率。我们还讨论了林德利悖论,并论证了随着样本量的增加,该悖论出现的概率很小,甚至消失。
{"title":"Bayesian hypothesis tests with diffuse priors: Can we have our cake and eat it too?","authors":"J. T. Ormerod,&nbsp;M. Stewart,&nbsp;W. Yu,&nbsp;S. E. Romanes","doi":"10.1111/anzs.12410","DOIUrl":"10.1111/anzs.12410","url":null,"abstract":"<p>We propose a new class of priors for Bayesian hypothesis testing, which we name ‘cake priors’. These priors circumvent the Jeffreys–Lindley paradox (also called Bartlett's paradox) a problem associated with the use of diffuse priors leading to nonsensical statistical inferences. Cake priors allow the use of diffuse priors (having one's cake) while achieving theoretically justified inferences (eating it too). We demonstrate this methodology for Bayesian hypotheses tests for various common scenarios. The resulting Bayesian test statistic takes the form of a penalised likelihood ratio test statistic. Under typical regularity conditions, we show that Bayesian hypothesis tests based on cake priors are Chernoff consistent, that is, achieve zero type I and II error probabilities asymptotically. We also discuss Lindley's paradox and argue that the paradox occurs with small and vanishing probability as sample size increases.</p>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/anzs.12410","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140182403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Asymptotics for the conditional self-weighted M $$ M $$ estimator of GRCA( p $$ p $$ ) models and its statistical inference GRCA(p$$ p$$) 模型的条件自加权 M$$ M$$ 估计器的渐近性及其统计推论
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2024-02-21 DOI: 10.1111/anzs.12408
Chi Yao, Wei Yu, Xuejun Wang

Under the p$$ p $$-order generalised random coefficient autoregressive (GRCA(p$$ p $$)) model with random coefficients Φt,$$ {boldsymbol{Phi}}_t, $$ we propose a conditional self-weighted M$$ M $$ estimator of EΦt$$ mathrm{E}{boldsymbol{Phi}}_t $$. We investigate the asymptotic normality of this estimator with possibly heavy-tailed random variables. Furthermore, a Wald test statistic is constructed for the linear restriction on the parameters. In addition, the simulation experiments are carried out to assess the finite sample performance of theoretical results. Finally, a real data analysis about the increase (%) in the number of construction projects this year over the same period of last year is provided.

摘要在具有随机系数的-阶广义随机系数自回归(GRCA())模型下,我们提出了一个条件自加权估计器。 我们研究了该估计器在可能存在重尾随机变量的情况下的渐近正态性。此外,我们还构建了参数线性限制的 Wald 检验统计量。此外,我们还进行了模拟实验,以评估理论结果的有限样本性能。最后,提供了有关今年建筑项目数量比去年同期增长(%)的真实数据分析。
{"title":"Asymptotics for the conditional self-weighted \u0000 \u0000 \u0000 M\u0000 \u0000 $$ M $$\u0000 estimator of GRCA(\u0000 \u0000 \u0000 p\u0000 \u0000 $$ p $$\u0000 ) models and its statistical inference","authors":"Chi Yao,&nbsp;Wei Yu,&nbsp;Xuejun Wang","doi":"10.1111/anzs.12408","DOIUrl":"10.1111/anzs.12408","url":null,"abstract":"<div>\u0000 \u0000 <p>Under the <math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>p</mi>\u0000 </mrow>\u0000 <annotation>$$ p $$</annotation>\u0000 </semantics></math>-order generalised random coefficient autoregressive (GRCA(<math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>p</mi>\u0000 </mrow>\u0000 <annotation>$$ p $$</annotation>\u0000 </semantics></math>)) model with random coefficients <math>\u0000 <semantics>\u0000 <mrow>\u0000 <msub>\u0000 <mrow>\u0000 <mi>Φ</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>t</mi>\u0000 </mrow>\u0000 </msub>\u0000 <mo>,</mo>\u0000 </mrow>\u0000 <annotation>$$ {boldsymbol{Phi}}_t, $$</annotation>\u0000 </semantics></math> we propose a conditional self-weighted <math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>M</mi>\u0000 </mrow>\u0000 <annotation>$$ M $$</annotation>\u0000 </semantics></math> estimator of <math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>E</mi>\u0000 <msub>\u0000 <mrow>\u0000 <mi>Φ</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>t</mi>\u0000 </mrow>\u0000 </msub>\u0000 </mrow>\u0000 <annotation>$$ mathrm{E}{boldsymbol{Phi}}_t $$</annotation>\u0000 </semantics></math>. We investigate the asymptotic normality of this estimator with possibly heavy-tailed random variables. Furthermore, a Wald test statistic is constructed for the linear restriction on the parameters. In addition, the simulation experiments are carried out to assess the finite sample performance of theoretical results. Finally, a real data analysis about the increase (%) in the number of construction projects this year over the same period of last year is provided.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2024-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139954304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unified robust estimation 统一稳健估算
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2024-02-20 DOI: 10.1111/anzs.12409
Zhu Wang

Robust estimation is primarily concerned with providing reliable parameter estimates in the presence of outliers. Numerous robust loss functions have been proposed in regression and classification, along with various computing algorithms. In modern penalised generalised linear models (GLMs), however, there is limited research on robust estimation that can provide weights to determine the outlier status of the observations. This article proposes a unified framework based on a large family of loss functions, a composite of concave and convex functions (CC-family). Properties of the CC-family are investigated, and CC-estimation is innovatively conducted via the iteratively reweighted convex optimisation (IRCO), which is a generalisation of the iteratively reweighted least squares in robust linear regression. For robust GLM, the IRCO becomes the iteratively reweighted GLM. The unified framework contains penalised estimation and robust support vector machine (SVM) and is demonstrated with a variety of data applications.

摘要稳健估计主要涉及在存在异常值的情况下提供可靠的参数估计。在回归和分类中提出了许多稳健损失函数以及各种计算算法。然而,在现代惩罚性广义线性模型(GLM)中,能提供权重以确定观测值离群状态的稳健估计研究还很有限。本文提出了一个基于损失函数大家族的统一框架,即凹函数和凸函数的复合体(CC-family)。本文研究了 CC 系列的特性,并通过迭代加权凸优化(IRCO)创新性地进行了 CC 估计,IRCO 是稳健线性回归中迭代加权最小二乘法的概括。对于稳健 GLM,IRCO 成为迭代重权 GLM。该统一框架包含惩罚估计和稳健支持向量机(SVM),并通过各种数据应用进行了演示。
{"title":"Unified robust estimation","authors":"Zhu Wang","doi":"10.1111/anzs.12409","DOIUrl":"10.1111/anzs.12409","url":null,"abstract":"<div>\u0000 \u0000 <p>Robust estimation is primarily concerned with providing reliable parameter estimates in the presence of outliers. Numerous robust loss functions have been proposed in regression and classification, along with various computing algorithms. In modern penalised generalised linear models (GLMs), however, there is limited research on robust estimation that can provide weights to determine the outlier status of the observations. This article proposes a unified framework based on a large family of loss functions, a composite of concave and convex functions (CC-family). Properties of the CC-family are investigated, and CC-estimation is innovatively conducted via the iteratively reweighted convex optimisation (IRCO), which is a generalisation of the iteratively reweighted least squares in robust linear regression. For robust GLM, the IRCO becomes the iteratively reweighted GLM. The unified framework contains penalised estimation and robust support vector machine (SVM) and is demonstrated with a variety of data applications.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2024-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139953816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Latent heterogeneity in COVID-19 hospitalisations: a cluster-weighted approach to analyse mortality COVID-19 住院病例的潜在异质性:采用聚类加权法分析死亡率
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2024-02-13 DOI: 10.1111/anzs.12407
Paolo Berta, Salvatore Ingrassia, Giorgio Vittadini, Daniele Spinelli

The COVID-19 pandemic caused an unprecedented excess mortality. Since 2020, many studies have focussed on the characteristics of COVID-19 patients who did not survive. From the statistical point of view, what seems to dominate is the large heterogeneity of the populations affected by COVID-19 and the extreme difficulty in identifying subpopulations who died affected by a plurality of contemporary characteristics. In this paper, we propose an extremely flexible approach based on a cluster-weighted model, which allows us to identify latent groups of patients sharing similar characteristics at the moment of hospitalisation as well as a similar mortality. We focus on one of the hardest hit areas in Italy and study the heterogeneity in the population of patients affected by COVID-19 using administrative data on hospitalisations in the first wave of the pandemic. Results highlighted that a model-based clustering approach is essential to understand the complexity of the COVID-19 patients treated by hospitals and who die during hospitalisation.

COVID-19 大流行造成了前所未有的超额死亡率。自 2020 年以来,许多研究重点关注 COVID-19 未存活患者的特征。从统计学的角度来看,受 COVID-19 影响的人群具有很大的异质性,要识别受多种当代特征影响而死亡的亚人群极其困难。在本文中,我们提出了一种基于聚类加权模型的极为灵活的方法,该方法允许我们识别在住院时具有相似特征以及相似死亡率的潜在患者群体。我们将重点放在意大利的重灾区之一,并利用大流行第一波住院治疗的行政数据研究了受 COVID-19 影响的患者群体的异质性。研究结果表明,基于模型的聚类方法对于了解接受医院治疗并在住院期间死亡的 COVID-19 患者的复杂性至关重要。
{"title":"Latent heterogeneity in COVID-19 hospitalisations: a cluster-weighted approach to analyse mortality","authors":"Paolo Berta,&nbsp;Salvatore Ingrassia,&nbsp;Giorgio Vittadini,&nbsp;Daniele Spinelli","doi":"10.1111/anzs.12407","DOIUrl":"10.1111/anzs.12407","url":null,"abstract":"<p>The COVID-19 pandemic caused an unprecedented excess mortality. Since 2020, many studies have focussed on the characteristics of COVID-19 patients who did not survive. From the statistical point of view, what seems to dominate is the large heterogeneity of the populations affected by COVID-19 and the extreme difficulty in identifying subpopulations who died affected by a plurality of contemporary characteristics. In this paper, we propose an extremely flexible approach based on a cluster-weighted model, which allows us to identify latent groups of patients sharing similar characteristics at the moment of hospitalisation as well as a similar mortality. We focus on one of the hardest hit areas in Italy and study the heterogeneity in the population of patients affected by COVID-19 using administrative data on hospitalisations in the first wave of the pandemic. Results highlighted that a model-based clustering approach is essential to understand the complexity of the COVID-19 patients treated by hospitals and who die during hospitalisation.</p>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2024-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/anzs.12407","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139772597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel response model and target selection method with applications to marketing 应用于市场营销的新型响应模型和目标选择方法
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2024-01-18 DOI: 10.1111/anzs.12406
Y. Cai

Response models used in marketing are not always constructed for later marketing optimisation, which often results in unsatisfactory results in target selection for future marketing activities. To solve this problem, we develop a new binary response model and a new marketing target selection method. The proposed model can predict multiple propensity scores per customer through customer-specific propensity score distributions, which is not possible with existing response models, filling a gap in the literature. The target selection method can determine the best propensity scores from those predicted by the proposed model and use them to select customers for further marketing activities. Our simulation results and application to real marketing data confirm that the performance of the proposed model in target selection is significantly better than that of the existing models, including some popular machine learning methods, which indicate that our method can be very useful in practice.

市场营销中使用的响应模型并不总是为以后的市场营销优化而构建的,这往往会导致未来市场营销活动的目标选择结果不尽如人意。为了解决这个问题,我们开发了一种新的二元响应模型和一种新的营销目标选择方法。所提出的模型可以通过特定客户的倾向得分分布来预测每个客户的多个倾向得分,这是现有响应模型所无法实现的,填补了文献空白。目标选择方法可从所提模型预测的倾向得分中确定最佳倾向得分,并利用这些倾向得分选择客户开展进一步营销活动。我们的仿真结果和对真实营销数据的应用证实,建议模型在目标选择方面的性能明显优于现有模型,包括一些流行的机器学习方法,这表明我们的方法在实践中非常有用。
{"title":"A novel response model and target selection method with applications to marketing","authors":"Y. Cai","doi":"10.1111/anzs.12406","DOIUrl":"10.1111/anzs.12406","url":null,"abstract":"<p>Response models used in marketing are not always constructed for later marketing optimisation, which often results in unsatisfactory results in target selection for future marketing activities. To solve this problem, we develop a new binary response model and a new marketing target selection method. The proposed model can predict multiple propensity scores per customer through customer-specific propensity score distributions, which is not possible with existing response models, filling a gap in the literature. The target selection method can determine the best propensity scores from those predicted by the proposed model and use them to select customers for further marketing activities. Our simulation results and application to real marketing data confirm that the performance of the proposed model in target selection is significantly better than that of the existing models, including some popular machine learning methods, which indicate that our method can be very useful in practice.</p>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/anzs.12406","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139515508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying changes in the distribution of income from higher-order moments with an application to Australia 从高阶矩确定收入分配的变化并应用于澳大利亚
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2024-01-17 DOI: 10.1111/anzs.12405
Vance L. Martin, Jialu Shi, Yang Song, Wenying Yao

Changes in the distribution of income over time are identified based on an adjusted two-sample version of the Neyman smooth test by using subsampling methods to approximate the sampling distribution of the test statistic when samples are not independent of each other. A range of Monte Carlo experiments show that the approach corrects for size distortions arising from dependent samples as well as generating monotonic power functions. Applying the approach to studying the distribution of income in Australia over the business cycle and the Global Financial Crisis, the empirical results highlight the importance of higher-order moments and demonstrate that business cycles are not all alike as the relative strengths of higher-order moments vary over phases of the cycle.

在样本互不独立的情况下,使用子抽样方法近似检验统计量的抽样分布,根据调整后的奈曼平滑检验的双样本版本,确定收入分布随时间的变化。一系列蒙特卡罗实验表明,该方法可以纠正因依赖样本而产生的大小失真,并生成单调的幂函数。应用该方法研究澳大利亚在商业周期和全球金融危机期间的收入分配情况,实证结果突出了高阶矩的重要性,并表明商业周期并不都是一样的,因为高阶矩的相对强度随周期的不同阶段而变化。
{"title":"Identifying changes in the distribution of income from higher-order moments with an application to Australia","authors":"Vance L. Martin,&nbsp;Jialu Shi,&nbsp;Yang Song,&nbsp;Wenying Yao","doi":"10.1111/anzs.12405","DOIUrl":"10.1111/anzs.12405","url":null,"abstract":"<p>Changes in the distribution of income over time are identified based on an adjusted two-sample version of the Neyman smooth test by using subsampling methods to approximate the sampling distribution of the test statistic when samples are not independent of each other. A range of Monte Carlo experiments show that the approach corrects for size distortions arising from dependent samples as well as generating monotonic power functions. Applying the approach to studying the distribution of income in Australia over the business cycle and the Global Financial Crisis, the empirical results highlight the importance of higher-order moments and demonstrate that business cycles are not all alike as the relative strengths of higher-order moments vary over phases of the cycle.</p>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/anzs.12405","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139515413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exact testing for heteroscedasticity in a two-way layout in variety frost trials when incorporating a covariate 在品种霜冻试验的双向布局中,在纳入协变量时对异方差进行精确测试
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2024-01-01 DOI: 10.1111/anzs.12404
Angelika A. Pilkington, Brenton R. Clarke, Dean A. Diepeveen
Two-way layouts are common in grain industry research where it is often the case that there are one or more covariates. It is widely recognised that when estimating fixed effect parameters, one should also examine for possible extra error variance structure. An exact test for heteroscedasticity, when there is a covariate, is illustrated for a data set from frost trials in Western Australia. While the general algebra for the test is known, albeit in past literature, there are computational aspects of implementing the test for the two way when there are covariates. In this scenario the test is shown to have greater power than the industry standard, and because of its exact size, is preferable to use of the restricted maximum likelihood ratio test (REMLRT) based on the approximate asymptotic distribution in this instance. Formulation of the exact test considered here involves creation of appropriate contrasts in the experimental design. This is illustrated using specific choices of observations corresponding to an index set in the linear model for the two-way layout. Also an algorithm supplied complements the test. Comparisons of size and power then ensue. The test has natural extensions when there are unbalanced data, and more than one covariate may be present. Results can be extended to Balanced Incomplete Block Designs.
双向布局在谷物产业研究中很常见,因为谷物产业研究通常存在一个或多个协变量。人们普遍认为,在估计固定效应参数时,还应检查可能存在的额外误差方差结构。本文以西澳大利亚霜冻试验的数据集为例,说明了在存在协变量的情况下对异方差进行精确检验的方法。尽管该检验的一般代数在过去的文献中已为人所知,但当存在协变量时,对双向检验的实施还涉及计算问题。在这种情况下,检验结果表明比行业标准具有更大的功率,而且由于其精确的规模,在这种情况下比使用基于近似渐近分布的限制性最大似然比检验(REMLRT)更为可取。本文所考虑的精确检验方法包括在实验设计中建立适当的对比。我们将使用与双向布局线性模型中的指标集相对应的观测数据的具体选择来说明这一点。此外,还提供了一种对检验进行补充的算法。然后对规模和功率进行比较。当存在不平衡数据,并且可能存在一个以上的协变量时,该检验具有自然的扩展性。结果可扩展到平衡不完全区组设计。
{"title":"Exact testing for heteroscedasticity in a two-way layout in variety frost trials when incorporating a covariate","authors":"Angelika A. Pilkington, Brenton R. Clarke, Dean A. Diepeveen","doi":"10.1111/anzs.12404","DOIUrl":"https://doi.org/10.1111/anzs.12404","url":null,"abstract":"Two-way layouts are common in grain industry research where it is often the case that there are one or more covariates. It is widely recognised that when estimating fixed effect parameters, one should also examine for possible extra error variance structure. An exact test for heteroscedasticity, when there is a covariate, is illustrated for a data set from frost trials in Western Australia. While the general algebra for the test is known, albeit in past literature, there are computational aspects of implementing the test for the two way when there are covariates. In this scenario the test is shown to have greater power than the industry standard, and because of its exact size, is preferable to use of the restricted maximum likelihood ratio test (REMLRT) based on the approximate asymptotic distribution in this instance. Formulation of the exact test considered here involves creation of appropriate contrasts in the experimental design. This is illustrated using specific choices of observations corresponding to an index set in the linear model for the two-way layout. Also an algorithm supplied complements the test. Comparisons of size and power then ensue. The test has natural extensions when there are unbalanced data, and more than one covariate may be present. Results can be extended to Balanced Incomplete Block Designs.","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139078183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exact likelihoods for N-mixture models with time-to-detection data 具有时间检测数据的 N 混合物模型的精确似然值
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2023-12-11 DOI: 10.1111/anzs.12401
Linda M. Haines, Res Altwegg, D. L. Borchers
This paper is concerned with the formulation of N�$$ N $$�-mixture models for estimating the abundance and probability of detection of a species from binary response, count and time-to-detection data. A modelling framework, which encompasses time-to-first-detection within the context of detection/non-detection and time-to-each-detection and time-to-first-detection within the context of count data, is introduced. Two observation processes which depend on whether or not double counting is assumed to occur are also considered. The main focus of the paper is on the derivation of explicit forms for the likelihoods associated with each of the proposed models. Closed-form expressions for the likelihoods associated with time-to-detection data are new and are developed from the theory of order statistics. A key finding of the study is that, based on the assumption of no double counting, the likelihoods associated with times-to-detection together with count data are the product of the likelihood for the counts alone and a term which depends on the detection probability parameter. This result demonstrates that, in this case, recording times-to-detection could well improve precision in estimation over recording counts alone. In contrast, for the double counting protocol with exponential arrival times, no information was found to be gained by recording times-to-detection in addition to the count data. An R package and an accompanying vignette are also introduced in order to complement the algebraic results and to demonstrate the use of the models in practice.
本文涉及 N$$ N$ 混合模型的建立,用于从二元响应、计数和检测时间数据中估计物种的丰度和检测概率。本文介绍了一个建模框架,其中包括检测/未检测背景下的首次检测时间,以及计数数据背景下的每次检测时间和首次检测时间。此外,还考虑了取决于是否假设发生重复计数的两个观测过程。本文的主要重点是推导与每个建议模型相关的似然的明确形式。与时间检测数据相关的似然的闭式表达是新的,是从阶次统计理论中发展出来的。研究的一个重要发现是,基于无重复计数的假设,与检测时间和计数数据相关的似然值是单独计数似然值与一个取决于检测概率参数的项的乘积。这一结果表明,在这种情况下,记录检测时间比单独记录计数更能提高估算精度。与此相反,对于指数到达时间的双重计数协议,除了计数数据外,记录检测时间也无法获得任何信息。为了补充代数结果并演示模型的实际应用,我们还介绍了一个 R 软件包和随附的小故事。
{"title":"Exact likelihoods for N-mixture models with time-to-detection data","authors":"Linda M. Haines, Res Altwegg, D. L. Borchers","doi":"10.1111/anzs.12401","DOIUrl":"https://doi.org/10.1111/anzs.12401","url":null,"abstract":"This paper is concerned with the formulation of <math altimg=\"urn:x-wiley:anzs:media:anzs12401:anzs12401-math-0001\" display=\"inline\" location=\"graphic/anzs12401-math-0001.png\" overflow=\"scroll\">\u0000<semantics>\u0000<mrow>\u0000<mi>N</mi>\u0000</mrow>\u0000$$ N $$</annotation>\u0000</semantics></math>-mixture models for estimating the abundance and probability of detection of a species from binary response, count and time-to-detection data. A modelling framework, which encompasses time-to-first-detection within the context of detection/non-detection and time-to-each-detection and time-to-first-detection within the context of count data, is introduced. Two observation processes which depend on whether or not double counting is assumed to occur are also considered. The main focus of the paper is on the derivation of explicit forms for the likelihoods associated with each of the proposed models. Closed-form expressions for the likelihoods associated with time-to-detection data are new and are developed from the theory of order statistics. A key finding of the study is that, based on the assumption of no double counting, the likelihoods associated with times-to-detection together with count data are the product of the likelihood for the counts alone and a term which depends on the detection probability parameter. This result demonstrates that, in this case, recording times-to-detection could well improve precision in estimation over recording counts alone. In contrast, for the double counting protocol with exponential arrival times, no information was found to be gained by recording times-to-detection in addition to the count data. An <span style=\"font-family:sans-serif\">R</span> package and an accompanying vignette are also introduced in order to complement the algebraic results and to demonstrate the use of the models in practice.","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2023-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138567024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Model averaged tail area confidence intervals in nested linear regression models 嵌套线性回归模型中的模型平均尾区置信区间
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2023-12-07 DOI: 10.1111/anzs.12402
Paul Kabaila, Ayesha Perera
The performance, in terms of coverage and expected length, of the model averaged tail area (MATA) confidence interval, proposed by Turek & Fletcher (2012, Computational Statistics & Data Analysis, 56, 2809–2815), depends greatly on the data-based model weights used in its construction. We generalise the computationally convenient exact formulae due to Kabaila, Welsh & Abeysekera (2016, Scandinavian Journal of Statistics, 43, 35–48) for the coverage and expected length of this confidence interval for two nested linear regression models to the case of two or more nested linear regression models. This permits the numerical assessment of the performance, in terms of coverage probability and scaled expected length, of the MATA confidence interval for any given data-based model weights in the context of three or more nested linear regression models. We illustrate this numerical assessment of performance of the MATA confidence interval, for model weights based on any given Generalised Information Criterion, in the context of three nested linear regression models using the real life ‘Cholesterol’ data. This provides a very informative further exploration of the influence of these model weights on the performance of this confidence interval.
Turek & Fletcher(2012,Computational Statistics & Data Analysis,56,2809-2815)提出的模型平均尾区(MATA)置信区间在覆盖率和预期长度方面的性能,在很大程度上取决于其构建过程中使用的基于数据的模型权重。我们将 Kabaila、Welshamp &; Abeysekera(2016,《斯堪的纳维亚统计杂志》,43,35-48)提出的计算方便的精确公式,用于两个嵌套线性回归模型的覆盖范围和该置信区间的预期长度,推广到两个或更多嵌套线性回归模型的情况。这样,在三个或更多嵌套线性回归模型的情况下,对于任何给定的基于数据的模型权重,MATA 置信区间在覆盖概率和按比例预期长度方面的性能都可以进行数值评估。我们利用现实生活中的 "胆固醇 "数据,在三个嵌套线性回归模型的背景下,针对基于任何给定广义信息准则的模型权重,对 MATA 置信区间的性能进行了数值评估。这为进一步探索这些模型权重对置信区间性能的影响提供了非常丰富的信息。
{"title":"Model averaged tail area confidence intervals in nested linear regression models","authors":"Paul Kabaila, Ayesha Perera","doi":"10.1111/anzs.12402","DOIUrl":"https://doi.org/10.1111/anzs.12402","url":null,"abstract":"The performance, in terms of coverage and expected length, of the model averaged tail area (MATA) confidence interval, proposed by Turek &amp; Fletcher (2012, <i>Computational Statistics &amp; Data Analysis</i>, 56, 2809–2815), depends greatly on the data-based model weights used in its construction. We generalise the computationally convenient exact formulae due to Kabaila, Welsh &amp; Abeysekera (2016, <i>Scandinavian Journal of Statistics</i>, 43, 35–48) for the coverage and expected length of this confidence interval for two nested linear regression models to the case of two or more nested linear regression models. This permits the numerical assessment of the performance, in terms of coverage probability and scaled expected length, of the MATA confidence interval for any given data-based model weights in the context of three or more nested linear regression models. We illustrate this numerical assessment of performance of the MATA confidence interval, for model weights based on any given Generalised Information Criterion, in the context of three nested linear regression models using the real life ‘Cholesterol’ data. This provides a very informative further exploration of the influence of these model weights on the performance of this confidence interval.","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2023-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138548308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Australian & New Zealand Journal of Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1