首页 > 最新文献

Australian & New Zealand Journal of Statistics最新文献

英文 中文
Asymptotics for the conditional self-weighted M $$ M $$ estimator of GRCA( p $$ p $$ ) models and its statistical inference GRCA(p$$ p$$) 模型的条件自加权 M$$ M$$ 估计器的渐近性及其统计推论
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-02-21 DOI: 10.1111/anzs.12408
Chi Yao, Wei Yu, Xuejun Wang

Under the p$$ p $$-order generalised random coefficient autoregressive (GRCA(p$$ p $$)) model with random coefficients Φt,$$ {boldsymbol{Phi}}_t, $$ we propose a conditional self-weighted M$$ M $$ estimator of EΦt$$ mathrm{E}{boldsymbol{Phi}}_t $$. We investigate the asymptotic normality of this estimator with possibly heavy-tailed random variables. Furthermore, a Wald test statistic is constructed for the linear restriction on the parameters. In addition, the simulation experiments are carried out to assess the finite sample performance of theoretical results. Finally, a real data analysis about the increase (%) in the number of construction projects this year over the same period of last year is provided.

摘要在具有随机系数的-阶广义随机系数自回归(GRCA())模型下,我们提出了一个条件自加权估计器。 我们研究了该估计器在可能存在重尾随机变量的情况下的渐近正态性。此外,我们还构建了参数线性限制的 Wald 检验统计量。此外,我们还进行了模拟实验,以评估理论结果的有限样本性能。最后,提供了有关今年建筑项目数量比去年同期增长(%)的真实数据分析。
{"title":"Asymptotics for the conditional self-weighted \u0000 \u0000 \u0000 M\u0000 \u0000 $$ M $$\u0000 estimator of GRCA(\u0000 \u0000 \u0000 p\u0000 \u0000 $$ p $$\u0000 ) models and its statistical inference","authors":"Chi Yao,&nbsp;Wei Yu,&nbsp;Xuejun Wang","doi":"10.1111/anzs.12408","DOIUrl":"10.1111/anzs.12408","url":null,"abstract":"<div>\u0000 \u0000 <p>Under the <math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>p</mi>\u0000 </mrow>\u0000 <annotation>$$ p $$</annotation>\u0000 </semantics></math>-order generalised random coefficient autoregressive (GRCA(<math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>p</mi>\u0000 </mrow>\u0000 <annotation>$$ p $$</annotation>\u0000 </semantics></math>)) model with random coefficients <math>\u0000 <semantics>\u0000 <mrow>\u0000 <msub>\u0000 <mrow>\u0000 <mi>Φ</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>t</mi>\u0000 </mrow>\u0000 </msub>\u0000 <mo>,</mo>\u0000 </mrow>\u0000 <annotation>$$ {boldsymbol{Phi}}_t, $$</annotation>\u0000 </semantics></math> we propose a conditional self-weighted <math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>M</mi>\u0000 </mrow>\u0000 <annotation>$$ M $$</annotation>\u0000 </semantics></math> estimator of <math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>E</mi>\u0000 <msub>\u0000 <mrow>\u0000 <mi>Φ</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>t</mi>\u0000 </mrow>\u0000 </msub>\u0000 </mrow>\u0000 <annotation>$$ mathrm{E}{boldsymbol{Phi}}_t $$</annotation>\u0000 </semantics></math>. We investigate the asymptotic normality of this estimator with possibly heavy-tailed random variables. Furthermore, a Wald test statistic is constructed for the linear restriction on the parameters. In addition, the simulation experiments are carried out to assess the finite sample performance of theoretical results. Finally, a real data analysis about the increase (%) in the number of construction projects this year over the same period of last year is provided.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"66 1","pages":"103-124"},"PeriodicalIF":1.1,"publicationDate":"2024-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139954304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unified robust estimation 统一稳健估算
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-02-20 DOI: 10.1111/anzs.12409
Zhu Wang

Robust estimation is primarily concerned with providing reliable parameter estimates in the presence of outliers. Numerous robust loss functions have been proposed in regression and classification, along with various computing algorithms. In modern penalised generalised linear models (GLMs), however, there is limited research on robust estimation that can provide weights to determine the outlier status of the observations. This article proposes a unified framework based on a large family of loss functions, a composite of concave and convex functions (CC-family). Properties of the CC-family are investigated, and CC-estimation is innovatively conducted via the iteratively reweighted convex optimisation (IRCO), which is a generalisation of the iteratively reweighted least squares in robust linear regression. For robust GLM, the IRCO becomes the iteratively reweighted GLM. The unified framework contains penalised estimation and robust support vector machine (SVM) and is demonstrated with a variety of data applications.

摘要稳健估计主要涉及在存在异常值的情况下提供可靠的参数估计。在回归和分类中提出了许多稳健损失函数以及各种计算算法。然而,在现代惩罚性广义线性模型(GLM)中,能提供权重以确定观测值离群状态的稳健估计研究还很有限。本文提出了一个基于损失函数大家族的统一框架,即凹函数和凸函数的复合体(CC-family)。本文研究了 CC 系列的特性,并通过迭代加权凸优化(IRCO)创新性地进行了 CC 估计,IRCO 是稳健线性回归中迭代加权最小二乘法的概括。对于稳健 GLM,IRCO 成为迭代重权 GLM。该统一框架包含惩罚估计和稳健支持向量机(SVM),并通过各种数据应用进行了演示。
{"title":"Unified robust estimation","authors":"Zhu Wang","doi":"10.1111/anzs.12409","DOIUrl":"10.1111/anzs.12409","url":null,"abstract":"<div>\u0000 \u0000 <p>Robust estimation is primarily concerned with providing reliable parameter estimates in the presence of outliers. Numerous robust loss functions have been proposed in regression and classification, along with various computing algorithms. In modern penalised generalised linear models (GLMs), however, there is limited research on robust estimation that can provide weights to determine the outlier status of the observations. This article proposes a unified framework based on a large family of loss functions, a composite of concave and convex functions (CC-family). Properties of the CC-family are investigated, and CC-estimation is innovatively conducted via the iteratively reweighted convex optimisation (IRCO), which is a generalisation of the iteratively reweighted least squares in robust linear regression. For robust GLM, the IRCO becomes the iteratively reweighted GLM. The unified framework contains penalised estimation and robust support vector machine (SVM) and is demonstrated with a variety of data applications.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"66 1","pages":"77-102"},"PeriodicalIF":1.1,"publicationDate":"2024-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139953816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Latent heterogeneity in COVID-19 hospitalisations: a cluster-weighted approach to analyse mortality COVID-19 住院病例的潜在异质性:采用聚类加权法分析死亡率
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-02-13 DOI: 10.1111/anzs.12407
Paolo Berta, Salvatore Ingrassia, Giorgio Vittadini, Daniele Spinelli

The COVID-19 pandemic caused an unprecedented excess mortality. Since 2020, many studies have focussed on the characteristics of COVID-19 patients who did not survive. From the statistical point of view, what seems to dominate is the large heterogeneity of the populations affected by COVID-19 and the extreme difficulty in identifying subpopulations who died affected by a plurality of contemporary characteristics. In this paper, we propose an extremely flexible approach based on a cluster-weighted model, which allows us to identify latent groups of patients sharing similar characteristics at the moment of hospitalisation as well as a similar mortality. We focus on one of the hardest hit areas in Italy and study the heterogeneity in the population of patients affected by COVID-19 using administrative data on hospitalisations in the first wave of the pandemic. Results highlighted that a model-based clustering approach is essential to understand the complexity of the COVID-19 patients treated by hospitals and who die during hospitalisation.

COVID-19 大流行造成了前所未有的超额死亡率。自 2020 年以来,许多研究重点关注 COVID-19 未存活患者的特征。从统计学的角度来看,受 COVID-19 影响的人群具有很大的异质性,要识别受多种当代特征影响而死亡的亚人群极其困难。在本文中,我们提出了一种基于聚类加权模型的极为灵活的方法,该方法允许我们识别在住院时具有相似特征以及相似死亡率的潜在患者群体。我们将重点放在意大利的重灾区之一,并利用大流行第一波住院治疗的行政数据研究了受 COVID-19 影响的患者群体的异质性。研究结果表明,基于模型的聚类方法对于了解接受医院治疗并在住院期间死亡的 COVID-19 患者的复杂性至关重要。
{"title":"Latent heterogeneity in COVID-19 hospitalisations: a cluster-weighted approach to analyse mortality","authors":"Paolo Berta,&nbsp;Salvatore Ingrassia,&nbsp;Giorgio Vittadini,&nbsp;Daniele Spinelli","doi":"10.1111/anzs.12407","DOIUrl":"10.1111/anzs.12407","url":null,"abstract":"<p>The COVID-19 pandemic caused an unprecedented excess mortality. Since 2020, many studies have focussed on the characteristics of COVID-19 patients who did not survive. From the statistical point of view, what seems to dominate is the large heterogeneity of the populations affected by COVID-19 and the extreme difficulty in identifying subpopulations who died affected by a plurality of contemporary characteristics. In this paper, we propose an extremely flexible approach based on a cluster-weighted model, which allows us to identify latent groups of patients sharing similar characteristics at the moment of hospitalisation as well as a similar mortality. We focus on one of the hardest hit areas in Italy and study the heterogeneity in the population of patients affected by COVID-19 using administrative data on hospitalisations in the first wave of the pandemic. Results highlighted that a model-based clustering approach is essential to understand the complexity of the COVID-19 patients treated by hospitals and who die during hospitalisation.</p>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"66 1","pages":"1-20"},"PeriodicalIF":1.1,"publicationDate":"2024-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/anzs.12407","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139772597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel response model and target selection method with applications to marketing 应用于市场营销的新型响应模型和目标选择方法
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-01-18 DOI: 10.1111/anzs.12406
Y. Cai

Response models used in marketing are not always constructed for later marketing optimisation, which often results in unsatisfactory results in target selection for future marketing activities. To solve this problem, we develop a new binary response model and a new marketing target selection method. The proposed model can predict multiple propensity scores per customer through customer-specific propensity score distributions, which is not possible with existing response models, filling a gap in the literature. The target selection method can determine the best propensity scores from those predicted by the proposed model and use them to select customers for further marketing activities. Our simulation results and application to real marketing data confirm that the performance of the proposed model in target selection is significantly better than that of the existing models, including some popular machine learning methods, which indicate that our method can be very useful in practice.

市场营销中使用的响应模型并不总是为以后的市场营销优化而构建的,这往往会导致未来市场营销活动的目标选择结果不尽如人意。为了解决这个问题,我们开发了一种新的二元响应模型和一种新的营销目标选择方法。所提出的模型可以通过特定客户的倾向得分分布来预测每个客户的多个倾向得分,这是现有响应模型所无法实现的,填补了文献空白。目标选择方法可从所提模型预测的倾向得分中确定最佳倾向得分,并利用这些倾向得分选择客户开展进一步营销活动。我们的仿真结果和对真实营销数据的应用证实,建议模型在目标选择方面的性能明显优于现有模型,包括一些流行的机器学习方法,这表明我们的方法在实践中非常有用。
{"title":"A novel response model and target selection method with applications to marketing","authors":"Y. Cai","doi":"10.1111/anzs.12406","DOIUrl":"10.1111/anzs.12406","url":null,"abstract":"<p>Response models used in marketing are not always constructed for later marketing optimisation, which often results in unsatisfactory results in target selection for future marketing activities. To solve this problem, we develop a new binary response model and a new marketing target selection method. The proposed model can predict multiple propensity scores per customer through customer-specific propensity score distributions, which is not possible with existing response models, filling a gap in the literature. The target selection method can determine the best propensity scores from those predicted by the proposed model and use them to select customers for further marketing activities. Our simulation results and application to real marketing data confirm that the performance of the proposed model in target selection is significantly better than that of the existing models, including some popular machine learning methods, which indicate that our method can be very useful in practice.</p>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"66 1","pages":"48-76"},"PeriodicalIF":1.1,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/anzs.12406","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139515508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying changes in the distribution of income from higher-order moments with an application to Australia 从高阶矩确定收入分配的变化并应用于澳大利亚
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-01-17 DOI: 10.1111/anzs.12405
Vance L. Martin, Jialu Shi, Yang Song, Wenying Yao

Changes in the distribution of income over time are identified based on an adjusted two-sample version of the Neyman smooth test by using subsampling methods to approximate the sampling distribution of the test statistic when samples are not independent of each other. A range of Monte Carlo experiments show that the approach corrects for size distortions arising from dependent samples as well as generating monotonic power functions. Applying the approach to studying the distribution of income in Australia over the business cycle and the Global Financial Crisis, the empirical results highlight the importance of higher-order moments and demonstrate that business cycles are not all alike as the relative strengths of higher-order moments vary over phases of the cycle.

在样本互不独立的情况下,使用子抽样方法近似检验统计量的抽样分布,根据调整后的奈曼平滑检验的双样本版本,确定收入分布随时间的变化。一系列蒙特卡罗实验表明,该方法可以纠正因依赖样本而产生的大小失真,并生成单调的幂函数。应用该方法研究澳大利亚在商业周期和全球金融危机期间的收入分配情况,实证结果突出了高阶矩的重要性,并表明商业周期并不都是一样的,因为高阶矩的相对强度随周期的不同阶段而变化。
{"title":"Identifying changes in the distribution of income from higher-order moments with an application to Australia","authors":"Vance L. Martin,&nbsp;Jialu Shi,&nbsp;Yang Song,&nbsp;Wenying Yao","doi":"10.1111/anzs.12405","DOIUrl":"10.1111/anzs.12405","url":null,"abstract":"<p>Changes in the distribution of income over time are identified based on an adjusted two-sample version of the Neyman smooth test by using subsampling methods to approximate the sampling distribution of the test statistic when samples are not independent of each other. A range of Monte Carlo experiments show that the approach corrects for size distortions arising from dependent samples as well as generating monotonic power functions. Applying the approach to studying the distribution of income in Australia over the business cycle and the Global Financial Crisis, the empirical results highlight the importance of higher-order moments and demonstrate that business cycles are not all alike as the relative strengths of higher-order moments vary over phases of the cycle.</p>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"66 1","pages":"21-47"},"PeriodicalIF":1.1,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/anzs.12405","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139515413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exact testing for heteroscedasticity in a two-way layout in variety frost trials when incorporating a covariate 在品种霜冻试验的双向布局中,在纳入协变量时对异方差进行精确测试
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-01-01 DOI: 10.1111/anzs.12404
Angelika A. Pilkington, Brenton R. Clarke, Dean A. Diepeveen

Two-way layouts are common in grain industry research where it is often the case that there are one or more covariates. It is widely recognised that when estimating fixed effect parameters, one should also examine for possible extra error variance structure. An exact test for heteroscedasticity, when there is a covariate, is illustrated for a data set from frost trials in Western Australia. While the general algebra for the test is known, albeit in past literature, there are computational aspects of implementing the test for the two way when there are covariates. In this scenario the test is shown to have greater power than the industry standard, and because of its exact size, is preferable to use of the restricted maximum likelihood ratio test (REMLRT) based on the approximate asymptotic distribution in this instance. Formulation of the exact test considered here involves creation of appropriate contrasts in the experimental design. This is illustrated using specific choices of observations corresponding to an index set in the linear model for the two-way layout. Also an algorithm supplied complements the test. Comparisons of size and power then ensue. The test has natural extensions when there are unbalanced data, and more than one covariate may be present. Results can be extended to Balanced Incomplete Block Designs.

双向布局在谷物产业研究中很常见,因为谷物产业研究通常存在一个或多个协变量。人们普遍认为,在估计固定效应参数时,还应检查可能存在的额外误差方差结构。本文以西澳大利亚霜冻试验的数据集为例,说明了在存在协变量的情况下对异方差进行精确检验的方法。尽管该检验的一般代数在过去的文献中已为人所知,但当存在协变量时,对双向检验的实施还涉及计算问题。在这种情况下,检验结果表明比行业标准具有更大的功率,而且由于其精确的规模,在这种情况下比使用基于近似渐近分布的限制性最大似然比检验(REMLRT)更为可取。本文所考虑的精确检验方法包括在实验设计中建立适当的对比。我们将使用与双向布局线性模型中的指标集相对应的观测数据的具体选择来说明这一点。此外,还提供了一种对检验进行补充的算法。然后对规模和功率进行比较。当存在不平衡数据,并且可能存在一个以上的协变量时,该检验具有自然的扩展性。结果可扩展到平衡不完全区组设计。
{"title":"Exact testing for heteroscedasticity in a two-way layout in variety frost trials when incorporating a covariate","authors":"Angelika A. Pilkington,&nbsp;Brenton R. Clarke,&nbsp;Dean A. Diepeveen","doi":"10.1111/anzs.12404","DOIUrl":"10.1111/anzs.12404","url":null,"abstract":"<p>Two-way layouts are common in grain industry research where it is often the case that there are one or more covariates. It is widely recognised that when estimating fixed effect parameters, one should also examine for possible extra error variance structure. An exact test for heteroscedasticity, when there is a covariate, is illustrated for a data set from frost trials in Western Australia. While the general algebra for the test is known, albeit in past literature, there are computational aspects of implementing the test for the two way when there are covariates. In this scenario the test is shown to have greater power than the industry standard, and because of its exact size, is preferable to use of the restricted maximum likelihood ratio test (REMLRT) based on the approximate asymptotic distribution in this instance. Formulation of the exact test considered here involves creation of appropriate contrasts in the experimental design. This is illustrated using specific choices of observations corresponding to an index set in the linear model for the two-way layout. Also an algorithm supplied complements the test. Comparisons of size and power then ensue. The test has natural extensions when there are unbalanced data, and more than one covariate may be present. Results can be extended to Balanced Incomplete Block Designs.</p>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"65 4","pages":"309-326"},"PeriodicalIF":0.8,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/anzs.12404","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139078183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exact likelihoods for N-mixture models with time-to-detection data 具有时间检测数据的 N 混合物模型的精确似然值
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-12-11 DOI: 10.1111/anzs.12401
Linda M. Haines, Res Altwegg, D. L. Borchers

This paper is concerned with the formulation of N$$ N $$-mixture models for estimating the abundance and probability of detection of a species from binary response, count and time-to-detection data. A modelling framework, which encompasses time-to-first-detection within the context of detection/non-detection and time-to-each-detection and time-to-first-detection within the context of count data, is introduced. Two observation processes which depend on whether or not double counting is assumed to occur are also considered. The main focus of the paper is on the derivation of explicit forms for the likelihoods associated with each of the proposed models. Closed-form expressions for the likelihoods associated with time-to-detection data are new and are developed from the theory of order statistics. A key finding of the study is that, based on the assumption of no double counting, the likelihoods associated with times-to-detection together with count data are the product of the likelihood for the counts alone and a term which depends on the detection probability parameter. This result demonstrates that, in this case, recording times-to-detection could well improve precision in estimation over recording counts alone. In contrast, for the double counting protocol with exponential arrival times, no information was found to be gained by recording times-to-detection in addition to the count data. An R package and an accompanying vignette are also introduced in order to complement the algebraic results and to demonstrate the use of the models in practice.

本文涉及 N$$ N$ 混合模型的建立,用于从二元响应、计数和检测时间数据中估计物种的丰度和检测概率。本文介绍了一个建模框架,其中包括检测/未检测背景下的首次检测时间,以及计数数据背景下的每次检测时间和首次检测时间。此外,还考虑了取决于是否假设发生重复计数的两个观测过程。本文的主要重点是推导与每个建议模型相关的似然的明确形式。与时间检测数据相关的似然的闭式表达是新的,是从阶次统计理论中发展出来的。研究的一个重要发现是,基于无重复计数的假设,与检测时间和计数数据相关的似然值是单独计数似然值与一个取决于检测概率参数的项的乘积。这一结果表明,在这种情况下,记录检测时间比单独记录计数更能提高估算精度。与此相反,对于指数到达时间的双重计数协议,除了计数数据外,记录检测时间也无法获得任何信息。为了补充代数结果并演示模型的实际应用,我们还介绍了一个 R 软件包和随附的小故事。
{"title":"Exact likelihoods for N-mixture models with time-to-detection data","authors":"Linda M. Haines,&nbsp;Res Altwegg,&nbsp;D. L. Borchers","doi":"10.1111/anzs.12401","DOIUrl":"10.1111/anzs.12401","url":null,"abstract":"<div>\u0000 \u0000 <p>This paper is concerned with the formulation of <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>N</mi>\u0000 </mrow>\u0000 <annotation>$$ N $$</annotation>\u0000 </semantics></math>-mixture models for estimating the abundance and probability of detection of a species from binary response, count and time-to-detection data. A modelling framework, which encompasses time-to-first-detection within the context of detection/non-detection and time-to-each-detection and time-to-first-detection within the context of count data, is introduced. Two observation processes which depend on whether or not double counting is assumed to occur are also considered. The main focus of the paper is on the derivation of explicit forms for the likelihoods associated with each of the proposed models. Closed-form expressions for the likelihoods associated with time-to-detection data are new and are developed from the theory of order statistics. A key finding of the study is that, based on the assumption of no double counting, the likelihoods associated with times-to-detection together with count data are the product of the likelihood for the counts alone and a term which depends on the detection probability parameter. This result demonstrates that, in this case, recording times-to-detection could well improve precision in estimation over recording counts alone. In contrast, for the double counting protocol with exponential arrival times, no information was found to be gained by recording times-to-detection in addition to the count data. An <span>R</span> package and an accompanying vignette are also introduced in order to complement the algebraic results and to demonstrate the use of the models in practice.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"65 4","pages":"327-343"},"PeriodicalIF":0.8,"publicationDate":"2023-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138567024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Model averaged tail area confidence intervals in nested linear regression models 嵌套线性回归模型中的模型平均尾区置信区间
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-12-07 DOI: 10.1111/anzs.12402
Paul Kabaila, Ayesha Perera

The performance, in terms of coverage and expected length, of the model averaged tail area (MATA) confidence interval, proposed by Turek & Fletcher (2012, Computational Statistics & Data Analysis, 56, 2809–2815), depends greatly on the data-based model weights used in its construction. We generalise the computationally convenient exact formulae due to Kabaila, Welsh & Abeysekera (2016, Scandinavian Journal of Statistics, 43, 35–48) for the coverage and expected length of this confidence interval for two nested linear regression models to the case of two or more nested linear regression models. This permits the numerical assessment of the performance, in terms of coverage probability and scaled expected length, of the MATA confidence interval for any given data-based model weights in the context of three or more nested linear regression models. We illustrate this numerical assessment of performance of the MATA confidence interval, for model weights based on any given Generalised Information Criterion, in the context of three nested linear regression models using the real life ‘Cholesterol’ data. This provides a very informative further exploration of the influence of these model weights on the performance of this confidence interval.

Turek & Fletcher(2012,Computational Statistics & Data Analysis,56,2809-2815)提出的模型平均尾区(MATA)置信区间在覆盖率和预期长度方面的性能,在很大程度上取决于其构建过程中使用的基于数据的模型权重。我们将 Kabaila、Welshamp &; Abeysekera(2016,《斯堪的纳维亚统计杂志》,43,35-48)提出的计算方便的精确公式,用于两个嵌套线性回归模型的覆盖范围和该置信区间的预期长度,推广到两个或更多嵌套线性回归模型的情况。这样,在三个或更多嵌套线性回归模型的情况下,对于任何给定的基于数据的模型权重,MATA 置信区间在覆盖概率和按比例预期长度方面的性能都可以进行数值评估。我们利用现实生活中的 "胆固醇 "数据,在三个嵌套线性回归模型的背景下,针对基于任何给定广义信息准则的模型权重,对 MATA 置信区间的性能进行了数值评估。这为进一步探索这些模型权重对置信区间性能的影响提供了非常丰富的信息。
{"title":"Model averaged tail area confidence intervals in nested linear regression models","authors":"Paul Kabaila,&nbsp;Ayesha Perera","doi":"10.1111/anzs.12402","DOIUrl":"10.1111/anzs.12402","url":null,"abstract":"<div>\u0000 \u0000 <p>The performance, in terms of coverage and expected length, of the model averaged tail area (MATA) confidence interval, proposed by Turek &amp; Fletcher (2012, <i>Computational Statistics &amp; Data Analysis</i>, 56, 2809–2815), depends greatly on the data-based model weights used in its construction. We generalise the computationally convenient exact formulae due to Kabaila, Welsh &amp; Abeysekera (2016, <i>Scandinavian Journal of Statistics</i>, 43, 35–48) for the coverage and expected length of this confidence interval for two nested linear regression models to the case of two or more nested linear regression models. This permits the numerical assessment of the performance, in terms of coverage probability and scaled expected length, of the MATA confidence interval for any given data-based model weights in the context of three or more nested linear regression models. We illustrate this numerical assessment of performance of the MATA confidence interval, for model weights based on any given Generalised Information Criterion, in the context of three nested linear regression models using the real life ‘Cholesterol’ data. This provides a very informative further exploration of the influence of these model weights on the performance of this confidence interval.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"65 4","pages":"364-378"},"PeriodicalIF":0.8,"publicationDate":"2023-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138548308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The role of pairwise matching in experimental design for an incidence outcome 成对匹配在实验设计中对发生率结果的作用
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-11-27 DOI: 10.1111/anzs.12403
Adam Kapelner, Abba M. Krieger, David Azriel

We consider the problem of evaluating designs for a two-arm randomised experiment with an incidence (binary) outcome under a non-parametric general response model. Our two main results are that the a priori pair matching design is (1) the optimal design as measured by mean squared error among all block designs which includes complete randomisation. And (2), this pair-matching design is minimax, that is, it provides the lowest mean squared error under an adversarial response model. Theoretical results are supported by simulations and clinical trial data where we demonstrate the superior performance of pairwise matching designs under realistic conditions.

我们考虑在非参数一般反应模型下评估具有发生率(二元)结果的双臂随机实验设计的问题。我们的两个主要结果是,先验配对设计是(1)在包括完全随机化的所有块设计中,以均方误差衡量的最佳设计。(2)这种配对设计是minimax的,即在对抗响应模型下,它提供了最小的均方误差。理论结果得到了模拟和临床试验数据的支持,在这些数据中,我们证明了在现实条件下成对匹配设计的优越性能。
{"title":"The role of pairwise matching in experimental design for an incidence outcome","authors":"Adam Kapelner,&nbsp;Abba M. Krieger,&nbsp;David Azriel","doi":"10.1111/anzs.12403","DOIUrl":"10.1111/anzs.12403","url":null,"abstract":"<div>\u0000 \u0000 <p>We consider the problem of evaluating designs for a two-arm randomised experiment with an incidence (binary) outcome under a non-parametric general response model. Our two main results are that the a priori pair matching design is (1) the optimal design as measured by mean squared error among all block designs which includes complete randomisation. And (2), this pair-matching design is minimax, that is, it provides the lowest mean squared error under an adversarial response model. Theoretical results are supported by simulations and clinical trial data where we demonstrate the superior performance of pairwise matching designs under realistic conditions.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"65 4","pages":"379-393"},"PeriodicalIF":0.8,"publicationDate":"2023-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138517672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Measurement errors in semi-parametric generalised regression models 半参数广义回归模型的测量误差
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-10-11 DOI: 10.1111/anzs.12400
Mohammad W. Hattab, David Ruppert

Regression models that ignore measurement error in predictors may produce highly biased estimates leading to erroneous inferences. It is well known that it is extremely difficult to take measurement error into account in Gaussian non-parametric regression. This problem becomes even more difficult when considering other families such as binary, Poisson and negative binomial regression. We present a novel method aiming to correct for measurement error when estimating regression functions. Our approach is sufficiently flexible to cover virtually all distributions and link functions regularly considered in generalised linear models. This approach depends on approximating the first and the second moment of the response after integrating out the true unobserved predictors in any semi-parametric generalised regression model. By the latter is meant a model with both linear and non-parametric effects that are connected to the mean response by a link function and with a response distribution in an exponential family or quasi-likelihood model. Unlike previous methods, the method we now propose is not restricted to truncated splines and can utilise various basis functions. Moreover, it can operate without making any distributional assumption about the unobserved predictor. Through extensive simulation studies, we study the performance of our method under many scenarios.

忽略预测量测量误差的回归模型可能产生高度偏倚的估计,从而导致错误的推断。众所周知,在高斯非参数回归中很难考虑测量误差。当考虑到其他的类,如二元回归、泊松回归和负二项回归时,这个问题变得更加困难。我们提出了一种新的方法来校正回归函数估计时的测量误差。我们的方法足够灵活,几乎涵盖了广义线性模型中经常考虑的所有分布和链接函数。这种方法依赖于在任何半参数广义回归模型中积分出真实的未观察到的预测因子后逼近响应的第一和第二时刻。后者是指具有线性和非参数效应的模型,这些效应通过连接函数与平均响应相连接,并具有指数族或准似然模型中的响应分布。与以前的方法不同,我们现在提出的方法不局限于截断样条,可以利用各种基函数。此外,它可以在没有对未观察到的预测器做出任何分布假设的情况下运行。通过大量的仿真研究,我们研究了该方法在多种场景下的性能。
{"title":"Measurement errors in semi-parametric generalised regression models","authors":"Mohammad W. Hattab,&nbsp;David Ruppert","doi":"10.1111/anzs.12400","DOIUrl":"10.1111/anzs.12400","url":null,"abstract":"<div>\u0000 \u0000 <p>Regression models that ignore measurement error in predictors may produce highly biased estimates leading to erroneous inferences. It is well known that it is extremely difficult to take measurement error into account in Gaussian non-parametric regression. This problem becomes even more difficult when considering other families such as binary, Poisson and negative binomial regression. We present a novel method aiming to correct for measurement error when estimating regression functions. Our approach is sufficiently flexible to cover virtually all distributions and link functions regularly considered in generalised linear models. This approach depends on approximating the first and the second moment of the response after integrating out the true unobserved predictors in any semi-parametric generalised regression model. By the latter is meant a model with both linear and non-parametric effects that are connected to the mean response by a link function and with a response distribution in an exponential family or quasi-likelihood model. Unlike previous methods, the method we now propose is not restricted to truncated splines and can utilise various basis functions. Moreover, it can operate without making any distributional assumption about the unobserved predictor. Through extensive simulation studies, we study the performance of our method under many scenarios.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"65 4","pages":"344-363"},"PeriodicalIF":0.8,"publicationDate":"2023-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136212682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Australian & New Zealand Journal of Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1