首页 > 最新文献

Australian & New Zealand Journal of Statistics最新文献

英文 中文
Non-parametric depth-based tests for the multivariate location problem 多变量定位问题的非参数深度测试
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-06-29 DOI: 10.1111/anzs.12328
Sakineh Dehghan, Mohammad Reza Faridrohani

In this paper, using the notion of data depth, we describe two classes of affine invariant test statistics for the one-sample location problem. The tests are implemented through the idea of permutation tests. The performance of the test against some competitors is investigated with an extensive simulation study. It is observed that the tests perform well when compared to their competitors for a wide spectrum of alternatives. If the proposed test is defined based on a moment-free depth function, then it is not inherently required to have finite moments of any order and the tests have broader applicability than some of the existing tests. The robustness property of the proposed tests is considered with a simulation study. Finally, we apply the tests to a real data example.

本文利用数据深度的概念,描述了单样本定位问题的两类仿射不变检验统计量。这些测试是通过排列测试的思想实现的。对一些竞争对手的测试性能进行了广泛的模拟研究。可以观察到,与竞争对手相比,这些测试在广泛的替代方案中表现良好。如果提议的测试是基于无矩深度函数定义的,那么它本质上不需要具有任何阶的有限矩,并且测试比现有的一些测试具有更广泛的适用性。对所提出的测试方法进行了鲁棒性仿真研究。最后,我们将测试应用到一个真实的数据示例中。
{"title":"Non-parametric depth-based tests for the multivariate location problem","authors":"Sakineh Dehghan,&nbsp;Mohammad Reza Faridrohani","doi":"10.1111/anzs.12328","DOIUrl":"10.1111/anzs.12328","url":null,"abstract":"<div>\u0000 \u0000 <p>In this paper, using the notion of data depth, we describe two classes of affine invariant test statistics for the one-sample location problem. The tests are implemented through the idea of permutation tests. The performance of the test against some competitors is investigated with an extensive simulation study. It is observed that the tests perform well when compared to their competitors for a wide spectrum of alternatives. If the proposed test is defined based on a moment-free depth function, then it is not inherently required to have finite moments of any order and the tests have broader applicability than some of the existing tests. The robustness property of the proposed tests is considered with a simulation study. Finally, we apply the tests to a real data example.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12328","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80013153","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
New moderation methods of higher school certificate assessments: a case study of the New South Wales practice 高等学校证书评估的新适度方法:新南威尔士州实践的案例研究
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-06-29 DOI: 10.1111/anzs.12317
Yanlin Shi

The Higher School Certificate (HSC) is the credential awarded to secondary school students in New South Wales (NSW), Australia. This paper reviews the current moderation process of the HSC and introduces and compares a range of modern statistical methods. With a comprehensive analysis of the complete 2013–2016 HSC results, we show that the monotone spline regression with the Huber loss function consistently beats the existing moderation method. With its simple structure, fast execution and improved effectiveness, this new moderation model is an ideal replacement of the in-force quadratic for the HSC practice in NSW.

高等学校证书(HSC)是颁发给澳大利亚新南威尔士州(NSW)中学生的证书。本文回顾了当前HSC的调节过程,并介绍和比较了一系列现代统计方法。通过对2013-2016年完整的HSC结果的综合分析,我们发现带有Huber损失函数的单调样条回归始终优于现有的调节方法。该模型结构简单,执行速度快,效率高,是新南威尔士州HSC实践中有效二次元的理想替代品。
{"title":"New moderation methods of higher school certificate assessments: a case study of the New South Wales practice","authors":"Yanlin Shi","doi":"10.1111/anzs.12317","DOIUrl":"10.1111/anzs.12317","url":null,"abstract":"<div>\u0000 \u0000 <p>The Higher School Certificate (HSC) is the credential awarded to secondary school students in New South Wales (NSW), Australia. This paper reviews the current moderation process of the HSC and introduces and compares a range of modern statistical methods. With a comprehensive analysis of the complete 2013–2016 HSC results, we show that the monotone spline regression with the Huber loss function consistently beats the existing moderation method. With its simple structure, fast execution and improved effectiveness, this new moderation model is an ideal replacement of the in-force quadratic for the HSC practice in NSW.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12317","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87803813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecasting the old-age dependency ratio to determine a sustainable pension age 预测老年抚养比,确定可持续的领取养老金年龄
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-06-29 DOI: 10.1111/anzs.12330
Rob J. Hyndman, Yijun Zeng, Han Lin Shang

We forecast the old-age dependency ratio for Australia under various pension age proposals, and estimate a pension age scheme that will provide a stable old-age dependency ratio at a specified level. Our approach involves a stochastic population forecasting method based on coherent functional data models for mortality, fertility and net migration, which we use to simulate the future age-structure of the population. Our results suggest that the Australian pension age should be increased to 68 by 2030, 69 by 2036 and 70 by 2050, in order to maintain the old-age dependency ratio at 23%, just above the 2018 level. Our general approach can easily be extended to other target levels of the old-age dependency ratio and to other countries.

我们预测了澳大利亚在不同养老金年龄建议下的老年抚养比,并估计了一种养老金年龄计划,该计划将在特定水平上提供稳定的老年抚养比。我们的方法涉及一种随机人口预测方法,该方法基于死亡率、生育率和净迁移的连贯功能数据模型,我们使用该模型来模拟人口的未来年龄结构。我们的研究结果表明,到2030年,澳大利亚的养老金年龄应该提高到68岁,到2036年提高到69岁,到2050年提高到70岁,以使老年抚养比保持在23%,略高于2018年的水平。我们的一般方法可以很容易地推广到老年抚养比率的其他目标水平和其他国家。
{"title":"Forecasting the old-age dependency ratio to determine a sustainable pension age","authors":"Rob J. Hyndman,&nbsp;Yijun Zeng,&nbsp;Han Lin Shang","doi":"10.1111/anzs.12330","DOIUrl":"10.1111/anzs.12330","url":null,"abstract":"<div>\u0000 \u0000 <p>We forecast the old-age dependency ratio for Australia under various pension age proposals, and estimate a pension age scheme that will provide a stable old-age dependency ratio at a specified level. Our approach involves a stochastic population forecasting method based on coherent functional data models for mortality, fertility and net migration, which we use to simulate the future age-structure of the population. Our results suggest that the Australian pension age should be increased to 68 by 2030, 69 by 2036 and 70 by 2050, in order to maintain the old-age dependency ratio at 23%, just above the 2018 level. Our general approach can easily be extended to other target levels of the old-age dependency ratio and to other countries.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12330","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77019366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A shared parameter mixture model for longitudinal income data with missing responses and zero rounding 具有缺失响应和零舍入的纵向收入数据共享参数混合模型
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-06-17 DOI: 10.1111/anzs.12323
Francis K.C. Hui, Howard D. Bondell
The analysis of longitudinal income data is often made challenging for several reasons. For example, in a national Australian survey on income over time, a non‐negligible proportion of responses are missing, and it is believed the missingness mechanism is non‐ignorable. Also, there are a large number of reported zero incomes, some of which may be true zeros (corresponding to individuals who legitimately do not earn an income), while some may be false zeros (corresponding to individuals choosing to round their income to zero). We propose a new shared parameter mixture (SPM) model for analysing semi‐continuous longitudinal income data, which addresses the two challenges of income non‐response and zero rounding. This is accomplished by jointly modelling an individual's underlying income together with the probability of missingness and rounding to zero, where both probabilities are permitted to vary in a smooth manner with their underlying non‐zero income. Applying the SPM model to the Australian income survey reveals that on average, older female individuals and individuals with a long‐term health condition are considerably less likely to earn an income, while income tended to be highest for male individuals on fixed‐term/permanent job contracts between ages 50 and 60. Furthermore there is evidence of both zero rounding, and conditional on the assumed missingness mechanism, individuals with incomes at the higher and lower ends are more likely to not report their income.
由于几个原因,纵向收入数据的分析往往具有挑战性。例如,在澳大利亚一项关于长期收入的全国性调查中,有很大比例的回答是缺失的,人们认为缺失机制是不可忽视的。此外,还有大量报告的零收入,其中一些可能是真零(对应于合法不赚取收入的个人),而一些可能是假零(对应于个人选择将其收入四舍五入为零)。本文提出了一种新的用于分析半连续纵向收入数据的共享参数混合(SPM)模型,该模型解决了收入无响应和零舍入的两个挑战。这是通过将个人的潜在收入与缺失概率一起建模并四舍五入到零来实现的,在这种情况下,两种概率都可以随其潜在的非零收入平稳变化。将SPM模型应用于澳大利亚收入调查表明,平均而言,老年女性个人和有长期健康问题的个人赚取收入的可能性要小得多,而50至60岁之间签订定期/长期工作合同的男性个人的收入往往最高。此外,有证据表明,零四舍五入和假设缺失机制的条件下,收入较高和较低的个人更有可能不报告他们的收入。
{"title":"A shared parameter mixture model for longitudinal income data with missing responses and zero rounding","authors":"Francis K.C. Hui,&nbsp;Howard D. Bondell","doi":"10.1111/anzs.12323","DOIUrl":"10.1111/anzs.12323","url":null,"abstract":"The analysis of longitudinal income data is often made challenging for several reasons. For example, in a national Australian survey on income over time, a non‐negligible proportion of responses are missing, and it is believed the missingness mechanism is non‐ignorable. Also, there are a large number of reported zero incomes, some of which may be true zeros (corresponding to individuals who legitimately do not earn an income), while some may be false zeros (corresponding to individuals choosing to round their income to zero). We propose a new shared parameter mixture (SPM) model for analysing semi‐continuous longitudinal income data, which addresses the two challenges of income non‐response and zero rounding. This is accomplished by jointly modelling an individual's underlying income together with the probability of missingness and rounding to zero, where both probabilities are permitted to vary in a smooth manner with their underlying non‐zero income. Applying the SPM model to the Australian income survey reveals that on average, older female individuals and individuals with a long‐term health condition are considerably less likely to earn an income, while income tended to be highest for male individuals on fixed‐term/permanent job contracts between ages 50 and 60. Furthermore there is evidence of both zero rounding, and conditional on the assumed missingness mechanism, individuals with incomes at the higher and lower ends are more likely to not report their income.","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12323","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89592499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adversarial risk analysis for first-price sealed-bid auctions 首价密封拍卖的对抗风险分析
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-06-01 DOI: 10.1111/anzs.12315
Muhammad Ejaz, Chaitanya Joshi, Stephen Joe

Adversarial risk analysis (ARA) is an upcoming methodology that is considered to have advantages over the traditional decision-theoretic and game-theoretic approaches. ARA solutions for first-price sealed-bid (FPSB) auctions have been found but only under strong assumptions which make the model somewhat unrealistic. In this paper, we use ARA methodology to model FPSB auctions using more realistic assumptions. We define a new utility function that considers bidders’ wealth, we assume a reserve price and find solutions not only for risk-neutral but also for risk-averse as well as risk-seeking bidders. We model the problem using ARA for non-strategic play and level-k thinking solution concepts.

对抗风险分析(ARA)是一种新兴的方法,被认为比传统的决策理论和博弈论方法具有优势。已经找到了首价密封投标(FPSB)拍卖的ARA解决方案,但仅在强有力的假设下,这使得该模型有些不现实。在本文中,我们使用ARA方法使用更现实的假设来模拟FPSB拍卖。我们定义了一个考虑投标人财富的新效用函数,我们假设了一个保留价格,并找到了解决方案,不仅适用于风险中性的投标人,也适用于风险厌恶者和风险寻求者。我们使用ARA对非战略性玩法和k级思维解决方案概念进行建模。
{"title":"Adversarial risk analysis for first-price sealed-bid auctions","authors":"Muhammad Ejaz,&nbsp;Chaitanya Joshi,&nbsp;Stephen Joe","doi":"10.1111/anzs.12315","DOIUrl":"10.1111/anzs.12315","url":null,"abstract":"<div>\u0000 \u0000 <p>Adversarial risk analysis (ARA) is an upcoming methodology that is considered to have advantages over the traditional decision-theoretic and game-theoretic approaches. ARA solutions for first-price sealed-bid (FPSB) auctions have been found but only under strong assumptions which make the model somewhat unrealistic. In this paper, we use ARA methodology to model FPSB auctions using more realistic assumptions. We define a new utility function that considers bidders’ wealth, we assume a reserve price and find solutions not only for risk-neutral but also for risk-averse as well as risk-seeking bidders. We model the problem using ARA for non-strategic play and level-<i>k</i> thinking solution concepts.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12315","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75026347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
On distance based goodness of fit tests for missing data when missing occurs at random 缺失数据随机发生时基于距离的拟合优度检验
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-05-30 DOI: 10.1111/anzs.12313
Subhra Sankar Dhar, Ujjwal Das

Various non-parametric goodness of fit tests have already been investigated in the literature. However, those tests are rarely used in the case of missing observations. We here study the goodness of fit test for missing data based on Lp distances along with Kolmogorov–Smirnov and Cramer–von-Mises distances when missingness occurs at random. The asymptotic distributions of the proposed test statistics have been derived under contiguous alternatives that enable us to investigate the asymptotic local power of the tests. We also study the performance of the tests for finite samples using simulation, and the tests perform well for those cases. The usefulness of the tests is illustrated on three real data sets.

各种非参数拟合优度检验已经在文献中进行了研究。但是,在缺少观测值的情况下很少使用这些测试。本文研究了随机缺失发生时,基于Lp距离以及Kolmogorov-Smirnov和crmer - von- mises距离对缺失数据的拟合优度检验。所提出的检验统计量的渐近分布已经在相邻的备选项下得到,使我们能够研究检验的渐近局部幂。本文还对有限样本下的测试性能进行了仿真研究,结果表明,在这种情况下,测试效果良好。在三个实际数据集上说明了测试的有效性。
{"title":"On distance based goodness of fit tests for missing data when missing occurs at random","authors":"Subhra Sankar Dhar,&nbsp;Ujjwal Das","doi":"10.1111/anzs.12313","DOIUrl":"10.1111/anzs.12313","url":null,"abstract":"<div>\u0000 \u0000 <p>Various non-parametric goodness of fit tests have already been investigated in the literature. However, those tests are rarely used in the case of missing observations. We here study the goodness of fit test for missing data based on <i>L</i><sub><i>p</i></sub> distances along with Kolmogorov–Smirnov and Cramer–von-Mises distances when missingness occurs at random. The asymptotic distributions of the proposed test statistics have been derived under contiguous alternatives that enable us to investigate the asymptotic local power of the tests. We also study the performance of the tests for finite samples using simulation, and the tests perform well for those cases. The usefulness of the tests is illustrated on three real data sets.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12313","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81486366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian decision rules to classification problems 分类问题的贝叶斯决策规则
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-05-24 DOI: 10.1111/anzs.12325
Yuqi Long, Xingzhong Xu

In this paper, we analysed classification rules under Bayesian decision theory. The setup we considered here is fairly general, which can represent all possible parametric models. The Bayes classification rule we investigated minimises the Bayes risk under general loss functions. Among the existing literatures, the 0-1 loss function appears most frequently, under which the Bayes classification rule is determined by the posterior predictive densities. Theoretically, we extended the Bernstein–von Mises theorem to the multiple-sample case. On this basis, the oracle property of Bayes classification rule has been discussed in detail, which refers to the convergence of the Bayes classification rule to the one built from the true distributions, as the sample size tends to infinity. Simulations show that the Bayes classification rules do have some advantages over the traditional classifiers, especially when the number of features approaches the sample size.

本文分析了贝叶斯决策理论下的分类规则。我们在这里考虑的设置是相当普遍的,它可以表示所有可能的参数模型。我们研究的贝叶斯分类规则在一般损失函数下最小化贝叶斯风险。在现有文献中,出现频率最高的是0-1损失函数,在该损失函数下,贝叶斯分类规则由后验预测密度决定。在理论上,我们将Bernstein-von Mises定理推广到多样本情况。在此基础上,详细讨论了贝叶斯分类规则的oracle性,即贝叶斯分类规则在样本量趋于无穷大时收敛于由真实分布构建的分类规则。仿真表明,贝叶斯分类规则确实比传统分类器有一些优势,特别是当特征数量接近样本量时。
{"title":"Bayesian decision rules to classification problems","authors":"Yuqi Long,&nbsp;Xingzhong Xu","doi":"10.1111/anzs.12325","DOIUrl":"10.1111/anzs.12325","url":null,"abstract":"<div>\u0000 \u0000 <p>In this paper, we analysed classification rules under Bayesian decision theory. The setup we considered here is fairly general, which can represent all possible parametric models. The Bayes classification rule we investigated minimises the Bayes risk under general loss functions. Among the existing literatures, the 0-1 loss function appears most frequently, under which the Bayes classification rule is determined by the posterior predictive densities. Theoretically, we extended the Bernstein–von Mises theorem to the multiple-sample case. On this basis, the oracle property of Bayes classification rule has been discussed in detail, which refers to the convergence of the Bayes classification rule to the one built from the true distributions, as the sample size tends to infinity. Simulations show that the Bayes classification rules do have some advantages over the traditional classifiers, especially when the number of features approaches the sample size.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12325","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89269715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Globally intensity-reweighted estimators for K- and pair correlation functions K-和对相关函数的全局强度重加权估计
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-05-17 DOI: 10.1111/anzs.12318
Thomas Shaw, Jesper M⊘ller, Rasmus Plenge Waagepetersen

We introduce new estimators of the inhomogeneous K-function and the pair correlation function of a spatial point process as well as the cross K-function and the cross pair correlation function of a bivariate spatial point process under the assumption of second-order intensity-reweighted stationarity. These estimators rely on a ‘global’ normalisation factor which depends on an aggregation of the intensity function, while the existing estimators depend ‘locally’ on the intensity function at the individual observed points. The advantages of our new global estimators over the existing local estimators are demonstrated by theoretical considerations and a simulation study.

在二阶强度重加权平稳性的假设下,引入了空间点过程的非齐次k函数和对相关函数的新估计,以及二元空间点过程的交叉k函数和交叉对相关函数的新估计。这些估计依赖于一个“全局”归一化因子,该因子依赖于强度函数的集合,而现有的估计依赖于单个观测点的“局部”强度函数。通过理论分析和仿真研究,证明了我们的全局估计器相对于现有的局部估计器的优越性。
{"title":"Globally intensity-reweighted estimators for K- and pair correlation functions","authors":"Thomas Shaw,&nbsp;Jesper M⊘ller,&nbsp;Rasmus Plenge Waagepetersen","doi":"10.1111/anzs.12318","DOIUrl":"10.1111/anzs.12318","url":null,"abstract":"<div>\u0000 \u0000 <p>We introduce new estimators of the inhomogeneous <i>K</i>-function and the pair correlation function of a spatial point process as well as the cross <i>K</i>-function and the cross pair correlation function of a bivariate spatial point process under the assumption of second-order intensity-reweighted stationarity. These estimators rely on a ‘global’ normalisation factor which depends on an aggregation of the intensity function, while the existing estimators depend ‘locally’ on the intensity function at the individual observed points. The advantages of our new global estimators over the existing local estimators are demonstrated by theoretical considerations and a simulation study.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12318","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75341203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
A few statistical principles for data science 数据科学的一些统计原则
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-05-08 DOI: 10.1111/anzs.12324
Noel Cressie

In any other circumstance, it might make sense to define the extent of the terrain (Data Science) first, and then locate and describe the landmarks (Principles). But this data revolution we are experiencing defies a cadastral survey. Areas are continually being annexed into Data Science. For example, biometrics was traditionally statistics for agriculture in all its forms but now, in Data Science, it means the study of characteristics that can be used to identify an individual. Examples of non-intrusive measurements include height, weight, fingerprints, retina scan, voice, photograph/video (facial landmarks and facial expressions) and gait. A multivariate analysis of such data would be a complex project for a statistician, but a software engineer might appear to have no trouble with it at all. In any applied-statistics project, the statistician worries about uncertainty and quantifies it by modelling data as realisations generated from a probability space. Another approach to uncertainty quantification is to find similar data sets, and then use the variability of results between these data sets to capture the uncertainty. Both approaches allow ‘error bars’ to be put on estimates obtained from the original data set, although the interpretations are different. A third approach, that concentrates on giving a single answer and gives up on uncertainty quantification, could be considered as Data Engineering, although it has staked a claim in the Data Science terrain. This article presents a few (actually nine) statistical principles for data scientists that have helped me, and continue to help me, when I work on complex interdisciplinary projects.

在任何其他情况下,首先定义地形的范围(数据科学),然后定位和描述地标(原则)可能是有意义的。但我们正在经历的这场数据革命与地籍调查背道而驰。数据科学领域不断被吞并。例如,生物计量学传统上是各种形式的农业统计,但现在,在数据科学中,它意味着对可用于识别个体的特征的研究。非侵入式测量的例子包括身高、体重、指纹、视网膜扫描、声音、照片/视频(面部标志和面部表情)和步态。对于统计学家来说,对这些数据进行多变量分析将是一个复杂的项目,但软件工程师似乎完全没有问题。在任何应用统计学项目中,统计学家都担心不确定性,并通过将数据建模为从概率空间生成的实现来量化不确定性。不确定性量化的另一种方法是找到相似的数据集,然后利用这些数据集之间结果的可变性来捕捉不确定性。这两种方法都允许在从原始数据集获得的估计值上放置“误差条”,尽管解释不同。第三种方法,专注于给出单一答案,放弃不确定性量化,可以被认为是数据工程,尽管它在数据科学领域占有一席之地。本文为数据科学家提供了一些(实际上是9条)统计原则,当我从事复杂的跨学科项目时,这些原则已经并将继续帮助我。
{"title":"A few statistical principles for data science","authors":"Noel Cressie","doi":"10.1111/anzs.12324","DOIUrl":"10.1111/anzs.12324","url":null,"abstract":"<div>\u0000 \u0000 <p>In any other circumstance, it might make sense to define the extent of the terrain (Data Science) first, and then locate and describe the landmarks (Principles). But this data revolution we are experiencing defies a cadastral survey. Areas are continually being annexed into Data Science. For example, biometrics was traditionally statistics for agriculture in all its forms but now, in Data Science, it means the study of characteristics that can be used to identify an individual. Examples of non-intrusive measurements include height, weight, fingerprints, retina scan, voice, photograph/video (facial landmarks and facial expressions) and gait. A multivariate analysis of such data would be a complex project for a statistician, but a software engineer might appear to have no trouble with it at all. In any applied-statistics project, the statistician worries about uncertainty and quantifies it by modelling data as realisations generated from a probability space. Another approach to uncertainty quantification is to find similar data sets, and then use the variability of results between these data sets to capture the uncertainty. Both approaches allow ‘error bars’ to be put on estimates obtained from the original data set, although the interpretations are different. A third approach, that concentrates on giving a single answer and gives up on uncertainty quantification, could be considered as Data Engineering, although it has staked a claim in the Data Science terrain. This article presents a few (actually nine) statistical principles for data scientists that have helped me, and continue to help me, when I work on complex interdisciplinary projects.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12324","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82540496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Information criteria for inhomogeneous spatial point processes 非齐次空间点过程的信息准则
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-05-08 DOI: 10.1111/anzs.12327
Achmad Choiruddin, Jean-François Coeurjolly, Rasmus Waagepetersen

The theoretical foundation for a number of model selection criteria is established in the context of inhomogeneous point processes and under various asymptotic settings: infill, increasing domain and combinations of these. For inhomogeneous Poisson processes we consider Akaike's information criterion and the Bayesian information criterion, and in particular we identify the point process analogue of ‘sample size’ needed for the Bayesian information criterion. Considering general inhomogeneous point processes we derive new composite likelihood and composite Bayesian information criteria for selecting a regression model for the intensity function. The proposed model selection criteria are evaluated using simulations of Poisson processes and cluster point processes.

在非齐次点过程和各种渐近设置下建立了许多模型选择准则的理论基础:填充,增加域和这些的组合。对于非齐次泊松过程,我们考虑赤池的信息准则和贝叶斯信息准则,特别是我们确定了贝叶斯信息准则所需的“样本量”的点过程模拟。考虑到一般的非齐次点过程,我们提出了新的复合似然和复合贝叶斯信息准则来选择强度函数的回归模型。利用泊松过程和聚类点过程的模拟对所提出的模型选择准则进行了评估。
{"title":"Information criteria for inhomogeneous spatial point processes","authors":"Achmad Choiruddin,&nbsp;Jean-François Coeurjolly,&nbsp;Rasmus Waagepetersen","doi":"10.1111/anzs.12327","DOIUrl":"10.1111/anzs.12327","url":null,"abstract":"<div>\u0000 \u0000 <p>The theoretical foundation for a number of model selection criteria is established in the context of inhomogeneous point processes and under various asymptotic settings: infill, increasing domain and combinations of these. For inhomogeneous Poisson processes we consider Akaike's information criterion and the Bayesian information criterion, and in particular we identify the point process analogue of ‘sample size’ needed for the Bayesian information criterion. Considering general inhomogeneous point processes we derive new composite likelihood and composite Bayesian information criteria for selecting a regression model for the intensity function. The proposed model selection criteria are evaluated using simulations of Poisson processes and cluster point processes.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12327","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80429406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
期刊
Australian & New Zealand Journal of Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1