首页 > 最新文献

Australian & New Zealand Journal of Statistics最新文献

英文 中文
Variable selection using penalised likelihoods for point patterns on a linear network 使用惩罚似然对线性网络上的点模式进行变量选择
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-10-18 DOI: 10.1111/anzs.12341
Suman Rakshit, Greg McSwiggan, Gopalan Nair, Adrian Baddeley

Motivated by the analysis of a comprehensive database of road traffic accidents, we investigate methods of variable selection for spatial point process models on a linear network. The original data may include explanatory spatial covariates, such as road curvature, and ‘mark’ variables attributed to individual accidents, such as accident severity. The treatment of mark variables is new. Variable selection is applied to the canonical covariates, which may include spatial covariate effects, mark effects and mark-covariate interactions. We approximate the likelihood of the point process model by that of a generalised linear model, in such a way that spatial covariates and marks are both associated with canonical covariates. We impose a convex penalty on the log likelihood, principally the elastic-net penalty, and maximise the penalised loglikelihood by cyclic coordinate ascent. A simulation study compares the performances of the lasso, ridge regression and elastic-net methods of variable selection on their ability to select variables correctly, and on their bias and standard error. Standard techniques for selecting the regularisation parameter γ often yielded unsatisfactory results. We propose two new rules for selecting γ which are designed to have better performance. The methods are tested on a small dataset on crimes in a Chicago neighbourhood, and applied to a large dataset of road traffic accidents in Western Australia.

通过对道路交通事故综合数据库的分析,研究了线性网络空间点过程模型的变量选择方法。原始数据可能包括解释性空间协变量,如道路曲率,以及归因于个别事故的“标记”变量,如事故严重程度。标记变量的处理是新的。变量选择应用于典型协变量,其中可能包括空间协变量效应、标记效应和标记-协变量相互作用。我们通过广义线性模型近似点过程模型的似然,以这样一种方式,空间协变量和标记都与正则协变量相关联。我们在对数似然上施加一个凸惩罚,主要是弹性网惩罚,并通过循环坐标上升最大化惩罚的对数似然。仿真研究比较了套索、脊回归和弹性网三种变量选择方法正确选择变量的能力,以及它们的偏差和标准误差。选择正则化参数γ的标准技术常常产生不满意的结果。我们提出了两个新的选择γ的规则,它们具有更好的性能。这些方法在芝加哥社区的一个小型犯罪数据集上进行了测试,并应用于西澳大利亚州的一个大型道路交通事故数据集。
{"title":"Variable selection using penalised likelihoods for point patterns on a linear network","authors":"Suman Rakshit,&nbsp;Greg McSwiggan,&nbsp;Gopalan Nair,&nbsp;Adrian Baddeley","doi":"10.1111/anzs.12341","DOIUrl":"10.1111/anzs.12341","url":null,"abstract":"<div>\u0000 \u0000 <p>Motivated by the analysis of a comprehensive database of road traffic accidents, we investigate methods of variable selection for spatial point process models on a linear network. The original data may include explanatory spatial covariates, such as road curvature, and ‘mark’ variables attributed to individual accidents, such as accident severity. The treatment of mark variables is new. Variable selection is applied to the canonical covariates, which may include spatial covariate effects, mark effects and mark-covariate interactions. We approximate the likelihood of the point process model by that of a generalised linear model, in such a way that spatial covariates and marks are both associated with canonical covariates. We impose a convex penalty on the log likelihood, principally the elastic-net penalty, and maximise the penalised loglikelihood by cyclic coordinate ascent. A simulation study compares the performances of the lasso, ridge regression and elastic-net methods of variable selection on their ability to select variables correctly, and on their bias and standard error. Standard techniques for selecting the regularisation parameter <i>γ</i> often yielded unsatisfactory results. We propose two new rules for selecting <i>γ</i> which are designed to have better performance. The methods are tested on a small dataset on crimes in a Chicago neighbourhood, and applied to a large dataset of road traffic accidents in Western Australia.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90533201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
ECM algorithm for estimating vector ARMA model with variance gamma distribution and possible unbounded density 用ECM算法估计具有方差分布和可能无界密度的向量ARMA模型
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-10-18 DOI: 10.1111/anzs.12340
Thanakorn Nitithumbundit, Jennifer S.K. Chan

The simultaneous analysis of several financial time series is salient in portfolio setting and risk management. This paper proposes a novel alternating expectation conditional maximisation (AECM) algorithm to estimate the vector autoregressive moving average (VARMA) model with variance gamma (VG) error distribution in the multivariate skewed setting. We explain why the VARMA-VG model is suitable for high-frequency returns (HFRs) because VG distribution provides thick tails to capture the high kurtosis in the data and unbounded central density further captures the majority of near-zero HFRs. The distribution can also be expressed in normal-mean-variance mixtures to facilitate model implementation using the Bayesian or expectation maximisation (EM) approach. We adopt the EM approach to avoid the time-consuming Markov chain Monto Carlo sampling and solve the unbounded density problem in the classical maximum likelihood estimation. We conduct extensive simulation studies to evaluate the accuracy of the proposed AECM estimator and apply the models to analyse the dependency between two HFR series from the time zones that only differ by one hour.

同时分析多个金融时间序列在投资组合设置和风险管理中具有重要意义。本文提出了一种新的交替期望条件最大化(AECM)算法,用于估计多元偏态设置下具有方差伽玛(VG)误差分布的向量自回归移动平均(VARMA)模型。我们解释了为什么VARMA-VG模型适用于高频回报(HFRs),因为VG分布提供了厚尾来捕获数据中的高峰度,无界中心密度进一步捕获了大多数接近零的HFRs。分布也可以用正态-均值-方差混合表示,以方便使用贝叶斯或期望最大化(EM)方法实现模型。采用EM方法避免了耗时的马尔可夫链蒙特卡罗采样,解决了经典极大似然估计中的无界密度问题。我们进行了广泛的模拟研究,以评估所提出的AECM估计器的准确性,并应用模型来分析两个仅相差一小时的时区HFR序列之间的相关性。
{"title":"ECM algorithm for estimating vector ARMA model with variance gamma distribution and possible unbounded density","authors":"Thanakorn Nitithumbundit,&nbsp;Jennifer S.K. Chan","doi":"10.1111/anzs.12340","DOIUrl":"https://doi.org/10.1111/anzs.12340","url":null,"abstract":"<div>\u0000 \u0000 <p>The simultaneous analysis of several financial time series is salient in portfolio setting and risk management. This paper proposes a novel alternating expectation conditional maximisation (AECM) algorithm to estimate the vector autoregressive moving average (VARMA) model with variance gamma (VG) error distribution in the multivariate skewed setting. We explain why the VARMA-VG model is suitable for high-frequency returns (HFRs) because VG distribution provides thick tails to capture the high kurtosis in the data and unbounded central density further captures the majority of near-zero HFRs. The distribution can also be expressed in normal-mean-variance mixtures to facilitate model implementation using the Bayesian or expectation maximisation (EM) approach. We adopt the EM approach to avoid the time-consuming Markov chain Monto Carlo sampling and solve the unbounded density problem in the classical maximum likelihood estimation. We conduct extensive simulation studies to evaluate the accuracy of the proposed AECM estimator and apply the models to analyse the dependency between two HFR series from the time zones that only differ by one hour.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"137538704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Inverse G-Wishart distribution and variational message passing 逆G-Wishart分布与变分消息传递
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-10-07 DOI: 10.1111/anzs.12339
Luca Maestrini, Matt P. Wand

Message passing on a factor graph is a powerful paradigm for the coding of approximate inference algorithms for arbitrarily large graphical models. The notion of a factor graph fragment allows for compartmentalisation of algebra and computer code. We show that the Inverse G-Wishart family of distributions enables fundamental variational message passing factor graph fragments to be expressed elegantly and succinctly. Such fragments arise in models for which approximate inference concerning covariance matrix or variance parameters is made, and are ubiquitous in contemporary statistics and machine learning.

在因子图上传递消息是为任意大型图形模型编写近似推理算法的强大范例。因子图片段的概念允许代数和计算机代码的划分。我们证明了逆G-Wishart分布族使基本变分消息传递因子图片段能够优雅而简洁地表达。这种片段出现在对协方差矩阵或方差参数进行近似推理的模型中,在当代统计学和机器学习中无处不在。
{"title":"The Inverse G-Wishart distribution and variational message passing","authors":"Luca Maestrini,&nbsp;Matt P. Wand","doi":"10.1111/anzs.12339","DOIUrl":"10.1111/anzs.12339","url":null,"abstract":"<div>\u0000 \u0000 <p>Message passing on a factor graph is a powerful paradigm for the coding of approximate inference algorithms for arbitrarily large graphical models. The notion of a factor graph fragment allows for compartmentalisation of algebra and computer code. We show that the Inverse G-Wishart family of distributions enables fundamental variational message passing factor graph fragments to be expressed elegantly and succinctly. Such fragments arise in models for which approximate inference concerning covariance matrix or variance parameters is made, and are ubiquitous in contemporary statistics and machine learning.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81925035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
An adequacy approach for deciding the number of clusters for OTRIMLE robust Gaussian mixture-based clustering 基于OTRIMLE鲁棒高斯混合聚类的聚类数量决定的充分性方法
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-09-03 DOI: 10.1111/anzs.12338
Christian Hennig, Pietro Coretto

We introduce a new approach to deciding the number of clusters. The approach is applied to Optimally Tuned Robust Improper Maximum Likelihood Estimation (OTRIMLE; Coretto & Hennig, Journal of the American Statistical Association 111, 1648–1659) of a Gaussian mixture model allowing for observations to be classified as ‘noise’, but it can be applied to other clustering methods as well. The quality of a clustering is assessed by a statistic Q that measures how close the within-cluster distributions are to elliptical unimodal distributions that have the only mode in the mean. This non-parametric measure allows for non-Gaussian clusters as long as they have a good quality according to Q. The simplicity of a model is assessed by a measure S that prefers a smaller number of clusters unless additional clusters can reduce the estimated noise proportion substantially. The simplest model is then chosen that is adequate for the data in the sense that its observed value of Q is not significantly larger than what is expected for data truly generated from the fitted model, as can be assessed by parametric bootstrap. The approach is compared with model-based clustering using the Bayesian information criterion (BIC) and the integrated complete likelihood (ICL) in a simulation study and on two real data sets.

我们介绍了一种确定集群数量的新方法。该方法应用于最优调谐鲁棒不当极大似然估计(OTRIMLE;Coretto,Hennig,《美国统计协会杂志》(Journal of American Statistical Association),第111期,1648-1659期),他提出了一种高斯混合模型,该模型允许将观测结果归类为“噪声”,但它也可以应用于其他聚类方法。聚类的质量是通过统计量Q来评估的,该统计量Q测量聚类内分布与椭圆单峰分布的接近程度,椭圆单峰分布的唯一模式是在平均值中。这种非参数度量允许非高斯聚类,只要它们根据q具有良好的质量。模型的简单性由度量S评估,该度量S倾向于较少数量的聚类,除非额外的聚类可以大幅降低估计的噪声比例。然后选择最简单的模型,该模型适合于数据,因为其观察到的Q值不会显著大于从拟合模型真正生成的数据的预期值,可以通过参数自举来评估。在仿真研究和两个真实数据集上,将该方法与基于贝叶斯信息准则(BIC)和集成完全似然(ICL)的模型聚类方法进行了比较。
{"title":"An adequacy approach for deciding the number of clusters for OTRIMLE robust Gaussian mixture-based clustering","authors":"Christian Hennig,&nbsp;Pietro Coretto","doi":"10.1111/anzs.12338","DOIUrl":"10.1111/anzs.12338","url":null,"abstract":"<p>We introduce a new approach to deciding the number of clusters. The approach is applied to Optimally Tuned Robust Improper Maximum Likelihood Estimation (OTRIMLE; Coretto &amp; Hennig, <i>Journal of the American Statistical Association</i> <b>111</b>, 1648–1659) of a Gaussian mixture model allowing for observations to be classified as ‘noise’, but it can be applied to other clustering methods as well. The quality of a clustering is assessed by a statistic <i>Q</i> that measures how close the within-cluster distributions are to elliptical unimodal distributions that have the only mode in the mean. This non-parametric measure allows for non-Gaussian clusters as long as they have a good quality according to <i>Q</i>. The simplicity of a model is assessed by a measure <i>S</i> that prefers a smaller number of clusters unless additional clusters can reduce the estimated noise proportion substantially. The simplest model is then chosen that is adequate for the data in the sense that its observed value of <i>Q</i> is not significantly larger than what is expected for data truly generated from the fitted model, as can be assessed by parametric bootstrap. The approach is compared with model-based clustering using the Bayesian information criterion (BIC) and the integrated complete likelihood (ICL) in a simulation study and on two real data sets.</p>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12338","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75692546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
What is the effective sample size of a spatial point process? 空间点过程的有效样本量是多少?
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-07-21 DOI: 10.1111/anzs.12337
Ian W. Renner, David I. Warton, Francis K.C. Hui

Point process models are a natural approach for modelling data that arise as point events. In the case of Poisson counts, these may be fitted easily as a weighted Poisson regression. Point processes lack the notion of sample size. This is problematic for model selection, because various classical criteria such as the Bayesian information criterion (BIC) are a function of the sample size, n, and are derived in an asymptotic framework where n tends to infinity. In this paper, we develop an asymptotic result for Poisson point process models in which the observed number of point events, m, plays the role that sample size does in the classical regression context. Following from this result, we derive a version of BIC for point process models, and when fitted via penalised likelihood, conditions for the LASSO penalty that ensure consistency in estimation and the oracle property. We discuss challenges extending these results to the wider class of Gibbs models, of which the Poisson point process model is a special case.

点过程模型是对作为点事件产生的数据进行建模的自然方法。在泊松计数的情况下,这些可以很容易地拟合为加权泊松回归。点过程缺乏样本大小的概念。这对于模型选择是有问题的,因为各种经典准则,如贝叶斯信息准则(BIC)是样本量n的函数,并且是在n趋于无穷大的渐近框架中导出的。在本文中,我们开发了泊松点过程模型的渐近结果,其中观察到的点事件数m在经典回归环境中起着样本大小的作用。根据这一结果,我们为点过程模型导出了一个版本的BIC,当通过惩罚似然进行拟合时,LASSO惩罚的条件确保了估计和oracle属性的一致性。我们讨论了将这些结果扩展到更广泛的吉布斯模型的挑战,其中泊松点过程模型是一个特例。
{"title":"What is the effective sample size of a spatial point process?","authors":"Ian W. Renner,&nbsp;David I. Warton,&nbsp;Francis K.C. Hui","doi":"10.1111/anzs.12337","DOIUrl":"10.1111/anzs.12337","url":null,"abstract":"<div>\u0000 \u0000 <p>Point process models are a natural approach for modelling data that arise as point events. In the case of Poisson counts, these may be fitted easily as a weighted Poisson regression. Point processes lack the notion of sample size. This is problematic for model selection, because various classical criteria such as the Bayesian information criterion (BIC) are a function of the sample size, <i>n</i>, and are derived in an asymptotic framework where <i>n</i> tends to infinity. In this paper, we develop an asymptotic result for Poisson point process models in which the observed number of point events, <i>m</i>, plays the role that sample size does in the classical regression context. Following from this result, we derive a version of BIC for point process models, and when fitted via penalised likelihood, conditions for the LASSO penalty that ensure consistency in estimation and the oracle property. We discuss challenges extending these results to the wider class of Gibbs models, of which the Poisson point process model is a special case.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12337","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81154600","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Anna Karenina and the two envelopes problem 安娜·卡列尼娜和两个信封的问题
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-07-21 DOI: 10.1111/anzs.12329
R. D. Gill

The Anna Karenina principle is named after the opening sentence in the eponymous novel: Happy families are all alike; every unhappy family is unhappy in its own way. The two envelopes problem (TEP) is a much-studied paradox in probability theory, mathematical economics, logic and philosophy. Time and again a new analysis is published in which an author claims finally to explain what actually goes wrong in this paradox. Each author (the present author included) emphasises what is new in their approach and concludes that earlier approaches did not get to the root of the matter. We observe that though a logical argument is only correct if every step is correct, an apparently logical argument which goes astray can be thought of as going astray at different places. This leads to a comparison between the literature on TEP and a successful movie franchise: it generates a succession of sequels, and even prequels, each with a different director who approaches the same basic premise in a personal way. We survey resolutions in the literature with a view to synthesis, correct common errors, and give a new theorem on order properties of an exchangeable pair of random variables, at the heart of most TEP variants and interpretations. A theorem on asymptotic independence between the amount in your envelope and the question whether it is smaller or larger shows that the pathological situation of improper priors or infinite expectation values has consequences as we merely approach such a situation.

安娜·卡列尼娜原则是以同名小说的开头一句话命名的:幸福的家庭都是相似的;不幸的家庭各有各的不幸。双信封问题(TEP)是概率论、数理经济学、逻辑学和哲学中一个被广泛研究的悖论。不断有新的分析发表,其中作者声称最终解释了这个悖论到底出了什么问题。每位作者(包括本作者)都强调了他们方法中的新内容,并得出结论说,以前的方法没有触及问题的根源。我们注意到,虽然逻辑论证只有在每一步都正确的情况下才是正确的,但一个表面上合乎逻辑的论证,如果误入歧途,可以认为是在不同的地方误入歧途。这让我们将TEP的文学作品与成功的电影系列进行比较:它产生了一系列续集,甚至前传,每一部都有不同的导演,以个人的方式处理相同的基本前提。我们调查了文献中的决议,以综合,纠正常见错误,并给出了一个关于可交换随机变量对的阶性质的新定理,这是大多数TEP变体和解释的核心。一个关于你信封里的数量和它是大还是小的问题之间的渐近独立的定理表明,当我们仅仅接近这种情况时,不当先验或无限期望值的病态情况就会产生后果。
{"title":"Anna Karenina and the two envelopes problem","authors":"R. D. Gill","doi":"10.1111/anzs.12329","DOIUrl":"10.1111/anzs.12329","url":null,"abstract":"<div>\u0000 \u0000 <p>The Anna Karenina principle is named after the opening sentence in the eponymous novel: Happy families are all alike; every unhappy family is unhappy in its own way. The two envelopes problem (TEP) is a much-studied paradox in probability theory, mathematical economics, logic and philosophy. Time and again a new analysis is published in which an author claims finally to explain what actually goes wrong in this paradox. Each author (the present author included) emphasises what is new in their approach and concludes that earlier approaches did not get to the root of the matter. We observe that though a logical argument is only correct if every step is correct, an apparently logical argument which goes astray can be thought of as going astray at different places. This leads to a comparison between the literature on TEP and a successful movie franchise: it generates a succession of sequels, and even prequels, each with a different director who approaches the same basic premise in a personal way. We survey resolutions in the literature with a view to synthesis, correct common errors, and give a new theorem on order properties of an exchangeable pair of random variables, at the heart of most TEP variants and interpretations. A theorem on asymptotic independence between the amount in your envelope and the question whether it is smaller or larger shows that the pathological situation of improper priors or infinite expectation values has consequences as we merely approach such a situation.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12329","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80260052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Festschrift for Adrian Baddeley 阿德里安·巴德利的欢宴
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-07-21 DOI: 10.1111/anzs.12322
Martin L. Hazelton, R. Turner

This article introduces a special issue of the Australian and New Zealand Journal of Statistics, being a Festschrift for Adrian Baddeley on the occasion of his 65th birthday.

这篇文章介绍了澳大利亚和新西兰统计杂志的特刊,作为阿德里安·巴德利65岁生日的纪念礼物。
{"title":"A Festschrift for Adrian Baddeley","authors":"Martin L. Hazelton,&nbsp;R. Turner","doi":"10.1111/anzs.12322","DOIUrl":"10.1111/anzs.12322","url":null,"abstract":"<div>\u0000 \u0000 <p>This article introduces a special issue of the Australian and New Zealand Journal of Statistics, being a Festschrift for Adrian Baddeley on the occasion of his 65th birthday.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12322","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73814182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dependent radius marks of Laguerre tessellations: a case study 拉盖尔镶嵌的依赖半径标记:一个案例研究
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-07-21 DOI: 10.1111/anzs.12314
Dietrich Stoyan, Viktor Beneš, Filip Seitl

We study a particular marked three-dimensional point process sample that represents a Laguerre tessellation. It comes from a polycrystalline sample of aluminium alloy material. The ‘points’ are the cell generators while the ‘marks’ are radius marks that control the size and shape of the tessellation cells. Our statistical mark correlation analyses show that the marks of the sample are in clear and plausible spatial correlation: the marks of generators close together tend to be small and similar and the form of the correlation functions does not justify geostatistical marking. We show that a simplified modelling of tessellations by Laguerre tessellations with independent radius marks may lead to wrong results. When we started from the aluminium alloy data and generated random marks by random permutation we obtained tessellations with characteristics quite different from the original ones. We observed similar behaviour for simulated Laguerre tessellations. This fact, which seems to be natural for the given data type, makes fitting of models to empirical Laguerre tessellations quite difficult: the generator points and radius marks have to be modelled simultaneously. This may imply that the reconstruction methods are more efficient than point-process modelling if only samples of similar Laguerre tessellations are needed. We also found that literature recipes for bandwidth choice for estimating correlation functions should be used with care.

我们研究了一个特殊的标记三维点过程样本,它代表了拉盖尔镶嵌。它来自于铝合金材料的多晶样品。“点”是单元生成器,而“标记”是半径标记,控制镶嵌单元的大小和形状。我们的统计标记相关性分析表明,样品的标记具有清晰而合理的空间相关性:靠近在一起的发电机的标记往往小而相似,相关函数的形式不能证明地质统计标记。我们表明,通过具有独立半径标记的拉盖尔镶嵌来简化镶嵌建模可能会导致错误的结果。当我们从铝合金数据出发,通过随机排列产生随机标记时,我们得到了与原始特征截然不同的镶嵌。我们在模拟拉盖尔镶嵌中观察到类似的行为。对于给定的数据类型来说,这似乎是很自然的事实,但这使得模型拟合到经验拉盖尔镶嵌非常困难:生成器点和半径标记必须同时建模。这可能意味着,如果只需要类似拉盖尔镶嵌的样本,重建方法比点过程建模更有效。我们还发现,应该谨慎使用用于估计相关函数的带宽选择的文献配方。
{"title":"Dependent radius marks of Laguerre tessellations: a case study","authors":"Dietrich Stoyan,&nbsp;Viktor Beneš,&nbsp;Filip Seitl","doi":"10.1111/anzs.12314","DOIUrl":"10.1111/anzs.12314","url":null,"abstract":"<div>\u0000 \u0000 <p>We study a particular marked three-dimensional point process sample that represents a Laguerre tessellation. It comes from a polycrystalline sample of aluminium alloy material. The ‘points’ are the cell generators while the ‘marks’ are radius marks that control the size and shape of the tessellation cells. Our statistical mark correlation analyses show that the marks of the sample are in clear and plausible spatial correlation: the marks of generators close together tend to be small and similar and the form of the correlation functions does not justify geostatistical marking. We show that a simplified modelling of tessellations by Laguerre tessellations with independent radius marks may lead to wrong results. When we started from the aluminium alloy data and generated random marks by random permutation we obtained tessellations with characteristics quite different from the original ones. We observed similar behaviour for simulated Laguerre tessellations. This fact, which seems to be natural for the given data type, makes fitting of models to empirical Laguerre tessellations quite difficult: the generator points and radius marks have to be modelled simultaneously. This may imply that the reconstruction methods are more efficient than point-process modelling if only samples of similar Laguerre tessellations are needed. We also found that literature recipes for bandwidth choice for estimating correlation functions should be used with care.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12314","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80348739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Modelling temporal genetic and spatio-temporal residual effects for high-throughput phenotyping data* 高通量表型数据的时间遗传和时空残留效应建模*
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-07-20 DOI: 10.1111/anzs.12336
A. P. Verbyla, J. De Faveri, D. M. Deery, G. J. Rebetzke

High-throughput phenomics data are being collected in both the laboratory and the field. The data are often collected at many time points and there may be spatial variation in the laboratory or field that impacts on the growth of the plants, and that may influence the traits of interest. Modelling the genetic effects is of primary interest in such studies, but these effects might be biased if non-genetic effects present in the experiment are ignored. With data that are collected both in time and space, there may be a need to jointly model these multi-dimensional non-genetic effects. Thus both modelling of genetic effects over time and non-genetic effects over time and space in a one-stage analysis is considered. An experiment that involves field phenomics data with four dimensions, two in space and two in time, provides the vehicle to examine the models. Factor analytic (FA) models are often used for genetic effects for different environments to provide reliable estimates of genetic variances and correlations. As the time dimension defines the environments, FA models are examined for the phenomics data. Reduced rank tensor smoothing splines are presented as a possible approach for modelling the spatio-temporal effects, although an additional term is included for heterogeneity over the two time dimensions. This approach is feasible, although very time-consuming. The process of model selection for the genetic effects is presented including tests, information criteria and diagnostics. Comparisons of more simplistic models are made with the reduced rank tensor spline. This also shows the interplay between the genetic and residual models in model selection.

高通量表型组学数据正在实验室和实地收集。数据通常在许多时间点收集,实验室或田间可能存在空间差异,影响植物的生长,并可能影响感兴趣的性状。模拟遗传效应是这类研究的主要兴趣,但如果忽略实验中存在的非遗传效应,这些效应可能会有偏差。有了在时间和空间上收集的数据,可能有必要共同建立这些多维非遗传效应的模型。因此,在单阶段分析中考虑了遗传效应随时间的建模和非遗传效应随时间和空间的建模。一项涉及四个维度(两个空间维度和两个时间维度)的现场表型组学数据的实验,为检验这些模型提供了工具。因子分析(FA)模型通常用于不同环境的遗传效应,以提供遗传方差和相关性的可靠估计。由于时间维度定义了环境,因此对表型组学数据进行了FA模型检验。降低秩张量平滑样条被提出作为一种可能的方法来模拟时空效应,尽管在两个时间维度上的异质性包括一个额外的术语。这种方法是可行的,尽管非常耗时。介绍了遗传效应的模型选择过程,包括测试、信息标准和诊断。用降阶张量样条对更简单的模型进行了比较。这也说明了遗传模型和残差模型在模型选择中的相互作用。
{"title":"Modelling temporal genetic and spatio-temporal residual effects for high-throughput phenotyping data*","authors":"A. P. Verbyla,&nbsp;J. De Faveri,&nbsp;D. M. Deery,&nbsp;G. J. Rebetzke","doi":"10.1111/anzs.12336","DOIUrl":"10.1111/anzs.12336","url":null,"abstract":"<div>\u0000 \u0000 <p>High-throughput phenomics data are being collected in both the laboratory and the field. The data are often collected at many time points and there may be spatial variation in the laboratory or field that impacts on the growth of the plants, and that may influence the traits of interest. Modelling the genetic effects is of primary interest in such studies, but these effects might be biased if non-genetic effects present in the experiment are ignored. With data that are collected both in time and space, there may be a need to jointly model these multi-dimensional non-genetic effects. Thus both modelling of genetic effects over time and non-genetic effects over time and space in a one-stage analysis is considered. An experiment that involves field phenomics data with four dimensions, two in space and two in time, provides the vehicle to examine the models. Factor analytic (FA) models are often used for genetic effects for different environments to provide reliable estimates of genetic variances and correlations. As the time dimension defines the environments, FA models are examined for the phenomics data. Reduced rank tensor smoothing splines are presented as a possible approach for modelling the spatio-temporal effects, although an additional term is included for heterogeneity over the two time dimensions. This approach is feasible, although very time-consuming. The process of model selection for the genetic effects is presented including tests, information criteria and diagnostics. Comparisons of more simplistic models are made with the reduced rank tensor spline. This also shows the interplay between the genetic and residual models in model selection.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12336","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89156281","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Conditional intensity: A powerful tool for modelling and analysing point process data 条件强度:一个强大的工具,用于建模和分析点过程数据
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2021-07-06 DOI: 10.1111/anzs.12331
Peter J. Diggle

The conditional intensity function of a spatial point process describes how the probability that a point of the process occurs ‘at’ a particular point in its carrier space depends on the realisation of the process in the remainder of the carrier space. Provided that the point process is simple, the conditional intensity determines all of the properties of the process, in particular its likelihood function. In this paper, we review the use of the conditional intensity function in the formulation of point process models and in making inferences from point process data, giving separate consideration to temporal, spatial and spatiotemporal settings. We argue that the conditional intensity function should take centre-stage in spatiotemporal point process modelling and analysis.

空间点过程的条件强度函数描述了过程的一个点在其载波空间中的特定点发生的概率如何取决于该过程在载波空间的剩余部分中的实现。假设点过程是简单的,则条件强度决定了过程的所有属性,特别是其似然函数。在本文中,我们回顾了条件强度函数在点过程模型的制定和从点过程数据推断中的使用,并分别考虑了时间、空间和时空设置。我们认为条件强度函数应该在时空点过程建模和分析中占据中心位置。
{"title":"Conditional intensity: A powerful tool for modelling and analysing point process data","authors":"Peter J. Diggle","doi":"10.1111/anzs.12331","DOIUrl":"10.1111/anzs.12331","url":null,"abstract":"<div>\u0000 \u0000 <p>The conditional intensity function of a spatial point process describes how the probability that a point of the process occurs ‘at’ a particular point in its carrier space depends on the realisation of the process in the remainder of the carrier space. Provided that the point process is simple, the conditional intensity determines all of the properties of the process, in particular its likelihood function. In this paper, we review the use of the conditional intensity function in the formulation of point process models and in making inferences from point process data, giving separate consideration to temporal, spatial and spatiotemporal settings. We argue that the conditional intensity function should take centre-stage in spatiotemporal point process modelling and analysis.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2021-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/anzs.12331","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86952633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Australian & New Zealand Journal of Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1