Australian & New Zealand Journal of Statistics最新文献

英文中文

Asymptotic Normality of Generalised Edge Frequency Polygon Estimator for Dependent Data 相关数据广义边频多边形估计的渐近正态性

IF 0.8 4区数学 Q3 STATISTICS & PROBABILITY

Australian & New Zealand Journal of Statistics

Pub Date : 2026-02-18 DOI: 10.1111/anzs.70036

Yan Wang, Xuejun Wang, Aiting Shen

Density function estimation is a cornerstone of statistical analysis. This paper focuses on the generalised edge frequency polygon estimator, establishing its asymptotic normality for identically distributed $� � � α � � �$ -mixing random variables. This finding complements the asymptotic theory outlined by Dong and Zheng (2001. Generalized edge frequency polygon for density estimation. Statistics and Probability Letters, 55, 137–145). Theoretical results are substantiated through simulations that assess finite-sample performance and an analysis of a real-world dataset.

密度函数估计是统计分析的基础。研究了同分布α $$ alpha $$ -混合随机变量的广义边频多边形估计，建立了其渐近正态性。这一发现补充了Dong和Zheng（2001）提出的渐近理论。密度估计的广义边缘频率多边形。统计与概率快报，55,137-145)。理论结果通过评估有限样本性能的模拟和对现实世界数据集的分析得到证实。

引用次数: 0

R (and Dialects) versus Python for Data Science R（和方言）与Python的数据科学

IF 0.8 4区数学 Q3 STATISTICS & PROBABILITY

Australian & New Zealand Journal of Statistics

Pub Date : 2026-02-18 DOI: 10.1111/anzs.70041

Norman Matloff

R and Python are the two dominant language tools for data science today. Which one is better, and in what senses? This paper explores such questions, in terms of aspects such as learning curve, clarity of expression, coding philosophy, high-performance computing capability and so on. Also, the article treats base-R and the tidyverse as two separate ‘dialects’ of R, so that the above comparisons are in many cases tripartite in nature.

R和Python是当今数据科学的两种主要语言工具。哪一个更好，在什么意义上？本文从学习曲线、表达清晰度、编码理念、高性能计算能力等方面对这些问题进行了探讨。此外，本文将base R和tidyverse视为R的两种独立的“方言”，因此上述比较在许多情况下是三方的。

引用次数: 0

Asymptotic Normality of Generalised Edge Frequency Polygon Estimator for Dependent Data 相关数据广义边频多边形估计的渐近正态性

IF 0.8 4区数学 Q3 STATISTICS & PROBABILITY

Australian & New Zealand Journal of Statistics

Pub Date : 2026-02-18 DOI: 10.1111/anzs.70036

Yan Wang, Xuejun Wang, Aiting Shen

引用次数: 0

Model Averaging of Partially Linear Multinomial Logit Model for Multi-Categorical Data 多类别数据的部分线性多项式Logit模型的模型平均

IF 0.8 4区数学 Q3 STATISTICS & PROBABILITY

Australian & New Zealand Journal of Statistics

Pub Date : 2026-02-12 DOI: 10.1111/anzs.70040

Jialei Liu, Jing Lv

Model averaging for categorical response variables has gained a lot of attention in recent years. To further improve the prediction accuracy, we present a partially linear multinomial logit model averaging (PLMLMA) technique. Our candidate models are built by selecting each continuous covariate in turn as the index variable of the non-parametric function, thus avoiding both the artificial selection of index variables and the curse of dimensionality in estimation. The model averaging weights are determined by minimising the Kullback-Leibler (KL) loss. We demonstrate asymptotic optimality by showing that the KL loss between the true model and the model where the log-odds ratio is estimated by the ‘working’ log-odds ratio is asymptotically equivalent to that of the best but impractical model averaging estimator. Furthermore, we establish the convergence rate of the weight estimator without assuming that the true model is included among the candidate models. The superior performance of the proposed method is evidenced by lower mean squared error (MSE) and higher hit rate (HR) in simulations, outperforming various competitors. We also apply our method to wheat variety classification to illustrate the merits of PLMLMA.

分类响应变量的模型平均方法近年来受到了广泛的关注。为了进一步提高预测精度，提出了一种部分线性多项式logit模型平均（PLMLMA）技术。我们的候选模型是通过依次选择每个连续协变量作为非参数函数的指标变量来构建的，从而避免了指标变量的人为选择和估计中的维数祸害。模型的平均权重是通过最小化Kullback-Leibler （KL）损失来确定的。我们通过显示真实模型和由“工作”对数-比值比估计的对数-比值比的模型之间的KL损失渐近等效于最佳但不切实际的模型平均估计器的KL损失来证明渐近最优性。此外，在不假设候选模型中包含真实模型的情况下，我们建立了权估计器的收敛速率。仿真结果表明，该方法具有较低的均方误差（MSE）和较高的命中率（HR），优于各种竞争对手。并将该方法应用于小麦品种分类，以说明PLMLMA的优点。

{"title":"Model Averaging of Partially Linear Multinomial Logit Model for Multi-Categorical Data","authors":"Jialei Liu, Jing Lv","doi":"10.1111/anzs.70040","DOIUrl":"10.1111/anzs.70040","url":null,"abstract":"<div>\u0000 \u0000 <p>Model averaging for categorical response variables has gained a lot of attention in recent years. To further improve the prediction accuracy, we present a partially linear multinomial logit model averaging (PLMLMA) technique. Our candidate models are built by selecting each continuous covariate in turn as the index variable of the non-parametric function, thus avoiding both the artificial selection of index variables and the curse of dimensionality in estimation. The model averaging weights are determined by minimising the Kullback-Leibler (KL) loss. We demonstrate asymptotic optimality by showing that the KL loss between the true model and the model where the log-odds ratio is estimated by the ‘working’ log-odds ratio is asymptotically equivalent to that of the best but impractical model averaging estimator. Furthermore, we establish the convergence rate of the weight estimator without assuming that the true model is included among the candidate models. The superior performance of the proposed method is evidenced by lower mean squared error (MSE) and higher hit rate (HR) in simulations, outperforming various competitors. We also apply our method to wheat variety classification to illustrate the merits of PLMLMA.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"68 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2026-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146217042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Random-Effects Approach to Regression Involving Many Categorical Predictors and Their Interactions 涉及许多分类预测因子及其相互作用的随机效应回归方法

IF 0.8 4区数学 Q3 STATISTICS & PROBABILITY

Australian & New Zealand Journal of Statistics

Pub Date : 2026-01-23 DOI: 10.1111/anzs.70034

Hanmei Sun, Jiangshan Zhang, Jiming Jiang

Linear model prediction with a large number of potential predictors is both statistically and computationally challenging. The traditional approaches are largely based on shrinkage selection/estimation methods, which are applicable even when the number of potential predictors is (much) larger than the sample size. A situation of the latter scenario occurs when the candidate predictors involve many binary indicators corresponding to categories of some categorical predictors as well as their interactions. We propose an alternative approach to the shrinkage prediction methods in such a case based on mixed model prediction, which effectively treats combinations of the categorical effects as random effects. We establish theoretical validity of the proposed method and demonstrate empirically its advantage over the shrinkage methods. We also develop measures of uncertainty for the proposed method and evaluate their performance empirically. A real-data example is considered.

具有大量潜在预测因子的线性模型预测在统计和计算上都具有挑战性。传统的方法主要基于收缩选择/估计方法，即使潜在预测因子的数量（大大）大于样本量也适用。当候选预测因子涉及与某些分类预测因子的类别及其相互作用相对应的许多二元指标时，就会出现后一种情况。我们提出了一种基于混合模型预测的收缩预测方法，该方法有效地将分类效应的组合视为随机效应。我们建立了该方法的理论有效性，并实证证明了其相对于收缩方法的优势。我们还为所提出的方法开发了不确定度度量，并对其性能进行了实证评估。考虑一个实际数据示例。

引用次数: 0

The Impact of R on Statistical Science for Spatial Point Processes R对空间点过程统计科学的影响

IF 0.8 4区数学 Q3 STATISTICS & PROBABILITY

Australian & New Zealand Journal of Statistics

Pub Date : 2026-01-23 DOI: 10.1111/anzs.70037

Adrian Baddeley, Ege Rubak, Rolf Turner

A spatial point pattern is a dataset representing the observed locations of things or events, such as disease cases, distant galaxies, trees, crimes, earthquakes or road accidents. The stochastic mechanism that generated the data is called a spatial point process. The statistical analysis of such data has been completely transformed by the availability of R. The R environment has enabled fundamental methodological research to proceed hand in hand with software development, leading to substantial advances in statistical methodology for spatial point processes which were immediately applicable to real data. This article is a broad historical account of the breakthroughs in statistical methodology for spatial point processes that were facilitated by R software, specifically the authors' package spatstat. It focuses on spatial point process modelling, model-fitting, model diagnostics, challenges to statistical inference, lessons to be learned and new challenges for future research.

空间点模式是表示观察到的事物或事件的位置的数据集，例如疾病病例、遥远星系、树木、犯罪、地震或道路事故。产生数据的随机机制称为空间点过程。这种数据的统计分析已经完全被R的可用性所改变。R环境使基本的方法研究与软件开发携手并进，导致空间点过程的统计方法的实质性进展，这些方法立即适用于实际数据。本文对空间点过程的统计方法的突破进行了广泛的历史描述，这些突破是由R软件促进的，特别是作者的spatstat包。它侧重于空间点过程建模、模型拟合、模型诊断、统计推断的挑战、吸取的教训和未来研究的新挑战。

引用次数: 0

Co-Clustering Analysis of Multi-Layer Directed Networks: A Spectral Approach 多层有向网络的共聚类分析：一种谱方法

IF 0.8 4区数学 Q3 STATISTICS & PROBABILITY

Australian & New Zealand Journal of Statistics

Pub Date : 2025-12-25 DOI: 10.1111/anzs.70035

Xiangchao Li, Qing Yang

This article focuses on community detection in multi-layer-directed degree-corrected block models. A spectral co-clustering method building upon a bias-adjusted aggregation matrix is proposed to identify the community memberships. We theoretically demonstrate the efficacy of our approach by controlling the proportion of misclustered nodes. Both simulations and real-data analysis provide empirical evidence supporting the effectiveness of our proposed methodology.

本文主要研究多层定向度校正块模型中的社区检测问题。提出了一种基于偏差调整聚合矩阵的谱共聚类方法来识别群落成员。我们通过控制错聚节点的比例从理论上证明了我们的方法的有效性。模拟和实际数据分析都提供了经验证据，支持我们提出的方法的有效性。

引用次数: 0

Addendum to ‘The Incremental Progression From Fixed to Random Factors in the Analysis of Variance: A New Synthesis’ 《方差分析中从固定因素到随机因素的增量进展：一种新的综合》附录

IF 0.8 4区数学 Q3 STATISTICS & PROBABILITY

Australian & New Zealand Journal of Statistics

Pub Date : 2025-11-25 DOI: 10.1111/anzs.70022

Anderson, M. J., Gorley, R. N., and Terlizzi, A. (2025). “ The Incremental Progression From Fixed to Random Factors in the Analysis of Variance: A New Synthesis.” Australian & New Zealand Journal of Statistics 67(1): 3–30.

This article was presented on Tuesday, 3 December 2024, as the inaugural live-streamed Australian & New Zealand Journal of Statistics Read Paper at the New Zealand Statistical Association (NZSA) Conference held at Victoria University of Wellington, New Zealand, with discussion, including presentations by invited discussants.

The title of the paper should have been ‘The Incremental Progression From Fixed to Random Factors in the Analysis of Variance: A New Synthesis (with discussion)’.

The written discussion comments and authors' response should have appeared alongside the article upon publication. These are now added to the original article.

安德森，m.j., Gorley, r.n.和Terlizzi， A.（2025）。方差分析中从固定因素到随机因素的渐进式进展：一种新的综合。澳大利亚&新西兰统计杂志67(1):3-30。本文于2024年12月3日（星期二）在新西兰惠灵顿维多利亚大学举行的新西兰统计协会（NZSA）会议上作为《澳大利亚和新西兰统计杂志》的首篇直播阅读论文发表，并进行了讨论，包括特邀嘉宾的演讲。论文的标题应该是“方差分析中从固定因素到随机因素的渐进进展：一种新的综合（含讨论）”。书面讨论意见和作者的回应应该在文章发表时出现在文章旁边。这些现在被添加到原始文章中。

引用次数: 0

Seminal Ideas and Controversies in Statistics. By Roderick J. A. Little, Boca Raton, FL: CRC Press, 2025. 243 pp. AU$120 (Paperback). ISBN: 978-1-032-49356-5 统计学中的开创性思想和争议。罗德里克·j·a·利特尔著，佛罗里达州博卡拉顿：CRC出版社，2025年。243页，120澳元（平装本）。ISBN: 978-1-032-49356-5

IF 0.8 4区数学 Q3 STATISTICS & PROBABILITY

Australian & New Zealand Journal of Statistics

Pub Date : 2025-11-06 DOI: 10.1111/anzs.70030

Luke R. Lloyd-Jones

引用次数: 0

Examining the Interface Design of Tidyverse 浅析《Tidyverse》的界面设计

IF 0.8 4区数学 Q3 STATISTICS & PROBABILITY

Australian & New Zealand Journal of Statistics

Pub Date : 2025-11-05 DOI: 10.1111/anzs.70031

Emi Tanaka

The tidyverse is a popular meta-package comprising several core R packages to aid in various data science tasks, including data import, manipulation and visualisation. Although functionalities offered by the tidyverse can generally be replicated using other packages, its widespread adoption in both teaching and practice indicates there are factors contributing to its preference, despite some debate over its usage. This suggests that particular aspects, such as interface design, may play a significant role in its selection. Examining the interface design can potentially reveal aspects that aid the design process for developers. While Tidyverse has been lauded for adopting a user-centred design, arguably some elements of the design focus on the work domain instead of the end user. We examine the Tidyverse interface design via the lens of human–computer interaction, with an emphasis on data visualisation and data wrangling, to identify factors that might serve as a model for developers designing their packages. We recommend that developers adopt an iterative design that is informed by user feedback, analysis and complete coverage of the work domain, and ensure perceptual visibility of system constraints and relationships.

tidyverse是一个流行的元包，由几个核心R包组成，可以帮助完成各种数据科学任务，包括数据导入、操作和可视化。尽管tidyverse提供的功能通常可以使用其他软件包复制，但它在教学和实践中的广泛采用表明，尽管对其使用存在一些争议，但仍有一些因素促成了对它的偏爱。这表明，某些方面，如界面设计，可能在其选择中发挥重要作用。检查界面设计可以潜在地揭示有助于开发人员设计过程的方面。虽然Tidyverse因采用以用户为中心的设计而受到称赞，但有争议的是，设计的一些元素关注的是工作领域而不是最终用户。我们通过人机交互的镜头来检查Tidyverse界面设计，重点是数据可视化和数据整理，以确定可能作为开发人员设计其软件包的模型的因素。我们建议开发人员采用由用户反馈、分析和工作域的完整覆盖所通知的迭代设计，并确保系统约束和关系的感知可见性。

{"title":"Examining the Interface Design of Tidyverse","authors":"Emi Tanaka","doi":"10.1111/anzs.70031","DOIUrl":"https://doi.org/10.1111/anzs.70031","url":null,"abstract":"<div>\u0000 \u0000 <p>The <span>tidyverse</span> is a popular meta-package comprising several core <span>R</span> packages to aid in various data science tasks, including data import, manipulation and visualisation. Although functionalities offered by the <span>tidyverse</span> can generally be replicated using other packages, its widespread adoption in both teaching and practice indicates there are factors contributing to its preference, despite some debate over its usage. This suggests that particular aspects, such as interface design, may play a significant role in its selection. Examining the interface design can potentially reveal aspects that aid the design process for developers. While Tidyverse has been lauded for adopting a user-centred design, arguably some elements of the design focus on the work domain instead of the end user. We examine the Tidyverse interface design via the lens of human–computer interaction, with an emphasis on data visualisation and data wrangling, to identify factors that might serve as a model for developers designing their packages. We recommend that developers adopt an iterative design that is informed by user feedback, analysis and complete coverage of the work domain, and ensure perceptual visibility of system constraints and relationships.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"68 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2025-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146091082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Australian & New Zealand Journal of Statistics

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀