首页 > 最新文献

Australian & New Zealand Journal of Statistics最新文献

英文 中文
The role of pairwise matching in experimental design for an incidence outcome 成对匹配在实验设计中对发生率结果的作用
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2023-11-27 DOI: 10.1111/anzs.12403
Adam Kapelner, Abba M. Krieger, David Azriel
We consider the problem of evaluating designs for a two-arm randomised experiment with an incidence (binary) outcome under a non-parametric general response model. Our two main results are that the a priori pair matching design is (1) the optimal design as measured by mean squared error among all block designs which includes complete randomisation. And (2), this pair-matching design is minimax, that is, it provides the lowest mean squared error under an adversarial response model. Theoretical results are supported by simulations and clinical trial data where we demonstrate the superior performance of pairwise matching designs under realistic conditions.
我们考虑在非参数一般反应模型下评估具有发生率(二元)结果的双臂随机实验设计的问题。我们的两个主要结果是,先验配对设计是(1)在包括完全随机化的所有块设计中,以均方误差衡量的最佳设计。(2)这种配对设计是minimax的,即在对抗响应模型下,它提供了最小的均方误差。理论结果得到了模拟和临床试验数据的支持,在这些数据中,我们证明了在现实条件下成对匹配设计的优越性能。
{"title":"The role of pairwise matching in experimental design for an incidence outcome","authors":"Adam Kapelner, Abba M. Krieger, David Azriel","doi":"10.1111/anzs.12403","DOIUrl":"https://doi.org/10.1111/anzs.12403","url":null,"abstract":"We consider the problem of evaluating designs for a two-arm randomised experiment with an incidence (binary) outcome under a non-parametric general response model. Our two main results are that the a priori pair matching design is (1) the optimal design as measured by mean squared error among all block designs which includes complete randomisation. And (2), this pair-matching design is minimax, that is, it provides the lowest mean squared error under an adversarial response model. Theoretical results are supported by simulations and clinical trial data where we demonstrate the superior performance of pairwise matching designs under realistic conditions.","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2023-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138517672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Measurement errors in semi‐parametric generalised regression models 半参数广义回归模型的测量误差
4区 数学 Q3 Mathematics Pub Date : 2023-10-11 DOI: 10.1111/anzs.12400
Mohammad W. Hattab, David Ruppert
Summary Regression models that ignore measurement error in predictors may produce highly biased estimates leading to erroneous inferences. It is well known that it is extremely difficult to take measurement error into account in Gaussian non‐parametric regression. This problem becomes even more difficult when considering other families such as binary, Poisson and negative binomial regression. We present a novel method aiming to correct for measurement error when estimating regression functions. Our approach is sufficiently flexible to cover virtually all distributions and link functions regularly considered in generalised linear models. This approach depends on approximating the first and the second moment of the response after integrating out the true unobserved predictors in any semi‐parametric generalised regression model. By the latter is meant a model with both linear and non‐parametric effects that are connected to the mean response by a link function and with a response distribution in an exponential family or quasi‐likelihood model. Unlike previous methods, the method we now propose is not restricted to truncated splines and can utilise various basis functions. Moreover, it can operate without making any distributional assumption about the unobserved predictor. Through extensive simulation studies, we study the performance of our method under many scenarios.
忽略预测量测量误差的回归模型可能产生高度偏倚的估计,导致错误的推断。众所周知,在高斯非参数回归中考虑测量误差是非常困难的。当考虑到其他的类,如二元回归、泊松回归和负二项回归时,这个问题变得更加困难。我们提出了一种新的方法来校正回归函数估计时的测量误差。我们的方法足够灵活,几乎涵盖了广义线性模型中经常考虑的所有分布和链接函数。这种方法依赖于在任何半参数广义回归模型中积分出真实的未观察到的预测因子后,逼近响应的第一和第二时刻。后者是指具有线性和非参数效应的模型,这些效应通过链接函数与平均响应相连接,并具有指数族或准似然模型中的响应分布。与以前的方法不同,我们现在提出的方法不局限于截断样条,可以利用各种基函数。此外,它可以在没有对未观察到的预测器做出任何分布假设的情况下运行。通过大量的仿真研究,我们研究了该方法在多种场景下的性能。
{"title":"Measurement errors in semi‐parametric generalised regression models","authors":"Mohammad W. Hattab, David Ruppert","doi":"10.1111/anzs.12400","DOIUrl":"https://doi.org/10.1111/anzs.12400","url":null,"abstract":"Summary Regression models that ignore measurement error in predictors may produce highly biased estimates leading to erroneous inferences. It is well known that it is extremely difficult to take measurement error into account in Gaussian non‐parametric regression. This problem becomes even more difficult when considering other families such as binary, Poisson and negative binomial regression. We present a novel method aiming to correct for measurement error when estimating regression functions. Our approach is sufficiently flexible to cover virtually all distributions and link functions regularly considered in generalised linear models. This approach depends on approximating the first and the second moment of the response after integrating out the true unobserved predictors in any semi‐parametric generalised regression model. By the latter is meant a model with both linear and non‐parametric effects that are connected to the mean response by a link function and with a response distribution in an exponential family or quasi‐likelihood model. Unlike previous methods, the method we now propose is not restricted to truncated splines and can utilise various basis functions. Moreover, it can operate without making any distributional assumption about the unobserved predictor. Through extensive simulation studies, we study the performance of our method under many scenarios.","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136212682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparisons of distributions of Australian mental health scores 澳大利亚心理健康得分分布的比较
4区 数学 Q3 Mathematics Pub Date : 2023-10-11 DOI: 10.1111/anzs.12399
D. Gunawan, William E. Griffiths, D. Chotikapanich
Summary Bayesian non‐parametric estimates of Australian distributions of mental health scores are obtained to assess how the mental health status of the population has changed over time, and to compare the mental health status of female/male and Aboriginal/non‐Aboriginal population subgroups. First‐order and second‐order stochastic dominance are used to compare distributions, with results presented in terms of the posterior probability of dominance and the posterior probability of no dominance. If a criterion for dominance is satisfied, then, in terms of that criterion, the mental health status of the dominant population is superior to that of the dominated population. If neither distribution is dominant, then the mental health status of neither population is superior in the same sense. Our results suggest mental health has deteriorated in recent years, that males' mental health status is better than that of females, and that non‐Aboriginal health status is better than that of the Aboriginal population.
获得澳大利亚心理健康得分分布的贝叶斯非参数估计,以评估人口的心理健康状况如何随时间变化,并比较女性/男性和土著/非土著人口亚群的心理健康状况。一阶和二阶随机优势被用来比较分布,结果以优势的后验概率和无优势的后验概率表示。如果满足优势的标准,那么,根据该标准,优势群体的心理健康状况优于劣势群体。如果两种分布都不占优势,那么在同样的意义上,两种人口的心理健康状况都不占优势。结果显示,近年来原住民的心理健康状况有所恶化,男性的心理健康状况好于女性,非原住民的心理健康状况好于原住民。
{"title":"Comparisons of distributions of Australian mental health scores","authors":"D. Gunawan, William E. Griffiths, D. Chotikapanich","doi":"10.1111/anzs.12399","DOIUrl":"https://doi.org/10.1111/anzs.12399","url":null,"abstract":"Summary Bayesian non‐parametric estimates of Australian distributions of mental health scores are obtained to assess how the mental health status of the population has changed over time, and to compare the mental health status of female/male and Aboriginal/non‐Aboriginal population subgroups. First‐order and second‐order stochastic dominance are used to compare distributions, with results presented in terms of the posterior probability of dominance and the posterior probability of no dominance. If a criterion for dominance is satisfied, then, in terms of that criterion, the mental health status of the dominant population is superior to that of the dominated population. If neither distribution is dominant, then the mental health status of neither population is superior in the same sense. Our results suggest mental health has deteriorated in recent years, that males' mental health status is better than that of females, and that non‐Aboriginal health status is better than that of the Aboriginal population.","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136212528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Embedding latent class regression and latent class distal outcome models into cluster-weighted latent class analysis: a detailed simulation experiment 将潜在类别回归和潜在类别远端结果模型嵌入聚类加权潜在类别分析:一个详细的模拟实验
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2023-09-22 DOI: 10.1111/anzs.12396
Roberto Di Mari, Antonio Punzo, Zsuzsa Bakk

Usually in latent class (LC) analysis, external predictors are taken to be cluster conditional probability predictors (LC models with external predictors), and/or score conditional probability predictors (LC regression models). In such cases, their distribution is not of interest. Class-specific distribution is of interest in the distal outcome model, when the distribution of the external variables is assumed to depend on LC membership. In this paper, we consider a more general formulation, that embeds both the LC regression and the distal outcome models, as is typically done in cluster-weighted modelling. This allows us to investigate (1) whether the distribution of the external variables differs across classes, (2) whether there are significant direct effects of the external variables on the indicators, by modelling jointly the relationship between the external and the latent variables. We show the advantages of the proposed modelling approach through a set of artificial examples, an extensive simulation study and an empirical application about psychological contracts among employees and employers in Belgium and the Netherlands.

通常在潜在类别(LC)分析中,外部预测因子被视为聚类条件概率预测因子(具有外部预测因子的LC模型)和/或评分条件概率预测函数(LC回归模型)。在这种情况下,他们的分配是不感兴趣的。当假设外部变量的分布取决于LC成员时,类特异性分布在远端结果模型中是感兴趣的。在本文中,我们考虑了一个更通用的公式,它嵌入了LC回归和远端结果模型,就像在聚类加权模型中通常做的那样。这使我们能够通过联合建模外部变量和潜在变量之间的关系,研究(1)外部变量的分布是否在不同类别中不同,(2)外部变量对指标是否有显著的直接影响。我们通过一组人工示例、一项广泛的模拟研究以及比利时和荷兰员工和雇主心理契约的实证应用,展示了所提出的建模方法的优势。
{"title":"Embedding latent class regression and latent class distal outcome models into cluster-weighted latent class analysis: a detailed simulation experiment","authors":"Roberto Di Mari,&nbsp;Antonio Punzo,&nbsp;Zsuzsa Bakk","doi":"10.1111/anzs.12396","DOIUrl":"https://doi.org/10.1111/anzs.12396","url":null,"abstract":"<p>Usually in latent class (LC) analysis, external predictors are taken to be cluster conditional probability predictors (LC models with external predictors), and/or score conditional probability predictors (LC regression models). In such cases, their distribution is not of interest. Class-specific distribution is of interest in the distal outcome model, when the distribution of the external variables is assumed to depend on LC membership. In this paper, we consider a more general formulation, that embeds both the LC regression and the distal outcome models, as is typically done in cluster-weighted modelling. This allows us to investigate (1) whether the distribution of the external variables differs across classes, (2) whether there are significant direct effects of the external variables on the indicators, by modelling jointly the relationship between the external and the latent variables. We show the advantages of the proposed modelling approach through a set of artificial examples, an extensive simulation study and an empirical application about psychological contracts among employees and employers in Belgium and the Netherlands.</p>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2023-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/anzs.12396","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50141418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian modelling of effects of prenatal alcohol exposure on child cognition based on data from multiple cohorts 基于多个队列数据的产前酒精暴露对儿童认知影响的贝叶斯建模
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2023-09-08 DOI: 10.1111/anzs.12397
Khue-Dung Dang, Louise M. Ryan, Tugba Akkaya Hocagil, Richard J. Cook, Gale A. Richardson, Nancy L. Day, Claire D. Coles, Heather Carmichael Olson, Sandra W. Jacobson, Joseph L. Jacobson

High levels of prenatal alcohol exposure (PAE) result in significant cognitive deficits in children, but the exact nature of the dose-response relationship is less well understood. To investigate this relationship, data were assembled from six longitudinal birth cohort studies examining the effects of PAE on cognitive outcomes from early school age through adolescence. Structural equation models (SEMs) are a natural approach to consider, because of the way they conceptualise multiple observed outcomes as relating to an underlying latent variable of interest, which can then be modelled as a function of exposure and other predictors of interest. However, conventional SEMs could not be fitted in this context because slightly different outcome measures were used in the six studies. In this paper we propose a multi-group Bayesian SEM that maps the unobserved cognition variable to a broad range of observed outcomes. The relation between these variables and PAE is then examined while controlling for potential confounders via propensity score adjustment. By examining different possible dose-response functions, the proposed framework is used to investigate whether there is a threshold PAE level that results in minimal cognitive deficit.

高水平的产前酒精暴露(PAE)会导致儿童出现显著的认知缺陷,但剂量-反应关系的确切性质尚不清楚。为了研究这种关系,从六项纵向出生队列研究中收集了数据,这些研究考察了从学龄早期到青春期PAE对认知结果的影响。结构方程模型是一种自然的考虑方法,因为它们将多个观察到的结果概念化为与潜在的感兴趣变量有关,然后可以将其建模为暴露和其他感兴趣预测因素的函数。然而,由于六项研究中使用的结果指标略有不同,因此传统的SEMs无法适用于这种情况。在本文中,我们提出了一种多组贝叶斯SEM,将未观察到的认知变量映射到广泛的观察结果。然后检查这些变量与PAE之间的关系,同时通过倾向评分调整来控制潜在的混杂因素。通过检查不同可能的剂量反应函数,所提出的框架用于研究是否存在导致最小认知缺陷的阈值PAE水平。
{"title":"Bayesian modelling of effects of prenatal alcohol exposure on child cognition based on data from multiple cohorts","authors":"Khue-Dung Dang,&nbsp;Louise M. Ryan,&nbsp;Tugba Akkaya Hocagil,&nbsp;Richard J. Cook,&nbsp;Gale A. Richardson,&nbsp;Nancy L. Day,&nbsp;Claire D. Coles,&nbsp;Heather Carmichael Olson,&nbsp;Sandra W. Jacobson,&nbsp;Joseph L. Jacobson","doi":"10.1111/anzs.12397","DOIUrl":"https://doi.org/10.1111/anzs.12397","url":null,"abstract":"<div>\u0000 \u0000 <p>High levels of prenatal alcohol exposure (PAE) result in significant cognitive deficits in children, but the exact nature of the dose-response relationship is less well understood. To investigate this relationship, data were assembled from six longitudinal birth cohort studies examining the effects of PAE on cognitive outcomes from early school age through adolescence. Structural equation models (SEMs) are a natural approach to consider, because of the way they conceptualise multiple observed outcomes as relating to an underlying latent variable of interest, which can then be modelled as a function of exposure and other predictors of interest. However, conventional SEMs could not be fitted in this context because slightly different outcome measures were used in the six studies. In this paper we propose a multi-group Bayesian SEM that maps the unobserved cognition variable to a broad range of observed outcomes. The relation between these variables and PAE is then examined while controlling for potential confounders via propensity score adjustment. By examining different possible dose-response functions, the proposed framework is used to investigate whether there is a threshold PAE level that results in minimal cognitive deficit.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2023-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50125566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The multivariate component zero-inflated Poisson model for correlated count data analysis 用于相关计数数据分析的多元零膨胀泊松模型
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2023-08-27 DOI: 10.1111/anzs.12395
Qin Wu, Guo-Liang Tian, Tao Li, Man-Lai Tang, Chi Zhang

Multivariate zero-inflated Poisson (ZIP) distributions are important tools for modelling and analysing correlated count data with extra zeros. Unfortunately, existing multivariate ZIP distributions consider only the overall zero-inflation while the component zero-inflation is not well addressed. This paper proposes a flexible multivariate ZIP distribution, called the multivariate component ZIP distribution, in which both the overall and component zero-inflations are taken into account. Likelihood-based inference procedures including the calculation of maximum likelihood estimates of parameters in the model without and with covariates are provided. Simulation studies indicate that the performance of the proposed methods on the multivariate component ZIP model is satisfactory. The Australia health care utilisation data set is analysed to demonstrate that the new distribution is more appropriate than the existing multivariate ZIP distributions.

多变量零膨胀泊松(ZIP)分布是建模和分析具有额外零的相关计数数据的重要工具。不幸的是,现有的多变量ZIP分布只考虑总体零通货膨胀,而零通货膨胀分量没有得到很好的解决。本文提出了一种灵活的多元ZIP分布,称为多元分量ZIP分布,其中同时考虑了整体膨胀和零分量膨胀。提供了基于似然的推理程序,包括在没有协变量和有协变量的情况下计算模型中参数的最大似然估计。仿真研究表明,所提出的方法在多元分量ZIP模型上的性能是令人满意的。对澳大利亚医疗保健利用率数据集进行了分析,以证明新的分布比现有的多变量ZIP分布更合适。
{"title":"The multivariate component zero-inflated Poisson model for correlated count data analysis","authors":"Qin Wu,&nbsp;Guo-Liang Tian,&nbsp;Tao Li,&nbsp;Man-Lai Tang,&nbsp;Chi Zhang","doi":"10.1111/anzs.12395","DOIUrl":"https://doi.org/10.1111/anzs.12395","url":null,"abstract":"<div>\u0000 \u0000 <p>Multivariate zero-inflated Poisson (ZIP) distributions are important tools for modelling and analysing correlated count data with extra zeros. Unfortunately, existing multivariate ZIP distributions consider only the overall zero-inflation while the component zero-inflation is not well addressed. This paper proposes a flexible multivariate ZIP distribution, called the multivariate component ZIP distribution, in which both the overall and component zero-inflations are taken into account. Likelihood-based inference procedures including the calculation of maximum likelihood estimates of parameters in the model without and with covariates are provided. Simulation studies indicate that the performance of the proposed methods on the multivariate component ZIP model is satisfactory. The Australia health care utilisation data set is analysed to demonstrate that the new distribution is more appropriate than the existing multivariate ZIP distributions.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2023-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50145271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Short-term forecasting with a computationally efficient nonparametric transfer function model 一种计算高效的非参数传递函数模型的短期预测
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2023-08-01 DOI: 10.1111/anzs.12394
Jun. M. Liu

In this paper a semi-parametric approach is developed to model non-linear relationships in time series data using polynomial splines. Polynomial splines require very little assumption about the functional form of the underlying relationship, so they are very flexible and can be used to model highly non-linear relationships. Polynomial splines are also computationally very efficient. The serial correlation in the data is accounted for by modelling the noise as an autoregressive integrated moving average (ARIMA) process, by doing so, the efficiency in nonparametric estimation is improved and correct inferences can be obtained. The explicit structure of the ARIMA model allows the correlation information to be used to improve forecasting performance. An algorithm is developed to automatically select and estimate the polynomial spline model and the ARIMA model through backfitting. This method is applied on a real-life data set to forecast hourly electricity usage. The non-linear effect of temperature on hourly electricity usage is allowed to be different at different hours of the day and days of the week. The forecasting performance of the developed method is evaluated in post-sample forecasting and compared with several well-accepted models. The results show the performance of the proposed model is comparable with a long short-term memory deep learning model.

本文提出了一种使用多项式样条对时间序列数据中的非线性关系进行建模的半参数方法。多项式样条几乎不需要对基础关系的函数形式进行假设,因此它们非常灵活,可以用于建模高度非线性的关系。多项式样条在计算上也是非常有效的。通过将噪声建模为自回归积分移动平均(ARIMA)过程来解释数据中的序列相关性,通过这样做,提高了非参数估计的效率,并可以获得正确的推断。ARIMA模型的显式结构允许使用相关性信息来提高预测性能。开发了一种通过反拟合自动选择和估计多项式样条模型和ARIMA模型的算法。该方法应用于实际数据集,以预测每小时用电量。允许温度对每小时用电量的非线性影响在一天中的不同时间和一周中的不同日子是不同的。在样本后预测中评估了所开发方法的预测性能,并与几种公认的模型进行了比较。结果表明,该模型的性能与长短期记忆深度学习模型相当。
{"title":"Short-term forecasting with a computationally efficient nonparametric transfer function model","authors":"Jun. M. Liu","doi":"10.1111/anzs.12394","DOIUrl":"https://doi.org/10.1111/anzs.12394","url":null,"abstract":"<div>\u0000 \u0000 <p>In this paper a semi-parametric approach is developed to model non-linear relationships in time series data using polynomial splines. Polynomial splines require very little assumption about the functional form of the underlying relationship, so they are very flexible and can be used to model highly non-linear relationships. Polynomial splines are also computationally very efficient. The serial correlation in the data is accounted for by modelling the noise as an autoregressive integrated moving average (ARIMA) process, by doing so, the efficiency in nonparametric estimation is improved and correct inferences can be obtained. The explicit structure of the ARIMA model allows the correlation information to be used to improve forecasting performance. An algorithm is developed to automatically select and estimate the polynomial spline model and the ARIMA model through backfitting. This method is applied on a real-life data set to forecast hourly electricity usage. The non-linear effect of temperature on hourly electricity usage is allowed to be different at different hours of the day and days of the week. The forecasting performance of the developed method is evaluated in post-sample forecasting and compared with several well-accepted models. The results show the performance of the proposed model is comparable with a long short-term memory deep learning model.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50114984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Asymptotics of M-estimator in multivariate linear regression models for a class of random errors 一类随机误差的多元线性回归模型中M-估计的渐近性
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2023-07-21 DOI: 10.1111/anzs.12393
Yi Wu, Wei Yu, Xuejun Wang

It is known that linear regression models have immense applications in various areas such as engineering technology, economics and social sciences. In this paper, we investigate the asymptotic properties of M-estimator in multivariate linear regression model based on a class of random errors satisfying a generalised Bernstein-type inequality. By using the generalised Bernstein-type inequality, we obtain a general result on almost sure convergence for a class of random variables and then obtain the strong consistency for the M-estimator in multivariate linear regression models under some mild conditions. The result extends or improves some existing ones in the literature. Moreover, we also consider the case when the dimension $p$ tends to infinity by establishing the rate of almost sure convergence for a class of random variables satisfying generalised Bernstein-type inequality. Some numerical simulations are also provided to verify the validity of the theoretical results.

众所周知,线性回归模型在工程技术、经济和社会科学等各个领域都有着巨大的应用。本文研究了基于一类满足广义Bernstein型不等式的随机误差的多元线性回归模型中M-估计量的渐近性质。利用广义Bernstein型不等式,我们得到了一类随机变量几乎肯定收敛的一般结果,并在一些温和条件下得到了多元线性回归模型中M-估计量的强一致性。该结果扩展或改进了文献中已有的一些结果。此外,我们还通过建立一类满足广义Bernstein型不等式的随机变量的几乎肯定收敛率,来考虑维数$p$趋于无穷大的情况。通过数值模拟验证了理论结果的正确性。
{"title":"Asymptotics of M-estimator in multivariate linear regression models for a class of random errors","authors":"Yi Wu,&nbsp;Wei Yu,&nbsp;Xuejun Wang","doi":"10.1111/anzs.12393","DOIUrl":"https://doi.org/10.1111/anzs.12393","url":null,"abstract":"<div>\u0000 \u0000 <p>It is known that linear regression models have immense applications in various areas such as engineering technology, economics and social sciences. In this paper, we investigate the asymptotic properties of <i>M</i>-estimator in multivariate linear regression model based on a class of random errors satisfying a generalised Bernstein-type inequality. By using the generalised Bernstein-type inequality, we obtain a general result on almost sure convergence for a class of random variables and then obtain the strong consistency for the <i>M</i>-estimator in multivariate linear regression models under some mild conditions. The result extends or improves some existing ones in the literature. Moreover, we also consider the case when the dimension $p$ tends to infinity by establishing the rate of almost sure convergence for a class of random variables satisfying generalised Bernstein-type inequality. Some numerical simulations are also provided to verify the validity of the theoretical results.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2023-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50148711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the selection of predictors by using greedy algorithms and information theoretic criteria 利用贪婪算法和信息论准则选择预测因子
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2023-06-29 DOI: 10.1111/anzs.12387
Fangyao Li, Christopher M. Triggs, Ciprian Doru Giurcăneanu

We discuss the use of the following greedy algorithms in the prediction of multivariate time series: Matching Pursuit Algorithm (MPA), Orthogonal Matching Pursuit (OMP), Relaxed Matching Pursuit (RMP), Frank–Wolfe Algorithm (FWA) and Constrained Matching Pursuit (CMP). The last two are known to be solvers for the lasso problem. Some of the algorithms are well-known (e.g. OMP), while others are less popular (e.g. RMP). We provide a unified presentation of all the algorithms, and evaluate their computational complexity for the high-dimensional case and for the big data case. We show how 12 information theoretic (IT) criteria can be used jointly with the greedy algorithms. As part of this effort, we derive new theoretical results that allow modification of the IT criteria such that to be compatible with RMP. The prediction capabilities are tested in experiments with two data sets. The first one involves air pollution data measured in Auckland (New Zealand) and the second one concerns the House Price Index in England (the United Kingdom).

我们讨论了以下贪婪算法在多变量时间序列预测中的应用:匹配追踪算法(MPA)、正交匹配追踪(OMP)、松弛匹配追踪(RMP)、Frank–Wolfe算法(FWA)和约束匹配追踪(CMP)。最后两个已知是套索问题的求解器。一些算法是众所周知的(例如OMP),而另一些算法则不那么流行(例如RMP)。我们提供了所有算法的统一表示,并评估了它们在高维情况和大数据情况下的计算复杂性。我们展示了如何将12个信息论(IT)准则与贪婪算法结合使用。作为这项工作的一部分,我们得出了新的理论结果,允许修改IT标准,使其与RMP兼容。在两个数据集的实验中测试了预测能力。第一个涉及奥克兰(新西兰)的空气污染数据,第二个涉及英国(英国)的房价指数。
{"title":"On the selection of predictors by using greedy algorithms and information theoretic criteria","authors":"Fangyao Li,&nbsp;Christopher M. Triggs,&nbsp;Ciprian Doru Giurcăneanu","doi":"10.1111/anzs.12387","DOIUrl":"https://doi.org/10.1111/anzs.12387","url":null,"abstract":"<p>We discuss the use of the following greedy algorithms in the prediction of multivariate time series: Matching Pursuit Algorithm (MPA), Orthogonal Matching Pursuit (OMP), Relaxed Matching Pursuit (RMP), Frank–Wolfe Algorithm (FWA) and Constrained Matching Pursuit (CMP). The last two are known to be solvers for the lasso problem. Some of the algorithms are well-known (e.g. OMP), while others are less popular (e.g. RMP). We provide a unified presentation of all the algorithms, and evaluate their computational complexity for the high-dimensional case and for the big data case. We show how 12 information theoretic (IT) criteria can be used jointly with the greedy algorithms. As part of this effort, we derive new theoretical results that allow modification of the IT criteria such that to be compatible with RMP. The prediction capabilities are tested in experiments with two data sets. The first one involves air pollution data measured in Auckland (New Zealand) and the second one concerns the House Price Index in England (the United Kingdom).</p>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/anzs.12387","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50155532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Visual assessment of matrix-variate normality 矩阵变量正态性的可视化评估
IF 1.1 4区 数学 Q3 Mathematics Pub Date : 2023-06-17 DOI: 10.1111/anzs.12388
Nikola Počuča, Michael P.B. Gallaugher, Katharine M. Clark, Paul D. McNicholas

In recent years, the analysis of three-way data has become ever more prevalent in the literature. It is becoming increasingly common to analyse such data by means of matrix-variate distributions, the most prevalent of which is the matrix-variate normal distribution. Although many methods exist for assessing multivariate normality, there is a relative paucity of approaches for assessing matrix-variate normality. Herein, a new visual method is proposed for assessing matrix-variate normality by means of a distance–distance plot. In addition, a testing procedure is discussed to be used in tandem with the proposed visual method. The proposed approach is illustrated via simulated data as well as an application on analysing handwritten digits.

近年来,对三元数据的分析在文献中变得越来越普遍。通过矩阵变量分布来分析这些数据变得越来越普遍,其中最普遍的是矩阵变量正态分布。尽管存在许多评估多元正态性的方法,但评估矩阵多元正态的方法相对较少。本文提出了一种新的视觉方法,通过距离-距离图来评估矩阵变量的正态性。此外,还讨论了与所提出的视觉方法一起使用的测试程序。通过模拟数据以及在手写数字分析中的应用,说明了所提出的方法。
{"title":"Visual assessment of matrix-variate normality","authors":"Nikola Počuča,&nbsp;Michael P.B. Gallaugher,&nbsp;Katharine M. Clark,&nbsp;Paul D. McNicholas","doi":"10.1111/anzs.12388","DOIUrl":"https://doi.org/10.1111/anzs.12388","url":null,"abstract":"<div>\u0000 \u0000 <p>In recent years, the analysis of three-way data has become ever more prevalent in the literature. It is becoming increasingly common to analyse such data by means of matrix-variate distributions, the most prevalent of which is the matrix-variate normal distribution. Although many methods exist for assessing multivariate normality, there is a relative paucity of approaches for assessing matrix-variate normality. Herein, a new visual method is proposed for assessing matrix-variate normality by means of a distance–distance plot. In addition, a testing procedure is discussed to be used in tandem with the proposed visual method. The proposed approach is illustrated via simulated data as well as an application on analysing handwritten digits.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.1,"publicationDate":"2023-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50151748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Australian & New Zealand Journal of Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1