首页 > 最新文献

Statistica Neerlandica最新文献

英文 中文
Linear Regression Models with Multiplicative Distortions under New Identifiability Conditions 新的可辨识性条件下具有乘法扭曲的线性回归模型
IF 1.5 3区 数学 Q2 Mathematics Pub Date : 2023-05-11 DOI: 10.1111/stan.12304
Jun Zhang, Bingqing Lin, Yan Zhou
This paper considers linear regression models when neither the response variable nor the covariates can be directly observed, but are measured with multiplicative distortion measurement errors. We propose new identifiability conditions for the distortion functions via the varying coefficient models, then moment‐based estimators of parameters in the model are proposed by using the estimated varying coefficient functions. This method does not require the independence condition between the confounding variables and the unobserved response and variables. We establish the connections among the varying coefficient based estimators, the conditional mean calibration and the conditional absolute mean calibration. We study the asymptotic results of these proposed estimators, and discuss their asymptotic efficiencies. Lastly, we make some comparisons among the proposed estimators through the simulation. These methods are applied to analyze a real dataset for an illustration.
{"title":"Linear Regression Models with Multiplicative Distortions under New Identifiability Conditions","authors":"Jun Zhang, Bingqing Lin, Yan Zhou","doi":"10.1111/stan.12304","DOIUrl":"https://doi.org/10.1111/stan.12304","url":null,"abstract":"This paper considers linear regression models when neither the response variable nor the covariates can be directly observed, but are measured with multiplicative distortion measurement errors. We propose new identifiability conditions for the distortion functions via the varying coefficient models, then moment‐based estimators of parameters in the model are proposed by using the estimated varying coefficient functions. This method does not require the independence condition between the confounding variables and the unobserved response and variables. We establish the connections among the varying coefficient based estimators, the conditional mean calibration and the conditional absolute mean calibration. We study the asymptotic results of these proposed estimators, and discuss their asymptotic efficiencies. Lastly, we make some comparisons among the proposed estimators through the simulation. These methods are applied to analyze a real dataset for an illustration.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75687111","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
The optimal input‐independent baseline for binary classification: The Dutch Draw 二元分类的最佳输入无关基线:荷兰平局
3区 数学 Q2 Mathematics Pub Date : 2023-05-01 DOI: 10.1111/stan.12297
Joris Pries, Etienne van de Bijl, Jan Klein, Sandjai Bhulai, Rob van der Mei
Before any binary classification model is taken into practice, it is important to validate its performance on a proper test set. Without a frame of reference given by a baseline method, it is impossible to determine if a score is “good” or “bad.” The goal of this paper is to examine all baseline methods that are independent of feature values and determine which model is the “best” and why. By identifying which baseline models are optimal, a crucial selection decision in the evaluation process is simplified. We prove that the recently proposed Dutch Draw baseline is the best input‐independent classifier (independent of feature values) for all order‐invariant measures (independent of sequence order) assuming that the samples are randomly shuffled. This means that the Dutch Draw baseline is the optimal baseline under these intuitive requirements and should therefore be used in practice.
在将任何二元分类模型付诸实践之前,重要的是要在适当的测试集上验证其性能。如果没有基准方法提供的参考框架,就不可能确定分数是“好”还是“坏”。本文的目标是检查所有独立于特征值的基线方法,并确定哪个模型是“最好的”,以及为什么。通过确定哪些基线模型是最优的,可以简化评估过程中的关键选择决策。我们证明了最近提出的Dutch Draw基线是所有阶不变度量(与序列顺序无关)的最佳输入无关分类器(与特征值无关),假设样本是随机洗牌的。这意味着荷兰抽签基线是这些直观要求下的最佳基线,因此应该在实践中使用。
{"title":"The optimal input‐independent baseline for binary classification: The Dutch Draw","authors":"Joris Pries, Etienne van de Bijl, Jan Klein, Sandjai Bhulai, Rob van der Mei","doi":"10.1111/stan.12297","DOIUrl":"https://doi.org/10.1111/stan.12297","url":null,"abstract":"Before any binary classification model is taken into practice, it is important to validate its performance on a proper test set. Without a frame of reference given by a baseline method, it is impossible to determine if a score is “good” or “bad.” The goal of this paper is to examine all baseline methods that are independent of feature values and determine which model is the “best” and why. By identifying which baseline models are optimal, a crucial selection decision in the evaluation process is simplified. We prove that the recently proposed Dutch Draw baseline is the best input‐independent classifier (independent of feature values) for all order‐invariant measures (independent of sequence order) assuming that the samples are randomly shuffled. This means that the Dutch Draw baseline is the optimal baseline under these intuitive requirements and should therefore be used in practice.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136012239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
New closed‐form efficient estimator for multivariate gamma distribution 多元伽玛分布的新闭形有效估计
IF 1.5 3区 数学 Q2 Mathematics Pub Date : 2023-04-28 DOI: 10.1111/stan.12299
Yu-Hyeong Jang, Jun Zhao, Hyoung-Moon Kim, Kyusang Yu, Sunghoon Kwon, Sunghwan Kim
Maximum likelihood estimation is used widely in classical statistics. However, except in a few cases, it does not have a closed form. Furthermore, it takes time to derive the maximum likelihood estimator (MLE) owing to the use of iterative methods such as Newton–Raphson. Nonetheless, this estimation method has several advantages, chief among them being the invariance property and asymptotic normality. Based on the first approximation to the solution of the likelihood equation, we obtain an estimator that has the same asymptotic behavior as the MLE for multivariate gamma distribution. The newly proposed estimator, denoted as MLECE$$ {mathrm{MLE}}_{mathrm{CE}} $$ , is also in closed form as long as the n$$ sqrt{n} $$ ‐consistent initial estimator is in the closed form. Hence, we develop some closed‐form n$$ sqrt{n} $$ ‐consistent estimators for multivariate gamma distribution to improve the small‐sample property. MLECE$$ {mathrm{MLE}}_{mathrm{CE}} $$ is an alternative to MLE and performs better compared to MLE in terms of computation time, especially for large datasets, and stability. For the bivariate gamma distribution, the MLECE$$ {mathrm{MLE}}_{mathrm{CE}} $$ is over 130 times faster than the MLE, and as the sample size increasing, the MLECE$$ {mathrm{MLE}}_{mathrm{CE}} $$ is over 200 times faster than the MLE. Owing to the instant calculation of the proposed estimator, it can be used in state–space modeling or real‐time processing models.
极大似然估计在经典统计学中应用广泛。然而,除了少数情况外,它没有封闭形式。此外,由于使用Newton-Raphson等迭代方法,导出最大似然估计量(MLE)需要时间。然而,这种估计方法有几个优点,其中最主要的是不变性和渐近正态性。基于似然方程解的第一次近似,我们得到了一个与多元伽玛分布的最大似然值具有相同渐近特性的估计量。新提出的估计量,表示为MLECE $$ {mathrm{MLE}}_{mathrm{CE}} $$,只要n $$ sqrt{n} $$‐一致的初始估计量是封闭形式,它也是封闭形式。因此,我们为多元伽玛分布开发了一些封闭形式的n $$ sqrt{n} $$一致估计,以改善小样本性质。MLECE $$ {mathrm{MLE}}_{mathrm{CE}} $$是MLE的替代方案,在计算时间(特别是对于大型数据集)和稳定性方面比MLE表现更好。对于二元gamma分布,MLECE $$ {mathrm{MLE}}_{mathrm{CE}} $$比MLE快130倍以上,随着样本量的增加,MLECE $$ {mathrm{MLE}}_{mathrm{CE}} $$比MLE快200倍以上。由于所提出的估计器可即时计算,因此可用于状态空间建模或实时处理模型。
{"title":"New closed‐form efficient estimator for multivariate gamma distribution","authors":"Yu-Hyeong Jang, Jun Zhao, Hyoung-Moon Kim, Kyusang Yu, Sunghoon Kwon, Sunghwan Kim","doi":"10.1111/stan.12299","DOIUrl":"https://doi.org/10.1111/stan.12299","url":null,"abstract":"Maximum likelihood estimation is used widely in classical statistics. However, except in a few cases, it does not have a closed form. Furthermore, it takes time to derive the maximum likelihood estimator (MLE) owing to the use of iterative methods such as Newton–Raphson. Nonetheless, this estimation method has several advantages, chief among them being the invariance property and asymptotic normality. Based on the first approximation to the solution of the likelihood equation, we obtain an estimator that has the same asymptotic behavior as the MLE for multivariate gamma distribution. The newly proposed estimator, denoted as MLECE$$ {mathrm{MLE}}_{mathrm{CE}} $$ , is also in closed form as long as the n$$ sqrt{n} $$ ‐consistent initial estimator is in the closed form. Hence, we develop some closed‐form n$$ sqrt{n} $$ ‐consistent estimators for multivariate gamma distribution to improve the small‐sample property. MLECE$$ {mathrm{MLE}}_{mathrm{CE}} $$ is an alternative to MLE and performs better compared to MLE in terms of computation time, especially for large datasets, and stability. For the bivariate gamma distribution, the MLECE$$ {mathrm{MLE}}_{mathrm{CE}} $$ is over 130 times faster than the MLE, and as the sample size increasing, the MLECE$$ {mathrm{MLE}}_{mathrm{CE}} $$ is over 200 times faster than the MLE. Owing to the instant calculation of the proposed estimator, it can be used in state–space modeling or real‐time processing models.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73572282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The Multilateral Spatial Integer‐valued Process of order 1 1阶的多边空间整值过程
IF 1.5 3区 数学 Q2 Mathematics Pub Date : 2023-04-25 DOI: 10.1111/stan.12298
D. Karlis, Azmi Chutoo, N. Mamode Khan, V. Jowaheer
In spatial count data analysis, modeling with a multilateral lattice structure presents some important challenges. They include both the model construction and the estimation of the model parameters, since the structure accommodates the left, right, top, bottom, and diagonal site effects. Thus, the multilateral spatial process unifies all the popular spatial subclasses that include the unilateral, Rook, Bishop, and Queen models and, hence, makes it suitable for a wide variety of applications. This paper introduces a first‐order multilateral integer‐valued spatial process, based on a binomial thinning mechanism and some innovation term, under both stationary and nonstationary conditions. The estimation of parameters is handled by the conditional maximum likelihood estimation (CML) approach. Simulation experiments are implemented to assess the consistency of the CML estimators in the stationary and nonstationary multilateral spatial model and its subclasses, based on different grid sizes and under both covariate and noncovariate designs. The proposed model, along with its subclasses are applied to real datasets.
{"title":"The Multilateral Spatial Integer‐valued Process of order 1","authors":"D. Karlis, Azmi Chutoo, N. Mamode Khan, V. Jowaheer","doi":"10.1111/stan.12298","DOIUrl":"https://doi.org/10.1111/stan.12298","url":null,"abstract":"In spatial count data analysis, modeling with a multilateral lattice structure presents some important challenges. They include both the model construction and the estimation of the model parameters, since the structure accommodates the left, right, top, bottom, and diagonal site effects. Thus, the multilateral spatial process unifies all the popular spatial subclasses that include the unilateral, Rook, Bishop, and Queen models and, hence, makes it suitable for a wide variety of applications. This paper introduces a first‐order multilateral integer‐valued spatial process, based on a binomial thinning mechanism and some innovation term, under both stationary and nonstationary conditions. The estimation of parameters is handled by the conditional maximum likelihood estimation (CML) approach. Simulation experiments are implemented to assess the consistency of the CML estimators in the stationary and nonstationary multilateral spatial model and its subclasses, based on different grid sizes and under both covariate and noncovariate designs. The proposed model, along with its subclasses are applied to real datasets.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85976129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stochastic comparisons of largest claim amounts from heterogeneous portfolios 异质投资组合中最大索赔金额的随机比较
IF 1.5 3区 数学 Q2 Mathematics Pub Date : 2023-04-20 DOI: 10.1111/stan.12296
Pradip Kundu, Amarjit Kundu, Biplab Hawlader
This paper investigates stochastic comparisons of largest claim amounts of two sets of independent or interdependent portfolios in the sense of some stochastic orders. Let random variable Xi$$ {X}_i $$ ( i=1,…,n$$ i=1,dots, n $$ ) with distribution function F(x;αi)$$ Fleft(x;{alpha}_iright) $$ , represents the claim amount for ith risk of a portfolio. Here two largest claim amounts are compared considering that the claim variables follow a general semiparametric family of distributions having the property that the survival function F‾(x;α)$$ overline{F}left(x;alpha right) $$ is increasing in α$$ alpha $$ or is increasing and convex/concave in α$$ alpha $$ . The results obtained in this paper apply to a large class of well‐known distributions including the family of exponentiated/generalized distributions (e.g., exponentiated exponential, Weibull, gamma and Pareto family), Rayleigh distribution and Marshall–Olkin family of distributions. As a direct consequence of some main theorems, we also obtained the results for scale family of distributions. Several numerical examples are provided to illustrate the results.
本文研究了两组独立或相互依赖的投资组合在一定随机顺序下最大索赔额的随机比较问题。$$ {X}_i $$ (i=1,…,n$$ i=1,dots, n $$ ),分布函数F(x;αi)$$ Fleft(x;{alpha}_iright) $$ ,表示投资组合风险的索赔金额。这里比较两个最大的索赔金额,考虑索赔变量遵循一般的半参数分布族,其性质是生存函数F (x;α)$$ overline{F}left(x;alpha right) $$ 在α中增加$$ alpha $$ 或在α中呈递增和凸/凹$$ alpha $$ . 本文所得到的结果适用于一大类众所周知的分布,包括指数/广义分布族(如指数分布族、Weibull分布族、gamma分布族和Pareto分布族)、Rayleigh分布族和Marshall-Olkin分布族。作为一些主要定理的直接推论,我们也得到了分布的尺度族的结果。给出了几个数值算例来说明结果。
{"title":"Stochastic comparisons of largest claim amounts from heterogeneous portfolios","authors":"Pradip Kundu, Amarjit Kundu, Biplab Hawlader","doi":"10.1111/stan.12296","DOIUrl":"https://doi.org/10.1111/stan.12296","url":null,"abstract":"This paper investigates stochastic comparisons of largest claim amounts of two sets of independent or interdependent portfolios in the sense of some stochastic orders. Let random variable Xi$$ {X}_i $$ ( i=1,…,n$$ i=1,dots, n $$ ) with distribution function F(x;αi)$$ Fleft(x;{alpha}_iright) $$ , represents the claim amount for ith risk of a portfolio. Here two largest claim amounts are compared considering that the claim variables follow a general semiparametric family of distributions having the property that the survival function F‾(x;α)$$ overline{F}left(x;alpha right) $$ is increasing in α$$ alpha $$ or is increasing and convex/concave in α$$ alpha $$ . The results obtained in this paper apply to a large class of well‐known distributions including the family of exponentiated/generalized distributions (e.g., exponentiated exponential, Weibull, gamma and Pareto family), Rayleigh distribution and Marshall–Olkin family of distributions. As a direct consequence of some main theorems, we also obtained the results for scale family of distributions. Several numerical examples are provided to illustrate the results.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74143095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Analysis of cross‐over experiments with count data in the presence of carry‐over effects 对存在结转效应的计数数据进行交叉实验分析
IF 1.5 3区 数学 Q2 Mathematics Pub Date : 2023-04-16 DOI: 10.1111/stan.12295
Nelson Alirio Cruz, Luis Alberto López Pérez, Oscar Orlando Melo
This paper presents an experimental cross‐over design whose response variable is a count that belongs to the Poisson distribution. The methodology is extended to data with overdispersion or subdispersion. We present the theoretical development for analysis of cases with few treatments and a few periods. In this case, we consider the log‐linear link for estimation effects and the Delta method for the asymptotic inference of the estimators. When the number of periods and sequences increases, we propose an extension of the previous methodology, using the generalized linear models. In this extension, cross‐over designs for count data include treatments, sequences, time effects, covariables, and any correlation structure. The most important result of the methodology is that it allows the detection of significant factors within the cross‐over design when the response variable belongs to the exponential family, especially the treatment effects. Finally, we present the analysis of data obtained in a student hydration study and a simulation study. We show a comparison between the usual methods of analysis and those obtained in the present work, demonstrating the advantage over the usual methods in situations with carry‐over presence.
本文提出了一种实验交叉设计,其响应变量为属于泊松分布的计数。该方法可扩展到具有过色散或次色散的数据。我们提出了对几种治疗方法和几种时期的病例分析的理论发展。在这种情况下,我们考虑对数线性联系的估计效果和Delta方法的渐近推断的估计量。当周期和序列的数量增加时,我们提出使用广义线性模型扩展先前的方法。在这个扩展中,计数数据的交叉设计包括处理,序列,时间效应,协变量和任何相关结构。该方法最重要的结果是,当响应变量属于指数族时,它允许在交叉设计中检测显着因素,特别是处理效果。最后,我们对学生水化研究和模拟研究中获得的数据进行了分析。我们展示了通常的分析方法与在当前工作中获得的分析方法之间的比较,证明了在结转存在的情况下比通常方法的优势。
{"title":"Analysis of cross‐over experiments with count data in the presence of carry‐over effects","authors":"Nelson Alirio Cruz, Luis Alberto López Pérez, Oscar Orlando Melo","doi":"10.1111/stan.12295","DOIUrl":"https://doi.org/10.1111/stan.12295","url":null,"abstract":"This paper presents an experimental cross‐over design whose response variable is a count that belongs to the Poisson distribution. The methodology is extended to data with overdispersion or subdispersion. We present the theoretical development for analysis of cases with few treatments and a few periods. In this case, we consider the log‐linear link for estimation effects and the Delta method for the asymptotic inference of the estimators. When the number of periods and sequences increases, we propose an extension of the previous methodology, using the generalized linear models. In this extension, cross‐over designs for count data include treatments, sequences, time effects, covariables, and any correlation structure. The most important result of the methodology is that it allows the detection of significant factors within the cross‐over design when the response variable belongs to the exponential family, especially the treatment effects. Finally, we present the analysis of data obtained in a student hydration study and a simulation study. We show a comparison between the usual methods of analysis and those obtained in the present work, demonstrating the advantage over the usual methods in situations with carry‐over presence.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89772271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating function method for nonnegative autoregressive models 非负自回归模型的估计函数方法
IF 1.5 3区 数学 Q2 Mathematics Pub Date : 2023-03-31 DOI: 10.1111/stan.12294
E. Hari, Prasad N. Balakrishna, E. H. Prasad
A stationary sequence of nonnegative random variables generated by autoregressive (AR) models may be used to describe the inter‐arrival times between events in counting processes. Even though, several such models are available in the literature, there is no unified approach to estimate their parameters. In this paper, we propose a class of combined estimating function method to estimate the model parameters of AR models with gamma marginals. The proposed method is compared with other estimation procedures and are illustrated by simulation and data analysis.
由自回归(AR)模型产生的非负随机变量的平稳序列可用于描述计数过程中事件之间的间隔到达时间。尽管文献中有几个这样的模型,但没有统一的方法来估计它们的参数。本文提出了一类组合估计函数方法来估计具有伽马边际的AR模型的模型参数。将该方法与其他估计方法进行了比较,并通过仿真和数据分析加以说明。
{"title":"Estimating function method for nonnegative autoregressive models","authors":"E. Hari, Prasad N. Balakrishna, E. H. Prasad","doi":"10.1111/stan.12294","DOIUrl":"https://doi.org/10.1111/stan.12294","url":null,"abstract":"A stationary sequence of nonnegative random variables generated by autoregressive (AR) models may be used to describe the inter‐arrival times between events in counting processes. Even though, several such models are available in the literature, there is no unified approach to estimate their parameters. In this paper, we propose a class of combined estimating function method to estimate the model parameters of AR models with gamma marginals. The proposed method is compared with other estimation procedures and are illustrated by simulation and data analysis.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86547801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A robust mixed‐effects parametric quantile regression model for continuous proportions: Quantifying the constraints to vitality in cushion plants 连续比例的鲁棒混合效应参数分位数回归模型:量化缓冲植物活力的约束
IF 1.5 3区 数学 Q2 Mathematics Pub Date : 2023-03-29 DOI: 10.1111/stan.12293
D. Burger, Sean van der Merwe, E. Lesaffre, P. C. le Roux, Morgan J. Raath‐Krüger
There is no literature on outlier‐robust parametric mixed‐effects quantile regression models for continuous proportion data as an alternative to systematically identifying and eliminating outliers. To fill this gap, we formulate a robust method by extending the recently proposed fixed‐effects quantile regression model based on the heavy‐tailed Johnson‐ t$$ t $$ distribution for continuous proportion data to the mixed‐effects modeling context, using a Bayesian approach. Our proposed method is motivated by and used to model the extreme quantiles of the vitality of cushion plants to provide insights into the ecology of the system in which the plants are dominant. We conducted a simulation study to assess the new method's performance and robustness to outliers. We show that the new model has good accuracy and confidence interval coverage properties and is remarkably robust to outliers. In contrast, our study demonstrates that the current approach in the literature for modeling hierarchically structured bounded data's quantiles is susceptible to outliers, especially when modeling the extreme quantiles. We conclude that the proposed model is an appropriate robust alternative to the current approach for modeling the quantiles of correlated continuous proportions when outliers are present in the data.
目前还没有关于连续比例数据的异常值-鲁棒参数混合效应分位数回归模型作为系统识别和消除异常值的替代方法的文献。为了填补这一空白,我们通过使用贝叶斯方法,将最近提出的基于连续比例数据的重尾Johnson - t $$ t $$分布的固定效应分位数回归模型扩展到混合效应建模环境,从而制定了一种鲁棒方法。我们提出的方法是由缓冲植物活力的极端分位数模型驱动的,并用于对植物占主导地位的系统的生态学提供见解。我们进行了仿真研究,以评估新方法的性能和对异常值的鲁棒性。结果表明,新模型具有良好的精度和置信区间覆盖性能,对异常值具有显著的鲁棒性。相比之下,我们的研究表明,目前文献中用于分层结构有界数据分位数建模的方法容易受到异常值的影响,特别是在建模极端分位数时。我们得出的结论是,当数据中存在异常值时,所提出的模型是对相关连续比例的分位数建模的当前方法的适当鲁棒替代方法。
{"title":"A robust mixed‐effects parametric quantile regression model for continuous proportions: Quantifying the constraints to vitality in cushion plants","authors":"D. Burger, Sean van der Merwe, E. Lesaffre, P. C. le Roux, Morgan J. Raath‐Krüger","doi":"10.1111/stan.12293","DOIUrl":"https://doi.org/10.1111/stan.12293","url":null,"abstract":"There is no literature on outlier‐robust parametric mixed‐effects quantile regression models for continuous proportion data as an alternative to systematically identifying and eliminating outliers. To fill this gap, we formulate a robust method by extending the recently proposed fixed‐effects quantile regression model based on the heavy‐tailed Johnson‐ t$$ t $$ distribution for continuous proportion data to the mixed‐effects modeling context, using a Bayesian approach. Our proposed method is motivated by and used to model the extreme quantiles of the vitality of cushion plants to provide insights into the ecology of the system in which the plants are dominant. We conducted a simulation study to assess the new method's performance and robustness to outliers. We show that the new model has good accuracy and confidence interval coverage properties and is remarkably robust to outliers. In contrast, our study demonstrates that the current approach in the literature for modeling hierarchically structured bounded data's quantiles is susceptible to outliers, especially when modeling the extreme quantiles. We conclude that the proposed model is an appropriate robust alternative to the current approach for modeling the quantiles of correlated continuous proportions when outliers are present in the data.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90397245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A partial posterior p value test for multilevel mediation 多水平中介的部分后验p值检验
IF 1.5 3区 数学 Q2 Mathematics Pub Date : 2023-03-16 DOI: 10.1111/stan.12291
Kyle Cox, Ben Kelcey
A variety of inferential tests are available for single and multilevel mediation but most come with notable limitations that balance tradeoffs between power and Type I error. We extend the partial posterior p value method (p3 method) to test multilevel mediation. This contemporary resampling‐based composite approach is specifically suited for complex null hypotheses. We develop the p3 method and investigate its performance within the context of two‐level cluster‐randomized multilevel mediation studies. Similar to its performance in single‐level studies, we found that the p3 method performed well relative to other mediation tests suggesting it provides a judicious balance between Type I error rate and power. While bias‐corrected bootstrapping achieved the best overall performance, the p3 method serves as an alternative tool for researchers investigating multilevel mediation that is especially useful when conducting a priori power analyses. To encourage utilization, we provide R code for implementing the p3 method.
可用于单层和多层中介的各种推断测试,但大多数都有明显的限制,需要在功率和I型误差之间进行权衡。我们扩展了部分后验p值法(p3法)来检验多层次中介。这种基于重新采样的当代复合方法特别适合于复杂的零假设。我们开发了p3方法,并在两级聚类随机多水平中介研究的背景下研究了其性能。与其在单水平研究中的表现相似,我们发现p3方法相对于其他中介测试表现良好,这表明它在I型错误率和功率之间提供了明智的平衡。虽然偏差校正的自举实现了最佳的整体性能,但p3方法作为研究人员调查多层次中介的替代工具,在进行先验功率分析时特别有用。为了鼓励使用,我们提供了实现p3方法的R代码。
{"title":"A partial posterior p value test for multilevel mediation","authors":"Kyle Cox, Ben Kelcey","doi":"10.1111/stan.12291","DOIUrl":"https://doi.org/10.1111/stan.12291","url":null,"abstract":"A variety of inferential tests are available for single and multilevel mediation but most come with notable limitations that balance tradeoffs between power and Type I error. We extend the partial posterior p value method (p3 method) to test multilevel mediation. This contemporary resampling‐based composite approach is specifically suited for complex null hypotheses. We develop the p3 method and investigate its performance within the context of two‐level cluster‐randomized multilevel mediation studies. Similar to its performance in single‐level studies, we found that the p3 method performed well relative to other mediation tests suggesting it provides a judicious balance between Type I error rate and power. While bias‐corrected bootstrapping achieved the best overall performance, the p3 method serves as an alternative tool for researchers investigating multilevel mediation that is especially useful when conducting a priori power analyses. To encourage utilization, we provide R code for implementing the p3 method.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80357683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A portmanteau test for the iid hypothesis iid假设的组合检验
IF 1.5 3区 数学 Q2 Mathematics Pub Date : 2023-02-28 DOI: 10.1111/stan.12290
Ricardo Bórquez
In this paper, we introduce a new portmanteau test for the iid hypothesis, where the elements of the sample are allowed to take values in a general space (e.g., a function space). We study the finite sample properties of our test, evaluating its performance in terms of empirical size and power. In particular, we compare the empirical power of our test with the power of other tests in the literature designed to work in a specific data setting, as the one‐way analysis of variance test used in experimental data analysis and three portmanteau tests used in time series analysis. In every case, we found conditions where our test outperformed in power to the competing test. Finally, to illustrate the usefulness of our test, we implement it on two real‐world applications based on function‐valued data.
在本文中,我们为iid假设引入了一个新的组合检验,其中样本的元素允许在一般空间(例如,函数空间)中取值。我们研究了我们的测试的有限样本性质,在经验规模和功率方面评估其性能。特别地,我们将我们的检验的经验力量与文献中设计用于特定数据设置的其他检验的力量进行了比较,如实验数据分析中使用的单向方差分析检验和时间序列分析中使用的三个组合检验。在每种情况下,我们都发现我们的测试在功率方面优于竞争测试的条件。最后,为了说明我们的测试的有用性,我们在两个基于函数值数据的实际应用程序中实现了它。
{"title":"A portmanteau test for the iid hypothesis","authors":"Ricardo Bórquez","doi":"10.1111/stan.12290","DOIUrl":"https://doi.org/10.1111/stan.12290","url":null,"abstract":"In this paper, we introduce a new portmanteau test for the iid hypothesis, where the elements of the sample are allowed to take values in a general space (e.g., a function space). We study the finite sample properties of our test, evaluating its performance in terms of empirical size and power. In particular, we compare the empirical power of our test with the power of other tests in the literature designed to work in a specific data setting, as the one‐way analysis of variance test used in experimental data analysis and three portmanteau tests used in time series analysis. In every case, we found conditions where our test outperformed in power to the competing test. Finally, to illustrate the usefulness of our test, we implement it on two real‐world applications based on function‐valued data.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86661955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Statistica Neerlandica
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1