首页 > 最新文献

Test最新文献

英文 中文
Two-step semiparametric empirical likelihood inference from capture–recapture data with missing covariates 从具有缺失协变量的捕获-再捕获数据中进行两步半参数经验似然推断
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-02-14 DOI: 10.1007/s11749-024-00921-1
Yang Liu, Yukun Liu, Pengfei Li, Riquan Zhang

Missing covariates are not uncommon in capture–recapture studies. When covariate information is missing at random in capture–recapture data, an empirical full likelihood method has been demonstrated to outperform conditional-likelihood-based methods in abundance estimation. However, the fully observed covariates must be discrete, and the method is not directly applicable to continuous-time capture–recapture data. Based on the Binomial and Poisson regression models, we propose a two-step semiparametric empirical likelihood approach for abundance estimation in the presence of missing covariates, regardless of whether the fully observed covariates are discrete or continuous. We show that the maximum semiparametric empirical likelihood estimators for the underlying parameters and the abundance are asymptotically normal, and more efficient than the counterpart for a completely known non-missingness probability. After scaling, the empirical likelihood ratio test statistic for abundance follows a limiting chi-square distribution with one degree of freedom. The proposed approach is further extended to one-inflated count regression models, and a score-like test is constructed to assess whether one-inflation exists among the number of captures. Our simulation shows that, compared with the previous method, the proposed method not only performs better in correcting bias, but also has a more accurate coverage in the presence of fully observed continuous covariates, although there may be a slight efficiency loss when the fully observed covariates are only discrete. The performance of the new method is illustrated by analyses of the yellow-bellied prinia data and the rana pretiosa data.

在捕获-再捕获研究中,协变量缺失的情况并不少见。当捕获-再捕获数据中的协变量信息随机缺失时,经验全似然法在丰度估计中的表现优于基于条件似然法的方法。然而,完全观测到的协变量必须是离散的,该方法不能直接用于连续时间的捕获-再捕获数据。基于二项回归和泊松回归模型,我们提出了一种两步半参数经验似然法,用于缺失协变量情况下的丰度估计,无论完全观测到的协变量是离散的还是连续的。我们证明,基础参数和丰度的最大半参数经验似然估计值是渐近正态的,比完全已知非缺失概率的对应估计值更有效。缩放后,丰度的经验似然比检验统计量遵循一个自由度的极限奇平方分布。所提出的方法进一步扩展到单膨胀计数回归模型,并构建了一个类似分数的检验来评估捕获数量中是否存在单膨胀。我们的模拟结果表明,与之前的方法相比,所提出的方法不仅在纠正偏差方面表现更好,而且在完全观测到连续协变量的情况下具有更准确的覆盖范围,不过当完全观测到的协变量只是离散协变量时,可能会有轻微的效率损失。新方法的性能通过对黄腹角雉数据和滇金丝猴数据的分析得到了说明。
{"title":"Two-step semiparametric empirical likelihood inference from capture–recapture data with missing covariates","authors":"Yang Liu, Yukun Liu, Pengfei Li, Riquan Zhang","doi":"10.1007/s11749-024-00921-1","DOIUrl":"https://doi.org/10.1007/s11749-024-00921-1","url":null,"abstract":"<p>Missing covariates are not uncommon in capture–recapture studies. When covariate information is missing at random in capture–recapture data, an empirical full likelihood method has been demonstrated to outperform conditional-likelihood-based methods in abundance estimation. However, the fully observed covariates must be discrete, and the method is not directly applicable to continuous-time capture–recapture data. Based on the Binomial and Poisson regression models, we propose a two-step semiparametric empirical likelihood approach for abundance estimation in the presence of missing covariates, regardless of whether the fully observed covariates are discrete or continuous. We show that the maximum semiparametric empirical likelihood estimators for the underlying parameters and the abundance are asymptotically normal, and more efficient than the counterpart for a completely known non-missingness probability. After scaling, the empirical likelihood ratio test statistic for abundance follows a limiting chi-square distribution with one degree of freedom. The proposed approach is further extended to one-inflated count regression models, and a score-like test is constructed to assess whether one-inflation exists among the number of captures. Our simulation shows that, compared with the previous method, the proposed method not only performs better in correcting bias, but also has a more accurate coverage in the presence of fully observed continuous covariates, although there may be a slight efficiency loss when the fully observed covariates are only discrete. The performance of the new method is illustrated by analyses of the yellow-bellied prinia data and the rana pretiosa data.</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"73 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139773396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A copula formulation for multivariate latent Markov models 多元潜马尔可夫模型的共轭公式
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-02-07 DOI: 10.1007/s11749-024-00919-9
Alfonso Russo, Alessio Farcomeni

We specify a general formulation for multivariate latent Markov models for panel data, where outcomes are possibly of mixed-type (categorical, discrete, continuous). Conditionally on a time-varying discrete latent variable and covariates, the joint distribution of outcomes simultaneously observed is expressed through a parametric copula. We therefore do not make any conditional independence assumption. The observed likelihood is maximized by means of an expectation–maximization algorithm. In a simulation study, we argue how modeling the residual contemporary dependence might be crucial in order to avoid bias in the parameter estimates. We illustrate through an original application to assessment of poverty through direct and indirect indicators in a cohort of Italian households.

我们为面板数据的多变量潜马尔可夫模型提出了一个通用公式,其中的结果可能是混合型的(分类、离散、连续)。以时变离散潜变量和协变量为条件,同时观测到的结果的联合分布通过参数 copula 表示。因此,我们不做任何条件独立性假设。通过期望最大化算法使观察到的可能性最大化。在一项模拟研究中,我们论证了剩余当代依赖性建模对于避免参数估计偏差的重要性。我们通过对意大利家庭队列中的直接和间接指标进行贫困评估的原始应用来说明这一点。
{"title":"A copula formulation for multivariate latent Markov models","authors":"Alfonso Russo, Alessio Farcomeni","doi":"10.1007/s11749-024-00919-9","DOIUrl":"https://doi.org/10.1007/s11749-024-00919-9","url":null,"abstract":"<p>We specify a general formulation for multivariate latent Markov models for panel data, where outcomes are possibly of mixed-type (categorical, discrete, continuous). Conditionally on a time-varying discrete latent variable and covariates, the joint distribution of outcomes simultaneously observed is expressed through a parametric copula. We therefore do not make any conditional independence assumption. The observed likelihood is maximized by means of an expectation–maximization algorithm. In a simulation study, we argue how modeling the residual contemporary dependence might be crucial in order to avoid bias in the parameter estimates. We illustrate through an original application to assessment of poverty through direct and indirect indicators in a cohort of Italian households.\u0000</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"36 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139769593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Change-point detection in a tensor regression model 张量回归模型中的变化点检测
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-02-06 DOI: 10.1007/s11749-023-00915-5
Mai Ghannam, Sévérien Nkurunziza

In this paper, we consider an inference problem in a tensor regression model with one change-point. Specifically, we consider a general hypothesis testing problem on a tensor parameter and the studied testing problem includes as a special case the problem about the absence of a change-point. To this end, we derive the unrestricted estimator (UE) and the restricted estimator (RE) as well as the joint asymptotic normality of the UE and RE. Thanks to the established asymptotic normality, we derive a test for testing the hypothesized restriction. We also derive the asymptotic power of the proposed test and we prove that the established test is consistent. Beyond the complexity of the testing problem in the tensor model, we consider a very general case where the tensor error term and the regressors do not need to be independent and the dependence structure of the outer-product of the tensor error term and regressors is as weak as that of an (mathcal {L}^2-) mixingale. Further, to study the performance of the proposed methods in small and moderate sample sizes, we present some simulation results that corroborate the theoretical results. Finally, to illustrate the application of the proposed methods, we test the non-existence of a change-point in some fMRI neuro-imaging data.

在本文中,我们考虑了有一个变化点的张量回归模型中的推理问题。具体来说,我们考虑的是关于张量参数的一般假设检验问题,所研究的检验问题包括作为特例的无变化点问题。为此,我们推导了非限制估计器(UE)和限制估计器(RE),以及 UE 和 RE 的联合渐近正态性。利用已建立的渐近正态性,我们推导出一个检验假设限制的检验方法。我们还推导出了所提检验的渐近幂,并证明所建立的检验是一致的。除了张量模型中检验问题的复杂性之外,我们还考虑了一种非常普遍的情况,即张量误差项和回归项不需要是独立的,张量误差项和回归项的外积的依赖结构与 (mathcal {L}^2-) mixingale 的依赖结构一样弱。此外,为了研究建议的方法在小样本量和中等样本量下的性能,我们给出了一些模拟结果,以证实理论结果。最后,为了说明所提方法的应用,我们测试了一些 fMRI 神经成像数据中变化点的不存在性。
{"title":"Change-point detection in a tensor regression model","authors":"Mai Ghannam, Sévérien Nkurunziza","doi":"10.1007/s11749-023-00915-5","DOIUrl":"https://doi.org/10.1007/s11749-023-00915-5","url":null,"abstract":"<p>In this paper, we consider an inference problem in a tensor regression model with one change-point. Specifically, we consider a general hypothesis testing problem on a tensor parameter and the studied testing problem includes as a special case the problem about the absence of a change-point. To this end, we derive the unrestricted estimator (UE) and the restricted estimator (RE) as well as the joint asymptotic normality of the UE and RE. Thanks to the established asymptotic normality, we derive a test for testing the hypothesized restriction. We also derive the asymptotic power of the proposed test and we prove that the established test is consistent. Beyond the complexity of the testing problem in the tensor model, we consider a very general case where the tensor error term and the regressors do not need to be independent and the dependence structure of the outer-product of the tensor error term and regressors is as weak as that of an <span>(mathcal {L}^2-)</span> mixingale. Further, to study the performance of the proposed methods in small and moderate sample sizes, we present some simulation results that corroborate the theoretical results. Finally, to illustrate the application of the proposed methods, we test the non-existence of a change-point in some fMRI neuro-imaging data.\u0000</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"29 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139769742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian sample size determination for detecting heterogeneity in multi-site replication studies 贝叶斯样本量确定法检测多站点复制研究中的异质性
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-01-23 DOI: 10.1007/s11749-023-00916-4
Konstantinos Bourazas, Guido Consonni, Laura Deldossi

An ongoing “replication crisis” calls into question scientific discoveries across a variety of disciplines ranging from life to social sciences. Replication studies aim to investigate the validity of findings in published research, and try to assess whether the latter are statistically consistent with those in the replications. While the majority of replication projects are based on a single experiment, multiple independent replications of the same experiment conducted simultaneously at different sites are becoming more frequent. In connection with these types of projects, we deal with testing heterogeneity among sites; specifically, we focus on sample size determination suitable to deliver compelling evidence once the experimental data are gathered.

持续的 "复制危机 "使从生命科学到社会科学等各学科的科学发现受到质疑。复制研究旨在调查已发表研究结果的有效性,并试图评估后者在统计学上是否与复制的结果一致。虽然大多数复制项目都是以单项实验为基础,但在不同地点同时对同一实验进行多项独立复制的情况也越来越多。对于这些类型的项目,我们要处理的问题是测试不同地点之间的异质性;具体来说,我们的重点是确定样本量,以便在收集到实验数据后提供令人信服的证据。
{"title":"Bayesian sample size determination for detecting heterogeneity in multi-site replication studies","authors":"Konstantinos Bourazas, Guido Consonni, Laura Deldossi","doi":"10.1007/s11749-023-00916-4","DOIUrl":"https://doi.org/10.1007/s11749-023-00916-4","url":null,"abstract":"<p>An ongoing “replication crisis” calls into question scientific discoveries across a variety of disciplines ranging from life to social sciences. Replication studies aim to investigate the validity of findings in published research, and try to assess whether the latter are statistically consistent with those in the replications. While the majority of replication projects are based on a single experiment, multiple independent replications of the same experiment conducted simultaneously at different sites are becoming more frequent. In connection with these types of projects, we deal with testing heterogeneity among sites; specifically, we focus on sample size determination suitable to deliver compelling evidence once the experimental data are gathered.</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"7 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139553795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comments on: Shape-based functional data analysis 评论基于形状的功能数据分析
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-01-18 DOI: 10.1007/s11749-023-00914-6
J. E. Borgert, J. S. Marron

This discussion paper applauds the authors for their impactful contribution to functional data analysis (FDA). Their primary insight lies in a formal mathematical definition of the “shape” of a curve, which they connect to familiar intuitive notions through a number of examples. Notably, the paper highlights the pitfalls of less well-thought-out curve registration approaches. The authors’ application of COVID-19 data enriches the discussion, highlighting the work’s practical relevance. We discuss connections of this work with object-oriented data analysis and propose enhancements to the authors’ shape-based functional principal component analysis. Additionally, we illustrate the practical significance of adaptive alignment with an example from our own research.

本讨论稿对作者为函数数据分析 (FDA) 所做的有影响力的贡献表示赞赏。他们的主要见解在于对曲线 "形状 "的正式数学定义,并通过大量实例将其与熟悉的直观概念联系起来。值得注意的是,论文强调了一些考虑不周的曲线注册方法存在的缺陷。作者对 COVID-19 数据的应用丰富了讨论内容,突出了这项工作的实用性。我们讨论了这项工作与面向对象数据分析的联系,并对作者基于形状的功能主成分分析提出了改进建议。此外,我们还以自己的研究为例,说明了自适应配准的实际意义。
{"title":"Comments on: Shape-based functional data analysis","authors":"J. E. Borgert, J. S. Marron","doi":"10.1007/s11749-023-00914-6","DOIUrl":"https://doi.org/10.1007/s11749-023-00914-6","url":null,"abstract":"<p>This discussion paper applauds the authors for their impactful contribution to functional data analysis (FDA). Their primary insight lies in a formal mathematical definition of the “shape” of a curve, which they connect to familiar intuitive notions through a number of examples. Notably, the paper highlights the pitfalls of less well-thought-out curve registration approaches. The authors’ application of COVID-19 data enriches the discussion, highlighting the work’s practical relevance. We discuss connections of this work with object-oriented data analysis and propose enhancements to the authors’ shape-based functional principal component analysis. Additionally, we illustrate the practical significance of adaptive alignment with an example from our own research.</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"11 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139499873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LRD spectral analysis of multifractional functional time series on manifolds 流形上多分量函数时间序列的 LRD 频谱分析
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-01-12 DOI: 10.1007/s11749-023-00913-7
Diana P. Ovalle–Muñoz, M. Dolores Ruiz–Medina

This paper addresses the estimation of the second-order structure of a manifold cross-time random field (RF) displaying spatially varying Long Range Dependence (LRD), adopting the functional time series framework introduced in Ruiz-Medina (Fract Calc Appl Anal 25:1426–1458, 2022). Conditions for the asymptotic unbiasedness of the integrated periodogram operator in the Hilbert–Schmidt operator norm are derived beyond structural assumptions. Weak-consistent estimation of the long-memory operator is achieved under a semiparametric functional spectral framework in the Gaussian context. The case where the projected manifold process can display Short Range Dependence (SRD) and LRD at different manifold scales is also analyzed. The performance of both estimation procedures is illustrated in the simulation study, in the context of multifractionally integrated spherical functional autoregressive–moving average (SPHARMA(p,q)) processes.

本文采用 Ruiz-Medina (Fract Calc Appl Anal 25:1426-1458, 2022) 中引入的函数时间序列框架,探讨了流形跨时间随机场(RF)二阶结构的估计问题,该随机场显示了空间变化的长程依赖性(LRD)。在希尔伯特-施密特(Hilbert-Schmidt)算子规范之外,还推导出了积分周期图算子的渐近无偏性条件。在高斯背景下的半参数函数谱框架下实现了长记忆算子的弱一致性估计。此外,还分析了投影流形过程在不同流形尺度上显示短程依赖性(SRD)和长程依赖性的情况。在多分量积分球形函数自回归移动平均(SPHARMA(p,q))过程的背景下,模拟研究说明了这两种估计程序的性能。
{"title":"LRD spectral analysis of multifractional functional time series on manifolds","authors":"Diana P. Ovalle–Muñoz, M. Dolores Ruiz–Medina","doi":"10.1007/s11749-023-00913-7","DOIUrl":"https://doi.org/10.1007/s11749-023-00913-7","url":null,"abstract":"<p>This paper addresses the estimation of the second-order structure of a manifold cross-time random field (RF) displaying spatially varying Long Range Dependence (LRD), adopting the functional time series framework introduced in Ruiz-Medina (Fract Calc Appl Anal 25:1426–1458, 2022). Conditions for the asymptotic unbiasedness of the integrated periodogram operator in the Hilbert–Schmidt operator norm are derived beyond structural assumptions. Weak-consistent estimation of the long-memory operator is achieved under a semiparametric functional spectral framework in the Gaussian context. The case where the projected manifold process can display Short Range Dependence (SRD) and LRD at different manifold scales is also analyzed. The performance of both estimation procedures is illustrated in the simulation study, in the context of multifractionally integrated spherical functional autoregressive–moving average (SPHARMA(p,q)) processes.</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"13 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139461192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On variability of the mean remaining lifetime at random age 关于随机年龄下平均剩余寿命的变化
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-01-12 DOI: 10.1007/s11749-023-00917-3
Majid Asadi, Maxim Finkelstein

In this short communication, we discuss the remaining lifetime and the mean remaining lifetime (MRL) of an item with a random age. We show that the MRL at random age is closely related to some well-known variability measures. First, we provide a decomposition result showing that the MRL at random age, similar to other variability measures, has a covariance representation. Under the proportional hazards (PH) model, we show that the MRL, depending on the parameter of proportionality, subsumes the Gini’s mean difference and the cumulative residual entropy as special cases. It is also shown that, under the PH model, the MRL can be expressed via the equilibrium distribution and the mean number of events in the generalized Pólya process.

在这篇短文中,我们讨论了随机年龄下物品的剩余寿命和平均剩余寿命(MRL)。我们表明,随机年龄的 MRL 与一些著名的变异度量密切相关。首先,我们提供了一个分解结果,表明随机年龄的 MRL 与其他变异度量类似,具有协方差表示。在比例危险(PH)模型下,我们表明,根据比例参数的不同,MRL 包含基尼均值差和累积残差熵这两个特例。我们还证明,在 PH 模型下,MRL 可以通过广义波利亚过程中的平衡分布和事件平均数量来表示。
{"title":"On variability of the mean remaining lifetime at random age","authors":"Majid Asadi, Maxim Finkelstein","doi":"10.1007/s11749-023-00917-3","DOIUrl":"https://doi.org/10.1007/s11749-023-00917-3","url":null,"abstract":"<p>In this short communication, we discuss the remaining lifetime and the mean remaining lifetime (MRL) of an item with a random age. We show that the MRL at random age is closely related to some well-known variability measures. First, we provide a decomposition result showing that the MRL at random age, similar to other variability measures, has a covariance representation. Under the proportional hazards (PH) model, we show that the MRL, depending on the parameter of proportionality, subsumes the Gini’s mean difference and the cumulative residual entropy as special cases. It is also shown that, under the PH model, the MRL can be expressed via the equilibrium distribution and the mean number of events in the generalized Pólya process.\u0000</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"6 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139461602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A generalized Hosmer–Lemeshow goodness-of-fit test for a family of generalized linear models 广义线性模型族的广义 Hosmer-Lemeshow 拟合度检验
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-12-19 DOI: 10.1007/s11749-023-00912-8

Abstract

Generalized linear models (GLMs) are very widely used, but formal goodness-of-fit (GOF) tests for the overall fit of the model seem to be in wide use only for certain classes of GLMs. We develop and apply a new goodness-of-fit test, similar to the well-known and commonly used Hosmer–Lemeshow (HL) test, that can be used with a wide variety of GLMs. The test statistic is a variant of the HL statistic, but we rigorously derive an asymptotically correct sampling distribution using methods of Stute and Zhu (Scand J Stat 29(3):535–545, 2002) and demonstrate its consistency. We compare the performance of our new test with other GOF tests for GLMs, including a naive direct application of the HL test to the Poisson problem. Our test provides competitive or comparable power in various simulation settings and we identify a situation where a naive version of the test fails to hold its size. Our generalized HL test is straightforward to implement and interpret and an R package is publicly available.

摘要 广义线性模型(GLM)的应用非常广泛,但对模型总体拟合程度的正式拟合优度(GOF)检验似乎只广泛用于某些类别的 GLM。我们开发并应用了一种新的拟合优度检验,类似于著名且常用的 Hosmer-Lemeshow (HL) 检验,可用于多种 GLM。该检验统计量是 HL 统计量的变种,但我们使用 Stute 和 Zhu 的方法(Scand J Stat 29(3):535-545, 2002)严格推导出了渐近正确的抽样分布,并证明了其一致性。我们将新检验方法的性能与 GLM 的其他 GOF 检验方法进行了比较,包括将 HL 检验方法直接应用于泊松问题的天真检验方法。我们的检验在各种模拟环境中都具有竞争力或可比性,而且我们还发现了一种情况,即天真版本的检验无法保持其规模。我们的广义 HL 检验可以直接实施和解释,并且有一个公开的 R 软件包。
{"title":"A generalized Hosmer–Lemeshow goodness-of-fit test for a family of generalized linear models","authors":"","doi":"10.1007/s11749-023-00912-8","DOIUrl":"https://doi.org/10.1007/s11749-023-00912-8","url":null,"abstract":"<h3>Abstract</h3> <p>Generalized linear models (GLMs) are very widely used, but formal goodness-of-fit (GOF) tests for the overall fit of the model seem to be in wide use only for certain classes of GLMs. We develop and apply a new goodness-of-fit test, similar to the well-known and commonly used Hosmer–Lemeshow (HL) test, that can be used with a wide variety of GLMs. The test statistic is a variant of the HL statistic, but we rigorously derive an asymptotically correct sampling distribution using methods of Stute and Zhu (Scand J Stat 29(3):535–545, 2002) and demonstrate its consistency. We compare the performance of our new test with other GOF tests for GLMs, including a naive direct application of the HL test to the Poisson problem. Our test provides competitive or comparable power in various simulation settings and we identify a situation where a naive version of the test fails to hold its size. Our generalized HL test is straightforward to implement and interpret and an <span>R</span> package is publicly available.</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"887 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2023-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138741449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rejoinder on: statistical inference and large-scale multiple testing for high-dimensional regression models 再论:高维回归模型的统计推断和大规模多重测试
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-12-18 DOI: 10.1007/s11749-023-00911-9
T. Tony Cai, Zijian Guo, Yin Xia
{"title":"Rejoinder on: statistical inference and large-scale multiple testing for high-dimensional regression models","authors":"T. Tony Cai, Zijian Guo, Yin Xia","doi":"10.1007/s11749-023-00911-9","DOIUrl":"https://doi.org/10.1007/s11749-023-00911-9","url":null,"abstract":"","PeriodicalId":51189,"journal":{"name":"Test","volume":"74 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138717037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Specification procedures for multivariate stable-Paretian laws for independent and for conditionally heteroskedastic data 独立数据和条件异方差数据的多变量稳定-Paretian定律的规范程序
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-12-15 DOI: 10.1007/s11749-023-00909-3
Simos G. Meintanis, John P. Nolan, Charl Pretorius

We consider goodness-of-fit methods for multivariate symmetric and asymmetric stable Paretian random vectors in arbitrary dimension. The methods are based on the empirical characteristic function and are implemented both in the i.i.d. context as well as for innovations in GARCH models. Asymptotic properties of the proposed procedures are discussed, while the finite-sample properties are illustrated by means of an extensive Monte Carlo study. The procedures are also applied to real data from the financial markets.

我们考虑了任意维度的多元对称和非对称稳定帕累托随机向量的拟合优度方法。这些方法以经验特征函数为基础,在 i.i.d. 情况下以及针对 GARCH 模型中的创新时均可实施。我们讨论了所建议程序的渐近特性,并通过大量蒙特卡罗研究说明了其有限样本特性。这些程序还应用于金融市场的真实数据。
{"title":"Specification procedures for multivariate stable-Paretian laws for independent and for conditionally heteroskedastic data","authors":"Simos G. Meintanis, John P. Nolan, Charl Pretorius","doi":"10.1007/s11749-023-00909-3","DOIUrl":"https://doi.org/10.1007/s11749-023-00909-3","url":null,"abstract":"<p>We consider goodness-of-fit methods for multivariate symmetric and asymmetric stable Paretian random vectors in arbitrary dimension. The methods are based on the empirical characteristic function and are implemented both in the i.i.d. context as well as for innovations in GARCH models. Asymptotic properties of the proposed procedures are discussed, while the finite-sample properties are illustrated by means of an extensive Monte Carlo study. The procedures are also applied to real data from the financial markets.\u0000</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"38 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138690329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Test
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1