首页 > 最新文献

Annals of the Institute of Statistical Mathematics最新文献

英文 中文
Empirical likelihood MLE for joint modeling right censored survival data with longitudinal covariates 对带有纵向协变量的右删失生存数据进行联合建模的经验似然 MLE
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-04-29 DOI: 10.1007/s10463-024-00899-5
Jian-Jian Ren, Yuyin Shi

Up to now, almost all existing methods for joint modeling survival data and longitudinal data rely on parametric/semiparametric assumptions on longitudinal covariate process, and the resulting inferences critically depend on the validity of these assumptions that are difficult to verify in practice. The kernel method-based procedures rely on choices of kernel function and bandwidth, and none of the existing methods provides estimate for the baseline distribution in proportional hazards model. This article proposes a proportional hazards model for joint modeling right censored survival data and intensive longitudinal data taking into account of within-subject historic change patterns. Without any parametric/semiparametric assumptions or use of kernel method, we derive empirical likelihood-based maximum likelihood estimators and partial likelihood estimators for the regression parameter and the baseline distribution function. We develop stable computing algorithms and present some simulation results. Analyses of real dataset are conducted for smoking cessation data and liver disease data.

迄今为止,几乎所有现有的生存数据和纵向数据联合建模方法都依赖于对纵向协变量过程的参数/半参数假设,而由此得出的推论关键取决于这些假设的有效性,而这些假设在实践中很难验证。基于核方法的程序依赖于核函数和带宽的选择,而现有的方法都不能提供比例危险模型中基线分布的估计。本文提出了一种比例危险模型,用于对右删减生存数据和密集纵向数据进行联合建模,并考虑到了研究对象内部的历史变化模式。在没有任何参数/半参数假设或使用核方法的情况下,我们为回归参数和基线分布函数推导出了基于经验似然的最大似然估计量和偏似然估计量。我们开发了稳定的计算算法,并展示了一些模拟结果。我们对戒烟数据和肝病数据的真实数据集进行了分析。
{"title":"Empirical likelihood MLE for joint modeling right censored survival data with longitudinal covariates","authors":"Jian-Jian Ren,&nbsp;Yuyin Shi","doi":"10.1007/s10463-024-00899-5","DOIUrl":"10.1007/s10463-024-00899-5","url":null,"abstract":"<div><p>Up to now, almost all existing methods for joint modeling survival data and longitudinal data rely on parametric/semiparametric assumptions on longitudinal covariate process, and the resulting inferences critically depend on the validity of these assumptions that are difficult to verify in practice. The kernel method-based procedures rely on choices of kernel function and bandwidth, and none of the existing methods provides estimate for the baseline distribution in proportional hazards model. This article proposes a proportional hazards model for joint modeling right censored survival data and intensive longitudinal data taking into account of within-subject historic change patterns. Without any parametric/semiparametric assumptions or use of kernel method, we derive empirical likelihood-based maximum likelihood estimators and partial likelihood estimators for the regression parameter and the baseline distribution function. We develop stable computing algorithms and present some simulation results. Analyses of real dataset are conducted for smoking cessation data and liver disease data.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"76 4","pages":"617 - 648"},"PeriodicalIF":0.8,"publicationDate":"2024-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140826716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing the coverage probabilities of fixed-margin confidence intervals for the tail conditional allocation 评估尾部条件分配的固定边际置信区间的覆盖概率
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-04-23 DOI: 10.1007/s10463-024-00904-x
N. V. Gribkova, J. Su, R. Zitikis

The tail conditional allocation plays an important role in a number of areas, including economics, finance, insurance, and management. Fixed-margin confidence intervals and the assessment of their coverage probabilities are of much interest. In this paper, we offer a convenient way to achieve these goals via resampling. The theoretical part of the paper, which is technically demanding, is rigorously established under minimal conditions to facilitate the widest practical use. A simulation-based study and an analysis of real data illustrate the performance of the developed methodology.

尾部条件分配在经济、金融、保险和管理等多个领域发挥着重要作用。固定边际置信区间及其覆盖概率的评估备受关注。在本文中,我们提供了一种通过重采样实现这些目标的便捷方法。本文的理论部分对技术要求很高,但为了便于最广泛的实际应用,我们在最低条件下建立了严格的理论。基于模拟的研究和对真实数据的分析说明了所开发方法的性能。
{"title":"Assessing the coverage probabilities of fixed-margin confidence intervals for the tail conditional allocation","authors":"N. V. Gribkova,&nbsp;J. Su,&nbsp;R. Zitikis","doi":"10.1007/s10463-024-00904-x","DOIUrl":"10.1007/s10463-024-00904-x","url":null,"abstract":"<div><p>The tail conditional allocation plays an important role in a number of areas, including economics, finance, insurance, and management. Fixed-margin confidence intervals and the assessment of their coverage probabilities are of much interest. In this paper, we offer a convenient way to achieve these goals via resampling. The theoretical part of the paper, which is technically demanding, is rigorously established under minimal conditions to facilitate the widest practical use. A simulation-based study and an analysis of real data illustrate the performance of the developed methodology.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"76 5","pages":"821 - 850"},"PeriodicalIF":0.8,"publicationDate":"2024-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140798164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quasi-maximum likelihood estimation and penalized estimation under non-standard conditions 非标准条件下的准极大似然估计和惩罚性估计
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-04-23 DOI: 10.1007/s10463-024-00901-0
Junichiro Yoshida, Nakahiro Yoshida

The purpose of this article is to develop a general parametric estimation theory that allows the derivation of the limit distribution of estimators in non-regular models where the true parameter value may lie on the boundary of the parameter space or where even identifiability fails. For that, we propose a more general local approximation of the parameter space (at the true value) than previous studies. This estimation theory is comprehensive in that it can handle penalized estimation as well as quasi-maximum likelihood estimation (in the ergodic or non-ergodic statistics) under such non-regular models. In penalized estimation, depending on the boundary constraint, even the concave Bridge estimator does not necessarily give selection consistency. Therefore, we describe some sufficient condition for selection consistency, precisely evaluating the balance between the boundary constraint and the form of the penalty. An example is penalized MLE of variance components of random effects in linear mixed models.

本文的目的是发展一种一般参数估计理论,在非规则模型中,真参数值可能位于参数空间的边界上,或者甚至在可识别性失效的情况下,可以推导出估计子的极限分布。为此,我们提出了比以往研究更通用的参数空间(真值)局部近似方法。这种估计理论是全面的,因为它可以在这种非规则模型下处理惩罚估计和准极大似然估计(在啮合或非啮合统计中)。在惩罚估计中,根据边界约束,即使是凹桥估计器也不一定能给出选择一致性。因此,我们描述了选择一致性的一些充分条件,精确评估了边界约束和惩罚形式之间的平衡。一个例子是线性混合模型中随机效应方差分量的惩罚 MLE。
{"title":"Quasi-maximum likelihood estimation and penalized estimation under non-standard conditions","authors":"Junichiro Yoshida,&nbsp;Nakahiro Yoshida","doi":"10.1007/s10463-024-00901-0","DOIUrl":"10.1007/s10463-024-00901-0","url":null,"abstract":"<div><p>The purpose of this article is to develop a general parametric estimation theory that allows the derivation of the limit distribution of estimators in non-regular models where the true parameter value may lie on the boundary of the parameter space or where even identifiability fails. For that, we propose a more general local approximation of the parameter space (at the true value) than previous studies. This estimation theory is comprehensive in that it can handle penalized estimation as well as quasi-maximum likelihood estimation (in the ergodic or non-ergodic statistics) under such non-regular models. In penalized estimation, depending on the boundary constraint, even the concave Bridge estimator does not necessarily give selection consistency. Therefore, we describe some sufficient condition for selection consistency, precisely evaluating the balance between the boundary constraint and the form of the penalty. An example is penalized MLE of variance components of random effects in linear mixed models.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"76 5","pages":"711 - 763"},"PeriodicalIF":0.8,"publicationDate":"2024-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140798165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A delineation of new classes of exponential dispersion models supported on the set of nonnegative integers 非负整数集支持的指数离散模型新类别划分
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-04-22 DOI: 10.1007/s10463-024-00903-y
Shaul K. Bar-Lev, Gérard Letac, Ad Ridder

The aim of this paper is to delineate a set of new classes of natural exponential families and their associated exponential dispersion models whose probability distributions are supported on the set of nonnegative integers with positive mass on 0 and 1. The new classes are obtained by considering a specific form of their variance functions. We show that the distributions of all these classes are supported on nonnegative integers, that they are infinitely divisible, and that they are skewed to the right, leptokurtic, over-dispersed, and zero-inflated (relative to the Poisson class). Accordingly, these new classes significantly enrich the set of probability models for modeling zero-inflated and over-dispersed count data. Furthermore, we elaborate on numerical techniques how to compute the distributions of our classes, and apply these to an actual data experiment.

本文旨在划分一组新的自然指数族及其相关指数离散模型,它们的概率分布都支持在非负整数集合上,0 和 1 的质量为正。我们证明,所有这些类别的分布都支持非负整数,它们是无限可分的,并且(相对于泊松类)向右倾斜、畸变、过度分散和零膨胀。因此,这些新类别极大地丰富了零膨胀和过度分散计数数据建模的概率模型集。此外,我们还阐述了如何计算我们的类别分布的数值技术,并将其应用于实际数据实验。
{"title":"A delineation of new classes of exponential dispersion models supported on the set of nonnegative integers","authors":"Shaul K. Bar-Lev,&nbsp;Gérard Letac,&nbsp;Ad Ridder","doi":"10.1007/s10463-024-00903-y","DOIUrl":"10.1007/s10463-024-00903-y","url":null,"abstract":"<div><p>The aim of this paper is to delineate a set of new classes of natural exponential families and their associated exponential dispersion models whose probability distributions are supported on the set of nonnegative integers with positive mass on 0 and 1. The new classes are obtained by considering a specific form of their variance functions. We show that the distributions of all these classes are supported on nonnegative integers, that they are infinitely divisible, and that they are skewed to the right, leptokurtic, over-dispersed, and zero-inflated (relative to the Poisson class). Accordingly, these new classes significantly enrich the set of probability models for modeling zero-inflated and over-dispersed count data. Furthermore, we elaborate on numerical techniques how to compute the distributions of our classes, and apply these to an actual data experiment.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"76 4","pages":"679 - 709"},"PeriodicalIF":0.8,"publicationDate":"2024-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140673175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the universal consistency of an over-parametrized deep neural network estimate learned by gradient descent 论梯度下降法学习的过参数化深度神经网络估计的普遍一致性
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-04-08 DOI: 10.1007/s10463-024-00898-6
Selina Drews, Michael Kohler

Estimation of a multivariate regression function from independent and identically distributed data is considered. An estimate is defined which fits a deep neural network consisting of a large number of fully connected neural networks, which are computed in parallel, via gradient descent to the data. The estimate is over-parametrized in the sense that the number of its parameters is much larger than the sample size. It is shown that with a suitable random initialization of the network, a sufficiently small gradient descent step size, and a number of gradient descent steps that slightly exceed the reciprocal of this step size, the estimate is universally consistent. This means that the expected (L_2) error converges to zero for all distributions of the data where the response variable is square integrable.

研究考虑了从独立且同分布的数据中估计多元回归函数。我们定义了一个估计值,该估计值与一个深度神经网络相匹配,该网络由大量全连接神经网络组成,通过梯度下降对数据进行并行计算。从参数数量远大于样本量的意义上讲,该估计值是过参数化的。研究表明,如果对网络进行适当的随机初始化,梯度下降的步长足够小,梯度下降的步数略微超过步长的倒数,则估计结果是普遍一致的。这意味着,对于响应变量可平方整数的所有数据分布,预期的 (L_2) 误差都会趋近于零。
{"title":"On the universal consistency of an over-parametrized deep neural network estimate learned by gradient descent","authors":"Selina Drews,&nbsp;Michael Kohler","doi":"10.1007/s10463-024-00898-6","DOIUrl":"10.1007/s10463-024-00898-6","url":null,"abstract":"<div><p>Estimation of a multivariate regression function from independent and identically distributed data is considered. An estimate is defined which fits a deep neural network consisting of a large number of fully connected neural networks, which are computed in parallel, via gradient descent to the data. The estimate is over-parametrized in the sense that the number of its parameters is much larger than the sample size. It is shown that with a suitable random initialization of the network, a sufficiently small gradient descent step size, and a number of gradient descent steps that slightly exceed the reciprocal of this step size, the estimate is universally consistent. This means that the expected <span>(L_2)</span> error converges to zero for all distributions of the data where the response variable is square integrable.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"76 3","pages":"361 - 391"},"PeriodicalIF":0.8,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140565854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel two-sample test within the space of symmetric positive definite matrix distributions and its application in finance 对称正定矩阵分布空间内的新型双样本检验及其在金融领域的应用
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-04-08 DOI: 10.1007/s10463-024-00902-z
Žikica Lukić, Bojana Milošević

This paper introduces a novel two-sample test for a broad class of orthogonally invariant positive definite symmetric matrix distributions. Our test is the first of its kind, and we derive its asymptotic distribution. To estimate the test power, we use a warp-speed bootstrap method and consider the most common matrix distributions. We provide several real data examples, including the data for main cryptocurrencies and stock data of major US companies. The real data examples demonstrate the applicability of our test in the context closely related to algorithmic trading. The popularity of matrix distributions in many applications and the need for such a test in the literature are reconciled by our findings.

本文针对一大类正交不变正定对称矩阵分布提出了一种新颖的双样本检验。我们的检验是首个此类检验,并推导了其渐近分布。为了估计检验功率,我们使用了经速引导法,并考虑了最常见的矩阵分布。我们提供了几个真实数据示例,包括主要加密货币的数据和主要美国公司的股票数据。真实数据示例证明了我们的测试在与算法交易密切相关的环境中的适用性。我们的研究发现,矩阵分布在许多应用中的普及性和文献中对这种检验的需求是一致的。
{"title":"A novel two-sample test within the space of symmetric positive definite matrix distributions and its application in finance","authors":"Žikica Lukić,&nbsp;Bojana Milošević","doi":"10.1007/s10463-024-00902-z","DOIUrl":"10.1007/s10463-024-00902-z","url":null,"abstract":"<div><p>This paper introduces a novel two-sample test for a broad class of orthogonally invariant positive definite symmetric matrix distributions. Our test is the first of its kind, and we derive its asymptotic distribution. To estimate the test power, we use a warp-speed bootstrap method and consider the most common matrix distributions. We provide several real data examples, including the data for main cryptocurrencies and stock data of major US companies. The real data examples demonstrate the applicability of our test in the context closely related to algorithmic trading. The popularity of matrix distributions in many applications and the need for such a test in the literature are reconciled by our findings.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"76 5","pages":"797 - 820"},"PeriodicalIF":0.8,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140565855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical inference for random T-tessellations models. Application to agricultural landscape modeling 随机 T 型网格模型的统计推断。在农业景观建模中的应用
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-04-05 DOI: 10.1007/s10463-023-00893-3
Katarzyna Adamczyk-Chauvat, Mouna Kassa, Julien Papaïx, Kiên Kiêu, Radu S. Stoica

The Gibbsian T-tessellation models allow the representation of a wide range of spatial patterns. This paper proposes an integrated approach for statistical inference. Model parameters are estimated via Monte Carlo maximum likelihood. The simulations needed for likelihood computation are produced using an adapted Metropolis-Hastings-Green dynamics. In order to reduce the computational costs, a pseudolikelihood estimate is derived and then used for the initialization of the likelihood optimization. Model assessment is based on global envelope tests applied to the set of functional statistics of tessellation. Finally, a real data application is presented. This application analyzes three French agricultural landscapes. The Gibbs T-tessellation models simultaneously provide a morphological and statistical characterization of these data.

Gibbsian T-tessellation模型可以表示多种空间模式。本文提出了一种综合的统计推断方法。模型参数通过蒙特卡罗最大似然法进行估计。似然计算所需的模拟是通过改编的 Metropolis-Hastings-Green 动力学产生的。为了降低计算成本,先推导出伪似然估计值,然后用于似然优化的初始化。模型评估基于应用于镶嵌功能统计集的全局包络测试。最后,介绍了一个真实数据应用。该应用分析了三个法国农业景观。Gibbs T-tessellation模型同时提供了这些数据的形态和统计特征。
{"title":"Statistical inference for random T-tessellations models. Application to agricultural landscape modeling","authors":"Katarzyna Adamczyk-Chauvat,&nbsp;Mouna Kassa,&nbsp;Julien Papaïx,&nbsp;Kiên Kiêu,&nbsp;Radu S. Stoica","doi":"10.1007/s10463-023-00893-3","DOIUrl":"10.1007/s10463-023-00893-3","url":null,"abstract":"<div><p>The Gibbsian T-tessellation models allow the representation of a wide range of spatial patterns. This paper proposes an integrated approach for statistical inference. Model parameters are estimated via Monte Carlo maximum likelihood. The simulations needed for likelihood computation are produced using an adapted Metropolis-Hastings-Green dynamics. In order to reduce the computational costs, a pseudolikelihood estimate is derived and then used for the initialization of the likelihood optimization. Model assessment is based on global envelope tests applied to the set of functional statistics of tessellation. Finally, a real data application is presented. This application analyzes three French agricultural landscapes. The Gibbs T-tessellation models simultaneously provide a morphological and statistical characterization of these data.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"76 3","pages":"447 - 479"},"PeriodicalIF":0.8,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140565954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using the growth curve model in classification of repeated measurements 在重复测量分类中使用生长曲线模型
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-03-29 DOI: 10.1007/s10463-024-00900-1
Dietrich von Rosen, Martin Singull

In this paper, discrimination between two populations following the growth curve model is considered. A likelihood-based classification procedure is established, in the sense that we compare the two likelihoods given that the new observation belongs to respective population. The possibility to classify the new observation as belonging to an unknown population is discussed, which is shown to be natural when considering growth curves. Several examples and simulations are given to emphasize this possibility.

本文考虑了按照增长曲线模型区分两个种群的问题。本文建立了一个基于似然法的分类程序,即在新观测数据属于各自种群的情况下,比较两个似然法。我们还讨论了将新观测数据归类为未知种群的可能性,这在考虑增长曲线时是很自然的。我们给出了几个例子和模拟来强调这种可能性。
{"title":"Using the growth curve model in classification of repeated measurements","authors":"Dietrich von Rosen,&nbsp;Martin Singull","doi":"10.1007/s10463-024-00900-1","DOIUrl":"10.1007/s10463-024-00900-1","url":null,"abstract":"<div><p>In this paper, discrimination between two populations following the growth curve model is considered. A likelihood-based classification procedure is established, in the sense that we compare the two likelihoods given that the new observation belongs to respective population. The possibility to classify the new observation as belonging to an unknown population is discussed, which is shown to be natural when considering growth curves. Several examples and simulations are given to emphasize this possibility.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"76 3","pages":"511 - 534"},"PeriodicalIF":0.8,"publicationDate":"2024-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140367528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-sample hypothesis testing of high-dimensional mean vectors under covariance heterogeneity 协方差异质性下高维均值向量的多样本假设检验
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-03-22 DOI: 10.1007/s10463-024-00896-8
Lixiu Wu, Jiang Hu

In this paper, we focus on the hypothesis testing problem of the mean vectors of high-dimensional data in the multi-sample case. We propose two maximum-type statistics and apply a parametric bootstrap technique to compute the critical values. Unlike previous hypothesis testing methods that heavily depend on the structural assumptions of the unknown covariance matrix, the proposed methods accommodate a general covariance structure. Additionally, we introduce screening-based testing procedures to enhance the power of our tests. These test procedures do not require the use of approximate limiting distributions for the test statistics. Finally, we obtain and verify the theoretical properties through simulation studies.

本文主要研究多样本情况下高维数据均值向量的假设检验问题。我们提出了两种最大类型统计量,并应用参数自举技术计算临界值。以往的假设检验方法在很大程度上依赖于未知协方差矩阵的结构假设,与之不同的是,我们提出的方法适用于一般的协方差结构。此外,我们还引入了基于筛选的测试程序,以增强我们的测试能力。这些检验程序无需使用检验统计量的近似极限分布。最后,我们通过模拟研究获得并验证了理论特性。
{"title":"Multi-sample hypothesis testing of high-dimensional mean vectors under covariance heterogeneity","authors":"Lixiu Wu,&nbsp;Jiang Hu","doi":"10.1007/s10463-024-00896-8","DOIUrl":"10.1007/s10463-024-00896-8","url":null,"abstract":"<div><p>In this paper, we focus on the hypothesis testing problem of the mean vectors of high-dimensional data in the multi-sample case. We propose two maximum-type statistics and apply a parametric bootstrap technique to compute the critical values. Unlike previous hypothesis testing methods that heavily depend on the structural assumptions of the unknown covariance matrix, the proposed methods accommodate a general covariance structure. Additionally, we introduce screening-based testing procedures to enhance the power of our tests. These test procedures do not require the use of approximate limiting distributions for the test statistics. Finally, we obtain and verify the theoretical properties through simulation studies.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"76 4","pages":"579 - 615"},"PeriodicalIF":0.8,"publicationDate":"2024-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140204420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data segmentation for time series based on a general moving sum approach 基于一般移动总和方法的时间序列数据分割
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-03-14 DOI: 10.1007/s10463-023-00892-4
Claudia Kirch, Kerstin Reckruehm

We consider the multiple change point problem in a general framework based on estimating equations. This extends classical sample mean-based methodology to include robust methods but also different types of changes such as changes in linear regression or changes in count data including Poisson autoregressive time series. In this framework, we derive a general theory proving consistency for the number of change points and rates of convergence for the estimators of the locations of the change points. More precisely, two different types of MOSUM (moving sum) statistics are considered: A MOSUM-Wald statistic based on differences of local estimators and a MOSUM-score statistic based on a global inspection parameter. The latter is usually computationally less involved in particular in nonlinear problems where no closed form of the estimator is known such that numerical methods are required. Finally, we evaluate the methodology by some simulations as well as using geophysical well-log data.

我们在基于估计方程的一般框架中考虑了多变化点问题。这扩展了基于样本平均数的经典方法,不仅包括稳健方法,还包括不同类型的变化,如线性回归中的变化或包括泊松自回归时间序列在内的计数数据变化。在这一框架下,我们推导出一种一般理论,证明了变化点数量的一致性和变化点位置估计值的收敛率。更确切地说,我们考虑了两种不同类型的 MOSUM(移动总和)统计量:一种是基于局部估计值差异的 MOSUM-Wald 统计量,另一种是基于全局检验参数的 MOSUM-score 统计量。后者的计算量通常较小,尤其是在非线性问题中,由于不知道估计器的封闭形式,因此需要使用数值方法。最后,我们通过一些模拟以及地球物理井记录数据对该方法进行了评估。
{"title":"Data segmentation for time series based on a general moving sum approach","authors":"Claudia Kirch,&nbsp;Kerstin Reckruehm","doi":"10.1007/s10463-023-00892-4","DOIUrl":"10.1007/s10463-023-00892-4","url":null,"abstract":"<div><p>We consider the multiple change point problem in a general framework based on estimating equations. This extends classical sample mean-based methodology to include robust methods but also different types of changes such as changes in linear regression or changes in count data including Poisson autoregressive time series. In this framework, we derive a general theory proving consistency for the number of change points and rates of convergence for the estimators of the locations of the change points. More precisely, two different types of MOSUM (moving sum) statistics are considered: A MOSUM-Wald statistic based on differences of local estimators and a MOSUM-score statistic based on a global inspection parameter. The latter is usually computationally less involved in particular in nonlinear problems where no closed form of the estimator is known such that numerical methods are required. Finally, we evaluate the methodology by some simulations as well as using geophysical well-log data.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"76 3","pages":"393 - 421"},"PeriodicalIF":0.8,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140124482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Annals of the Institute of Statistical Mathematics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1