首页 > 最新文献

Journal of Statistical Planning and Inference最新文献

英文 中文
Homogeneity testing under finite mixtures of multivariate Poisson distributions 多元泊松分布有限混合下的均匀性检验
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2026-07-01 Epub Date: 2025-11-27 DOI: 10.1016/j.jspi.2025.106369
Guanfu Liu , Yuejiao Fu
The finite mixtures of multivariate Poisson (FMMP) distributions have wide applications in the real world. Testing for homogeneity under the FMMP models is important, however, there is no generic solution to this problem as far as we know. In this paper, we propose an EM-test for homogeneity under the FMMP models to fulfill the gap. We establish the strong consistency of the maximum likelihood estimator for the mixing distribution by relaxing two conditions required in existing literature. The null limiting distribution of the proposed test is studied, and based on the limiting distribution, a resampling procedure is constructed to approximate the p-value of the test. The loss of the strong identifiability for the multivariate Poisson distribution poses a significant challenge in deriving the null limiting distribution. Finally, simulation studies and real-data analysis demonstrate the good performance of the proposed test.
多元泊松分布的有限混合在现实世界中有着广泛的应用。在FMMP模型下测试同质性是很重要的,然而,据我们所知,这个问题没有通用的解决方案。在本文中,我们提出了FMMP模型下的同质性的em检验来填补这一空白。通过放宽现有文献中所要求的两个条件,建立了混合分布的极大似然估计量的强相合性。研究了该检验的零极限分布,并基于该极限分布构造了近似检验p值的重采样程序。多元泊松分布的强可辨识性的丧失对零极限分布的推导提出了重大挑战。最后,仿真研究和实际数据分析验证了该方法的良好性能。
{"title":"Homogeneity testing under finite mixtures of multivariate Poisson distributions","authors":"Guanfu Liu ,&nbsp;Yuejiao Fu","doi":"10.1016/j.jspi.2025.106369","DOIUrl":"10.1016/j.jspi.2025.106369","url":null,"abstract":"<div><div>The finite mixtures of multivariate Poisson (FMMP) distributions have wide applications in the real world. Testing for homogeneity under the FMMP models is important, however, there is no generic solution to this problem as far as we know. In this paper, we propose an EM-test for homogeneity under the FMMP models to fulfill the gap. We establish the strong consistency of the maximum likelihood estimator for the mixing distribution by relaxing two conditions required in existing literature. The null limiting distribution of the proposed test is studied, and based on the limiting distribution, a resampling procedure is constructed to approximate the <span><math><mi>p</mi></math></span>-value of the test. The loss of the strong identifiability for the multivariate Poisson distribution poses a significant challenge in deriving the null limiting distribution. Finally, simulation studies and real-data analysis demonstrate the good performance of the proposed test.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"243 ","pages":"Article 106369"},"PeriodicalIF":0.8,"publicationDate":"2026-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145610584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Max–min experimental designs for comparing pairs of treatments with binary outcomes 对具有二元结果的处理进行配对比较的最大最小实验设计
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2026-07-01 Epub Date: 2026-01-20 DOI: 10.1016/j.jspi.2026.106379
Satya Prakash Singh , Ori Davidov
In many experimental settings, the primary focus is the comparisons of pairs of treatments. This paper addresses the problem of optimally allocating experimental units to treatment groups when responses are binary. The proposed approach employs a power-based max–min approach that identifies the optimal experimental design in various experimental settings.
在许多实验环境中,主要关注的是对治疗方法的比较。本文解决了当反应是二元的情况下,如何将实验单元最优地分配给治疗组的问题。所提出的方法采用基于功率的最大最小方法来识别各种实验设置中的最佳实验设计。
{"title":"Max–min experimental designs for comparing pairs of treatments with binary outcomes","authors":"Satya Prakash Singh ,&nbsp;Ori Davidov","doi":"10.1016/j.jspi.2026.106379","DOIUrl":"10.1016/j.jspi.2026.106379","url":null,"abstract":"<div><div>In many experimental settings, the primary focus is the comparisons of pairs of treatments. This paper addresses the problem of optimally allocating experimental units to treatment groups when responses are binary. The proposed approach employs a power-based max–min approach that identifies the optimal experimental design in various experimental settings.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"243 ","pages":"Article 106379"},"PeriodicalIF":0.8,"publicationDate":"2026-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust and consistent model evaluation criteria in high-dimensional regression 高维回归中稳健一致的模型评价准则
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2026-05-01 Epub Date: 2025-10-28 DOI: 10.1016/j.jspi.2025.106358
Sumito Kurata, Kei Hirose
Most of the regularization methods such as the LASSO have one (or more) regularization parameter(s), and to select the value of the regularization parameter is essentially equal to select a model. Thus, to obtain a model suitable for the data and phenomenon, we need to determine an adequate value of the regularization parameter. Regarding the determination of the regularization parameter in the linear regression model, we often apply the information criteria like the AIC and BIC, however, it has been pointed out that these criteria are sensitive to outliers and tend not to perform well in high-dimensional settings. Outliers generally have a negative effect on not only estimation but also model selection, consequently, it is important to employ a selection method with robustness against outliers. In addition, when the number of explanatory variables is quite large, most conventional criteria are prone to select unnecessary explanatory variables. In this paper, we propose model evaluation criteria based on the statistical divergence with excellence in robustness in both of parametric estimation and model selection, by applying the quasi-Bayesian procedure. Our proposed criteria achieve the selection consistency even in high-dimensional settings due to precise approximation, simultaneously with robustness. We also investigate the conditions for establishing robustness and consistency, and provide an appropriate example of the divergence and penalty term that can achieve the desirable properties. We finally report the results of some numerical examples to verify that the proposed criteria perform robust and consistent variable selection compared with the conventional selection methods.
大多数正则化方法(如LASSO)都有一个(或多个)正则化参数,选择正则化参数的值本质上等于选择一个模型。因此,为了获得适合于数据和现象的模型,我们需要确定一个适当的正则化参数值。对于线性回归模型中正则化参数的确定,我们通常采用AIC和BIC等信息准则,但已有研究指出,这些准则对异常值敏感,在高维环境下往往表现不佳。异常值不仅对估计有负面影响,而且对模型选择也有负面影响,因此,采用对异常值具有鲁棒性的选择方法非常重要。此外,当解释变量的数量相当大时,大多数常规标准容易选择不必要的解释变量。本文应用拟贝叶斯过程,提出了基于统计散度的模型评价准则,该准则在参数估计和模型选择上都具有较好的鲁棒性。我们提出的标准即使在高维环境下,由于精确的近似,也能实现选择一致性,同时具有鲁棒性。我们还研究了建立鲁棒性和一致性的条件,并提供了一个适当的散度和惩罚项的例子,可以达到期望的性质。最后给出了一些数值算例,验证了所提出的准则与传统的选择方法相比具有鲁棒性和一致性。
{"title":"Robust and consistent model evaluation criteria in high-dimensional regression","authors":"Sumito Kurata,&nbsp;Kei Hirose","doi":"10.1016/j.jspi.2025.106358","DOIUrl":"10.1016/j.jspi.2025.106358","url":null,"abstract":"<div><div>Most of the regularization methods such as the LASSO have one (or more) regularization parameter(s), and to select the value of the regularization parameter is essentially equal to select a model. Thus, to obtain a model suitable for the data and phenomenon, we need to determine an adequate value of the regularization parameter. Regarding the determination of the regularization parameter in the linear regression model, we often apply the information criteria like the AIC and BIC, however, it has been pointed out that these criteria are sensitive to outliers and tend not to perform well in high-dimensional settings. Outliers generally have a negative effect on not only estimation but also model selection, consequently, it is important to employ a selection method with robustness against outliers. In addition, when the number of explanatory variables is quite large, most conventional criteria are prone to select unnecessary explanatory variables. In this paper, we propose model evaluation criteria based on the statistical divergence with excellence in robustness in both of parametric estimation and model selection, by applying the quasi-Bayesian procedure. Our proposed criteria achieve the selection consistency even in high-dimensional settings due to precise approximation, simultaneously with robustness. We also investigate the conditions for establishing robustness and consistency, and provide an appropriate example of the divergence and penalty term that can achieve the desirable properties. We finally report the results of some numerical examples to verify that the proposed criteria perform robust and consistent variable selection compared with the conventional selection methods.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"242 ","pages":"Article 106358"},"PeriodicalIF":0.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145415645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inference for trend functions in partially linear models 部分线性模型中趋势函数的推断
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2026-05-01 Epub Date: 2025-09-02 DOI: 10.1016/j.jspi.2025.106338
Sijie Zheng , Xiaojun Song
A nonparametric test is developed to determine whether the trend of a partially linear model (PLM) with dependent errors and locally stationary regressors follows a specific parametric form. The test is asymptotically normal under the null hypothesis of correct trend specification and is consistent against various alternatives that deviate from the null hypothesis. The testing power against two classes of local alternatives approaching the null at different rates is derived, along with the asymptotic distribution of the test under fixed alternatives. We also propose a wild bootstrap procedure to better approximate the finite sample null distribution of the test. Statistical inference is performed on the trend specification in the Phillips curve and ozone concentration.
提出了一种非参数检验方法,以确定具有相关误差和局部平稳回归量的部分线性模型(PLM)的趋势是否遵循特定参数形式。在正确趋势规范的零假设下,检验是渐近正态的,并且对于偏离零假设的各种替代方案是一致的。导出了针对两类以不同速率接近零的局部选择的检验能力,以及在固定选择下检验的渐近分布。我们还提出了一个野生自举过程,以更好地近似检验的有限样本零分布。对菲利普斯曲线的趋势规范和臭氧浓度进行了统计推断。
{"title":"Inference for trend functions in partially linear models","authors":"Sijie Zheng ,&nbsp;Xiaojun Song","doi":"10.1016/j.jspi.2025.106338","DOIUrl":"10.1016/j.jspi.2025.106338","url":null,"abstract":"<div><div>A nonparametric test is developed to determine whether the trend of a partially linear model (PLM) with dependent errors and locally stationary regressors follows a specific parametric form. The test is asymptotically normal under the null hypothesis of correct trend specification and is consistent against various alternatives that deviate from the null hypothesis. The testing power against two classes of local alternatives approaching the null at different rates is derived, along with the asymptotic distribution of the test under fixed alternatives. We also propose a wild bootstrap procedure to better approximate the finite sample null distribution of the test. Statistical inference is performed on the trend specification in the Phillips curve and ozone concentration.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"242 ","pages":"Article 106338"},"PeriodicalIF":0.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145004499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The k-sample Behrens-Fisher problem for high-dimensional data with model free assumption 具有无模型假设的高维数据k样本Behrens-Fisher问题
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2026-05-01 Epub Date: 2025-09-27 DOI: 10.1016/j.jspi.2025.106354
Yanbo Pei, Xiaoxiao Ren, Baoxue Zhang
The problem of testing the equality of k-sample mean vectors with different covariance matrices, known as the Behrens-Fisher (BF) problem for k-sample, is a significant issue in statistics. Hu and Bai (2017) proposed a test statistic that operates under a factor-like model structure assumption and demonstrated its normal limit. Building on this work, we further explore the asymptotic properties of the test statistic. We prove that the asymptotic null distribution of the test statistic is a Chi-square-type mixture distribution under a model-free assumption and establish its asymptotic power under a full alternative hypothesis. Moreover, we show that the asymptotic null distribution of the test statistic is either normal or a weighted sum of normal and Chi-square random variables, depending on the convergence rate of the eigenvalues of the covariance matrix with model free assumption. To address practical challenges in high-dimensional data, we propose a new weighted bootstrap procedure that is simple to implement. Simulation studies demonstrate that our proposed test procedure outperforms existing methods in terms of size control under various settings. Furthermore, real data applications illustrate the applicability of our test procedure to a variety of high-dimensional data analysis problems.
用不同的协方差矩阵检验k-样本均值向量是否相等的问题,被称为k-样本的Behrens-Fisher (BF)问题,是统计学中的一个重要问题。Hu和Bai(2017)提出了在类因子模型结构假设下运行的检验统计量,并证明了其正常极限。在此基础上,我们进一步探讨了检验统计量的渐近性质。在无模型假设下证明了检验统计量的渐近零分布是一个卡方型混合分布,在完全备择假设下证明了检验统计量的渐近幂。此外,我们证明了检验统计量的渐近零分布要么是正态分布,要么是正态和卡方随机变量的加权和,这取决于在无模型假设下协方差矩阵的特征值的收敛速度。为了解决高维数据中的实际挑战,我们提出了一种新的加权自举过程,该过程易于实现。仿真研究表明,我们提出的测试程序在各种设置下的尺寸控制方面优于现有方法。此外,实际数据应用说明了我们的测试程序对各种高维数据分析问题的适用性。
{"title":"The k-sample Behrens-Fisher problem for high-dimensional data with model free assumption","authors":"Yanbo Pei,&nbsp;Xiaoxiao Ren,&nbsp;Baoxue Zhang","doi":"10.1016/j.jspi.2025.106354","DOIUrl":"10.1016/j.jspi.2025.106354","url":null,"abstract":"<div><div>The problem of testing the equality of <em>k</em>-sample mean vectors with different covariance matrices, known as the Behrens-Fisher (BF) problem for <em>k</em>-sample, is a significant issue in statistics. Hu and Bai (2017) proposed a test statistic that operates under a factor-like model structure assumption and demonstrated its normal limit. Building on this work, we further explore the asymptotic properties of the test statistic. We prove that the asymptotic null distribution of the test statistic is a Chi-square-type mixture distribution under a model-free assumption and establish its asymptotic power under a full alternative hypothesis. Moreover, we show that the asymptotic null distribution of the test statistic is either normal or a weighted sum of normal and Chi-square random variables, depending on the convergence rate of the eigenvalues of the covariance matrix with model free assumption. To address practical challenges in high-dimensional data, we propose a new weighted bootstrap procedure that is simple to implement. Simulation studies demonstrate that our proposed test procedure outperforms existing methods in terms of size control under various settings. Furthermore, real data applications illustrate the applicability of our test procedure to a variety of high-dimensional data analysis problems.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"242 ","pages":"Article 106354"},"PeriodicalIF":0.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145220648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing goodness-of-fit for sparse categories using Rényi divergence 利用rsamnyi散度评估稀疏分类的拟合优度
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2026-05-01 Epub Date: 2025-09-19 DOI: 10.1016/j.jspi.2025.106350
Raul Matsushita , Gabriel Gomes , Regina Da Fonseca , Eduardo Nakano , Roberto Vila
We present the Rényi divergence as a statistic for assessing goodness-of-fit in sparse frequency tables, where small expected counts can undermine the reliability of the traditional chi-square test. The Rényi divergence with index in (0,1) is a natural choice because it circumvents division-related issues by small frequencies. Our main result demonstrates that the Rényi statistic asymptotically follows a chi-square distribution. Through theoretical insights and Monte Carlo simulations, we evaluate the performance of the Rényi statistic across various values of the divergence index. We find that smaller index values improve the alignment of the Rényi statistic with the chi-square distribution and enhance its performance in sparse data settings. Additionally, the Rényi statistic exhibits good power properties in detecting deviations from the null hypothesis under these conditions. To illustrate its practical applicability, we present two real-world data analyses, highlighting the robustness of the Rényi divergence in scenarios involving sparse categories.
我们将r尼散度作为稀疏频率表中评估拟合优度的统计量,其中较小的期望计数会破坏传统卡方检验的可靠性。指数为(0,1)的r nyi散度是一种自然选择,因为它通过小频率规避了与除法相关的问题。我们的主要结果表明,r逍遥统计量渐近地遵循卡方分布。通过理论分析和蒙特卡罗模拟,我们评估了rsamnyi统计在不同散度指数值上的性能。我们发现,较小的指数值改善了rsami统计量与卡方分布的一致性,并提高了其在稀疏数据设置中的性能。此外,在这些条件下,rsamnyi统计量在检测零假设偏差方面表现出良好的功率特性。为了说明它的实际适用性,我们提出了两个真实世界的数据分析,强调了在涉及稀疏类别的情况下rsamnyi分歧的鲁棒性。
{"title":"Assessing goodness-of-fit for sparse categories using Rényi divergence","authors":"Raul Matsushita ,&nbsp;Gabriel Gomes ,&nbsp;Regina Da Fonseca ,&nbsp;Eduardo Nakano ,&nbsp;Roberto Vila","doi":"10.1016/j.jspi.2025.106350","DOIUrl":"10.1016/j.jspi.2025.106350","url":null,"abstract":"<div><div>We present the Rényi divergence as a statistic for assessing goodness-of-fit in sparse frequency tables, where small expected counts can undermine the reliability of the traditional chi-square test. The Rényi divergence with index in (0,1) is a natural choice because it circumvents division-related issues by small frequencies. Our main result demonstrates that the Rényi statistic asymptotically follows a chi-square distribution. Through theoretical insights and Monte Carlo simulations, we evaluate the performance of the Rényi statistic across various values of the divergence index. We find that smaller index values improve the alignment of the Rényi statistic with the chi-square distribution and enhance its performance in sparse data settings. Additionally, the Rényi statistic exhibits good power properties in detecting deviations from the null hypothesis under these conditions. To illustrate its practical applicability, we present two real-world data analyses, highlighting the robustness of the Rényi divergence in scenarios involving sparse categories.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"242 ","pages":"Article 106350"},"PeriodicalIF":0.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145105738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tuning differential evolution algorithm for constructing uniform projection designs 构造均匀投影设计的差分进化优化算法
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2026-05-01 Epub Date: 2025-10-21 DOI: 10.1016/j.jspi.2025.106356
Samuel Onyambu, Hongquan Xu
Space-filling designs are extensively used in computer experiments to analyze complex systems. Among these, uniform projection designs stand out for their desirable low-dimensional projection properties and robustness against other criteria. However, no efficient algorithm currently exists for generating such designs. This study explores the construction of uniform projection designs using a differential evolution (DE) algorithm. DE, an evolutionary algorithm, is known for its simplicity, robustness, and effectiveness in solving complex optimization problems, though its performance is highly sensitive to several hyperparameters. Our goal is to investigate the structure of the hyperparameter space, evaluate the contribution of each hyperparameter, and provide guidelines for optimal hyperparameter settings across various scenarios. To achieve this, we conduct a comprehensive comparison of different experimental designs and surrogate models.
空间填充设计在分析复杂系统的计算机实验中被广泛使用。其中,均匀投影设计以其理想的低维投影特性和对其他标准的鲁棒性而脱颖而出。然而,目前还没有有效的算法来生成这样的设计。本研究探讨了使用差分进化(DE)算法构建均匀投影设计。DE是一种进化算法,以其简单性、鲁棒性和解决复杂优化问题的有效性而闻名,尽管它的性能对几个超参数非常敏感。我们的目标是研究超参数空间的结构,评估每个超参数的贡献,并为各种场景下的最佳超参数设置提供指导。为了实现这一点,我们对不同的实验设计和替代模型进行了全面的比较。
{"title":"Tuning differential evolution algorithm for constructing uniform projection designs","authors":"Samuel Onyambu,&nbsp;Hongquan Xu","doi":"10.1016/j.jspi.2025.106356","DOIUrl":"10.1016/j.jspi.2025.106356","url":null,"abstract":"<div><div>Space-filling designs are extensively used in computer experiments to analyze complex systems. Among these, uniform projection designs stand out for their desirable low-dimensional projection properties and robustness against other criteria. However, no efficient algorithm currently exists for generating such designs. This study explores the construction of uniform projection designs using a differential evolution (DE) algorithm. DE, an evolutionary algorithm, is known for its simplicity, robustness, and effectiveness in solving complex optimization problems, though its performance is highly sensitive to several hyperparameters. Our goal is to investigate the structure of the hyperparameter space, evaluate the contribution of each hyperparameter, and provide guidelines for optimal hyperparameter settings across various scenarios. To achieve this, we conduct a comprehensive comparison of different experimental designs and surrogate models.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"242 ","pages":"Article 106356"},"PeriodicalIF":0.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145362408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Structured regularization covariance estimation in tensor-valued data analysis 张量值数据分析中的结构化正则化协方差估计
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2026-05-01 Epub Date: 2025-09-06 DOI: 10.1016/j.jspi.2025.106337
Jiangyan Wang, Yang Ren, Jinguan Lin
Covariance estimation poses a crucial challenge in high-dimensional data analysis, especially when traditional methods (e.g., sample covariance) are inaccurate, particularly with small sample sizes. A promising solution is to exploit inherent data structures such as low-rankness, sparsity, or smoothness. For tensor data (multi-dimensional arrays), structured regularization aids in dimensionality reduction. This paper introduces novel regularization methods for tensor covariance estimation, specifically applying banded and tapering structures to the covariance matrix. We use Kronecker Product Canonical Polyadic (KPCP) decomposition to approximate large matrices via the Kronecker product of smaller matrices. A split resampling scheme is employed to select parameters for the KPCP decomposition from noisy data. This leads to two methods: KPCP-TB-R (Triply Banded-Resampling) and KPCP-TT-R (Triply Tapering-Resampling). Additionally, sparse (thresholding) and multi-structured regularization approaches are introduced for comparison. The effectiveness and robustness of the proposed methods are validated through extensive simulations and applied to monthly export trade volume data.
协方差估计在高维数据分析中提出了一个至关重要的挑战,特别是当传统方法(例如样本协方差)不准确时,特别是在小样本量的情况下。一个有希望的解决方案是利用固有的数据结构,如低秩、稀疏性或平滑性。对于张量数据(多维数组),结构化正则化有助于降维。介绍了一种新的正则化方法用于张量协方差估计,特别是对协方差矩阵应用带状结构和锥形结构。我们使用Kronecker积正则多进分解(KPCP)通过较小矩阵的Kronecker积来近似大矩阵。采用分割重采样的方法从噪声数据中选择KPCP分解的参数。这导致了两种方法:kcpp - tb - r(三重带重采样)和kcpp - tt - r(三重锥形重采样)。此外,还介绍了稀疏(阈值)和多结构正则化方法进行比较。通过大量的模拟和月度出口贸易量数据验证了所提出方法的有效性和鲁棒性。
{"title":"Structured regularization covariance estimation in tensor-valued data analysis","authors":"Jiangyan Wang,&nbsp;Yang Ren,&nbsp;Jinguan Lin","doi":"10.1016/j.jspi.2025.106337","DOIUrl":"10.1016/j.jspi.2025.106337","url":null,"abstract":"<div><div>Covariance estimation poses a crucial challenge in high-dimensional data analysis, especially when traditional methods (e.g., sample covariance) are inaccurate, particularly with small sample sizes. A promising solution is to exploit inherent data structures such as low-rankness, sparsity, or smoothness. For tensor data (multi-dimensional arrays), structured regularization aids in dimensionality reduction. This paper introduces novel regularization methods for tensor covariance estimation, specifically applying banded and tapering structures to the covariance matrix. We use Kronecker Product Canonical Polyadic (KPCP) decomposition to approximate large matrices via the Kronecker product of smaller matrices. A split resampling scheme is employed to select parameters for the KPCP decomposition from noisy data. This leads to two methods: KPCP-TB-R (Triply Banded-Resampling) and KPCP-TT-R (Triply Tapering-Resampling). Additionally, sparse (thresholding) and multi-structured regularization approaches are introduced for comparison. The effectiveness and robustness of the proposed methods are validated through extensive simulations and applied to monthly export trade volume data.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"242 ","pages":"Article 106337"},"PeriodicalIF":0.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145049542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Orthogonal Latin hypercube designs with hidden low-dimensional projection 具有隐藏低维投影的正交拉丁超立方体设计
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2026-05-01 Epub Date: 2025-09-24 DOI: 10.1016/j.jspi.2025.106349
Tian-fang Zhang , Yue-ru Yan , Fasheng Sun
Orthogonal Latin hypercube designs are widely used in computer experiments because of their attractive properties. In this article, we develop a new grouping method to construct such designs. Compared to the existing results, the new constructed designs can accommodate more factors with the same runsize, which means they are more cost-effective. Moreover, the resulting designs possess not only orthogonality, but also appealing space-filling properties in low dimensions, which make them very suitable for computer experiments.
正交拉丁超立方体设计由于其诱人的特性在计算机实验中得到了广泛的应用。在本文中,我们开发了一种新的分组方法来构造这样的设计。与现有的结果相比,新构建的设计可以在相同的运行尺寸下容纳更多的因素,这意味着它们更具成本效益。此外,所得到的设计不仅具有正交性,而且在低维空间中具有吸引人的空间填充特性,这使它们非常适合计算机实验。
{"title":"Orthogonal Latin hypercube designs with hidden low-dimensional projection","authors":"Tian-fang Zhang ,&nbsp;Yue-ru Yan ,&nbsp;Fasheng Sun","doi":"10.1016/j.jspi.2025.106349","DOIUrl":"10.1016/j.jspi.2025.106349","url":null,"abstract":"<div><div>Orthogonal Latin hypercube designs are widely used in computer experiments because of their attractive properties. In this article, we develop a new grouping method to construct such designs. Compared to the existing results, the new constructed designs can accommodate more factors with the same runsize, which means they are more cost-effective. Moreover, the resulting designs possess not only orthogonality, but also appealing space-filling properties in low dimensions, which make them very suitable for computer experiments.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"242 ","pages":"Article 106349"},"PeriodicalIF":0.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145220650","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust and computationally efficient gradient-based estimation 稳健且计算效率高的梯度估计
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2026-05-01 Epub Date: 2025-09-22 DOI: 10.1016/j.jspi.2025.106351
Yibo Yan , Xiaozhou Wang , Riquan Zhang
In this paper, we propose a class of estimators based on the robust and computationally efficient gradient estimation for both low- and high-dimensional risk minimization framework. The gradient estimation in this work is constructed using a series of newly proposed univariate robust and efficient mean estimators. Our proposed estimators are obtained iteratively using a variant of the gradient descent method, where the update direction is determined by a robust and computationally efficient gradient. These estimators not only have explicit expressions and can be obtained through arithmetic operations but are also robust to arbitrary outliers in common statistical models. Theoretically, we establish the convergence of the algorithms and derive non-asymptotic error bounds for these iterative estimators. Specifically, we apply our methods to linear and logistic regression models, achieving robust parameter estimates and corresponding excess risk bounds. Unlike previous work, our theoretical results rely on a magnitude function of the outliers, which captures the extent of their deviation from the inliers. Finally, we present extensive simulation experiments on both low- and high-dimensional linear models to demonstrate the superior performance of our proposed estimators compared to several baseline methods.
本文针对低维和高维风险最小化框架,提出了一类基于鲁棒性和计算效率高的梯度估计。本文中的梯度估计是使用一系列新提出的单变量稳健高效均值估计量来构造的。我们提出的估计量是使用梯度下降法的一种变体迭代获得的,其中更新方向由一个鲁棒且计算效率高的梯度决定。这些估计量不仅具有显式表达式,可以通过算术运算得到,而且对常见统计模型中的任意离群值具有鲁棒性。在理论上,我们建立了算法的收敛性,并推导了这些迭代估计的非渐近误差界。具体来说,我们将我们的方法应用于线性和逻辑回归模型,实现了鲁棒参数估计和相应的超额风险界限。与以前的工作不同,我们的理论结果依赖于离群值的大小函数,它捕获了离群值与内线的偏差程度。最后,我们在低维和高维线性模型上进行了广泛的模拟实验,以证明与几种基线方法相比,我们提出的估计器具有优越的性能。
{"title":"Robust and computationally efficient gradient-based estimation","authors":"Yibo Yan ,&nbsp;Xiaozhou Wang ,&nbsp;Riquan Zhang","doi":"10.1016/j.jspi.2025.106351","DOIUrl":"10.1016/j.jspi.2025.106351","url":null,"abstract":"<div><div>In this paper, we propose a class of estimators based on the robust and computationally efficient gradient estimation for both low- and high-dimensional risk minimization framework. The gradient estimation in this work is constructed using a series of newly proposed univariate robust and efficient mean estimators. Our proposed estimators are obtained iteratively using a variant of the gradient descent method, where the update direction is determined by a robust and computationally efficient gradient. These estimators not only have explicit expressions and can be obtained through arithmetic operations but are also robust to arbitrary outliers in common statistical models. Theoretically, we establish the convergence of the algorithms and derive non-asymptotic error bounds for these iterative estimators. Specifically, we apply our methods to linear and logistic regression models, achieving robust parameter estimates and corresponding excess risk bounds. Unlike previous work, our theoretical results rely on a magnitude function of the outliers, which captures the extent of their deviation from the inliers. Finally, we present extensive simulation experiments on both low- and high-dimensional linear models to demonstrate the superior performance of our proposed estimators compared to several baseline methods.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"242 ","pages":"Article 106351"},"PeriodicalIF":0.8,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145158353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Statistical Planning and Inference
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1