首页 > 最新文献

Statistical Papers最新文献

英文 中文
Matrix-variate generalized linear model with measurement error 带测量误差的矩阵变量广义线性模型
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-04-06 DOI: 10.1007/s00362-024-01540-6
Tianqi Sun, Weiyu Li, Lu Lin

Matrix-variate generalized linear model (mvGLM) has been investigated successfully under the framework of tensor generalized linear model, because matrix-form data can be regarded as a specific tensor (2-dimension). But there are few works focusing on matrix-form data with measurement error (ME), since tensor in conjunction with ME is relatively complex in structure. In this paper we introduce a mvGLM to primarily explore the influence of ME in the model with matrix-form data. We calculate the asymptotic bias based on error-prone mvGLM, and then develop bias-correction methods to tackle the affect of ME. Statistical properties for all methods are established, and the practical performance of all methods is further evaluated in analysis on synthetic and real data sets.

在张量广义线性模型的框架下,矩阵变量广义线性模型(mvGLM)已经得到了成功的研究,因为矩阵形式的数据可以被视为一个特定的张量(二维)。但是,由于带有测量误差(ME)的张量结构相对复杂,因此关注带有测量误差(ME)的矩阵形式数据的研究很少。在本文中,我们引入了 mvGLM,主要探讨 ME 在矩阵形式数据模型中的影响。我们基于易出错的 mvGLM 计算渐近偏差,然后开发偏差修正方法来解决 ME 的影响。我们建立了所有方法的统计特性,并通过对合成数据集和真实数据集的分析进一步评估了所有方法的实际性能。
{"title":"Matrix-variate generalized linear model with measurement error","authors":"Tianqi Sun, Weiyu Li, Lu Lin","doi":"10.1007/s00362-024-01540-6","DOIUrl":"https://doi.org/10.1007/s00362-024-01540-6","url":null,"abstract":"<p>Matrix-variate generalized linear model (mvGLM) has been investigated successfully under the framework of tensor generalized linear model, because matrix-form data can be regarded as a specific tensor (2-dimension). But there are few works focusing on matrix-form data with measurement error (ME), since tensor in conjunction with ME is relatively complex in structure. In this paper we introduce a mvGLM to primarily explore the influence of ME in the model with matrix-form data. We calculate the asymptotic bias based on error-prone mvGLM, and then develop bias-correction methods to tackle the affect of ME. Statistical properties for all methods are established, and the practical performance of all methods is further evaluated in analysis on synthetic and real data sets.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"55 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140602947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Some practical and theoretical issues related to the quantile estimators 与量化估计器有关的一些实践和理论问题
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-04-05 DOI: 10.1007/s00362-024-01543-3
Dagmara Dudek, Anna Kuczmaszewska

The paper contains the comparative analysis of the efficiency of different qunatile estimators for various distributions. Additionally, we show strong consistency of different quantile estimators and we study the Bahadur representation for each of the quantile estimators, when the sample is taken from NA, (varphi ), (rho ^*), (rho )-mixing population.

本文对各种分布的不同量化估计器的效率进行了比较分析。此外,我们还展示了不同量化估计器的强一致性,并研究了当样本取自 NA、(varphi )、(rho ^*)、(rho )-混合人群时,每个量化估计器的 Bahadur 表示。
{"title":"Some practical and theoretical issues related to the quantile estimators","authors":"Dagmara Dudek, Anna Kuczmaszewska","doi":"10.1007/s00362-024-01543-3","DOIUrl":"https://doi.org/10.1007/s00362-024-01543-3","url":null,"abstract":"<p>The paper contains the comparative analysis of the efficiency of different qunatile estimators for various distributions. Additionally, we show strong consistency of different quantile estimators and we study the Bahadur representation for each of the quantile estimators, when the sample is taken from NA, <span>(varphi )</span>, <span>(rho ^*)</span>, <span>(rho )</span>-mixing population.\u0000</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"46 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140573472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The exponentiated exponentially weighted moving average control chart 指数加权移动平均控制图
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-04-03 DOI: 10.1007/s00362-024-01544-2
Vasileios Alevizakos, Arpita Chatterjee, Kashinath Chatterjee, Christos Koukouvinos

Memory-type control charts are widely used for monitoring small to moderate shifts in the process parameter(s). In the present article, we present an exponentiated exponentially weighted moving average (Exp-EWMA) control chart that weights the past observations of a process using an exponentiated function. We evaluated the run-length characteristics of the Exp-EWMA chart via Monte Carlo simulations. A comparison study versus the CUSUM, EWMA and extended EWMA (EEWMA) charts under similar in-control (IC) run-length properties demonstrates that the Exp-EWMA chart is more effective for detecting small and, under certain circumstances, moderate shifts for both the zero-state and steady-state cases. Moreover, the Exp-EWMA chart has better zero-state out-of-control (OOC) performance than an EWMA chart with smoothing parameter equal to the limit to the infinity of the exponentiated function, while the two charts perform similarly for the steady-state case. Finally, it is shown that the Exp-EWMA chart is more IC robust than its competitors under several non-normal distributions. Two examples are provided to explain the implementation of the proposed chart

记忆型控制图被广泛用于监控过程参数的小幅至中幅变化。在本文中,我们介绍了一种指数加权指数加权移动平均(Exp-EWMA)控制图,它使用指数函数对过程的过去观测值进行加权。我们通过蒙特卡罗模拟评估了 Exp-EWMA 控制图的运行长度特性。与 CUSUM、EWMA 和扩展 EWMA (EEWMA) 控制图相比,在类似的控制内 (IC) 运行长度特性下进行的比较研究表明,Exp-EWMA 控制图在检测零状态和稳态情况下的微小偏移以及某些情况下的适度偏移方面更为有效。此外,Exp-EWMA 图表的零状态失控(OOC)性能优于平滑参数等于指数化函数无穷大极限的 EWMA 图表,而这两种图表在稳态情况下的性能相似。最后,Exp-EWMA 图表在几种非正态分布情况下的 IC 稳健性优于其他同类图表。本文提供了两个示例来说明所建议图表的实施情况
{"title":"The exponentiated exponentially weighted moving average control chart","authors":"Vasileios Alevizakos, Arpita Chatterjee, Kashinath Chatterjee, Christos Koukouvinos","doi":"10.1007/s00362-024-01544-2","DOIUrl":"https://doi.org/10.1007/s00362-024-01544-2","url":null,"abstract":"<p>Memory-type control charts are widely used for monitoring small to moderate shifts in the process parameter(s). In the present article, we present an exponentiated exponentially weighted moving average (Exp-EWMA) control chart that weights the past observations of a process using an exponentiated function. We evaluated the run-length characteristics of the Exp-EWMA chart via Monte Carlo simulations. A comparison study versus the CUSUM, EWMA and extended EWMA (EEWMA) charts under similar in-control (IC) run-length properties demonstrates that the Exp-EWMA chart is more effective for detecting small and, under certain circumstances, moderate shifts for both the zero-state and steady-state cases. Moreover, the Exp-EWMA chart has better zero-state out-of-control (OOC) performance than an EWMA chart with smoothing parameter equal to the limit to the infinity of the exponentiated function, while the two charts perform similarly for the steady-state case. Finally, it is shown that the Exp-EWMA chart is more IC robust than its competitors under several non-normal distributions. Two examples are provided to explain the implementation of the proposed chart</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"40 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140573634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A sequential feature selection approach to change point detection in mean-shift change point models 均值偏移变化点模型中变化点检测的顺序特征选择方法
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-04-03 DOI: 10.1007/s00362-024-01548-y

Abstract

Change point detection is an important area of scientific research and has applications in a wide range of fields. In this paper, we propose a sequential change point detection (SCPD) procedure for mean-shift change point models. Unlike classical feature selection based approaches, the SCPD method detects change points in the order of the conditional change sizes and makes full use of the identified change points information. The extended Bayesian information criterion (EBIC) is employed as the stopping rule in the SCPD procedure. We investigate the theoretical property of the procedure and compare its performance with other methods existing in the literature. It is established that the SCPD procedure has the property of detection consistency. Simulation studies and real data analyses demonstrate that the SCPD procedure has the edge over the other methods in terms of detection accuracy and robustness.

摘要 变更点检测是科学研究的一个重要领域,在许多领域都有应用。本文针对均值偏移变化点模型提出了一种序列变化点检测(SCPD)程序。与基于特征选择的经典方法不同,SCPD 方法按照条件变化大小的顺序检测变化点,并充分利用已识别的变化点信息。扩展贝叶斯信息准则(EBIC)被用作 SCPD 程序的停止规则。我们研究了该程序的理论属性,并将其性能与文献中已有的其他方法进行了比较。结果表明,SCPD 程序具有检测一致性的特性。模拟研究和实际数据分析表明,SCPD 程序在检测精度和鲁棒性方面优于其他方法。
{"title":"A sequential feature selection approach to change point detection in mean-shift change point models","authors":"","doi":"10.1007/s00362-024-01548-y","DOIUrl":"https://doi.org/10.1007/s00362-024-01548-y","url":null,"abstract":"<h3>Abstract</h3> <p>Change point detection is an important area of scientific research and has applications in a wide range of fields. In this paper, we propose a sequential change point detection (SCPD) procedure for mean-shift change point models. Unlike classical feature selection based approaches, the SCPD method detects change points in the order of the conditional change sizes and makes full use of the identified change points information. The extended Bayesian information criterion (EBIC) is employed as the stopping rule in the SCPD procedure. We investigate the theoretical property of the procedure and compare its performance with other methods existing in the literature. It is established that the SCPD procedure has the property of detection consistency. Simulation studies and real data analyses demonstrate that the SCPD procedure has the edge over the other methods in terms of detection accuracy and robustness.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"33 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140573739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hypothesis testing for varying coefficient models in tail index regression 尾部指数回归中不同系数模型的假设检验
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-04-02 DOI: 10.1007/s00362-024-01538-0
Koki Momoki, Takuma Yoshida

This study examines the varying coefficient model in tail index regression. The varying coefficient model is an efficient semiparametric model that avoids the curse of dimensionality when including large covariates in the model. In fact, the varying coefficient model is useful in mean, quantile, and other regressions. The tail index regression is not an exception. However, the varying coefficient model is flexible, but leaner and simpler models are preferred for applications. Therefore, it is important to evaluate whether the estimated coefficient function varies significantly with covariates. If the effect of the non-linearity of the model is weak, the varying coefficient structure is reduced to a simpler model, such as a constant or zero. Accordingly, the hypothesis test for model assessment in the varying coefficient model has been discussed in mean and quantile regression. However, there are no results in tail index regression. In this study, we investigate the asymptotic properties of an estimator and provide a hypothesis testing method for varying coefficient models for tail index regression.

本研究探讨了尾部指数回归中的变化系数模型。变化系数模型是一种高效的半参数模型,当模型中包含大量协变量时,它可以避免维度诅咒。事实上,变化系数模型在均值回归、量化回归和其他回归中都很有用。尾指数回归也不例外。然而,变化系数模型是灵活的,但在应用中更倾向于采用更精简、更简单的模型。因此,评估估计的系数函数是否随协变量的变化而显著变化非常重要。如果模型的非线性影响较弱,则可将变化系数结构简化为更简单的模型,如常数或零。因此,在均值回归和量回归中讨论了变化系数模型中模型评估的假设检验。但是,在尾指数回归中还没有结果。在本研究中,我们研究了估计器的渐近特性,并为尾指数回归的变化系数模型提供了一种假设检验方法。
{"title":"Hypothesis testing for varying coefficient models in tail index regression","authors":"Koki Momoki, Takuma Yoshida","doi":"10.1007/s00362-024-01538-0","DOIUrl":"https://doi.org/10.1007/s00362-024-01538-0","url":null,"abstract":"<p>This study examines the varying coefficient model in tail index regression. The varying coefficient model is an efficient semiparametric model that avoids the curse of dimensionality when including large covariates in the model. In fact, the varying coefficient model is useful in mean, quantile, and other regressions. The tail index regression is not an exception. However, the varying coefficient model is flexible, but leaner and simpler models are preferred for applications. Therefore, it is important to evaluate whether the estimated coefficient function varies significantly with covariates. If the effect of the non-linearity of the model is weak, the varying coefficient structure is reduced to a simpler model, such as a constant or zero. Accordingly, the hypothesis test for model assessment in the varying coefficient model has been discussed in mean and quantile regression. However, there are no results in tail index regression. In this study, we investigate the asymptotic properties of an estimator and provide a hypothesis testing method for varying coefficient models for tail index regression.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"41 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140573805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Minimum contrast for the first-order intensity estimation of spatial and spatio-temporal point processes 空间和时空点过程一阶强度估计的最小对比度
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-03-26 DOI: 10.1007/s00362-024-01541-5
Nicoletta D’Angelo, Giada Adelfio

In this paper, we harness a result in point process theory, specifically the expectation of the weighted K-function, where the weighting is done by the true first-order intensity function. This theoretical result can be employed as an estimation method to derive parameter estimates for a particular model assumed for the data. The underlying motivation is to avoid the difficulties associated with dealing with complex likelihoods in point process models and their maximization. The exploited result makes our method theoretically applicable to any model specification. In this paper, we restrict our study to Poisson models, whose likelihood represents the base for many more complex point process models. In this context, our proposed method can estimate the vector of local parameters that correspond to the points within the analyzed point pattern without introducing any additional complexity compared to the global estimation. We illustrate the method through simulation studies for both purely spatial and spatio-temporal point processes and show complex scenarios based on the Poisson model through the analysis of two real datasets concerning environmental problems.

在本文中,我们利用了点过程理论中的一个结果,特别是加权 K 函数的期望,其中加权是由真实的一阶强度函数完成的。这一理论结果可作为一种估算方法,用于推导为数据假设的特定模型的参数估计。其根本动机在于避免处理点过程模型中复杂似然及其最大化所带来的困难。所利用的结果使我们的方法在理论上适用于任何模型规范。在本文中,我们的研究仅限于泊松模型,而泊松模型的似然是许多更复杂的点过程模型的基础。在这种情况下,我们提出的方法可以估算出与分析点模式中的点相对应的局部参数向量,与全局估算相比,不会带来任何额外的复杂性。我们通过对纯空间点过程和时空点过程的模拟研究来说明该方法,并通过分析两个有关环境问题的真实数据集来展示基于泊松模型的复杂情景。
{"title":"Minimum contrast for the first-order intensity estimation of spatial and spatio-temporal point processes","authors":"Nicoletta D’Angelo, Giada Adelfio","doi":"10.1007/s00362-024-01541-5","DOIUrl":"https://doi.org/10.1007/s00362-024-01541-5","url":null,"abstract":"<p>In this paper, we harness a result in point process theory, specifically the expectation of the weighted <i>K</i>-function, where the weighting is done by the true first-order intensity function. This theoretical result can be employed as an estimation method to derive parameter estimates for a particular model assumed for the data. The underlying motivation is to avoid the difficulties associated with dealing with complex likelihoods in point process models and their maximization. The exploited result makes our method theoretically applicable to any model specification. In this paper, we restrict our study to Poisson models, whose likelihood represents the base for many more complex point process models. In this context, our proposed method can estimate the vector of local parameters that correspond to the points within the analyzed point pattern without introducing any additional complexity compared to the global estimation. We illustrate the method through simulation studies for both purely spatial and spatio-temporal point processes and show complex scenarios based on the Poisson model through the analysis of two real datasets concerning environmental problems.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"20 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140883224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The resampling method via representative points 通过代表点重新取样的方法
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-03-18 DOI: 10.1007/s00362-024-01536-2
Long-Hao Xu, Yinan Li, Kai-Tai Fang

The bootstrap method relies on resampling from the empirical distribution to provide inferences about the population with a distribution F. The empirical distribution serves as an approximation to the population. It is possible, however, to resample from another approximating distribution of F to conduct simulation-based inferences. In this paper, we utilize representative points to form an alternative approximating distribution of F for resampling. The representative points in terms of minimum mean squared error from F have been widely applied to numerical integration, simulation, and the problems of grouping, quantization, and classification. The method of resampling via representative points can be used to estimate the sampling distribution of a statistic of interest. A basic theory for the proposed method is established. We prove the convergence of higher-order moments of the new approximating distribution of F, and establish the consistency of sampling distribution approximation in the cases of the sample mean and sample variance under the Kolmogorov metric and Mallows–Wasserstein metric. Based on some numerical studies, it has been shown that the proposed resampling method improves the nonparametric bootstrap in terms of confidence intervals for mean and variance.

自举法依赖于从经验分布中重新取样来推断具有分布 F 的群体。不过,也可以从 F 的另一个近似分布中重新取样,进行基于模拟的推断。在本文中,我们利用代表点来形成 F 的另一种近似分布,以进行重新采样。从 F 的最小均方误差来看,代表点已被广泛应用于数值积分、模拟以及分组、量化和分类等问题。通过代表点重新取样的方法可用于估计相关统计量的取样分布。我们建立了拟议方法的基本理论。我们证明了 F 的新近似分布的高阶矩的收敛性,并在 Kolmogorov 公制和 Mallows-Wasserstein 公制下建立了样本均值和样本方差情况下抽样分布近似的一致性。基于一些数值研究表明,所提出的重采样方法在均值和方差的置信区间方面改进了非参数引导法。
{"title":"The resampling method via representative points","authors":"Long-Hao Xu, Yinan Li, Kai-Tai Fang","doi":"10.1007/s00362-024-01536-2","DOIUrl":"https://doi.org/10.1007/s00362-024-01536-2","url":null,"abstract":"<p>The bootstrap method relies on resampling from the empirical distribution to provide inferences about the population with a distribution <i>F</i>. The empirical distribution serves as an approximation to the population. It is possible, however, to resample from another approximating distribution of <i>F</i> to conduct simulation-based inferences. In this paper, we utilize representative points to form an alternative approximating distribution of <i>F</i> for resampling. The representative points in terms of minimum mean squared error from <i>F</i> have been widely applied to numerical integration, simulation, and the problems of grouping, quantization, and classification. The method of resampling via representative points can be used to estimate the sampling distribution of a statistic of interest. A basic theory for the proposed method is established. We prove the convergence of higher-order moments of the new approximating distribution of <i>F</i>, and establish the consistency of sampling distribution approximation in the cases of the sample mean and sample variance under the Kolmogorov metric and Mallows–Wasserstein metric. Based on some numerical studies, it has been shown that the proposed resampling method improves the nonparametric bootstrap in terms of confidence intervals for mean and variance.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"84 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140147633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An heuristic scree plot criterion for the number of factors 因素数量的启发式克里图标准
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-03-18 DOI: 10.1007/s00362-023-01517-x

Abstract

Cattel’s (Multivar Behav Res 1:245–276, 1966) heuristic determines the number of factors as the elbow point between ‘steep’ and ‘not steep’ in the scree plot. In contrast, an elbow is by definition absent in points on a hyberbole with corresponding equisized surfaces. We formalize this heuristic and propose a criterion to determine the number of factors by comparing surfaces under the scree plot. Monte Carlo simulations shows that the finite-sample properties of our proposed criterion outperform benchmarks in the dynamic factor model literature.

摘要 卡特尔(Multivar Behav Res 1:245-276,1966 年)的启发式方法将因子数确定为树状图中 "陡峭 "与 "不陡峭 "之间的肘点。与此相反,根据定义,在具有相应等值化表面的小交叉点上不存在肘点。我们将这一启发式方法正式化,并提出了一个标准,通过比较克里图下的表面来确定因子的数量。蒙特卡罗模拟显示,我们提出的标准的有限样本属性优于动态因子模型文献中的基准。
{"title":"An heuristic scree plot criterion for the number of factors","authors":"","doi":"10.1007/s00362-023-01517-x","DOIUrl":"https://doi.org/10.1007/s00362-023-01517-x","url":null,"abstract":"<h3>Abstract</h3> <p>Cattel’s (Multivar Behav Res 1:245–276, 1966) heuristic determines the number of factors as the elbow point between ‘steep’ and ‘not steep’ in the scree plot. In contrast, an elbow is by definition absent in points on a hyberbole with corresponding equisized surfaces. We formalize this heuristic and propose a criterion to determine the number of factors by comparing surfaces under the scree plot. Monte Carlo simulations shows that the finite-sample properties of our proposed criterion outperform benchmarks in the dynamic factor model literature.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"44 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140147725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A semi-orthogonal nonnegative matrix tri-factorization algorithm for overlapping community detection 用于重叠群落检测的半正交非负矩阵三因子化算法
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-03-14 DOI: 10.1007/s00362-024-01537-1
Zhaoyang Li, Yuehan Yang

In this paper, we focus on overlapping community detection and propose an efficient semi-orthogonal nonnegative matrix tri-factorization (semi-ONMTF) algorithm. This method factorizes a matrix X into an orthogonal matrix U, a nonnegative matrix B, and a transposed matrix (U^mathrm {scriptscriptstyle T} ). We use the Cayley Transformation to maintain strict orthogonality of U that each iteration stays on the Stiefel Manifold. This algorithm is computationally efficient because the solutions of U and B are simplified into a matrix-wise update algorithm. Applying this method, we detect overlapping communities by the belonging coefficient vector and analyse associations between communities by the unweighted network of communities. We conduct simulations and applications to show that the proposed method has wide applicability. In a real data example, we apply the semi-ONMTF to a stock data set and construct a directed association network of companies. Based on the modularity for directed and overlapping communities, we obtain five overlapping communities, 17 overlapping nodes, and five outlier nodes in the network. We also discuss the associations between communities, providing insights into the overlapping community detection on the stock market network.

本文的重点是重叠群落检测,并提出了一种高效的半正交非负矩阵三因子化(semi-ONMTF)算法。该方法将矩阵 X 分解为一个正交矩阵 U、一个非负矩阵 B 和一个转置矩阵(U^mathrm {scriptscriptstyle T} )。我们使用凯利变换(Cayley Transformation)来保持 U 的严格正交性,使每次迭代都保持在 Stiefel Manifold 上。这种算法的计算效率很高,因为 U 和 B 的解被简化为矩阵更新算法。应用这种方法,我们可以通过归属系数向量检测重叠群落,并通过非加权群落网络分析群落间的关联。我们通过模拟和应用表明,所提出的方法具有广泛的适用性。在一个真实数据示例中,我们将半ONMTF应用于股票数据集,并构建了公司的有向关联网络。根据有向和重叠群落的模块性,我们得到了网络中的 5 个重叠群落、17 个重叠节点和 5 个离群节点。我们还讨论了社群之间的关联,为在股票市场网络上检测重叠社群提供了启示。
{"title":"A semi-orthogonal nonnegative matrix tri-factorization algorithm for overlapping community detection","authors":"Zhaoyang Li, Yuehan Yang","doi":"10.1007/s00362-024-01537-1","DOIUrl":"https://doi.org/10.1007/s00362-024-01537-1","url":null,"abstract":"<p>In this paper, we focus on overlapping community detection and propose an efficient semi-orthogonal nonnegative matrix tri-factorization (semi-ONMTF) algorithm. This method factorizes a matrix <i>X</i> into an orthogonal matrix <i>U</i>, a nonnegative matrix <i>B</i>, and a transposed matrix <span>(U^mathrm {scriptscriptstyle T} )</span>. We use the Cayley Transformation to maintain strict orthogonality of <i>U</i> that each iteration stays on the Stiefel Manifold. This algorithm is computationally efficient because the solutions of <i>U</i> and <i>B</i> are simplified into a matrix-wise update algorithm. Applying this method, we detect overlapping communities by the belonging coefficient vector and analyse associations between communities by the unweighted network of communities. We conduct simulations and applications to show that the proposed method has wide applicability. In a real data example, we apply the semi-ONMTF to a stock data set and construct a directed association network of companies. Based on the modularity for directed and overlapping communities, we obtain five overlapping communities, 17 overlapping nodes, and five outlier nodes in the network. We also discuss the associations between communities, providing insights into the overlapping community detection on the stock market network.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"395 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140147637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical simulations with LR random fuzzy numbers 使用 LR 随机模糊数进行统计模拟
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-03-08 DOI: 10.1007/s00362-024-01533-5
Abbas Parchami, Przemyslaw Grzegorzewski, Maciej Romaniuk

Computer simulations are a powerful tool in many fields of research. This also applies to the broadly understood analysis of experimental data, which are frequently burdened with multiple imperfections. Often the underlying imprecision or vagueness can be suitably described in terms of fuzzy numbers which enable also the capture of subjectivity. On the other hand, due to the random nature of the experimental data, the tools for their description must take into account their statistical nature. In this way, we come to random fuzzy numbers that model fuzzy data and are also solidly formalized within the probabilistic setting. In this contribution, we introduce the so-called LR random fuzzy numbers that can be used in various Monte-Carlo simulations on fuzzy data. The proposed method of generating fuzzy numbers with membership functions given by probability densities is both simple and rich, well-grounded mathematically, and has a high application potential.

计算机模拟是许多研究领域的有力工具。这同样适用于对实验数据的广义分析,因为实验数据往往存在多种不完善之处。通常情况下,可以用模糊数来适当地描述潜在的不精确性或模糊性,模糊数还可以捕捉主观性。另一方面,由于实验数据的随机性,对其进行描述的工具必须考虑其统计性质。这样,我们就得出了能模拟模糊数据的随机模糊数,并在概率论环境中将其形式化。在本文中,我们介绍了所谓的 LR 随机模糊数,它可用于对模糊数据进行各种蒙特卡洛模拟。所提出的生成模糊数的方法,其成员函数由概率密度给出,既简单又丰富,具有坚实的数学基础,应用潜力很大。
{"title":"Statistical simulations with LR random fuzzy numbers","authors":"Abbas Parchami, Przemyslaw Grzegorzewski, Maciej Romaniuk","doi":"10.1007/s00362-024-01533-5","DOIUrl":"https://doi.org/10.1007/s00362-024-01533-5","url":null,"abstract":"<p>Computer simulations are a powerful tool in many fields of research. This also applies to the broadly understood analysis of experimental data, which are frequently burdened with multiple imperfections. Often the underlying imprecision or vagueness can be suitably described in terms of fuzzy numbers which enable also the capture of subjectivity. On the other hand, due to the random nature of the experimental data, the tools for their description must take into account their statistical nature. In this way, we come to random fuzzy numbers that model fuzzy data and are also solidly formalized within the probabilistic setting. In this contribution, we introduce the so-called LR random fuzzy numbers that can be used in various Monte-Carlo simulations on fuzzy data. The proposed method of generating fuzzy numbers with membership functions given by probability densities is both simple and rich, well-grounded mathematically, and has a high application potential.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"23 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140070462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Statistical Papers
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1