首页 > 最新文献

Journal of Multivariate Analysis最新文献

英文 中文
Copula-based conditional tail indices 基于copula的条件尾指标
IF 1.6 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-11-24 DOI: 10.1016/j.jmva.2023.105268
Vincenzo Coia , Harry Joe , Natalia Nolde

Consider a multivariate distribution of (X,Y), where X is a vector of predictor variables and Y is a response variable. Results are obtained for comparing the conditional and marginal tail indices, ξY|X(x) and ξY, based on conditional distributions {FY|X(|x)} and marginal distribution FY, respectively. For a multivariate distribution based on a copula, the conditional tail index can be decomposed into a product of copula-based conditional tail indices and the marginal tail index. In some applications, one may want ξY|X(x) to be non-constant, and some new copula families are derived to facilitate this.

考虑(X,Y)的多变量分布,其中X是预测变量的向量,Y是响应变量。分别根据条件分布{FY|X(⋅| X)}和边际分布FY,比较条件尾指数ξY|X(X)和边际尾指数ξY的结果。对于基于copula的多元分布,条件尾指数可以分解为基于copula的条件尾指数与边际尾指数的乘积。在某些应用中,人们可能希望ξY|X(X)是非常数,为了实现这一点,推导了一些新的联结族。
{"title":"Copula-based conditional tail indices","authors":"Vincenzo Coia ,&nbsp;Harry Joe ,&nbsp;Natalia Nolde","doi":"10.1016/j.jmva.2023.105268","DOIUrl":"10.1016/j.jmva.2023.105268","url":null,"abstract":"<div><p><span>Consider a multivariate distribution of </span><span><math><mrow><mo>(</mo><mi>X</mi><mo>,</mo><mi>Y</mi><mo>)</mo></mrow></math></span>, where <span><math><mi>X</mi></math></span><span> is a vector of predictor variables and </span><span><math><mi>Y</mi></math></span><span> is a response variable. Results are obtained for comparing the conditional and marginal tail indices, </span><span><math><mrow><msub><mrow><mi>ξ</mi></mrow><mrow><mi>Y</mi><mo>|</mo><mi>X</mi></mrow></msub><mrow><mo>(</mo><mi>x</mi><mo>)</mo></mrow></mrow></math></span> and <span><math><msub><mrow><mi>ξ</mi></mrow><mrow><mi>Y</mi></mrow></msub></math></span>, based on conditional distributions <span><math><mrow><mo>{</mo><msub><mrow><mi>F</mi></mrow><mrow><mi>Y</mi><mo>|</mo><mi>X</mi></mrow></msub><mrow><mo>(</mo><mi>⋅</mi><mo>|</mo><mi>x</mi><mo>)</mo></mrow><mo>}</mo></mrow></math></span> and marginal distribution <span><math><msub><mrow><mi>F</mi></mrow><mrow><mi>Y</mi></mrow></msub></math></span><span>, respectively. For a multivariate distribution based on a copula, the conditional tail index can be decomposed into a product of copula-based conditional tail indices and the marginal tail index. In some applications, one may want </span><span><math><mrow><msub><mrow><mi>ξ</mi></mrow><mrow><mi>Y</mi><mo>|</mo><mi>X</mi></mrow></msub><mrow><mo>(</mo><mi>x</mi><mo>)</mo></mrow></mrow></math></span> to be non-constant, and some new copula families are derived to facilitate this.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"201 ","pages":"Article 105268"},"PeriodicalIF":1.6,"publicationDate":"2023-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138503933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel positive dependence property and its impact on a popular class of concordance measures 一个新的正相关性质及其对一类流行的一致性测度的影响
IF 1.6 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-11-23 DOI: 10.1016/j.jmva.2023.105259
Sebastian Fuchs, Marco Tschimpke

A novel positive dependence property is introduced, called positive measure inducing (PMI for short), being fulfilled by numerous copula classes, including Gaussian, Student t, Fréchet, Farlie–Gumbel–Morgenstern and Frank copulas; it is conjectured that even all positive quadrant dependent Archimedean copulas meet this property. From a geometric viewpoint, a PMI copula concentrates more mass near the main diagonal than in the opposite diagonal. A striking feature of PMI copulas is that they impose an ordering on a certain class of copula-induced measures of concordance, the latter originating in Edwards et al. (2004) and including Spearman’s rho ρ and Gini’s gamma γ, leading to numerous new inequalities such as 3γ2ρ. The measures of concordance within this class are estimated using (classical) empirical copulas and the intrinsic construction via empirical checkerboard copulas, and the estimators’ asymptotic behavior is determined. Building upon the presented inequalities, asymptotic tests are constructed having the potential of being used for detecting whether the underlying dependence structure of a given sample is PMI, which in turn can be used for excluding certain copula families from model building. The excellent performance of the tests is demonstrated in a simulation study and by means of a real-data example.

引入了一种新的正相关性质,称为正测度诱导(PMI),它被许多联结函数类所满足,包括Gaussian、Student t、fr、Farlie-Gumbel-Morgenstern和Frank联结函数;我们推测,甚至所有正象限相关的阿基米德连都满足这个性质。从几何角度来看,PMI联结体在主对角线附近比在相反对角线上集中更多的质量。PMI copula的一个显著特征是,它们对某一类copula诱导的一致性度量施加了排序,后者起源于Edwards等人(2004),包括Spearman的ρ和Gini的γ γ,导致了许多新的不等式,如3γ≥2ρ。利用(经典)经验copuls和经验棋盘copuls的固有构造估计了该类内的一致性测度,并确定了估计量的渐近性。基于所提出的不等式,构造渐近检验具有用于检测给定样本的潜在依赖结构是否为PMI的潜力,这反过来可用于从模型构建中排除某些联结族。通过仿真研究和实例验证了该方法的良好性能。
{"title":"A novel positive dependence property and its impact on a popular class of concordance measures","authors":"Sebastian Fuchs,&nbsp;Marco Tschimpke","doi":"10.1016/j.jmva.2023.105259","DOIUrl":"https://doi.org/10.1016/j.jmva.2023.105259","url":null,"abstract":"<div><p>A novel positive dependence property is introduced, called positive measure inducing (PMI for short), being fulfilled by numerous copula classes, including Gaussian, Student <span><math><mi>t</mi></math></span>, Fréchet, Farlie–Gumbel–Morgenstern and Frank copulas; it is conjectured that even all positive quadrant dependent Archimedean copulas meet this property. From a geometric viewpoint, a PMI copula concentrates more mass near the main diagonal than in the opposite diagonal. A striking feature of PMI copulas is that they impose an ordering on a certain class of copula-induced measures of concordance, the latter originating in Edwards et al. (2004) and including Spearman’s rho <span><math><mi>ρ</mi></math></span> and Gini’s gamma <span><math><mi>γ</mi></math></span>, leading to numerous new inequalities such as <span><math><mrow><mn>3</mn><mi>γ</mi><mo>≥</mo><mn>2</mn><mi>ρ</mi></mrow></math></span>. The measures of concordance within this class are estimated using (classical) empirical copulas and the intrinsic construction via empirical checkerboard copulas, and the estimators’ asymptotic behavior is determined. Building upon the presented inequalities, asymptotic tests are constructed having the potential of being used for detecting whether the underlying dependence structure of a given sample is PMI, which in turn can be used for excluding certain copula families from model building. The excellent performance of the tests is demonstrated in a simulation study and by means of a real-data example.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"200 ","pages":"Article 105259"},"PeriodicalIF":1.6,"publicationDate":"2023-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X23001057/pdfft?md5=f90554c17c7d483e5a17da19024d99eb&pid=1-s2.0-S0047259X23001057-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138439175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Matrix-valued isotropic covariance functions with local extrema 具有局部极值的矩阵值各向同性协方差函数
IF 1.6 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-11-18 DOI: 10.1016/j.jmva.2023.105250
Alfredo Alegría , Xavier Emery

Multivariate random fields are commonly used in spatial statistics and natural science to model coregionalized variables. In this context, the matrix-valued covariance function plays a central role in capturing their spatial continuity and interdependence. This study aims to contribute to the literature on covariance modeling by proposing new parametric families of isotropic matrix-valued functions exhibiting non-monotonic behaviors, namely hole effects and cross-dimples. The benefit of the proposed models is shown on a bivariate data set consisting of concentrations of airborne particulate matter.

多元随机场常用于空间统计和自然科学中对共区域化变量进行建模。在这种情况下,矩阵值协方差函数在捕捉它们的空间连续性和相互依赖性方面起着核心作用。本研究旨在通过提出具有非单调行为(即空穴效应和交叉凹陷)的各向同性矩阵值函数的新参数族,为协方差建模的文献做出贡献。在由空气中颗粒物浓度组成的双变量数据集上显示了所提出模型的效益。
{"title":"Matrix-valued isotropic covariance functions with local extrema","authors":"Alfredo Alegría ,&nbsp;Xavier Emery","doi":"10.1016/j.jmva.2023.105250","DOIUrl":"https://doi.org/10.1016/j.jmva.2023.105250","url":null,"abstract":"<div><p>Multivariate random fields are commonly used in spatial statistics<span><span> and natural science to model coregionalized variables. In this context, the matrix-valued covariance function<span> plays a central role in capturing their spatial continuity and interdependence. This study aims to contribute to the literature on covariance modeling by proposing new parametric families of isotropic matrix-valued functions exhibiting non-monotonic behaviors, namely hole effects and cross-dimples. The benefit of the proposed models is shown on a </span></span>bivariate data set consisting of concentrations of airborne particulate matter.</span></p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"200 ","pages":"Article 105250"},"PeriodicalIF":1.6,"publicationDate":"2023-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138396036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Asymptotic properties of hierarchical clustering in high-dimensional settings 高维环境下层次聚类的渐近性质
IF 1.6 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-11-14 DOI: 10.1016/j.jmva.2023.105251
Kento Egashira , Kazuyoshi Yata , Makoto Aoshima

In this study, three asymptotic behaviors of hierarchical clustering are defined and studied with strict conditions under several asymptotic settings, from large samples to high dimensionality, when having two independent populations. We proceed with the current comprehension of the asymptotic properties of hierarchical clustering in high-dimensional, low-sample-size (HDLSS) settings. For high-dimensional data, the asymptotic properties of hierarchical clustering are demonstrated under mild and practical settings, and we present simulation studies and hierarchical clustering performance discussions. Furthermore, hierarchical clustering was theoretically investigated when both the dimension and sample size approach infinity, and we generalized a latent number of populations considering hierarchical clustering in multiclass HDLSS settings.

本文在两个独立总体的情况下,定义并研究了从大样本到高维数的几种渐近设置下,层次聚类的三种渐近行为。我们继续当前的理解在高维,低样本大小(HDLSS)设置的层次聚类的渐近性质。对于高维数据,在温和和实际的环境下证明了层次聚类的渐近特性,并进行了仿真研究和层次聚类性能的讨论。此外,从理论上研究了维数和样本量都趋近于无穷大时的层次聚类,并在多类HDLSS设置中推广了考虑层次聚类的潜在总体数。
{"title":"Asymptotic properties of hierarchical clustering in high-dimensional settings","authors":"Kento Egashira ,&nbsp;Kazuyoshi Yata ,&nbsp;Makoto Aoshima","doi":"10.1016/j.jmva.2023.105251","DOIUrl":"https://doi.org/10.1016/j.jmva.2023.105251","url":null,"abstract":"<div><p>In this study, three asymptotic behaviors of hierarchical clustering are defined and studied with strict conditions under several asymptotic settings, from large samples to high dimensionality, when having two independent populations. We proceed with the current comprehension of the asymptotic properties of hierarchical clustering in high-dimensional, low-sample-size (HDLSS) settings. For high-dimensional data, the asymptotic properties of hierarchical clustering are demonstrated under mild and practical settings, and we present simulation studies and hierarchical clustering performance discussions. Furthermore, hierarchical clustering was theoretically investigated when both the dimension and sample size approach infinity, and we generalized a latent number of populations considering hierarchical clustering in multiclass HDLSS settings.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"199 ","pages":"Article 105251"},"PeriodicalIF":1.6,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X23000970/pdfft?md5=8ddd59ad8fdac0f31ad39835b3a16f61&pid=1-s2.0-S0047259X23000970-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134656717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical performance of quantile tensor regression with convex regularization 凸正则化分位数张量回归的统计性能
IF 1.6 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-11-14 DOI: 10.1016/j.jmva.2023.105249
Wenqi Lu , Zhongyi Zhu , Rui Li , Heng Lian

In this paper, we consider high-dimensional quantile tensor regression using a general convex decomposable regularizer and analyze the statistical performances of the estimator. The rates are stated in terms of the intrinsic dimension of the estimation problem, which is, roughly speaking, the dimension of the smallest subspace that contains the true coefficient. Previously, convex regularized tensor regression has been studied with a least squares loss, Gaussian tensorial predictors and Gaussian errors, with rates that depend on the Gaussian width of a convex set. Our results extend the previous work to nonsmooth quantile loss. To deal with the non-Gaussian setting, we use the concept of Rademacher complexity with appropriate concentration inequalities instead of the Gaussian width. For the multi-linear nuclear norm penalty, our Orlicz norm bound for the operator norm of a random matrix may be of independent interest. We validate the theoretical guarantees in numerical experiments. We also demonstrate advantage of quantile regression over mean regression, and compare the performance of convex regularization method and nonconvex decomposition method in solving quantile tensor regression problem in simulation studies.

本文利用一般凸可分解正则化器考虑高维分位数张量回归,并分析了该估计器的统计性能。速率是用估计问题的固有维数来表示的,粗略地说,就是包含真系数的最小子空间的维数。以前,凸正则化张量回归已经研究了最小二乘损失,高斯张量预测和高斯误差,其速率取决于凸集的高斯宽度。我们的结果将以前的工作扩展到非光滑分位数损失。为了处理非高斯设置,我们使用Rademacher复杂度的概念和适当的浓度不等式来代替高斯宽度。对于多线性核范数惩罚,随机矩阵的算子范数的Orlicz范数界可能是独立的。通过数值实验验证了理论保证。在仿真研究中,比较了凸正则化方法和非凸分解方法在求解分位数张量回归问题中的性能。
{"title":"Statistical performance of quantile tensor regression with convex regularization","authors":"Wenqi Lu ,&nbsp;Zhongyi Zhu ,&nbsp;Rui Li ,&nbsp;Heng Lian","doi":"10.1016/j.jmva.2023.105249","DOIUrl":"10.1016/j.jmva.2023.105249","url":null,"abstract":"<div><p><span><span>In this paper, we consider high-dimensional quantile<span> tensor regression using a general convex decomposable regularizer and analyze the statistical performances of the estimator. The rates are stated in terms of the intrinsic dimension of the estimation problem, which is, roughly speaking, the dimension of the smallest subspace that contains the true coefficient. Previously, convex regularized tensor regression has been studied with a least squares loss, Gaussian tensorial predictors and Gaussian errors, with rates that depend on the Gaussian width of a convex set. Our results extend the previous work to nonsmooth quantile loss. To deal with the non-Gaussian setting, we use the concept of </span></span>Rademacher<span><span> complexity with appropriate concentration inequalities instead of the Gaussian width. For the multi-linear nuclear norm penalty, our Orlicz norm bound for the operator norm of a random matrix may be of independent interest. We validate the theoretical guarantees in numerical experiments. We also demonstrate advantage of quantile regression over mean regression, and compare the performance of convex </span>regularization method and nonconvex </span></span>decomposition method in solving quantile tensor regression problem in simulation studies.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"200 ","pages":"Article 105249"},"PeriodicalIF":1.6,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135764061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Non-asymptotic robustness analysis of regression depth median 回归深度中位数的非渐近稳健性分析
IF 1.6 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-11-04 DOI: 10.1016/j.jmva.2023.105247
Yijun Zuo

The maximum depth estimator (aka depth median) (βRD) induced from regression depth (RD) of Rousseeuw and Hubert (1999) is one of the most prevailing estimators in regression. It possesses outstanding robustness similar to the univariate location counterpart. Indeed, βRD can, asymptotically, resist up to 33% contamination without breakdown, in contrast to the 0% for the traditional (least squares and least absolute deviations) estimators (see Van Aelst and Rousseeuw (2000)). The results from Van Aelst and Rousseeuw (2000) are pioneering, yet they are limited to regression-symmetric populations (with a strictly positive density), the ϵ-contamination, maximum-bias model, and in asymptotical sense. With a fixed finite-sample size practice, the most prevailing measure of robustness for estimators is the finite-sample breakdown point (FSBP) (Donoho and Huber, 1983). Despite many attempts made in the literature, only sporadic partial results on FSBP for βRD were obtained whereas an exact FSBP for βRD remained open in the last twenty-plus years. Furthermore, is the asymptotic breakdown value 1/3 (the limit of an increasing sequence of finite-sample breakdown values) relevant in the finite-sample practice? (Or what is the difference between the finite-sample and the limit breakdown values?). Such discussions are yet to be given in the literature. This article addresses the above issues, revealing an intrinsic connection between the regression depth of βRD and the newly obtained exact FSBP. It justifies the employment of βRD as a robust alternative to the traditional estimators and demonstrates the necessity and the merit of using the FSBP in finite-sample real practice.

由Rousseeuw和Hubert(1999)的回归深度(RD)导出的最大深度估计量(又称深度中位数)(βRD *)是回归中最流行的估计量之一。它具有与单变量定位对应物相似的出色鲁棒性。事实上,βRD *可以渐近地抵抗高达33%的污染而不破裂,而传统的(最小二乘和最小绝对偏差)估计器则为0%(见Van Aelst和Rousseeuw(2000))。Van Aelst和Rousseeuw(2000)的结果是开创性的,但它们仅限于回归对称种群(具有严格的正密度),ϵ-contamination,最大偏差模型和渐近意义。对于固定的有限样本大小的实践,对于估计器来说,最普遍的鲁棒性度量是有限样本击穿点(FSBP) (Donoho和Huber, 1983)。尽管在文献中做了许多尝试,但在βRD *的FSBP上只获得了零星的部分结果,而βRD *的确切FSBP在过去的20多年里仍然是开放的。此外,在有限样本实践中,渐近击穿值1/3(有限样本击穿值递增序列的极限)是否相关?(或者有限样本和极限击穿值之间有什么区别?)这样的讨论还没有在文献中给出。本文解决了上述问题,揭示了βRD *的回归深度与新获得的精确FSBP之间的内在联系。它证明了βRD *作为传统估计器的鲁棒替代,并证明了在有限样本实际实践中使用FSBP的必要性和优点。
{"title":"Non-asymptotic robustness analysis of regression depth median","authors":"Yijun Zuo","doi":"10.1016/j.jmva.2023.105247","DOIUrl":"https://doi.org/10.1016/j.jmva.2023.105247","url":null,"abstract":"<div><p>The maximum depth estimator (aka depth median) (<span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span>) induced from regression depth (RD) of Rousseeuw and Hubert (1999) is one of the most prevailing estimators in regression. It possesses outstanding robustness similar to the univariate location counterpart. Indeed, <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span> can, asymptotically, resist up to 33% contamination without breakdown, in contrast to the 0% for the traditional (least squares and least absolute deviations) estimators (see Van Aelst and Rousseeuw (2000)). The results from Van Aelst and Rousseeuw (2000) are pioneering, yet they are limited to regression-symmetric populations (with a strictly positive density), the <span><math><mi>ϵ</mi></math></span>-contamination, maximum-bias model, and in asymptotical sense. With a fixed finite-sample size practice, the most prevailing measure of robustness for estimators is the finite-sample breakdown point (FSBP) (Donoho and Huber, 1983). Despite many attempts made in the literature, only sporadic partial results on FSBP for <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span> were obtained whereas an exact FSBP for <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span> remained open in the last twenty-plus years. Furthermore, is the asymptotic breakdown value <span><math><mrow><mn>1</mn><mo>/</mo><mn>3</mn></mrow></math></span> (the limit of an increasing sequence of finite-sample breakdown values) relevant in the finite-sample practice? (Or what is the difference between the finite-sample and the limit breakdown values?). Such discussions are yet to be given in the literature. This article addresses the above issues, revealing an intrinsic connection between the regression depth of <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span> and the newly obtained exact FSBP. It justifies the employment of <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span> as a robust alternative to the traditional estimators and demonstrates the necessity and the merit of using the FSBP in finite-sample real practice.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"199 ","pages":"Article 105247"},"PeriodicalIF":1.6,"publicationDate":"2023-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X23000933/pdfft?md5=41b0163d4b47acc16c5399dda63160ea&pid=1-s2.0-S0047259X23000933-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91987809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
On moments of truncated multivariate normal/independent distributions 关于截断多元正态/独立分布的矩
IF 1.6 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-11-02 DOI: 10.1016/j.jmva.2023.105248
Tsung-I Lin , Wan-Lun Wang

Multivariate normal/independent (MNI) distributions contain many renowned heavy-tailed distributions such as the multivariate t, multivariate slash, multivariate contaminated normal, multivariate variance-gamma, and multivariate double exponential distributions. A frequent problem encountered in statistical analysis is the occurrence of truncated observations and non-normality such that theoretical moments are required for the estimation of the truncated multivariate normal/independent (TMNI) distributions. This paper is dedicated to deriving explicit expressions for the moments of the TMNI distributions with supports confined within a hyper-rectangle. A Monte Carlo experiment is undertaken to validate to the correctness of the proposed formulae for five selected members of the TMNI distributions. R scripts and data to reproduce the results are available in the GitHub repository.

多变量正态/独立(MNI)分布包含许多著名的重尾分布,如多变量t分布、多变量斜线分布、多变量污染正态分布、多变量方差-伽马分布和多变量双指数分布。统计分析中经常遇到的一个问题是出现截断观测值和非正态性,因此需要理论矩来估计截断多元正态/独立(TMNI)分布。本文致力于推导出在超矩形内支承的TMNI分布的矩的显式表达式。通过蒙特卡罗实验,对五个选定的TMNI分布进行了验证。在GitHub存储库中可以获得用于复制结果的R脚本和数据。
{"title":"On moments of truncated multivariate normal/independent distributions","authors":"Tsung-I Lin ,&nbsp;Wan-Lun Wang","doi":"10.1016/j.jmva.2023.105248","DOIUrl":"https://doi.org/10.1016/j.jmva.2023.105248","url":null,"abstract":"<div><p>Multivariate normal/independent (MNI) distributions contain many renowned heavy-tailed distributions such as the multivariate <span><math><mi>t</mi></math></span>, multivariate slash, multivariate contaminated normal, multivariate variance-gamma, and multivariate double exponential distributions. A frequent problem encountered in statistical analysis is the occurrence of truncated observations and non-normality such that theoretical moments are required for the estimation of the truncated multivariate normal/independent (TMNI) distributions. This paper is dedicated to deriving explicit expressions for the moments of the TMNI distributions with supports confined within a hyper-rectangle. A Monte Carlo experiment is undertaken to validate to the correctness of the proposed formulae for five selected members of the TMNI distributions. <span>R</span> scripts and data to reproduce the results are available in the GitHub repository.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"199 ","pages":"Article 105248"},"PeriodicalIF":1.6,"publicationDate":"2023-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X23000945/pdfft?md5=3cb17094e738a982bd8a526ea82d616f&pid=1-s2.0-S0047259X23000945-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91987808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantile-based MANOVA: A new tool for inferring multivariate data in factorial designs 基于分位数的方差分析:在析因设计中推断多变量数据的新工具
IF 1.6 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-10-27 DOI: 10.1016/j.jmva.2023.105246
Marléne Baumeister , Marc Ditzhaus , Markus Pauly

Multivariate analysis-of-variance (MANOVA) is a well established tool to examine multivariate endpoints. While classical approaches depend on restrictive assumptions like normality and homogeneity, there is a recent trend to more general and flexible procedures. In this paper, we proceed on this path, but do not follow the typical mean-focused perspective. Instead we consider general quantiles, in particular the median, for a more robust multivariate analysis. The resulting methodology is applicable for all kind of factorial designs and shown to be asymptotically valid. Our theoretical results are complemented by an extensive simulation study for small and moderate sample sizes. An illustrative data analysis is also presented.

多变量方差分析(MANOVA)是检验多变量终点的成熟工具。虽然经典方法依赖于限制性假设,如正态性和同质性,但最近的趋势是更通用和灵活的程序。在本文中,我们沿着这条道路前进,但不遵循典型的以均值为中心的观点。相反,我们考虑一般分位数,特别是中位数,以进行更稳健的多变量分析。所得到的方法适用于所有类型的析因设计,并证明是渐近有效的。我们的理论结果是补充了广泛的模拟研究小和中等样本量。并给出了一个说明性的数据分析。
{"title":"Quantile-based MANOVA: A new tool for inferring multivariate data in factorial designs","authors":"Marléne Baumeister ,&nbsp;Marc Ditzhaus ,&nbsp;Markus Pauly","doi":"10.1016/j.jmva.2023.105246","DOIUrl":"https://doi.org/10.1016/j.jmva.2023.105246","url":null,"abstract":"<div><p>Multivariate analysis-of-variance (MANOVA) is a well established tool to examine multivariate endpoints. While classical approaches depend on restrictive assumptions like normality and homogeneity, there is a recent trend to more general and flexible procedures. In this paper, we proceed on this path, but do not follow the typical mean-focused perspective. Instead we consider general quantiles, in particular the median, for a more robust multivariate analysis. The resulting methodology is applicable for all kind of factorial designs and shown to be asymptotically valid. Our theoretical results are complemented by an extensive simulation study for small and moderate sample sizes. An illustrative data analysis is also presented.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"199 ","pages":"Article 105246"},"PeriodicalIF":1.6,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X23000921/pdfft?md5=1d25d366a55a9ade017b7d42c7b49a4c&pid=1-s2.0-S0047259X23000921-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91987804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Flexible nonlinear inference and change-point testing of high-dimensional spectral density matrices 高维谱密度矩阵的柔性非线性推理与变点测试
IF 1.6 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-10-21 DOI: 10.1016/j.jmva.2023.105245
Ansgar Steland

This paper studies a flexible approach to analyze high-dimensional nonlinear time series of unconstrained dimension based on linear statistics calculated from spectral average statistics of bilinear forms and nonlinear transformations of lag-window (i.e. band-regularized) spectral density matrix estimators. That class of statistics includes, among others, smoothed periodograms, nonlinear statistics such as coherency, long-run-variance estimators and contrast statistics related to factorial effects as special cases. Especially, we introduce the class of nonlinear spectral averages of the spectral density matrix. Having in mind big data settings, we study a sampling design which includes a sparse sampling scheme. Gaussian approximations with optimal rate are derived for nonlinear time series of growing dimension for these frequency domain statistics and the underlying lag-window (cross-) spectral estimator under non-stationarity. For change-testing (self-standardized) CUSUM statistics are examined. Further, a specific wild bootstrap procedure is proposed to estimate critical values. Simulation studies and an application to SP500 financial returns are provided in a supplement to this paper.

本文研究了一种基于双线性形式的谱平均统计量计算的线性统计量和滞后窗(即带正则化)谱密度矩阵估计量的非线性变换来分析无约束维高维非线性时间序列的灵活方法。这类统计包括平滑周期图、非线性统计(如相干性)、长期运行方差估计和与因子效应相关的对比统计(作为特殊情况)。特别地,我们引入了谱密度矩阵的一类非线性谱平均。考虑到大数据环境,我们研究了一种包括稀疏采样方案的采样设计。针对这些频域统计量和非平稳下的滞后窗(交叉)谱估计量,导出了具有最优速率的非线性时间序列高斯逼近。对于变更测试(自我标准化),将检查CUSUM统计数据。进一步,提出了一种特定的野自举方法来估计临界值。本文的补充部分提供了对标准普尔500指数财务回报的模拟研究和应用。
{"title":"Flexible nonlinear inference and change-point testing of high-dimensional spectral density matrices","authors":"Ansgar Steland","doi":"10.1016/j.jmva.2023.105245","DOIUrl":"https://doi.org/10.1016/j.jmva.2023.105245","url":null,"abstract":"<div><p>This paper studies a flexible approach to analyze high-dimensional nonlinear time series of unconstrained dimension based on linear statistics calculated from spectral average statistics of bilinear forms and nonlinear transformations of lag-window (i.e. band-regularized) spectral density matrix estimators. That class of statistics includes, among others, smoothed periodograms, nonlinear statistics such as coherency, long-run-variance estimators and contrast statistics related to factorial effects as special cases. Especially, we introduce the class of nonlinear spectral averages of the spectral density matrix. Having in mind big data settings, we study a sampling design which includes a sparse sampling scheme. Gaussian approximations with optimal rate are derived for nonlinear time series of growing dimension for these frequency domain statistics and the underlying lag-window (cross-) spectral estimator under non-stationarity. For change-testing (self-standardized) CUSUM statistics are examined. Further, a specific wild bootstrap procedure is proposed to estimate critical values. Simulation studies and an application to SP500 financial returns are provided in a supplement to this paper.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"199 ","pages":"Article 105245"},"PeriodicalIF":1.6,"publicationDate":"2023-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X2300091X/pdfft?md5=df7e5644d46331b672b17462b8020fb3&pid=1-s2.0-S0047259X2300091X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91987805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large factor model estimation by nuclear norm plus ℓ1 norm penalization 核范数加1范数惩罚的大因子模型估计
IF 1.6 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-10-19 DOI: 10.1016/j.jmva.2023.105244
Matteo Farnè, Angela Montanari

This paper provides a comprehensive estimation framework via nuclear norm plus 1 norm penalization for high-dimensional approximate factor models with a sparse residual covariance. The underlying assumptions allow for non-pervasive latent eigenvalues and a prominent residual covariance pattern. In that context, existing approaches based on principal components may lead to misestimate the latent rank. On the contrary, the proposed optimization strategy recovers with high probability both the covariance matrix components and the latent rank and the residual sparsity pattern. Conditioning on the recovered low rank and sparse matrix varieties, we derive the finite sample covariance matrix estimators with the tightest error bound in minimax sense and we prove that the ensuing estimators of factor loadings and scores via Bartlett’s and Thomson’s methods have the same property. The asymptotic rates for those estimators of factor loadings and scores are also provided.

针对残差稀疏的高维近似因子模型,提出了一种核范数加1范数惩罚的综合估计框架。基本假设允许非普遍的潜在特征值和突出的残差协方差模式。在这种情况下,现有的基于主成分的方法可能导致对潜在秩的错误估计。相反,所提出的优化策略可以高概率地恢复协方差矩阵分量以及潜在秩和残差稀疏度模式。在恢复的低秩和稀疏矩阵变异的条件下,我们导出了误差界在极小极大意义上最紧的有限样本协方差矩阵估计量,并证明了随后的因子负荷和分数的Bartlett和Thomson方法估计量具有相同的性质。对这些因子负荷和分数的估计也给出了渐近率。
{"title":"Large factor model estimation by nuclear norm plus ℓ1 norm penalization","authors":"Matteo Farnè,&nbsp;Angela Montanari","doi":"10.1016/j.jmva.2023.105244","DOIUrl":"https://doi.org/10.1016/j.jmva.2023.105244","url":null,"abstract":"<div><p>This paper provides a comprehensive estimation framework via nuclear norm plus <span><math><msub><mrow><mi>ℓ</mi></mrow><mrow><mn>1</mn></mrow></msub></math></span> norm penalization for high-dimensional approximate factor models with a sparse residual covariance. The underlying assumptions allow for non-pervasive latent eigenvalues and a prominent residual covariance pattern. In that context, existing approaches based on principal components may lead to misestimate the latent rank. On the contrary, the proposed optimization strategy recovers with high probability both the covariance matrix components and the latent rank and the residual sparsity pattern. Conditioning on the recovered low rank and sparse matrix varieties, we derive the finite sample covariance matrix estimators with the tightest error bound in minimax sense and we prove that the ensuing estimators of factor loadings and scores via Bartlett’s and Thomson’s methods have the same property. The asymptotic rates for those estimators of factor loadings and scores are also provided.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"199 ","pages":"Article 105244"},"PeriodicalIF":1.6,"publicationDate":"2023-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X23000908/pdfft?md5=728de694d0d649b95d2f5a00e75117a5&pid=1-s2.0-S0047259X23000908-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91987803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Multivariate Analysis
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1