Pub Date : 2023-11-24DOI: 10.1016/j.jmva.2023.105268
Vincenzo Coia , Harry Joe , Natalia Nolde
Consider a multivariate distribution of , where is a vector of predictor variables and is a response variable. Results are obtained for comparing the conditional and marginal tail indices, and , based on conditional distributions and marginal distribution , respectively. For a multivariate distribution based on a copula, the conditional tail index can be decomposed into a product of copula-based conditional tail indices and the marginal tail index. In some applications, one may want to be non-constant, and some new copula families are derived to facilitate this.
{"title":"Copula-based conditional tail indices","authors":"Vincenzo Coia , Harry Joe , Natalia Nolde","doi":"10.1016/j.jmva.2023.105268","DOIUrl":"10.1016/j.jmva.2023.105268","url":null,"abstract":"<div><p><span>Consider a multivariate distribution of </span><span><math><mrow><mo>(</mo><mi>X</mi><mo>,</mo><mi>Y</mi><mo>)</mo></mrow></math></span>, where <span><math><mi>X</mi></math></span><span> is a vector of predictor variables and </span><span><math><mi>Y</mi></math></span><span> is a response variable. Results are obtained for comparing the conditional and marginal tail indices, </span><span><math><mrow><msub><mrow><mi>ξ</mi></mrow><mrow><mi>Y</mi><mo>|</mo><mi>X</mi></mrow></msub><mrow><mo>(</mo><mi>x</mi><mo>)</mo></mrow></mrow></math></span> and <span><math><msub><mrow><mi>ξ</mi></mrow><mrow><mi>Y</mi></mrow></msub></math></span>, based on conditional distributions <span><math><mrow><mo>{</mo><msub><mrow><mi>F</mi></mrow><mrow><mi>Y</mi><mo>|</mo><mi>X</mi></mrow></msub><mrow><mo>(</mo><mi>⋅</mi><mo>|</mo><mi>x</mi><mo>)</mo></mrow><mo>}</mo></mrow></math></span> and marginal distribution <span><math><msub><mrow><mi>F</mi></mrow><mrow><mi>Y</mi></mrow></msub></math></span><span>, respectively. For a multivariate distribution based on a copula, the conditional tail index can be decomposed into a product of copula-based conditional tail indices and the marginal tail index. In some applications, one may want </span><span><math><mrow><msub><mrow><mi>ξ</mi></mrow><mrow><mi>Y</mi><mo>|</mo><mi>X</mi></mrow></msub><mrow><mo>(</mo><mi>x</mi><mo>)</mo></mrow></mrow></math></span> to be non-constant, and some new copula families are derived to facilitate this.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"201 ","pages":"Article 105268"},"PeriodicalIF":1.6,"publicationDate":"2023-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138503933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-11-23DOI: 10.1016/j.jmva.2023.105259
Sebastian Fuchs, Marco Tschimpke
A novel positive dependence property is introduced, called positive measure inducing (PMI for short), being fulfilled by numerous copula classes, including Gaussian, Student , Fréchet, Farlie–Gumbel–Morgenstern and Frank copulas; it is conjectured that even all positive quadrant dependent Archimedean copulas meet this property. From a geometric viewpoint, a PMI copula concentrates more mass near the main diagonal than in the opposite diagonal. A striking feature of PMI copulas is that they impose an ordering on a certain class of copula-induced measures of concordance, the latter originating in Edwards et al. (2004) and including Spearman’s rho and Gini’s gamma , leading to numerous new inequalities such as . The measures of concordance within this class are estimated using (classical) empirical copulas and the intrinsic construction via empirical checkerboard copulas, and the estimators’ asymptotic behavior is determined. Building upon the presented inequalities, asymptotic tests are constructed having the potential of being used for detecting whether the underlying dependence structure of a given sample is PMI, which in turn can be used for excluding certain copula families from model building. The excellent performance of the tests is demonstrated in a simulation study and by means of a real-data example.
{"title":"A novel positive dependence property and its impact on a popular class of concordance measures","authors":"Sebastian Fuchs, Marco Tschimpke","doi":"10.1016/j.jmva.2023.105259","DOIUrl":"https://doi.org/10.1016/j.jmva.2023.105259","url":null,"abstract":"<div><p>A novel positive dependence property is introduced, called positive measure inducing (PMI for short), being fulfilled by numerous copula classes, including Gaussian, Student <span><math><mi>t</mi></math></span>, Fréchet, Farlie–Gumbel–Morgenstern and Frank copulas; it is conjectured that even all positive quadrant dependent Archimedean copulas meet this property. From a geometric viewpoint, a PMI copula concentrates more mass near the main diagonal than in the opposite diagonal. A striking feature of PMI copulas is that they impose an ordering on a certain class of copula-induced measures of concordance, the latter originating in Edwards et al. (2004) and including Spearman’s rho <span><math><mi>ρ</mi></math></span> and Gini’s gamma <span><math><mi>γ</mi></math></span>, leading to numerous new inequalities such as <span><math><mrow><mn>3</mn><mi>γ</mi><mo>≥</mo><mn>2</mn><mi>ρ</mi></mrow></math></span>. The measures of concordance within this class are estimated using (classical) empirical copulas and the intrinsic construction via empirical checkerboard copulas, and the estimators’ asymptotic behavior is determined. Building upon the presented inequalities, asymptotic tests are constructed having the potential of being used for detecting whether the underlying dependence structure of a given sample is PMI, which in turn can be used for excluding certain copula families from model building. The excellent performance of the tests is demonstrated in a simulation study and by means of a real-data example.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"200 ","pages":"Article 105259"},"PeriodicalIF":1.6,"publicationDate":"2023-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X23001057/pdfft?md5=f90554c17c7d483e5a17da19024d99eb&pid=1-s2.0-S0047259X23001057-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138439175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-11-18DOI: 10.1016/j.jmva.2023.105250
Alfredo Alegría , Xavier Emery
Multivariate random fields are commonly used in spatial statistics and natural science to model coregionalized variables. In this context, the matrix-valued covariance function plays a central role in capturing their spatial continuity and interdependence. This study aims to contribute to the literature on covariance modeling by proposing new parametric families of isotropic matrix-valued functions exhibiting non-monotonic behaviors, namely hole effects and cross-dimples. The benefit of the proposed models is shown on a bivariate data set consisting of concentrations of airborne particulate matter.
{"title":"Matrix-valued isotropic covariance functions with local extrema","authors":"Alfredo Alegría , Xavier Emery","doi":"10.1016/j.jmva.2023.105250","DOIUrl":"https://doi.org/10.1016/j.jmva.2023.105250","url":null,"abstract":"<div><p>Multivariate random fields are commonly used in spatial statistics<span><span> and natural science to model coregionalized variables. In this context, the matrix-valued covariance function<span> plays a central role in capturing their spatial continuity and interdependence. This study aims to contribute to the literature on covariance modeling by proposing new parametric families of isotropic matrix-valued functions exhibiting non-monotonic behaviors, namely hole effects and cross-dimples. The benefit of the proposed models is shown on a </span></span>bivariate data set consisting of concentrations of airborne particulate matter.</span></p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"200 ","pages":"Article 105250"},"PeriodicalIF":1.6,"publicationDate":"2023-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138396036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-11-14DOI: 10.1016/j.jmva.2023.105251
Kento Egashira , Kazuyoshi Yata , Makoto Aoshima
In this study, three asymptotic behaviors of hierarchical clustering are defined and studied with strict conditions under several asymptotic settings, from large samples to high dimensionality, when having two independent populations. We proceed with the current comprehension of the asymptotic properties of hierarchical clustering in high-dimensional, low-sample-size (HDLSS) settings. For high-dimensional data, the asymptotic properties of hierarchical clustering are demonstrated under mild and practical settings, and we present simulation studies and hierarchical clustering performance discussions. Furthermore, hierarchical clustering was theoretically investigated when both the dimension and sample size approach infinity, and we generalized a latent number of populations considering hierarchical clustering in multiclass HDLSS settings.
{"title":"Asymptotic properties of hierarchical clustering in high-dimensional settings","authors":"Kento Egashira , Kazuyoshi Yata , Makoto Aoshima","doi":"10.1016/j.jmva.2023.105251","DOIUrl":"https://doi.org/10.1016/j.jmva.2023.105251","url":null,"abstract":"<div><p>In this study, three asymptotic behaviors of hierarchical clustering are defined and studied with strict conditions under several asymptotic settings, from large samples to high dimensionality, when having two independent populations. We proceed with the current comprehension of the asymptotic properties of hierarchical clustering in high-dimensional, low-sample-size (HDLSS) settings. For high-dimensional data, the asymptotic properties of hierarchical clustering are demonstrated under mild and practical settings, and we present simulation studies and hierarchical clustering performance discussions. Furthermore, hierarchical clustering was theoretically investigated when both the dimension and sample size approach infinity, and we generalized a latent number of populations considering hierarchical clustering in multiclass HDLSS settings.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"199 ","pages":"Article 105251"},"PeriodicalIF":1.6,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X23000970/pdfft?md5=8ddd59ad8fdac0f31ad39835b3a16f61&pid=1-s2.0-S0047259X23000970-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134656717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-11-14DOI: 10.1016/j.jmva.2023.105249
Wenqi Lu , Zhongyi Zhu , Rui Li , Heng Lian
In this paper, we consider high-dimensional quantile tensor regression using a general convex decomposable regularizer and analyze the statistical performances of the estimator. The rates are stated in terms of the intrinsic dimension of the estimation problem, which is, roughly speaking, the dimension of the smallest subspace that contains the true coefficient. Previously, convex regularized tensor regression has been studied with a least squares loss, Gaussian tensorial predictors and Gaussian errors, with rates that depend on the Gaussian width of a convex set. Our results extend the previous work to nonsmooth quantile loss. To deal with the non-Gaussian setting, we use the concept of Rademacher complexity with appropriate concentration inequalities instead of the Gaussian width. For the multi-linear nuclear norm penalty, our Orlicz norm bound for the operator norm of a random matrix may be of independent interest. We validate the theoretical guarantees in numerical experiments. We also demonstrate advantage of quantile regression over mean regression, and compare the performance of convex regularization method and nonconvex decomposition method in solving quantile tensor regression problem in simulation studies.
{"title":"Statistical performance of quantile tensor regression with convex regularization","authors":"Wenqi Lu , Zhongyi Zhu , Rui Li , Heng Lian","doi":"10.1016/j.jmva.2023.105249","DOIUrl":"10.1016/j.jmva.2023.105249","url":null,"abstract":"<div><p><span><span>In this paper, we consider high-dimensional quantile<span> tensor regression using a general convex decomposable regularizer and analyze the statistical performances of the estimator. The rates are stated in terms of the intrinsic dimension of the estimation problem, which is, roughly speaking, the dimension of the smallest subspace that contains the true coefficient. Previously, convex regularized tensor regression has been studied with a least squares loss, Gaussian tensorial predictors and Gaussian errors, with rates that depend on the Gaussian width of a convex set. Our results extend the previous work to nonsmooth quantile loss. To deal with the non-Gaussian setting, we use the concept of </span></span>Rademacher<span><span> complexity with appropriate concentration inequalities instead of the Gaussian width. For the multi-linear nuclear norm penalty, our Orlicz norm bound for the operator norm of a random matrix may be of independent interest. We validate the theoretical guarantees in numerical experiments. We also demonstrate advantage of quantile regression over mean regression, and compare the performance of convex </span>regularization method and nonconvex </span></span>decomposition method in solving quantile tensor regression problem in simulation studies.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"200 ","pages":"Article 105249"},"PeriodicalIF":1.6,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135764061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-11-04DOI: 10.1016/j.jmva.2023.105247
Yijun Zuo
The maximum depth estimator (aka depth median) () induced from regression depth (RD) of Rousseeuw and Hubert (1999) is one of the most prevailing estimators in regression. It possesses outstanding robustness similar to the univariate location counterpart. Indeed, can, asymptotically, resist up to 33% contamination without breakdown, in contrast to the 0% for the traditional (least squares and least absolute deviations) estimators (see Van Aelst and Rousseeuw (2000)). The results from Van Aelst and Rousseeuw (2000) are pioneering, yet they are limited to regression-symmetric populations (with a strictly positive density), the -contamination, maximum-bias model, and in asymptotical sense. With a fixed finite-sample size practice, the most prevailing measure of robustness for estimators is the finite-sample breakdown point (FSBP) (Donoho and Huber, 1983). Despite many attempts made in the literature, only sporadic partial results on FSBP for were obtained whereas an exact FSBP for remained open in the last twenty-plus years. Furthermore, is the asymptotic breakdown value (the limit of an increasing sequence of finite-sample breakdown values) relevant in the finite-sample practice? (Or what is the difference between the finite-sample and the limit breakdown values?). Such discussions are yet to be given in the literature. This article addresses the above issues, revealing an intrinsic connection between the regression depth of and the newly obtained exact FSBP. It justifies the employment of as a robust alternative to the traditional estimators and demonstrates the necessity and the merit of using the FSBP in finite-sample real practice.
{"title":"Non-asymptotic robustness analysis of regression depth median","authors":"Yijun Zuo","doi":"10.1016/j.jmva.2023.105247","DOIUrl":"https://doi.org/10.1016/j.jmva.2023.105247","url":null,"abstract":"<div><p>The maximum depth estimator (aka depth median) (<span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span>) induced from regression depth (RD) of Rousseeuw and Hubert (1999) is one of the most prevailing estimators in regression. It possesses outstanding robustness similar to the univariate location counterpart. Indeed, <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span> can, asymptotically, resist up to 33% contamination without breakdown, in contrast to the 0% for the traditional (least squares and least absolute deviations) estimators (see Van Aelst and Rousseeuw (2000)). The results from Van Aelst and Rousseeuw (2000) are pioneering, yet they are limited to regression-symmetric populations (with a strictly positive density), the <span><math><mi>ϵ</mi></math></span>-contamination, maximum-bias model, and in asymptotical sense. With a fixed finite-sample size practice, the most prevailing measure of robustness for estimators is the finite-sample breakdown point (FSBP) (Donoho and Huber, 1983). Despite many attempts made in the literature, only sporadic partial results on FSBP for <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span> were obtained whereas an exact FSBP for <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span> remained open in the last twenty-plus years. Furthermore, is the asymptotic breakdown value <span><math><mrow><mn>1</mn><mo>/</mo><mn>3</mn></mrow></math></span> (the limit of an increasing sequence of finite-sample breakdown values) relevant in the finite-sample practice? (Or what is the difference between the finite-sample and the limit breakdown values?). Such discussions are yet to be given in the literature. This article addresses the above issues, revealing an intrinsic connection between the regression depth of <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span> and the newly obtained exact FSBP. It justifies the employment of <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span> as a robust alternative to the traditional estimators and demonstrates the necessity and the merit of using the FSBP in finite-sample real practice.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"199 ","pages":"Article 105247"},"PeriodicalIF":1.6,"publicationDate":"2023-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X23000933/pdfft?md5=41b0163d4b47acc16c5399dda63160ea&pid=1-s2.0-S0047259X23000933-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91987809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-11-02DOI: 10.1016/j.jmva.2023.105248
Tsung-I Lin , Wan-Lun Wang
Multivariate normal/independent (MNI) distributions contain many renowned heavy-tailed distributions such as the multivariate , multivariate slash, multivariate contaminated normal, multivariate variance-gamma, and multivariate double exponential distributions. A frequent problem encountered in statistical analysis is the occurrence of truncated observations and non-normality such that theoretical moments are required for the estimation of the truncated multivariate normal/independent (TMNI) distributions. This paper is dedicated to deriving explicit expressions for the moments of the TMNI distributions with supports confined within a hyper-rectangle. A Monte Carlo experiment is undertaken to validate to the correctness of the proposed formulae for five selected members of the TMNI distributions. R scripts and data to reproduce the results are available in the GitHub repository.
{"title":"On moments of truncated multivariate normal/independent distributions","authors":"Tsung-I Lin , Wan-Lun Wang","doi":"10.1016/j.jmva.2023.105248","DOIUrl":"https://doi.org/10.1016/j.jmva.2023.105248","url":null,"abstract":"<div><p>Multivariate normal/independent (MNI) distributions contain many renowned heavy-tailed distributions such as the multivariate <span><math><mi>t</mi></math></span>, multivariate slash, multivariate contaminated normal, multivariate variance-gamma, and multivariate double exponential distributions. A frequent problem encountered in statistical analysis is the occurrence of truncated observations and non-normality such that theoretical moments are required for the estimation of the truncated multivariate normal/independent (TMNI) distributions. This paper is dedicated to deriving explicit expressions for the moments of the TMNI distributions with supports confined within a hyper-rectangle. A Monte Carlo experiment is undertaken to validate to the correctness of the proposed formulae for five selected members of the TMNI distributions. <span>R</span> scripts and data to reproduce the results are available in the GitHub repository.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"199 ","pages":"Article 105248"},"PeriodicalIF":1.6,"publicationDate":"2023-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X23000945/pdfft?md5=3cb17094e738a982bd8a526ea82d616f&pid=1-s2.0-S0047259X23000945-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91987808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-10-27DOI: 10.1016/j.jmva.2023.105246
Marléne Baumeister , Marc Ditzhaus , Markus Pauly
Multivariate analysis-of-variance (MANOVA) is a well established tool to examine multivariate endpoints. While classical approaches depend on restrictive assumptions like normality and homogeneity, there is a recent trend to more general and flexible procedures. In this paper, we proceed on this path, but do not follow the typical mean-focused perspective. Instead we consider general quantiles, in particular the median, for a more robust multivariate analysis. The resulting methodology is applicable for all kind of factorial designs and shown to be asymptotically valid. Our theoretical results are complemented by an extensive simulation study for small and moderate sample sizes. An illustrative data analysis is also presented.
{"title":"Quantile-based MANOVA: A new tool for inferring multivariate data in factorial designs","authors":"Marléne Baumeister , Marc Ditzhaus , Markus Pauly","doi":"10.1016/j.jmva.2023.105246","DOIUrl":"https://doi.org/10.1016/j.jmva.2023.105246","url":null,"abstract":"<div><p>Multivariate analysis-of-variance (MANOVA) is a well established tool to examine multivariate endpoints. While classical approaches depend on restrictive assumptions like normality and homogeneity, there is a recent trend to more general and flexible procedures. In this paper, we proceed on this path, but do not follow the typical mean-focused perspective. Instead we consider general quantiles, in particular the median, for a more robust multivariate analysis. The resulting methodology is applicable for all kind of factorial designs and shown to be asymptotically valid. Our theoretical results are complemented by an extensive simulation study for small and moderate sample sizes. An illustrative data analysis is also presented.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"199 ","pages":"Article 105246"},"PeriodicalIF":1.6,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X23000921/pdfft?md5=1d25d366a55a9ade017b7d42c7b49a4c&pid=1-s2.0-S0047259X23000921-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91987804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-10-21DOI: 10.1016/j.jmva.2023.105245
Ansgar Steland
This paper studies a flexible approach to analyze high-dimensional nonlinear time series of unconstrained dimension based on linear statistics calculated from spectral average statistics of bilinear forms and nonlinear transformations of lag-window (i.e. band-regularized) spectral density matrix estimators. That class of statistics includes, among others, smoothed periodograms, nonlinear statistics such as coherency, long-run-variance estimators and contrast statistics related to factorial effects as special cases. Especially, we introduce the class of nonlinear spectral averages of the spectral density matrix. Having in mind big data settings, we study a sampling design which includes a sparse sampling scheme. Gaussian approximations with optimal rate are derived for nonlinear time series of growing dimension for these frequency domain statistics and the underlying lag-window (cross-) spectral estimator under non-stationarity. For change-testing (self-standardized) CUSUM statistics are examined. Further, a specific wild bootstrap procedure is proposed to estimate critical values. Simulation studies and an application to SP500 financial returns are provided in a supplement to this paper.
{"title":"Flexible nonlinear inference and change-point testing of high-dimensional spectral density matrices","authors":"Ansgar Steland","doi":"10.1016/j.jmva.2023.105245","DOIUrl":"https://doi.org/10.1016/j.jmva.2023.105245","url":null,"abstract":"<div><p>This paper studies a flexible approach to analyze high-dimensional nonlinear time series of unconstrained dimension based on linear statistics calculated from spectral average statistics of bilinear forms and nonlinear transformations of lag-window (i.e. band-regularized) spectral density matrix estimators. That class of statistics includes, among others, smoothed periodograms, nonlinear statistics such as coherency, long-run-variance estimators and contrast statistics related to factorial effects as special cases. Especially, we introduce the class of nonlinear spectral averages of the spectral density matrix. Having in mind big data settings, we study a sampling design which includes a sparse sampling scheme. Gaussian approximations with optimal rate are derived for nonlinear time series of growing dimension for these frequency domain statistics and the underlying lag-window (cross-) spectral estimator under non-stationarity. For change-testing (self-standardized) CUSUM statistics are examined. Further, a specific wild bootstrap procedure is proposed to estimate critical values. Simulation studies and an application to SP500 financial returns are provided in a supplement to this paper.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"199 ","pages":"Article 105245"},"PeriodicalIF":1.6,"publicationDate":"2023-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X2300091X/pdfft?md5=df7e5644d46331b672b17462b8020fb3&pid=1-s2.0-S0047259X2300091X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91987805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-10-19DOI: 10.1016/j.jmva.2023.105244
Matteo Farnè, Angela Montanari
This paper provides a comprehensive estimation framework via nuclear norm plus norm penalization for high-dimensional approximate factor models with a sparse residual covariance. The underlying assumptions allow for non-pervasive latent eigenvalues and a prominent residual covariance pattern. In that context, existing approaches based on principal components may lead to misestimate the latent rank. On the contrary, the proposed optimization strategy recovers with high probability both the covariance matrix components and the latent rank and the residual sparsity pattern. Conditioning on the recovered low rank and sparse matrix varieties, we derive the finite sample covariance matrix estimators with the tightest error bound in minimax sense and we prove that the ensuing estimators of factor loadings and scores via Bartlett’s and Thomson’s methods have the same property. The asymptotic rates for those estimators of factor loadings and scores are also provided.
{"title":"Large factor model estimation by nuclear norm plus ℓ1 norm penalization","authors":"Matteo Farnè, Angela Montanari","doi":"10.1016/j.jmva.2023.105244","DOIUrl":"https://doi.org/10.1016/j.jmva.2023.105244","url":null,"abstract":"<div><p>This paper provides a comprehensive estimation framework via nuclear norm plus <span><math><msub><mrow><mi>ℓ</mi></mrow><mrow><mn>1</mn></mrow></msub></math></span> norm penalization for high-dimensional approximate factor models with a sparse residual covariance. The underlying assumptions allow for non-pervasive latent eigenvalues and a prominent residual covariance pattern. In that context, existing approaches based on principal components may lead to misestimate the latent rank. On the contrary, the proposed optimization strategy recovers with high probability both the covariance matrix components and the latent rank and the residual sparsity pattern. Conditioning on the recovered low rank and sparse matrix varieties, we derive the finite sample covariance matrix estimators with the tightest error bound in minimax sense and we prove that the ensuing estimators of factor loadings and scores via Bartlett’s and Thomson’s methods have the same property. The asymptotic rates for those estimators of factor loadings and scores are also provided.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"199 ","pages":"Article 105244"},"PeriodicalIF":1.6,"publicationDate":"2023-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X23000908/pdfft?md5=728de694d0d649b95d2f5a00e75117a5&pid=1-s2.0-S0047259X23000908-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91987803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}